Research & Writing

Blog

Dispatches from the edge of chaos — on nonlinear dynamics, AI, emergence, and the mathematics of complex systems.

CNN

WaveNet: Architecture & How It Works

WaveNet is a deep generative model for raw audio waveforms that uses dilated causal convolutions to model long-range temporal dependencies, producing remarkably natural-sounding speech and revolutionizing

2 min read
Transformer

Whisper: Architecture & How It Works

Whisper is OpenAI's general-purpose speech recognition model that approaches human-level robustness and accuracy by training on 680,000 hours of weakly supervised audio data

2 min read