From the perceptron to the transformer — a ground-up exploration of how neural networks learn, predict, and generate, and what it actually means for a machine to understand.
Weights, activations, and backpropagation — the mathematical engine behind every neural network, explained from first principles.
The transformer architecture dissected — self-attention, positional encoding, and why this mechanism changed everything in AI.
How next-token prediction at billion-parameter scale gives rise to emergent reasoning — and why nobody fully understands why it works.