AI
an archive of posts in this category
| Jan 08, 2026 | Unigram toy model is surprisingly rich -- representation collapse, scaling laws, learning rate schedule |
|---|---|
| Jan 07, 2026 | Fine-tuning with sparse updates? A toy teacher-student Setup |
| Jan 06, 2026 | Multi-Head Cross Entropy Loss |
| Jan 05, 2026 | What's the difference -- (physics of) AI, physics, math and interpretability |
| Jan 04, 2026 | Representation anisotropy from nonlinear functions |
| Jan 03, 2026 | Training dynamics of A Single ReLU Neuron |
| Jan 02, 2026 | Physics of AI – How to Begin |
| Jan 01, 2026 | Physics of Feature Learning 1 – A Perspective from Nonlinearity |
| Dec 31, 2025 | Physics of AI Requires Mindset Shifts |
| Dec 25, 2025 | Achieving AGI Intelligently – Structure, Not Scale |
| May 27, 2024 | Philosophical thoughts on Kolmogorov-Arnold Networks |
| Jun 16, 2023 | A Good ML Theory is Like Physics -- A Physicist's Analysis of Grokking |