Feb 03, 2026 Depth 4 -- Flat directions (in weight space) are high frequency modes (in function space) Feb 02, 2026 Depth 3 -- Fun facts about loss hessian eigenvalues Jan 25, 2026 Optimization 3 / Depth 2 -- Adding Bias After ReLU Jan 20, 2026 Depth 1 -- Understanding Pre-LN and Post-LN