Mar 17, 2026 Estimating structural information fraction of your dataset Mar 06, 2026 Memory 2 -- How many bits does each parameter store? An analysis of MLP Feb 09, 2026 Memory 1 -- How much do linear layers memorize?