Ziming Liu
  • about
  • blog
  • publications
  • media coverage
  • talk
  • When does Kimi's "Attention Residuals" work?

    6 min read   ·   March 16, 2026

    2026   ·   Physics-of-AI   Optimization   Depth   Data     ·   AI  

  • When does RandOpt work?

    7 min read   ·   March 15, 2026

    2026   ·   Physics-of-AI   Optimization     ·   AI  

  • Tokenization 1 -- Factorized tokenization

    4 min read   ·   March 14, 2026

    2026   ·   Physics-of-AI   Tokenization     ·   AI  

  • How to ground your ideas?

    6 min read   ·   March 13, 2026

    2026   ·   Physics-of-AI   Methodology     ·   AI  

  • A toy model of distillation

    6 min read   ·   March 12, 2026

    2026   ·   Physics-of-AI   Optimization     ·   AI  

  • Newer
  • 1
  • 2
  • 3
  • 4
  • 5
  • Older
© Copyright 2026 Ziming Liu. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Photos from Unsplash.