Ziming Liu
  • about
  • blog
  • publications
  • media coverage
  • talk
  • Depth 3 -- Fun facts about loss hessian eigenvalues

    5 min read   ·   February 02, 2026

    2026   ·   Physics-of-AI   Depth     ·   AI  

  • Diffusion 2 -- Visualizing flow matching, temporal dynamics

    4 min read   ·   February 01, 2026

    2026   ·   Physics-of-AI   Diffusion     ·   AI  

  • Sparse attention 7 -- Stack of causal attention creates implicit positional embedding, and explaning "Loss in the middle"

    5 min read   ·   January 31, 2026

    2026   ·   Physics-of-AI   Sparse-attention     ·   AI  

  • Sparse attention 6 -- In-context Associative recall

    5 min read   ·   January 30, 2026

    2026   ·   Physics-of-AI   Sparse-attention     ·   AI  

  • MLP 2 -- Effective linearity, Generalized SiLU

    4 min read   ·   January 29, 2026

    2026   ·   Physics-of-AI   MLP   Non-linearity   Activation-function     ·   AI  

  • Newer
  • 5
  • 6
  • 7
  • 8
  • 9
  • Older
© Copyright 2026 Ziming Liu. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Photos from Unsplash.