(I’ve been working on …)

Machine Learning

Graphics

Footnotes

  1. At the time of writing this, 5 records: incorporating Flash Attention 3, adding a better matrix sign function, speeding up Muon, improving weight decay, and adding batch size scheduling. ^