Benchmarks I'm watching now

Whenever a new model is released, I read the benchmarks. I haven’t seen a good list of benchmarks. I tend to see them referenced and then bookmark them.

Here’s my list. I keep a continually updated list of benchmarks at samek.fyi/benchmarks. I deleted the old list that used to live in this post so that I could link to the updated one instead.