Benchmarks I'm watching now
Whenever a new model is released, I read the benchmarks. I haven’t seen a good list of benchmarks. I tend to see them referenced and then bookmark them.
Here’s my list. I keep a continually updated list of benchmarks at samek.fyi/benchmarks. I deleted the old list that used to live in this post so that I could link to the updated one instead.