The app for independent voices

Looks like we got a new DeepSeek model over the holidays (again):

Basically pushes RLVR & self-refinement to gold-level scores on IMO 2025.

Coincidentally, I am currently working on a chapter on self-refinement, and this comes in handy as a nice, scaled-up case study.

Nov 29
at
3:08 PM
Relevant people

Log in or sign up

Join the most interesting and insightful discussions.