Writeup on Rubric-Based RL is out now:
Covers 15+ papers, the path from LLM-as-a-Judge to rubrics, and how we can use rubrics to extend RLVR beyond verifiable domains (with tons of tips / tricks from recent research). Hope it's helpful!