The app for independent voices

THE RLHF book

My book, Reinforcement Learning from Human Feedback, is wrapping up and going into final production (copyediting, making pretty, formatting, etc.). Shipping to you in 1-2 months!

It's a wonderful project to create a foundation of knowledge for the research communities that I love and operate in. It’s the book I wish I had when starting on…

Apr 9
at
6:19 PM
Relevant people

Log in or sign up

Join the most interesting and insightful discussions.