Jon Mick (@jonmick): "The "H" in RLHF stands for "Human." But whose human preferences are we training on? A generic thumbs up? A vibes-based "which response was better"? What if the reward function wasn't flattened human preference, but a comprehensive model of you? Your cognitive architecture. You…"

The app for independent voices

The "H" in RLHF stands for "Human."

But whose human preferences are we training on? A generic thumbs up? A vibes-based "which response was better"?

What if the reward function wasn't flattened human preference, but a comprehensive model of you? Your cognitive architecture. Your values. Your actual life.

That's what a Life Model enables. Not better AI for everyone. Better AI for each person specifically. Reinforcement learning where the signal isn't "good response." It's "good response for this mind."

Scale that across thousands of unique models and you're not fine-tuning a product. You're building infrastructure that gets smarter about people.

That’s what I’m excited about!

Feb 27

10:00 PM

The app for independent voices

Log in or sign up