Frans Zdyb on Substack: "Great post! I highly recommend looking into the literature on assistance games, which is a specific proposal for how to get AI to infer our intentions, rather than optimize a prespecified reward. See eg https://arxiv.org/abs/1606.03137 I think this area will become very relevan…"

Trump's Approval Rating Hits All Time High; Stanford Economist Explains Why Tariffs Make America Poorer; Roomba Escapes The Matrix

New

∙

1m read

The Parnas Perspective

The Trump Administration Likely Violated a Court Order | Let's Break it Down

Encounter At Buc-ee's

The Thing Donald Trump Won't Forget

New

∙

5m watch

The app for independent voices

Frans Zdyb

Nov 9

Frans Zdyb

Great post! I highly recommend looking into the literature on assistance games, which is a specific proposal for how to get AI to infer our intentions, rather than optimize a prespecified reward. See eg arxiv.org/abs/1606.03137

I think this area will become very relevant very soon, not just from a safety perspective, but even just for expanding the set of tasks that AI can take on - as you mention, reward design is not a great strategy.

Nov 9

2:51 PM