This is one of the most interesting papers on self-improving agents for this year.
(bookmark this one)
Most self-improving AI systems hit the same wall: the mechanism that generates improvements is fixed and can't improve itself.
This new work from Meta and collaborators breaks through this limitation.
They introduce Hyperagents, self-referential agents where the self-improvement process itself is editable.
The DGM-Hyperagent combines a task agent and a meta agent into a single modifiable program, enabling metacognitive self-modification.
It autonomously discovers innovations like persistent memory and performance tracking, and these meta-improvements transfer across domains and compound across runs.
Why does it matter?
- On paper review, DGM-H improved from 0.0 to 0.710 test accuracy.
- On robotics reward design, it went from 0.060 to 0.372.
- Transfer hyperagents achieved 0.630 on Olympiad-level math grading in a domain they were never trained on.
This is a step toward AI systems that don't just find better solutions but continuously improve how they search for improvements.
Paper: arxiv.org/abs/2603.19461
Learn to build effective AI agents in our academy: academy.dair.ai