Nested learning and the illusion of depth. Recent theoretical work argues that much of what is attributed to depth in modern neural networks can be explained by nested optimization dynamics and challenging assumptions nelslindahl.substack.co…