The app for independent voices

Good essay! It's also hard to figure out what the right modules to move are, I've been experimenting with evolving transformer architectures (github.com/strangeloopc…) which is, albeit rudimentary, set up to search those types of knobs and score them under objectives

The Enigmatic Multilayer Perceptrons in Transformers: Why They Have Become Targets For Optimization
Jan 24
at
4:58 AM
Relevant people

Log in or sign up

Join the most interesting and insightful discussions.