I remember reading a few research papers that show training on one dataset help the model improve on other tasks. But the point is that once the model is benchmaxxed for a specific benchmark, the score is an exaggeration of what the model does on real-world tasks.
Dec 11
at
4:35 PM
Relevant people
Log in or sign up
Join the most interesting and insightful discussions.