Chinese open-weight models might not be at the absolute frontier, but we learn a lot from them. I learn a lot more from the technical reports of DeepSeek and Kimi than dozens of pages of corporate speak in OpenAI and Anthropic model cards. (Gemma papers have some good details, to be fair.)
May 8
at
6:16 AM
Relevant people
Log in or sign up
Join the most interesting and insightful discussions.