Make money doing the work you believe in

Added a DeepSeek Sparse Attention (DSA) from-scratch implementation to my LLMs-from-scratch repo thanks to an awesome new reader contrib.

With motivation, overview, and GPT-style model reference implementation as standalone example code: github.com/rasbt/LLMs-f…

Happy weekend & tinkering!

May 23
at
7:00 PM
Relevant people

Log in or sign up

Join the most interesting and insightful discussions.