One skill every ML engineer has to master โ
๐ ๐ ๐ฆ๐๐๐๐ฒ๐บ ๐ฑ๐ฒ๐๐ถ๐ด๐ป
Yes. And do you know why?
Because good ML system design has NOT changed at all in the last 5 years.
And it won't.
๐ช๐ต๐ โ
Because any ML system is (and will always be) made of 3 types of programs (aka pipelines)
1๏ธโฃ โ ๐๐ฒ๐ฎ๐๐๐ฟ๐ฒ ๐ฝ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐ that transforms raw data into ML model features (e.g. vector embeddings) that are saved in a Feature Store or Vector DB.
2๏ธโฃโ ๐ง๐ฟ๐ฎ๐ถ๐ป๐ถ๐ป๐ด/๐๐ถ๐ป๐ฒ-๐๐๐ป๐ถ๐ป๐ด ๐ฝ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐ that read historical features from the Feature Store/Vector DB and generate a new model artifact, either by training from scratch or fine-tuning a base LLM. This model artifact is then pushed to a model registry.
3๏ธโฃ โ ๐๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ ๐ฝ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐ that load the model from the registry, and the input from the client app (for example a vector of numerical features, or a text prompt), generate a prediction (or a generation) and return it to the client app.
This is a universal blueprint, that together with CI/CD workflows (aka MLOps) helpx you build any ML system.
Now you can go and thank Jim Dowling for the idea ๐ก
----
Hi there! It's Pau Labarta Bajo ๐
Every day I share free, hands-on content, on production-grade ML, to help you build real-world ML products.
๐๐ผ๐น๐น๐ผ๐ ๐บ๐ฒ on Substack so you don't miss what's coming next
#mlops #machinelearning