Pau Labarta Bajo on Substack: "One skill every ML engineer has to master ↓ 𝗠𝗟 𝗦𝘆𝘀𝘁𝗲𝗺 𝗱𝗲𝘀𝗶𝗴𝗻 Yes. And do you know why? Because good ML system design has NOT changed at all in the last 5 years. And it won't. 𝗪𝗵𝘆 ❓ Because any ML system is (and will always be) made of 3 types of programs (…"

FRESH HELL Tina Brown's Diary

Meghan’s Buzzkill and a Murder Mystery Solved After 48 Years

New

∙

Malindi Minute: Black Authors, Buying Power, and Black Beauty

She’s Just the Tip of the Trump Administration’s Racist Iceberg

New

∙

7m read

The app for independent voices

Pau Labarta Bajo

Dec 16

Real-World Machine Learning

One skill every ML engineer has to master ↓

𝗠𝗟 𝗦𝘆𝘀𝘁𝗲𝗺 𝗱𝗲𝘀𝗶𝗴𝗻

Yes. And do you know why?

Because good ML system design has NOT changed at all in the last 5 years.

And it won't.

𝗪𝗵𝘆 ❓

Because any ML system is (and will always be) made of 3 types of programs (aka pipelines)

1️⃣ → 𝗙𝗲𝗮𝘁𝘂𝗿𝗲 𝗽𝗶𝗽𝗲𝗹𝗶𝗻𝗲𝘀 that transforms raw data into ML model features (e.g. vector embeddings) that are saved in a Feature Store or Vector DB.

2️⃣→ 𝗧𝗿𝗮𝗶𝗻𝗶𝗻𝗴/𝗙𝗶𝗻𝗲-𝘁𝘂𝗻𝗶𝗻𝗴 𝗽𝗶𝗽𝗲𝗹𝗶𝗻𝗲𝘀 that read historical features from the Feature Store/Vector DB and generate a new model artifact, either by training from scratch or fine-tuning a base LLM. This model artifact is then pushed to a model registry.

3️⃣ → 𝗜𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗽𝗶𝗽𝗲𝗹𝗶𝗻𝗲𝘀 that load the model from the registry, and the input from the client app (for example a vector of numerical features, or a text prompt), generate a prediction (or a generation) and return it to the client app.

This is a universal blueprint, that together with CI/CD workflows (aka MLOps) helpx you build any ML system.

Now you can go and thank Jim Dowling for the idea 💡

----

Hi there! It's Pau Labarta Bajo 👋

Every day I share free, hands-on content, on production-grade ML, to help you build real-world ML products.

𝗙𝗼𝗹𝗹𝗼𝘄 𝗺𝗲 on Substack so you don't miss what's coming next

#mlops #machinelearning

Dec 16

1:17 PM