Make money doing the work you believe in

Netflix is processing over 5 MILLION records per second to track every move of a user across streaming, games, and ads. How? They abandoned traditional warehouses for a Real-Time Distributed Graph (RDG) built on Apache Flink. This is the blueprint for unifying fragmented, high-scale data. 

The key components of RDG:

  • Ingestion is fueled by Kafka

  • Data Stored in Avro format with schema in central registry

  • Filtering, enriching, deduping, transforming streams are all enabled by Flink

  • Publish the processed data (Nodes & Edges of the RDG) to Data Mesh for further consumption

netflixtechblog.com/how…

Dec 17
at
3:00 PM
Relevant people

Log in or sign up

Join the most interesting and insightful discussions.