Art of Distributed System Architecture Design
Apr 24, 2015 · Big Data
Pinterest Real-Time Data Pipeline Using Kafka, Spark, and MemSQL
Pinterest built a real‑time data pipeline that streams user engagement events through Apache Kafka into Spark Streaming, enriches them with location and category information, and persists the results in MemSQL to enable fast, SQL‑based analytics for its recommendation engine.
KafkaMemSQLPinterest
0 likes · 3 min read