Big Data 5 min read

Youzan Big Data Technology Salon: Practices in Data Cost Governance, Apache Iceberg, Flink, and Data-Driven Growth

The Youzan Big Data Technology Salon brought together Youzan, NetEase and Didi to share practical approaches for cutting data‑infrastructure costs, building an Apache Iceberg‑based data lake, scaling Flink real‑time workloads, and creating a data‑driven growth platform that leverages tracking, A/B testing and analytics.

Youzan Coder
Youzan Coder
Youzan Coder
Youzan Big Data Technology Salon: Practices in Data Cost Governance, Apache Iceberg, Flink, and Data-Driven Growth

This era, big data technology has flown into ordinary companies. Even though it is already widespread, its technological iteration and upgrade are rapid, constantly emerging new solutions and application scenarios.

Youzan's 8 years of e-commerce experience have accumulated massive data. In the data processing process, stability and real-time challenges are constantly faced. As a core asset of the big data era, how to control costs and let data drive growth is also a hot topic.

In this big data technology salon, Youzan jointly with NetEase and Didi gathered together to exchange the latest technological practices and ideas in the big data field. Hope to learn from each other, leverage strengths, and add luster to respective businesses.

Data Cost Governance Practice at Youzan : Introduces how to build a data cost system from 0 to 1, saving millions of machine costs through various cost‑reduction measures. Practical technology operations achieve autonomous cost reduction exceeding 30%. The sharing mainly covers cost calculation models, full‑domain cost bills, component technology transformation, and cost operation practices.

Building an Efficient Data Lake Based on Apache Iceberg : Introduces Iceberg's basic principles and the pain points it solves, as well as how NetEase constructs a complete ecosystem around Iceberg and its future community‑based development plans.

Flink Practice at Didi : Didi has rich real‑time computing scenarios, from real‑time monitoring to real‑time business, from data channels to real‑time data warehouses. Currently Flink tasks exceed 10,000, with over 80% written in SQL and cluster nodes over 1,000. This sharing introduces years of accumulated experience in cluster construction and engine optimization, hoping to be helpful to everyone.

Building Youzan's Data‑Driven Growth System : Introduces the construction of Youzan's data‑driven growth system, including the design and implementation of tracking platforms, A/B testing systems, and growth analysis platforms, as well as the practice of growth hacking theory.

flinkDidiApache IcebergData Cost Governancedata-driven growthNetEase
Youzan Coder
Written by

Youzan Coder

Official Youzan tech channel, delivering technical insights and occasional daily updates from the Youzan tech team.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.