Big Data 7 min read

Beike Zhaofang's 秒X Real‑Time Analytics Platform: Architecture, Implementation, and Use Cases

The article details the design and deployment of the 秒X real‑time analytics platform at Beike Zhaofang, covering its background, Spark Streaming‑based architecture, fast configuration, data processing pipeline, monitoring, visualization, practical applications, and future development plans.

Beike Product & Technology
Beike Product & Technology
Beike Product & Technology
Beike Zhaofang's 秒X Real‑Time Analytics Platform: Architecture, Implementation, and Use Cases

Background: With the development of big data technology, real‑time, accurate and stable data processing has become a business requirement, leading Beike Zhaofang to build the “秒X” real‑time analytics platform for instant metric collection, monitoring and alerting.

Architecture: The platform uses Spark Streaming for low‑latency, high‑throughput stream processing, Druid.io for real‑time multidimensional OLAP analysis, and Elasticsearch for detailed data search, forming a pipeline that ingests data from Kafka, cleanses, stores, and serves it.

Fast configuration: Each Kafka topic is treated as a task; users configure parsing, field extraction, filtering, transformation, indexing and dimension/metric definitions through a UI, enabling rapid setup of new data sources.

Data processing flow: (1) Data collection via Kafka; (2) Real‑time cleaning and transformation with Spark Streaming; (3) Storage in Druid, HBase, Elasticsearch, Redis; (4) Applications such as dashboards, alerts, detailed queries and API services.

Monitoring and alerting: The platform aggregates metrics and performs fine‑grained checks, triggering SMS or email alerts when thresholds are breached, ensuring timely response to anomalies.

Visualization and retrieval: Users can build OLAP models, create charts, dashboards, and perform interactive Elasticsearch queries for detailed log inspection.

Use cases: Security department uses 秒X for domain‑level peak traffic, IP access monitoring and volume statistics; Kylin engine uses it to monitor query latency and IP query counts, with automatic alerts for queries exceeding 10 seconds.

Conclusion: 秒X has been stable in production, supporting multiple business units, and the team will continue to enhance real‑time capabilities, simplify configuration and improve data accuracy.

About the team: The Beike Zhaofang big‑data architecture team builds and maintains the company’s data storage, compute and streaming platforms, providing high‑performance OLAP engines and toolchains.

monitoringBig DataReal-time AnalyticsElasticsearchDruidSpark Streaming
Beike Product & Technology
Written by

Beike Product & Technology

As Beike's official product and technology account, we are committed to building a platform for sharing Beike's product and technology insights, targeting internet/O2O developers and product professionals. We share high-quality original articles, tech salon events, and recruitment information weekly. Welcome to follow us.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.