Tagged articles
559 articles
Page 5 of 6
DataFunTalk
DataFunTalk
Jul 25, 2022 · Big Data

Taobao Data Model Governance and Intelligent Modeling with DataWorks

This article summarizes Guo Jinshi's presentation on Taobao's data model governance, covering the current data landscape, identified problems, analysis of root causes, proposed governance solutions—including DataWorks intelligent modeling—and future plans, while also providing a Q&A session on practical implementation.

AlibabaBig DataDataWorks
0 likes · 13 min read
Taobao Data Model Governance and Intelligent Modeling with DataWorks
AntTech
AntTech
Jun 29, 2022 · Information Security

Data Confidentiality Era: Development and Security – Highlights from Wei Tao’s 2022 Big Data Summit Speech

Wei Tao, Vice President of Ant Group, outlined the transition to a data‑confidentiality era, emphasizing the need for privacy‑computing security grading, technical requirements, and industry collaboration to safely circulate data as a new production factor in the post‑2022 big data landscape.

Data SecurityPrivacy Computingconfidential data
0 likes · 11 min read
Data Confidentiality Era: Development and Security – Highlights from Wei Tao’s 2022 Big Data Summit Speech
政采云技术
政采云技术
Jun 21, 2022 · Big Data

Overview of the Traffic Domain and Its Data Governance Architecture

This document presents a comprehensive overview of the traffic domain in a data warehouse, covering its concepts, objectives, guiding principles, core and extension models, data quality, monitoring, scheduling, and operational practices to achieve a complete, accurate, efficient, low‑cost, and high‑value traffic data system while addressing massive data volume, consistency, and SLA challenges.

Big DataData WarehouseOperations
0 likes · 15 min read
Overview of the Traffic Domain and Its Data Governance Architecture
DataFunTalk
DataFunTalk
Jun 2, 2022 · Big Data

Data Governance Practices and Product Strategy at NetEase: Challenges, Solutions, and Future Plans

The article presents NetEase's internal data governance experience, outlining past challenges, current pain points, a comprehensive product strategy covering scope, value quantification, and feature implementation, and shares initial results and future plans to build an automated, end‑to‑end big‑data optimization platform.

cost optimizationdata governancedata quality
0 likes · 13 min read
Data Governance Practices and Product Strategy at NetEase: Challenges, Solutions, and Future Plans
Architect
Architect
May 25, 2022 · Big Data

Metadata Infrastructure and Governance in Bilibili's Data Platform

The article details how Bilibili built a unified metadata infrastructure—including a URN‑based model, collection pipelines, quality assurance, storage in TiDB/ES/HugeGraph, and query services—to support data discovery, lineage, impact analysis, and governance across its growing data platform.

Big DataData CatalogData Lineage
0 likes · 21 min read
Metadata Infrastructure and Governance in Bilibili's Data Platform
Bilibili Tech
Bilibili Tech
May 24, 2022 · Big Data

Metadata Infrastructure and Governance in Bilibili Data Platform

Bilibili’s data platform consolidates scattered metadata into a unified URN‑based model stored across TiDB, Elasticsearch, and HugeGraph, offering batch‑pull and embedded collection, flexible SQL‑like queries, comprehensive lineage mapping, and powering data‑map, lineage‑map, and impact‑analysis tools while planning expanded quality assurance and self‑service dictionaries.

Data LineageData PlatformSQL parsing
0 likes · 21 min read
Metadata Infrastructure and Governance in Bilibili Data Platform
Architects Research Society
Architects Research Society
May 17, 2022 · Information Security

Understanding Data Governance, Models, Policies, and Best Practices

The article explains data governance concepts, outlines four common governance models, details key policy elements such as availability, quality, integrity, usability, and security, and highlights the benefits, risks, and best‑practice recommendations for implementing effective data governance in organizations.

ComplianceData ManagementData Security
0 likes · 10 min read
Understanding Data Governance, Models, Policies, and Best Practices
DaTaobao Tech
DaTaobao Tech
May 13, 2022 · Big Data

Taobao Big Data Model Governance and DataWorks Co‑development

Taobao’s rapidly expanding technical data system faced naming inconsistencies, low table reuse, and costly, inefficient data usage, prompting a joint effort with DataWorks to digitize model evaluation, enforce standardized governance, deliver intelligent end‑to‑end modeling tools, and launch a development assistant, resulting in a health‑monitoring dashboard, upgraded data maps, and a roadmap for further automation and architecture refinement.

Big DataData PlatformDataWorks
0 likes · 12 min read
Taobao Big Data Model Governance and DataWorks Co‑development
Meituan Technology Team
Meituan Technology Team
May 12, 2022 · Operations

Systematic Data Governance Framework and Practices at Meituan Accommodation

The Meituan Accommodation data governance team shares how they evolved from ad‑hoc, single‑point fixes to a systematic, automated governance framework—covering management, standards, capability, execution, evaluation, and vision—using standardization, digitization, and systematization to achieve measurable quality, cost and efficiency gains across thousands of data assets.

DigitizationMeituanMetrics
0 likes · 33 min read
Systematic Data Governance Framework and Practices at Meituan Accommodation
vivo Internet Technology
vivo Internet Technology
Apr 20, 2022 · Big Data

Implementing Field Lineage in Spark SQL: A Technical Deep Dive

The article details how to add field‑lineage tracking to Spark SQL by creating a custom SparkSessionExtension that injects a check‑analysis rule and a parser, which capture INSERT statements, analyze the physical plan, and generate a JSON mapping of source‑to‑target fields for data governance.

Field LineageLogical PlanPhysical Plan
0 likes · 9 min read
Implementing Field Lineage in Spark SQL: A Technical Deep Dive
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 7, 2022 · Big Data

How Alibaba’s Big Data Model Governance Boosted Efficiency and Cut Costs

This article details Alibaba's large‑scale data model governance initiative, analyzing current data issues, presenting a comprehensive solution—including model digitization, public model sinking, productization, daily governance, and search‑enhancement—and outlining achieved results and future plans to further improve data quality, reuse, and operational efficiency.

DataWorksModel Scoringdata efficiency
0 likes · 12 min read
How Alibaba’s Big Data Model Governance Boosted Efficiency and Cut Costs
dbaplus Community
dbaplus Community
Mar 15, 2022 · Big Data

How to Build a Real‑Time Data Warehouse with Flink SQL: Architecture, Implementation, and Governance

This article explains the challenges of early real‑time data pipelines, introduces a layered real‑time warehouse architecture, provides step‑by‑step Flink SQL code for building a demo warehouse, and covers comprehensive data governance, quality metrics, lifecycle management, and naming conventions for production‑grade big‑data systems.

Flink SQLReal-time Data Warehousedata governance
0 likes · 60 min read
How to Build a Real‑Time Data Warehouse with Flink SQL: Architecture, Implementation, and Governance
BaiPing Technology
BaiPing Technology
Mar 14, 2022 · Big Data

Mastering DataWorks & MaxCompute: A Complete Guide to Big Data Architecture and Governance

DataWorks, Alibaba Cloud’s comprehensive PaaS platform, combined with the serverless MaxCompute data warehouse, offers an integrated solution for data integration, development, quality, and services, while detailed naming and layer conventions ensure scalable, maintainable big‑data architectures and effective governance across ODS, CDM, DWD, DWS, and ADS layers.

Big DataDataWorksMaxCompute
0 likes · 8 min read
Mastering DataWorks & MaxCompute: A Complete Guide to Big Data Architecture and Governance
政采云技术
政采云技术
Feb 8, 2022 · Industry Insights

Unlocking Enterprise Value with a Data Middle Platform: Architecture & Indicators

This article traces the evolution from traditional data warehouses to modern data lakes and data middle platforms, explains why siloed data development hampers efficiency, and details the architecture and indicator‑library design used by Zhengcaiyun to achieve unified, reusable data services.

Big DataData LakehouseData Middle Platform
0 likes · 14 min read
Unlocking Enterprise Value with a Data Middle Platform: Architecture & Indicators
DevOps
DevOps
Jan 26, 2022 · R&D Management

Digital R&D Management Capability Building for Financial Organizations

This article outlines the comprehensive architecture and key points for building digital R&D management capabilities in financial organizations, reviewing historical challenges, identifying four major pain points, proposing an overall framework, detailing twelve essential capabilities, and offering principles for effective implementation.

Financial IndustryR&D Managementcapability building
0 likes · 20 min read
Digital R&D Management Capability Building for Financial Organizations
DataFunTalk
DataFunTalk
Jan 24, 2022 · Big Data

MobTech Data Governance and Security Practices: Architecture, Implementation, and Financial Industry Use Cases

This article presents MobTech’s comprehensive data governance and security practices, covering the necessity of governance, its benefits, a full‑chain governance framework, specific challenges in the financial sector, the evolution of their integrated architecture, and detailed implementations of security, model, asset, monitoring, and quality management systems.

data governancedata qualityfinancial technology
0 likes · 21 min read
MobTech Data Governance and Security Practices: Architecture, Implementation, and Financial Industry Use Cases
DataFunSummit
DataFunSummit
Jan 23, 2022 · Big Data

MobTech's Integrated Data Governance Practices and Architecture

This article presents MobTech's comprehensive data governance and security practices, covering the necessity of governance, challenges in large‑scale data environments, the full‑link governance chain, modular architecture, and specific implementations for financial risk‑control scenarios.

Big DataData ArchitectureData Management
0 likes · 19 min read
MobTech's Integrated Data Governance Practices and Architecture
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 18, 2022 · Big Data

Data Warehouse Data Quality Measurement Standards

The article outlines four key dimensions for evaluating data warehouse data quality—correctness, completeness, timeliness, and consistency—explains common consistency issues such as differing metric values across models, cross‑dimensional aggregations, and real‑time versus batch calculations, and proposes organizational and review mechanisms to mitigate these problems.

Big DataConsistencyData Warehouse
0 likes · 9 min read
Data Warehouse Data Quality Measurement Standards
21CTO
21CTO
Jan 13, 2022 · Fundamentals

How to Achieve Data Maturity: Turning Data into a Strategic Product

The article explains why data maturity is essential for modern enterprises, defines its three pillars—people, tools, and readiness—shows how treating data as a product follows the same principles as great products, and outlines the four S (Speed, Scale, Simplicity, SQL) that guide a mature data ecosystem.

Big DataData Productdata governance
0 likes · 6 min read
How to Achieve Data Maturity: Turning Data into a Strategic Product
21CTO
21CTO
Jan 8, 2022 · Big Data

How Amazon’s Intelligent Lakehouse Redefines Big Data Architecture

The article examines Amazon’s Intelligent Lakehouse architecture, tracing its evolution from early data‑lake‑warehouse integrations to a modern, serverless, secure, and AI‑enhanced platform that unifies data storage, governance, and analytics to lower big‑data costs and boost agility.

Big DataData Warehousedata governance
0 likes · 12 min read
How Amazon’s Intelligent Lakehouse Redefines Big Data Architecture
Volcano Engine Developer Services
Volcano Engine Developer Services
Jan 4, 2022 · Big Data

How ByteDance Scales EB-Level Data: Architecture, BP Model & Real-Time Insights

ByteDance’s data platform, built over seven years, now handles exabyte-scale data and over 100 million TPS, using a hybrid “middle‑platform + Business Partner” model, custom engines like ClickHouse/ByteHouse, agile governance, and a suite of products to support internal and external businesses, illustrating large-scale big-data engineering practices.

Big DataByteDanceClickHouse
0 likes · 22 min read
How ByteDance Scales EB-Level Data: Architecture, BP Model & Real-Time Insights
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 4, 2022 · Big Data

Big Data Mastery Roadmap: Learning Path, Resources, Future Trends and Interview Guidance

This comprehensive guide outlines a step‑by‑step learning roadmap for aspiring big data professionals, covering fundamentals, programming languages, Linux, databases, distributed theory, networking, offline and real‑time computing, data governance, warehouses, toolchains, video/book recommendations, future industry trends, interview tips, and community resources.

Big DataDistributed SystemsInterview Tips
0 likes · 42 min read
Big Data Mastery Roadmap: Learning Path, Resources, Future Trends and Interview Guidance
dbaplus Community
dbaplus Community
Dec 22, 2021 · Fundamentals

How Xiaomi Built a Scalable Metadata Platform for Data Governance

This article details Xiaomi's end‑to‑end metadata platform, covering its three‑layer architecture, the evolution of full‑domain metadata, real‑time lineage, precise measurement, and how these capabilities enable data map, governance, cost control, and quality improvements for future business empowerment.

Xiaomidata governancedata quality
0 likes · 20 min read
How Xiaomi Built a Scalable Metadata Platform for Data Governance
DataFunSummit
DataFunSummit
Dec 22, 2021 · Big Data

Data Governance Practices and Experiences at NetEase Cloud Music

This article details NetEase Cloud Music's comprehensive data governance journey, covering data warehouse architecture, data standards, event tracking (埋点) governance, asset lifecycle management, and future automation plans, illustrating how systematic governance improves data quality, cost efficiency, and business insight.

Big DataData Warehousedata governance
0 likes · 21 min read
Data Governance Practices and Experiences at NetEase Cloud Music
Architects Research Society
Architects Research Society
Dec 21, 2021 · Fundamentals

Next-Generation Master Data Management (MDM): Architecture, Business Value, and Technical Challenges

This article explains master data management concepts, regulatory drivers, business benefits, key technical challenges, architectural trends such as graph databases and machine learning, and highlights leading vendors, providing a comprehensive overview for enterprises seeking modern MDM solutions.

AnalyticsBig DataGraph Database
0 likes · 9 min read
Next-Generation Master Data Management (MDM): Architecture, Business Value, and Technical Challenges
Architects Research Society
Architects Research Society
Dec 20, 2021 · Fundamentals

Common Misconceptions About Master Data Management (MDM)

The article explains common misconceptions about Master Data Management, emphasizing its enterprise-wide scope, the importance of data quality, governance, workflow, real‑time integration, and the need for organizational change management, while warning against treating MDM as a simple project.

MDMdata governancedata quality
0 likes · 8 min read
Common Misconceptions About Master Data Management (MDM)
Baidu Geek Talk
Baidu Geek Talk
Dec 20, 2021 · Mobile Development

Master Data Management: Concepts, Architecture, and Practical Implementation in Baidu Smart Mini Programs

The article outlines master data management concepts and maturity levels, then details Baidu Smart Mini Program’s practical architecture—spanning analysis, domain‑driven design, high‑availability services, transaction handling, caching, real‑time sync, and governance—that eliminates data silos, ensures consistency, and supports over 9,000 QPS with 99.99% SLA.

Baidu Mini ProgramsMaster Data Managementdata governance
0 likes · 16 min read
Master Data Management: Concepts, Architecture, and Practical Implementation in Baidu Smart Mini Programs
Ctrip Technology
Ctrip Technology
Dec 16, 2021 · Big Data

Data Standard Management Practices in Ctrip Vacation Data Governance

This article outlines Ctrip Vacation's data standard management approach, covering why standards are needed, the three‑element framework of scope, tools, and policies, and detailed practices for data integration, production change handling, metadata governance, portal dashboard standardization, and self‑service query templating.

Big DataData WarehouseMetadata Management
0 likes · 12 min read
Data Standard Management Practices in Ctrip Vacation Data Governance
DataFunSummit
DataFunSummit
Dec 14, 2021 · Big Data

Data Map: Background, Definition, and Youzan’s Practical Implementation

This article introduces the concept of a data map, explains its background and goals, describes Youzan’s end‑to‑end data‑map practice—including full data lineage, search, management, link analysis, impact estimation, and optimization—and concludes with a summary and future outlook.

Big DataData LineageData Management
0 likes · 16 min read
Data Map: Background, Definition, and Youzan’s Practical Implementation
DataFunTalk
DataFunTalk
Dec 10, 2021 · Big Data

Building and Evolving NetEase Yanxuan Real-Time Computing Platform: Architecture, SQLization, Serviceization, and Data Governance

This article details NetEase Yanxuan's real-time computing platform development from 2017 to present, covering its architecture, Flink‑SQL development environment, service‑oriented deployment, resource optimization, cloud‑native migration, comprehensive data governance, and future plans for stream‑batch integration and intelligent job diagnostics.

Big DataCloud NativeFlink
0 likes · 14 min read
Building and Evolving NetEase Yanxuan Real-Time Computing Platform: Architecture, SQLization, Serviceization, and Data Governance
DataFunSummit
DataFunSummit
Dec 10, 2021 · Big Data

Real‑Time Platform Construction at NetEase Yanxuan: Architecture, SQL‑Based Streaming, Serviceization, and Data Governance

This article details NetEase Yanxuan's evolution of a real‑time data platform from 2017 to present, covering background, current scale, layered architecture, Flink‑SQL development IDE, service‑oriented task execution, resource‑optimizing deployment modes, cloud‑native migration, comprehensive data governance, and future batch‑stream integration plans.

Big DataCloud NativeFlink
0 likes · 15 min read
Real‑Time Platform Construction at NetEase Yanxuan: Architecture, SQL‑Based Streaming, Serviceization, and Data Governance
IT Architects Alliance
IT Architects Alliance
Dec 8, 2021 · Industry Insights

6 Proven Strategies to Modernize Your Cloud Data Warehouse

This article outlines six practical strategies—identifying bottlenecks, empowering data engineers, adopting distributed management, creating data contracts, embracing diverse perspectives, and streamlining workflows—to help organizations leverage cloud data warehouses more efficiently and drive better business intelligence outcomes.

Cloud ComputingData Warehousebusiness intelligence
0 likes · 8 min read
6 Proven Strategies to Modernize Your Cloud Data Warehouse
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 28, 2021 · Big Data

OneData Methodology: Building a Unified Data Warehouse Architecture and Governance Framework

This article presents the OneData methodology for designing, standardizing, and governing a data warehouse, detailing background challenges, goals, industry references, core concepts, unified business and design consolidation, data modeling layers, naming conventions, data quality controls, and the resulting operational improvements and business value.

Big DataData WarehouseOnedata
0 likes · 20 min read
OneData Methodology: Building a Unified Data Warehouse Architecture and Governance Framework
DataFunTalk
DataFunTalk
Nov 27, 2021 · Big Data

iQIYI Data Middle Platform: Architecture, Data Governance Practices, and Future Plans

The article details iQIYI’s data middle platform architecture and its comprehensive data governance practices, covering platform overview, data flow, unified standards, metadata management, production quality assurance, and future AI‑driven enhancements, illustrating how centralized data services improve reliability, efficiency, and security.

Big DataData Securitydata governance
0 likes · 27 min read
iQIYI Data Middle Platform: Architecture, Data Governance Practices, and Future Plans
AntTech
AntTech
Nov 26, 2021 · Information Security

Achieving “Computable but Not Identifiable”: Balancing Personal Data Protection and Industry Development with Trusted Computing

The article examines how the Personal Information Protection Law creates a new authorization framework and introduces the “computable but not identifiable” concept, arguing that trusted‑computing technologies and controlled environments can reconcile strict privacy safeguards with the data‑driven needs of AI and other industries.

Artificial IntelligenceInformation Securitydata anonymization
0 likes · 10 min read
Achieving “Computable but Not Identifiable”: Balancing Personal Data Protection and Industry Development with Trusted Computing
Baidu Geek Talk
Baidu Geek Talk
Nov 24, 2021 · Big Data

Building Big Data Infrastructure at Baidu Aifanfan: Architecture Practices and Lessons Learned

At Baidu Aifanfan, the data team built a unified real‑time and offline big‑data platform—leveraging Watt, Bigpipe, Fengge, AFS and Palo within Lambda/Kappa patterns and a fast‑slow parallel rollout—that cut OLAP query latency from 18 minutes to under 15 seconds, enabled self‑service analytics, and standardized metrics across 15 agile teams.

Apache DorisBig Data ArchitectureData Warehouse
0 likes · 23 min read
Building Big Data Infrastructure at Baidu Aifanfan: Architecture Practices and Lessons Learned
21CTO
21CTO
Nov 23, 2021 · Information Security

Dynamic Data Security: Unlocking Data Value and Protecting Privacy in Banking

In a recent statement, ICBC’s CTO emphasizes that data, as a crucial production factor, derives its core value during use, urging dynamic data security and personal information protection, cross‑institution collaboration, regulated data markets, and safe cross‑border flows to support a healthy digital economy.

Data SecurityPrivacy Computingcross‑border data
0 likes · 3 min read
Dynamic Data Security: Unlocking Data Value and Protecting Privacy in Banking
DataFunTalk
DataFunTalk
Oct 27, 2021 · Big Data

Data Value System and Cockpit Construction: A Case Study from CITIC Bank

This article explains how CITIC Bank's software development center built a data value system and management cockpit, detailing business objectives, overall architecture, digital management methodology, implementation steps, and real‑world usage to support the bank's digital transformation.

Big Databanking analyticsdata governance
0 likes · 16 min read
Data Value System and Cockpit Construction: A Case Study from CITIC Bank
DataFunSummit
DataFunSummit
Oct 26, 2021 · Big Data

Data Value System and Cockpit Construction: A Case Study from CITIC Bank

This article presents a comprehensive overview of CITIC Bank's data value system and cockpit construction, detailing business objectives, overall planning, digital management framework, methodology, implementation cases, and current usage, illustrating how data-driven analytics support the bank's digital transformation.

Big DataData CockpitData Value
0 likes · 17 min read
Data Value System and Cockpit Construction: A Case Study from CITIC Bank
High Availability Architecture
High Availability Architecture
Oct 25, 2021 · Big Data

iQIYI Data Governance Practices: Event Tracking (Pingback) Governance and Application

The article details iQIYI's comprehensive data governance initiative for event tracking (Pingback), covering definitions, timing, quality requirements, governance challenges, standardized specifications, coordinate management, testing and gray‑release processes, upgrade workflows, and data security measures that together reduced event volume by 40% and cut resource consumption in half.

AnalyticsBig Datadata governance
0 likes · 16 min read
iQIYI Data Governance Practices: Event Tracking (Pingback) Governance and Application
DataFunTalk
DataFunTalk
Oct 25, 2021 · Big Data

Building a Multi‑Dimensional Analysis System at Baixin Bank: Practices and Insights

This article details Baixin Bank’s multi‑dimensional analysis framework, covering the bank’s business model, data accuracy, completeness and usability requirements, the design of indicator and analysis systems, ladder‑style service concepts, user‑product‑enterprise scenario modeling, and the implementation of self‑service data products and governance processes.

BIMulti-dimensional Analyticsbanking
0 likes · 20 min read
Building a Multi‑Dimensional Analysis System at Baixin Bank: Practices and Insights
iQIYI Technical Product Team
iQIYI Technical Product Team
Oct 15, 2021 · Industry Insights

How iQIYI Streamlined Event Tracking: A Deep Dive into Data Governance

This article details iQIYI's comprehensive data‑governance practice for event tracking, covering the definition of pingback, the need for governance, the governance framework, coordinate management, gray‑data handling, and the upgrade process that reduced tracking volume by 40% while cutting resource consumption in half.

AnalyticsBig Datadata governance
0 likes · 17 min read
How iQIYI Streamlined Event Tracking: A Deep Dive into Data Governance
iQIYI Technical Product Team
iQIYI Technical Product Team
Oct 9, 2021 · Big Data

iQIYI Data Quality Monitoring: Exploration and Practice

At iTech Salon, iQIYI’s Peng Tao outlined a three‑layer data‑quality monitoring framework—pingback, middle, and business report layers—detailing anomaly‑detection techniques such as thresholds, statistical, correlation and Prophet forecasting, and announced future plans for intelligent rule generation and automated attribution to pinpoint root causes.

Rule Enginedata governancedata quality
0 likes · 11 min read
iQIYI Data Quality Monitoring: Exploration and Practice
IT Architects Alliance
IT Architects Alliance
Sep 12, 2021 · Industry Insights

Data Warehouse vs. Database: Core Differences and Building a Data Platform

This article explains what a data warehouse is, contrasts it with traditional databases, outlines how to design and build a data warehouse—including model selection, topic domain division, bus matrix, layered architecture, and data governance—then expands to the concept of a data middle platform and its distinction from data lakes and big‑data platforms.

Big DataData PlatformData Warehouse
0 likes · 18 min read
Data Warehouse vs. Database: Core Differences and Building a Data Platform
WecTeam
WecTeam
Sep 10, 2021 · Mobile Development

Boost Build Speed 35%: Swift‑ObjC Mixed Compilation & ByteDance Data Governance

This week’s WecTeam Front‑end Weekly spotlights two technical deep‑dives: a Swift‑Objective‑C mixed‑compilation technique that slashes build times by 35%, and ByteDance’s large‑scale data‑tracking governance framework that underpins its trillion‑plus real‑time analytics pipeline.

ByteDanceCompilation OptimizationMobile Development
0 likes · 2 min read
Boost Build Speed 35%: Swift‑ObjC Mixed Compilation & ByteDance Data Governance
DevOps
DevOps
Sep 6, 2021 · Operations

Huawei's Digital Transformation Practice: Management, Process, and Technology Evolution

This article presents Huawei's extensive digital transformation journey, detailing the continuous management system reforms, strategic shifts across multiple industries, data governance challenges, and practical initiatives such as cloud platforms, intelligent supply chains, and customer‑centric digital experiences that together illustrate how large enterprises can achieve sustainable growth through digitalization.

Cloud ComputingEnterprise Managementdata governance
0 likes · 33 min read
Huawei's Digital Transformation Practice: Management, Process, and Technology Evolution
dbaplus Community
dbaplus Community
Aug 31, 2021 · Big Data

How Meituan Waimai Built and Evolved Its Massive Data Warehouse from V1 to V3

This article details Meituan Waimai's data warehouse evolution—covering business context, four‑layer architecture, Spark‑based ETL, successive V1.0, V2.0, and V3.0 redesigns, data governance practices, resource‑optimization tactics, security measures, and future road‑maps—illustrated with diagrams and concrete technical choices.

Data SecurityETLResource Optimization
0 likes · 24 min read
How Meituan Waimai Built and Evolved Its Massive Data Warehouse from V1 to V3
DataFunTalk
DataFunTalk
Aug 30, 2021 · Fundamentals

20 Practical Strategies for Effective Data Governance

Effective data governance hinges on leadership commitment, clear policies, skilled teams, and integration into business processes, and this article outlines twenty actionable strategies—from securing executive support and embedding rules in systems to fostering data quality, visualization, and sustainable operations—to guide organizations toward successful governance.

Leadershipdata governancedata quality
0 likes · 8 min read
20 Practical Strategies for Effective Data Governance
DataFunSummit
DataFunSummit
Aug 22, 2021 · Big Data

Evolution and Optimization of Meituan Waimai Offline Data Warehouse: Architecture, ETL, Modeling, Governance, and Future Plans

This article details the historical development, architectural layers, ETL migration to Spark, data modeling standards, governance processes, resource optimization, security measures, and future roadmap of Meituan Waimai's offline data warehouse, illustrating how the team addressed scalability and efficiency challenges.

Big DataData WarehouseETL
0 likes · 21 min read
Evolution and Optimization of Meituan Waimai Offline Data Warehouse: Architecture, ETL, Modeling, Governance, and Future Plans
21CTO
21CTO
Aug 17, 2021 · Fundamentals

How Traditional Enterprises Can Master Digital Transformation: A 14th Five-Year Blueprint

The article explains that digital transformation for traditional companies requires a mindset shift beyond mere analysis of status and environment, and offers a PPT detailing the 14th Five-Year Plan, transformation roadmap, data governance, and AI application to guide enterprises through this strategic overhaul.

14th Five-Year PlanAIEnterprise Strategy
0 likes · 2 min read
How Traditional Enterprises Can Master Digital Transformation: A 14th Five-Year Blueprint
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 3, 2021 · Big Data

Inside ByteDance’s Traffic Platform: Powering Trillions of Real‑Time Events

This article, compiled from a Volcano Engine meetup, explains how ByteDance’s unified traffic platform designs, governs, and processes massive event‑tracking data in real time, covering embedding content solutions, link architecture, dynamic processing engines, and data‑governance practices that support trillions of daily events.

Big DataReal-time Processingdata engineering
0 likes · 16 min read
Inside ByteDance’s Traffic Platform: Powering Trillions of Real‑Time Events
IT Architects Alliance
IT Architects Alliance
Jul 31, 2021 · Big Data

Alibaba's Data Platform Evolution: Four Stages, Core Challenges, and Future Trends

The article outlines Alibaba's twelve‑year journey building a data middle‑platform, detailing four development stages, the four major technical challenges faced, and emerging trends such as lake‑warehouse integration, autonomous data‑warehouse operation, natural‑language query, and AI‑driven data engineering.

AlibabaData Middle Platformdata governance
0 likes · 17 min read
Alibaba's Data Platform Evolution: Four Stages, Core Challenges, and Future Trends
Architects' Tech Alliance
Architects' Tech Alliance
Jul 29, 2021 · Big Data

Alibaba's Data Platform Evolution: Four Stages, Core Challenges, and Future Trends

The article outlines Alibaba's twelve‑year journey of building a data middle platform, describing four development stages, the technical challenges faced, and emerging trends such as lake‑warehouse integration, autonomous data‑warehouse operation, natural‑language query, and AI engineering.

Artificial IntelligenceCloud ComputingData Middle Platform
0 likes · 17 min read
Alibaba's Data Platform Evolution: Four Stages, Core Challenges, and Future Trends
ITPUB
ITPUB
Jul 7, 2021 · Big Data

How NetEase Cloud Music Scaled Its Data Warehouse for Billion‑User Traffic

This article details NetEase Cloud Music's journey of redesigning its data warehouse and governance processes to support over a billion monthly active users, covering pain points, standardization, shared services, self‑service tools, and the resulting improvements in data quality, latency, and operational efficiency.

AnalyticsData PlatformReal-time Processing
0 likes · 19 min read
How NetEase Cloud Music Scaled Its Data Warehouse for Billion‑User Traffic
Architect
Architect
Jul 1, 2021 · Big Data

Data Governance Practices at Meituan Hotel Travel Platform

This article presents a comprehensive case study of Meituan's hotel‑travel data governance, covering the background, challenges, strategic goals, standardized processes, technical systems, cost and security optimizations, measurable outcomes, and future plans for automated governance.

Big DataData SecurityMeituan
0 likes · 29 min read
Data Governance Practices at Meituan Hotel Travel Platform
360 Tech Engineering
360 Tech Engineering
Jun 25, 2021 · Big Data

Introducing ULTRON: A Real‑Time Data Warehouse Platform Powered by FlinkSQL

ULTRON is a one‑stop real‑time data‑warehouse development platform built on FlinkSQL that unifies data integration, asset management, cluster deployment, modeling, ETL, OLAP analysis and governance, addressing the limitations of traditional batch‑oriented warehouses and simplifying streaming data workflows for developers.

FlinkSQLReal-time Data WarehouseStreaming
0 likes · 13 min read
Introducing ULTRON: A Real‑Time Data Warehouse Platform Powered by FlinkSQL
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 21, 2021 · Big Data

What Is a Big Data Platform and How to Design Its Architecture?

This article explains what a big data platform is, outlines its seven‑component overall architecture, details the technical stack from data sources to applications, and describes the key subsystems such as catalog management, data integration, governance, storage, processing, sharing, development, and analysis.

Distributed Systemsdata governancedata integration
0 likes · 11 min read
What Is a Big Data Platform and How to Design Its Architecture?
58 Tech
58 Tech
Jun 9, 2021 · Big Data

Designing and Implementing a Unified Data Metric System for 58 Commercial Data Team

This article explains how 58's commercial data team built a comprehensive data metric system—from identifying common metric definition issues to establishing a domain‑driven hierarchy, distinguishing atomic and derived metrics, implementing a unified metric management platform, and providing APIs and examples for querying and visualizing metrics.

Big DataJavaSQL
0 likes · 17 min read
Designing and Implementing a Unified Data Metric System for 58 Commercial Data Team
Big Data Technology & Architecture
Big Data Technology & Architecture
Jun 6, 2021 · Big Data

Understanding Data Warehouses: Concepts, Architecture, Modeling, and Governance

This article provides a comprehensive overview of data warehouses, explaining their purpose, differences from databases, OLTP vs OLAP, traditional versus internet data warehouse models, layered architecture, modeling theories, metric dictionaries, date dimensions, naming conventions, data governance, and incremental synchronization techniques with practical SQL examples.

Big DataETLSQL
0 likes · 24 min read
Understanding Data Warehouses: Concepts, Architecture, Modeling, and Governance
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 1, 2021 · Fundamentals

How Huawei Built a Comprehensive Data Governance Framework for Digital Transformation

Huawei’s 2017 digital‑transformation vision led to a five‑step data‑governance blueprint that evolved through two phases, defining a detailed data‑classification framework, structured and unstructured data management methods, metadata governance, and compliance‑driven external data handling to support enterprise‑wide intelligent operations.

data classificationdata governancemetadata
0 likes · 20 min read
How Huawei Built a Comprehensive Data Governance Framework for Digital Transformation
Efficient Ops
Efficient Ops
May 30, 2021 · Operations

How Intelligent Operations Are Redefining IT Management – Key Takeaways from the 2021 GOPS Conference

The 2021 GOPS Global Operations Conference in Shenzhen highlighted the shift toward intelligent, AI‑driven IT operations, presenting practical solutions, a three‑principle six‑step framework, and four core capabilities that help enterprises digitize, govern, and automate their operational data for higher efficiency.

IT OperationsIntelligent Operationsaiops
0 likes · 7 min read
How Intelligent Operations Are Redefining IT Management – Key Takeaways from the 2021 GOPS Conference
IT Architects Alliance
IT Architects Alliance
May 25, 2021 · Big Data

How Modern Data Middle Platforms Power Real‑Time and Offline Analytics

This article provides a comprehensive technical overview of data middle platforms, covering data aggregation, offline and real‑time development, smart operations, data asset management, governance, service layers, platform implementations, warehouse layering, and key differences between offline and real‑time data warehouses.

Big DataData PlatformData Warehouse
0 likes · 26 min read
How Modern Data Middle Platforms Power Real‑Time and Offline Analytics
Architects Research Society
Architects Research Society
May 23, 2021 · Big Data

Data Architecture Trends: From Chaos to an Organized Era – Insights from Anthony J. Algmin

The article reviews Anthony J. Algmin’s reflections on past data‑architecture predictions, current hot topics such as cloud, AI/ML, data governance, and real‑time analytics, and forecasts future trends including metadata management, blockchain, and the evolving role of data architects within enterprises.

Artificial IntelligenceBig DataData Architecture
0 likes · 13 min read
Data Architecture Trends: From Chaos to an Organized Era – Insights from Anthony J. Algmin
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 22, 2021 · Fundamentals

How Meituan Scaled Data Governance: Practical Lessons for Enterprise Data Management

This article outlines Meituan's journey in data governance, detailing the challenges of data quality, cost, security, standardization and efficiency, and presenting a three‑stage roadmap—passive, proactive, and automated governance—along with concrete technical and organizational solutions.

Data ArchitectureInformation Securitydata governance
0 likes · 9 min read
How Meituan Scaled Data Governance: Practical Lessons for Enterprise Data Management
Programmer DD
Programmer DD
May 22, 2021 · Big Data

What Is a Data Lake? Origins, Architecture, and How It Powers Modern Big Data

This article explains the concept of a data lake—its origin in 2011, how it differs from traditional databases and data warehouses, its core characteristics such as raw data storage, on‑demand computing, and schema‑on‑read, as well as its advantages, challenges, architectural components, and future outlook within the big‑data ecosystem.

Big DataData ArchitectureETL
0 likes · 20 min read
What Is a Data Lake? Origins, Architecture, and How It Powers Modern Big Data
Big Data Technology & Architecture
Big Data Technology & Architecture
May 19, 2021 · Big Data

Comprehensive Guide to Data Governance: Metadata, Data Quality, Standards, and Asset Management

This article provides an extensive overview of data governance in the big‑data era, covering common pitfalls, the role of metadata, data quality management, data standardization, and data asset management, and offers practical recommendations for organizations to implement effective governance practices.

Big DataData Asset Managementdata governance
0 likes · 42 min read
Comprehensive Guide to Data Governance: Metadata, Data Quality, Standards, and Asset Management
Big Data Technology & Architecture
Big Data Technology & Architecture
May 15, 2021 · Big Data

One‑Stop Big Data Platform Construction: Practices from WeBank, Beike, and iQIYI

This article shares practical notes on building a one‑stop big data platform, outlining essential functions such as data extraction, cleaning, storage, analysis, governance, and security, and presents implementation case studies from WeBank, Beike, and iQIYI to illustrate real‑world architectures and solutions.

Big DataData Platformcase study
0 likes · 8 min read
One‑Stop Big Data Platform Construction: Practices from WeBank, Beike, and iQIYI
Big Data Technology & Architecture
Big Data Technology & Architecture
May 11, 2021 · Big Data

Data Quality: Dimensions, Rules, and Constraints

The article explains the importance of data quality in the big data era, defines key quality dimensions such as completeness, uniqueness, validity, consistency, accuracy, timeliness, and credibility, and details how each dimension can be measured and enforced through specific constraints and validation rules.

Big DataConsistencyaccuracy
0 likes · 9 min read
Data Quality: Dimensions, Rules, and Constraints
Architecture Digest
Architecture Digest
May 7, 2021 · Big Data

Comprehensive Overview of Data Middle Platform Architecture and Practices

This article provides a detailed introduction to data middle platform concepts, covering data aggregation, ingestion tools, offline and real‑time development, data governance, service layers, monitoring, and deployment patterns, illustrating how enterprises build unified data ecosystems across various industries.

Big DataData PlatformData Warehouse
0 likes · 25 min read
Comprehensive Overview of Data Middle Platform Architecture and Practices
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 22, 2021 · Big Data

Debunking Common Misconceptions About Data Lakes

This article debunks eight common misconceptions about data lakes, explains why they are not mutually exclusive with data warehouses, clarifies that they are not limited to Hadoop or raw data only, and provides practical tips for building flexible, secure, and business‑driven data lake solutions.

AnalyticsBig DataCloud Services
0 likes · 21 min read
Debunking Common Misconceptions About Data Lakes
Meituan Technology Team
Meituan Technology Team
Apr 15, 2021 · Big Data

Data Governance Practices at Meituan Hotel & Travel Platform

Meituan’s hotel‑travel platform tackled exploding data‑quality, cost, efficiency, and security issues by establishing a full‑link governance framework—standardized processes, a Data Management Committee, and unified “One Model, One Logic, One Service, One Portal” systems—that cut per‑unit costs by ~40%, boosted engineer productivity over 60%, eliminated major security incidents, and set the stage for autonomous, AI‑driven data governance.

Big DataData SecurityMeituan
0 likes · 32 min read
Data Governance Practices at Meituan Hotel & Travel Platform
Efficient Ops
Efficient Ops
Mar 31, 2021 · Operations

Uncovering Digital Risks in DevOps: Safeguarding Your Digital Transformation

This article examines how result‑oriented DevOps drives digital transformation while exposing digital risks—from missing high‑level test scenarios and broken security data links to insufficient user‑experience foresight—and outlines strategies for data governance, risk mitigation, and effective decision‑support across the enterprise.

DevOpsdata governancedigital transformation
0 likes · 12 min read
Uncovering Digital Risks in DevOps: Safeguarding Your Digital Transformation
dbaplus Community
dbaplus Community
Mar 17, 2021 · Big Data

How We Cut PBs of Waste and Optimized HDFS with Tiered Storage and Cloud Migration

This article details a three‑part technical sharing that covers cost governance for offline Hadoop clusters, a large‑scale data‑center migration with architecture upgrades, and a tiered storage strategy using EC and COS to reduce storage costs and improve performance in a cloud‑native big‑data environment.

Big Data MigrationCOSCloud Native
0 likes · 10 min read
How We Cut PBs of Waste and Optimized HDFS with Tiered Storage and Cloud Migration
Baidu Intelligent Testing
Baidu Intelligent Testing
Mar 10, 2021 · Artificial Intelligence

End-to-End Consistency Assurance for Click‑Through Rate Models: Methodology, Implementation, and Reporting

This article presents a comprehensive model quality assurance framework for click‑through‑rate (CTR) prediction, detailing the challenges of data and logic inconsistency, defining consistency goals, describing a full‑stack verification pipeline—including online data capture, offline sample alignment, multi‑stage q‑value comparison, and automated reporting—and sharing practical deployment experiences and results.

CTRMachine Learningdata governance
0 likes · 19 min read
End-to-End Consistency Assurance for Click‑Through Rate Models: Methodology, Implementation, and Reporting
Suning Technology
Suning Technology
Mar 3, 2021 · Big Data

How Can China Build a Secure, Free Data Sharing Ecosystem?

The article examines China's push for free public data sharing, highlighting policy directives, the need for top‑level design, security standards, and education to create a unified, safe data‑governance framework that fuels the digital economy.

Big DataDigital Economydata governance
0 likes · 6 min read
How Can China Build a Secure, Free Data Sharing Ecosystem?
DataFunTalk
DataFunTalk
Feb 23, 2021 · Big Data

Meituan Hotel & Travel Data Governance: Journey, Practices, and Future Directions

This article outlines Meituan's hotel‑travel data governance evolution, describing the key quality, cost, security, standardization and efficiency challenges faced as the business scaled, and detailing the organizational, technical, metric, service and product‑entry solutions implemented to achieve systematic, measurable, and automated data governance.

Big DataData Securitydata governance
0 likes · 19 min read
Meituan Hotel & Travel Data Governance: Journey, Practices, and Future Directions
Yanxuan Tech Team
Yanxuan Tech Team
Feb 5, 2021 · Big Data

How NetEase Yanxuan Built a Robust Data Task Governance System in 2020

This article details NetEase Yanxuan's 2020 initiative to improve data task governance, describing identified pain points, the pre‑mid‑post framework for model, baseline, and incident handling, and the resulting products, processes, and future plans for a more reliable data warehouse.

Baseline ManagementData WarehouseTask Operations
0 likes · 27 min read
How NetEase Yanxuan Built a Robust Data Task Governance System in 2020

NetEase Yanxuan Data Task Governance Practice: Pre‑, In‑, and Post‑Operation Strategies

NetEase Yanxuan tackled data‑task governance by establishing pre‑operation guarantees, baseline‑driven in‑operation controls, and post‑operation interventions, delivering stable task output, reduced alarms, lineage awareness, rapid incident recovery, and reusable best‑practice products that earned the 2020 Technology Sharing Co‑building Award.

Baseline ManagementBig DataTask Operation
0 likes · 25 min read
NetEase Yanxuan Data Task Governance Practice: Pre‑, In‑, and Post‑Operation Strategies
21CTO
21CTO
Jan 25, 2021 · Big Data

Understanding Data Lakes vs. Data Warehouses: A Complete Guide

This article provides a comprehensive overview of data lakes and data warehouses, explaining their definitions, architectures, differences, and practical use cases, while also covering related concepts such as OLTP/OLAP, ETL processes, data governance, and modern lakehouse solutions.

Data Warehousedata governancedata lake
0 likes · 95 min read
Understanding Data Lakes vs. Data Warehouses: A Complete Guide