Tag

Schema Management

2 views collected around this technical thread.

DataFunSummit
DataFunSummit
May 4, 2025 · Big Data

Iceberg Table Format Practice in Huawei Terminal Cloud

This article explains how Huawei's terminal cloud adopts the Apache Iceberg table format to efficiently manage large-scale datasets, detailing its architecture, feature engineering, merge operations, LSM-based storage, schema versioning, AB testing support, catalog enhancements, and future roadmap for full lifecycle data governance.

Data LakeHuawei CloudIceberg
0 likes · 13 min read
Iceberg Table Format Practice in Huawei Terminal Cloud
Tencent Database Technology
Tencent Database Technology
Jan 19, 2023 · Databases

Atomic DDL Implementation in MySQL 8.0: Metadata Management and DDL Log Mechanisms

This article provides a comprehensive technical analysis of MySQL 8.0's atomic DDL implementation, detailing the transition from fragmented FRM-based metadata to a unified InnoDB Data Dictionary, and explaining how the DDL Log mechanism ensures crash-safe physical file operations during schema modifications.

Atomic DDLDDL LogData Dictionary
0 likes · 17 min read
Atomic DDL Implementation in MySQL 8.0: Metadata Management and DDL Log Mechanisms
DataFunTalk
DataFunTalk
Sep 3, 2021 · Big Data

Building an Exabyte‑Scale Data Lake with Apache Hudi at ByteDance: Architecture, Design Choices, and Performance Optimizations

This article details ByteDance's implementation of an exabyte‑scale data lake using Apache Hudi, covering scenario requirements, engine selection, functional support, schema management, extensive performance tuning, and future directions, while also noting recruitment opportunities within the team.

Apache HudiByteDanceData Lake
0 likes · 9 min read
Building an Exabyte‑Scale Data Lake with Apache Hudi at ByteDance: Architecture, Design Choices, and Performance Optimizations