Tag

Fileset

1 views collected around this technical thread.

Bilibili Tech
Bilibili Tech
Dec 17, 2024 · Big Data

Apache Gravitino: Metadata Management Practices and Production Experience at Bilibili

Bilibili adopted Apache Gravitino as a unified metadata platform that decouples consumers, consolidates schemas and Fileset‑based unstructured data across heterogeneous sources, cuts metadata and storage costs, resolves inconsistencies, boosts Hive Metastore performance, and enables features such as Iceberg branching and future AI‑centric governance.

Apache GravitinoBig DataData Governance
0 likes · 20 min read
Apache Gravitino: Metadata Management Practices and Production Experience at Bilibili
DataFunSummit
DataFunSummit
Dec 6, 2024 · Artificial Intelligence

Xiaomi AI Data Management Platform: Design, Implementation, and Practice

This article presents the background, design principles, architecture, and practical deployment of Xiaomi's AI Data Management Platform, highlighting how unified cataloging, Fileset integration, and notebook‑based development address AI data governance, cost reduction, and workflow efficiency for both structured and non‑structured data.

AI dataData GovernanceFileset
0 likes · 15 min read
Xiaomi AI Data Management Platform: Design, Implementation, and Practice