Tag

Storage Format

1 views collected around this technical thread.

DataFunSummit
DataFunSummit
Jun 19, 2024 · Big Data

Apache Hudi from Zero to One: Introduction to Hudi’s Storage Format (Part 1)

This article introduces Apache Hudi’s storage format, explaining the table layout, metadata and data file organization, the naming conventions of timeline actions, and the trade‑offs between Copy‑on‑Write and Merge‑on‑Read table types for transactional data lakes.

Apache HudiFile LayoutStorage Format
0 likes · 8 min read
Apache Hudi from Zero to One: Introduction to Hudi’s Storage Format (Part 1)
Big Data Technology Architecture
Big Data Technology Architecture
Aug 13, 2020 · Databases

Deep Dive into Apache Druid V1 Storage Format: Index Structures and Disk Layout

This article provides a detailed analysis of Apache Druid V1's column‑oriented storage format, covering dimension dictionaries, variable‑length encoded values, bitmap inverted indexes, array handling, and the physical metadata layout that enables sub‑second OLAP queries on massive datasets.

Apache DruidOLAPStorage Format
0 likes · 8 min read
Deep Dive into Apache Druid V1 Storage Format: Index Structures and Disk Layout