How to Migrate 100 Billion ClickHouse Rows to Doris: Three Practical Strategies

When a ClickHouse cluster needed to be decommissioned, the author evaluated three migration approaches—using Doris' ClickHouse catalog, exporting to files with Broker/Stream Load, and leveraging Spark—to move roughly 100 billion rows to Doris, comparing their complexity, reliability, and performance.

CatalogClickHouseSQL

0 likes · 9 min read

How to Migrate 100 Billion ClickHouse Rows to Doris: Three Practical Strategies

Big Data Technology & Architecture

Sep 19, 2022 · Big Data

Apache Iceberg Table and Catalog Configuration Guide for Hadoop

This article outlines the configuration settings for Apache Iceberg tables and catalogs on Hadoop, covering read and write properties, combine behavior for small HDFS files, reserved table properties, catalog lock options, and Hive Metastore connector Hadoop settings, supplemented with illustrative screenshots.

Big DataCatalogHadoop

0 likes · 3 min read

Apache Iceberg Table and Catalog Configuration Guide for Hadoop

Big Data Technology & Architecture

Apr 15, 2022 · Big Data

Configuring Flink SQL Client with Iceberg: Catalogs, DDL, Data Insertion and Query

This guide explains how to set up the Flink SQL client to work with Apache Iceberg, covering Scala version requirements, downloading and deploying Iceberg jars, configuring Hive and HDFS catalogs, creating databases and tables, performing insert and overwrite operations, and querying data in both batch and streaming modes.

Big DataCatalogFlink

0 likes · 18 min read

Configuring Flink SQL Client with Iceberg: Catalogs, DDL, Data Insertion and Query

Big Data Technology Architecture

Jul 15, 2021 · Big Data

Building Data Lake Solutions with Iceberg and Object Storage: Architecture, Write/Read Processes, and Storage Optimization

This article presents a comprehensive overview of using Apache Iceberg with object storage to construct scalable data lake solutions, covering lake architecture, Iceberg table organization, Flink‑based write and read workflows, catalog abstractions, object storage versus HDFS comparisons, append‑upload and atomic‑commit challenges, a demonstration setup, and ideas for storage optimization.

CatalogFlinkIceberg

0 likes · 16 min read

Building Data Lake Solutions with Iceberg and Object Storage: Architecture, Write/Read Processes, and Storage Optimization

Dual-Track Product Journal

May 7, 2020 · Product Management

Mastering E‑Commerce Product Management: From SPU & SKU to Full Lifecycle

This comprehensive guide explains the core concepts of e‑commerce product systems—including SPU and SKU definitions, their relationships, and the complete product management workflow covering publishing, editing, inventory, payment, logistics, and status transitions—providing essential knowledge for building robust online retail platforms.

CatalogSPUe-commerce

0 likes · 11 min read

Mastering E‑Commerce Product Management: From SPU & SKU to Full Lifecycle

21CTO

Jan 24, 2018 · Product Management

Designing a Scalable E‑Commerce Product Model: From SPU to SKU

This article walks through the evolution of e‑commerce product modeling, explaining how categories, brands, attributes, SPU and SKU concepts combine to form a flexible system that supports browsing, filtering, and detailed product pages while keeping the architecture maintainable.

CatalogSPUattribute design

0 likes · 10 min read

Designing a Scalable E‑Commerce Product Model: From SPU to SKU

How to Migrate 100 Billion ClickHouse Rows to Doris: Three Practical Strategies