Tag

partitioning

1 views collected around this technical thread.

Java Tech Enthusiast
Java Tech Enthusiast
May 10, 2025 · Databases

Efficient Strategies for Deleting Massive Data from Large Database Tables

The article outlines why deleting tens of millions of rows in one statement harms performance and reliability, then presents safe, efficient tactics—such as indexed batch deletes, partition drops, creating a new table, or using TRUNCATE—plus pre‑deletion planning, post‑delete validation, monitoring, and space reclamation.

BackupDatabaseSQL
0 likes · 11 min read
Efficient Strategies for Deleting Massive Data from Large Database Tables
IT Services Circle
IT Services Circle
May 6, 2025 · Databases

Efficient Strategies for Deleting Massive Data from Large Tables

This article explains the risks of bulk deletions on million‑row tables and provides a step‑by‑step guide—including pre‑deletion planning, indexing, backup, batch deletion, partitioning, table recreation, TRUNCATE, and post‑deletion verification—to safely and quickly remove large volumes of data while minimizing lock contention, log growth, CPU spikes, replication lag, and rollback complications.

BackupBatch DeleteSQL
0 likes · 11 min read
Efficient Strategies for Deleting Massive Data from Large Tables
IT Services Circle
IT Services Circle
Oct 1, 2024 · Databases

Effectiveness of Adding an Index on a Status Column in a Tens‑of‑Millions Row MySQL Table

This article explains how adding an index to a status column in a tens‑of‑millions‑row MySQL table affects query performance, covering the basic index lookup process, extreme cases where full scans are chosen, selectivity, covering indexes, composite indexes, partitioning, and using EXPLAIN to verify execution plans.

DatabaseEXPLAINMySQL
0 likes · 9 min read
Effectiveness of Adding an Index on a Status Column in a Tens‑of‑Millions Row MySQL Table
Architecture Digest
Architecture Digest
Sep 23, 2024 · Databases

Evaluating and Optimizing Large MySQL Tables: Partitioning, Sharding, and Hot‑Cold Archiving Strategies

This article explains how to assess MySQL table size using table capacity, disk usage, and instance resources, describes why large tables degrade performance, and presents three practical solutions—partitioning, sharding (horizontal/vertical), and hot‑cold data archiving—along with their trade‑offs and implementation tips.

B+ TreeDatabase OptimizationHotColdArchiving
0 likes · 15 min read
Evaluating and Optimizing Large MySQL Tables: Partitioning, Sharding, and Hot‑Cold Archiving Strategies
IT Xianyu
IT Xianyu
Aug 26, 2024 · Big Data

Hive Data Warehouse: Modeling, Partitioning, and ID‑Mapping for User Profiles

This article explains how Hive serves as a data‑warehouse layer for user‑profile tagging, covering data‑warehouse fundamentals, fact‑and‑dimension modeling, partitioned storage, label aggregation, and ID‑mapping techniques with practical Hive DDL/DML examples.

Big DataData WarehouseETL
0 likes · 11 min read
Hive Data Warehouse: Modeling, Partitioning, and ID‑Mapping for User Profiles
System Architect Go
System Architect Go
Aug 22, 2024 · Fundamentals

Replication and Partitioning Mechanisms in Redis, Kafka, and Elasticsearch

This article examines the replication and partitioning designs of Redis, Kafka, and Elasticsearch, drawing on concepts from Designing Data‑Intensive Applications to illustrate core distributed‑system principles, common challenges, and practical configuration options.

DDIAElasticsearchKafka
0 likes · 7 min read
Replication and Partitioning Mechanisms in Redis, Kafka, and Elasticsearch
Architect
Architect
Mar 29, 2024 · Databases

Understanding Database Sharding and Partitioning with ShardingSphere

This article explains the fundamentals of database sharding and partitioning, why they are needed for massive data volumes, various vertical and horizontal splitting strategies, routing algorithms, client vs. proxy deployment models, and the practical challenges and solutions when using ShardingSphere 5.x.

Horizontal SplittingMySQLShardingSphere
0 likes · 17 min read
Understanding Database Sharding and Partitioning with ShardingSphere
Architect's Guide
Architect's Guide
Feb 4, 2024 · Databases

Evaluating Large MySQL Table Size and Optimizing Query Performance with Partitioning, Sharding, and Archiving

This article explains how to assess MySQL table size through table capacity, disk usage, and instance resources, illustrates the impact of B+‑tree index depth on query speed, and presents three practical solutions—table partitioning, database sharding, and hot‑cold data archiving—to improve performance.

B+ TreeDatabase OptimizationMySQL
0 likes · 15 min read
Evaluating Large MySQL Table Size and Optimizing Query Performance with Partitioning, Sharding, and Archiving
Aikesheng Open Source Community
Aikesheng Open Source Community
Oct 25, 2023 · Databases

Understanding and Optimizing Indexes in OceanBase

This article explains the concepts of local and global indexes in OceanBase, demonstrates how to create tables and indexes, compares six practical scenarios—including no index, local index, partitioned queries, unique and global indexes—and provides performance metrics to guide database optimization.

Database OptimizationGlobal IndexIndexing
0 likes · 8 min read
Understanding and Optimizing Indexes in OceanBase
Weimob Technology Center
Weimob Technology Center
Oct 13, 2023 · Big Data

Optimizing StarRocks Tables: Design Tips, Real‑World Cases and Monitoring Strategies

This article explains how to design efficient StarRocks tables with proper field types, partitioning and bucketing, compares update and primary‑key models, presents real‑world cases of memory and tablet issues, provides a complete table‑creation example, and outlines comprehensive monitoring metrics to keep the analytical data warehouse performant and stable.

Big DataStarRocksanalytics
0 likes · 25 min read
Optimizing StarRocks Tables: Design Tips, Real‑World Cases and Monitoring Strategies
Sanyou's Java Diary
Sanyou's Java Diary
Jun 26, 2023 · Big Data

Master Kafka Interview Questions: Architecture, Partitioning, and Reliability Explained

This article provides a comprehensive overview of Kafka, covering its core architecture, message queue models, communication process, partition selection, consumer groups, rebalancing strategies, partition assignment algorithms, reliability guarantees, replica synchronization, and reasons for removing Zookeeper in newer versions.

Consumer GroupInterviewKafka
0 likes · 20 min read
Master Kafka Interview Questions: Architecture, Partitioning, and Reliability Explained
Architect
Architect
Jan 18, 2023 · Databases

Design and Architecture of Bilibili's High‑Performance KV Storage System

This article presents the background, overall architecture, partitioning strategies, raft‑based replication, binlog support, multi‑active deployment, bulk‑load mechanisms, storage‑engine optimizations, load‑balancing policies, and failure‑detection & recovery techniques of a high‑reliability, high‑throughput key‑value store used at Bilibili.

Bulk LoadKV storageLoad Balancing
0 likes · 22 min read
Design and Architecture of Bilibili's High‑Performance KV Storage System
Code Ape Tech Column
Code Ape Tech Column
Jan 12, 2023 · Databases

Implementing Monthly Sharding with MySQL Stored Procedures and SpringBoot‑MyBatis

This article presents a practical guide to implementing monthly sharding for high‑volume billing data using MySQL stored procedures, dynamic table creation, and SpringBoot‑MyBatis integration, covering schema design, code examples, and query strategies for inserting, retrieving, and managing partitioned tables.

Dynamic TableMyBatisMySQL
0 likes · 34 min read
Implementing Monthly Sharding with MySQL Stored Procedures and SpringBoot‑MyBatis
Architecture Digest
Architecture Digest
Jan 2, 2023 · Databases

Database Sharding and Partitioning Strategy for High‑Volume Order Systems

This article explains how to classify massive order data into hot and cold segments, store them in MySQL, Elasticsearch and Hive respectively, and implement sharding and partitioning at both table and database levels using modulo and hash calculations to achieve scalable performance for billions of orders.

DatabaseElasticsearchHive
0 likes · 8 min read
Database Sharding and Partitioning Strategy for High‑Volume Order Systems
Architecture Digest
Architecture Digest
Oct 30, 2022 · Backend Development

High‑Availability Architecture for a Large‑Scale Membership System

This article details the design and implementation of a high‑availability, high‑performance membership system that serves billions of users across multiple platforms, covering Elasticsearch dual‑center clusters, traffic‑isolated three‑cluster setups, Redis caching strategies, MySQL dual‑center partitioning, and advanced flow‑control and degradation mechanisms.

ElasticsearchHigh AvailabilityLoad Balancing
0 likes · 18 min read
High‑Availability Architecture for a Large‑Scale Membership System
Architect
Architect
Aug 26, 2022 · Big Data

Deep Dive into Kafka Architecture: Topics, Partitions, Replication, Consumers, and Transactions

This article provides a comprehensive overview of Kafka's architecture, covering topics, partitions, replication, producer and consumer workflows, offset management, rebalancing, delivery semantics, exactly‑once guarantees, transaction handling, file organization, and key configuration settings.

Distributed MessagingKafkaReplication
0 likes · 17 min read
Deep Dive into Kafka Architecture: Topics, Partitions, Replication, Consumers, and Transactions
Wukong Talks Architecture
Wukong Talks Architecture
Aug 9, 2022 · Big Data

Kafka Basics: 15 Key Questions and In‑Depth Answers

This comprehensive guide covers Kafka’s core concepts, architecture, Zookeeper role, producer sending modes, partitioning strategies, replica types, message deletion, performance optimizations, consumer models, offset management, and best‑practice recommendations for scaling and ensuring ordered delivery in distributed streaming systems.

Big DataConsumerKafka
0 likes · 31 min read
Kafka Basics: 15 Key Questions and In‑Depth Answers