Tag

read optimization

0 views collected around this technical thread.

360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Oct 31, 2024 · Backend Development

Boosting Ozone Block Reads with gRPC Streaming: Up to 30% Faster

This article explains how a gRPC bidirectional streaming read method was added to Ozone to reduce chunk‑by‑chunk request gaps, describes the client‑side implementation, presents single‑ and multi‑threaded performance tests showing roughly 30% faster reads, and outlines future enhancements such as pre‑fetching.

OzonePerformanceStreaming
0 likes · 7 min read
Boosting Ozone Block Reads with gRPC Streaming: Up to 30% Faster
DataFunSummit
DataFunSummit
Jun 28, 2024 · Big Data

Apache Hudi from Zero to One – Part 2: Reading Process and Query Types (Spark Example)

This article explains how Apache Hudi integrates with Spark to read data, detailing the Spark‑SQL planning stages, the Spark‑Hudi read workflow, and the four main Hudi query types—snapshot, read‑optimized, time‑travel, and incremental—along with example SQL commands and code snippets.

Apache HudiBig DataData Lake
0 likes · 11 min read
Apache Hudi from Zero to One – Part 2: Reading Process and Query Types (Spark Example)
Big Data Technology Architecture
Big Data Technology Architecture
Jun 16, 2021 · Big Data

HBase Read and Write Performance Optimization Guide

This guide details practical server‑side and client‑side techniques for improving HBase read and write throughput, covering rowkey design, BlockCache configuration, HFile management, compaction tuning, scan cache sizing, bulkload usage, WAL policies, and SSD storage options.

Big DataDatabase TuningHBase
0 likes · 8 min read
HBase Read and Write Performance Optimization Guide
Big Data Technology Architecture
Big Data Technology Architecture
Apr 15, 2021 · Databases

HBase Read Performance Optimization: Best Practices and Tuning Guide

This article presents a comprehensive, practical guide to diagnosing and optimizing HBase read latency, covering common issues such as Full GC, region‑in‑transition, low write throughput, and high read delay, and offering client‑side, server‑side, column‑family, and HDFS tuning recommendations.

Big DataHBasePerformance Tuning
0 likes · 15 min read
HBase Read Performance Optimization: Best Practices and Tuning Guide
Youzan Coder
Youzan Coder
Feb 20, 2019 · Databases

HBase Read Path Analysis

The article first outlines HBase’s overall architecture and core components, then details the end‑to‑end read path—from client request routing to RegionServer processing, data organization and filtering—and finally presents practical client‑ and server‑side optimizations such as heterogeneous storage, HDFS short‑circuit, hedged reads, high‑availability reads, and warm‑up failure fixes, illustrated with Youzan’s production cluster.

HBaseTechnical Guidedistributed systems
0 likes · 17 min read
HBase Read Path Analysis