Tag

Distributed SQL

1 views collected around this technical thread.

DataFunTalk
DataFunTalk
Feb 12, 2023 · Big Data

Optimizing Bilibili Presto Cluster Query Performance with Alluxio and Local Cache

This article presents a comprehensive technical overview of Bilibili's Presto cluster architecture, the challenges of query performance on Hadoop, and the systematic optimizations—including Alluxio integration, local cache mechanisms, multi‑active coordinators, label‑based scheduling, and real‑time penalties—that together improve availability, stability, and latency for large‑scale analytics workloads.

AlluxioBig DataCache
0 likes · 23 min read
Optimizing Bilibili Presto Cluster Query Performance with Alluxio and Local Cache
Didi Tech
Didi Tech
Oct 9, 2020 · Big Data

Presto at Didi: Architecture, Optimizations, and Operational Experience

At Didi, Presto has been the default ad‑hoc and Hive‑SQL engine for over three years, serving 6,000 users, processing 2‑3 PB daily and 30‑35 trillion rows, with mixed and dedicated clusters, migration to PrestoSQL 340, extensive Hive compatibility, label‑based isolation, a native Druid connector, usability and stability enhancements, and JVM‑level performance optimizations, while planning further resource‑saving upgrades.

Big DataDistributed SQLDruid Connector
0 likes · 17 min read
Presto at Didi: Architecture, Optimizations, and Operational Experience
IT Architects Alliance
IT Architects Alliance
Aug 31, 2020 · Databases

Distributed SQL: Features, Core Characteristics, and Cloud-Native Requirements

The article explains the evolution of database architecture toward distributed SQL, outlining its seven core features such as scalability, consistency, elasticity, geo‑replication, SQL support, data locality, and multi‑cloud capability, while also describing essential database functions and practical considerations for cloud‑native deployments.

Cloud DatabasesDatabase ArchitectureDistributed SQL
0 likes · 9 min read
Distributed SQL: Features, Core Characteristics, and Cloud-Native Requirements
360 Tech Engineering
360 Tech Engineering
Jul 18, 2019 · Databases

Principles and Practices of Apache Doris: Architecture, Key Technologies, and Real‑World Use Cases

This article presents a comprehensive overview of Apache Doris, covering its positioning as a distributed MPP analytical database, core architecture with FE and BE nodes, key technologies such as vectorized execution and materialized views, integration with Kafka and Elasticsearch, additional features, roadmap, and detailed case studies from Baidu Statistics and Meituan, illustrating its practical deployment and performance characteristics.

Apache DorisData WarehouseDistributed SQL
0 likes · 25 min read
Principles and Practices of Apache Doris: Architecture, Key Technologies, and Real‑World Use Cases