Big Data Technology & Architecture
Author

Big Data Technology & Architecture

Wang Zhiwu, a big data expert, dedicated to sharing big data technology.

1.0k
Articles
0
Likes
424
Views
0
Comments
Recent Articles

Latest from Big Data Technology & Architecture

100 recent articles max
Big Data Technology & Architecture
Big Data Technology & Architecture
Dec 10, 2025 · Big Data

What’s New in Apache Spark 4.0? Deep Dive into 2025 Core Updates

The 2025 release of Apache Spark 4.0 brings a comprehensive overhaul—including default ANSI SQL mode, full SQL scripting support, a new Real‑Time streaming mode, adaptive query execution, dynamic memory management, and GPU‑accelerated MLlib—significantly boosting performance, reliability, and developer productivity across big‑data workloads.

Apache SparkBig DataGPU Acceleration
0 likes · 9 min read
What’s New in Apache Spark 4.0? Deep Dive into 2025 Core Updates
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 28, 2025 · Big Data

What’s New in Apache Paimon 2025? Core Performance, AI Integration & Real‑Time Lakehouse Updates

The 2025 Apache Paimon release brings major performance boosts, AI‑centric multimodal storage, deeper streaming‑batch integration, and broader engine compatibility, detailing query and write optimizations, memory management tweaks, and a unified lake format for structured and unstructured data.

AI integrationApache PaimonBig Data
0 likes · 6 min read
What’s New in Apache Paimon 2025? Core Performance, AI Integration & Real‑Time Lakehouse Updates
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 27, 2025 · Artificial Intelligence

Rule‑Based NLQ vs LLMs: How ChatBI’s MQL Engine Delivers Precise BI Queries

The article explains how the rule‑based NLQ component of ChatBI replaces large language models with a detailed dictionary‑driven architecture, using a custom Metrics Query Language (MQL) to transform natural‑language business questions into accurate SQL, highlighting its stability, low cost, transparency, and limitations compared to LLM solutions.

Data QueryLLM comparisonMQL
0 likes · 12 min read
Rule‑Based NLQ vs LLMs: How ChatBI’s MQL Engine Delivers Precise BI Queries
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 17, 2025 · Big Data

Flink 2025 Updates: Disaggregated State, AI Agents, and SQL Enhancements

The 2025 Flink release introduces a disaggregated state management architecture for cloud‑native elasticity, AI‑driven Flink Agents with LLM, Memory and Tool support, Delta Join and VARIANT type for semi‑structured data, adaptive batch execution, incremental checkpoints, high‑speed network optimizations, and new SQL and Process Table Functions, reshaping real‑time analytics.

Disaggregated StateFlinkSQL Enhancements
0 likes · 8 min read
Flink 2025 Updates: Disaggregated State, AI Agents, and SQL Enhancements
Big Data Technology & Architecture
Big Data Technology & Architecture
Oct 30, 2025 · Backend Development

What’s New in Apache Kafka 4.1? Core Features and Architecture Changes Explained

Apache Kafka 4.1.0 introduces native queue semantics, a new Streams rebalancing protocol, multi‑version Connect plugins, a revamped consumer‑group protocol, enhanced transaction safety, and numerous client, monitoring, and security improvements, offering a comprehensive upgrade over the 4.0 release.

KafkaStreamingdistributed-systems
0 likes · 6 min read
What’s New in Apache Kafka 4.1? Core Features and Architecture Changes Explained
Big Data Technology & Architecture
Big Data Technology & Architecture
Oct 13, 2025 · Databases

Apache Doris 3.1 Unveiled: Variant, Index, and Lakehouse Boosts

The Apache Doris 3.1 release strengthens lake‑house capabilities with major upgrades to the VARIANT data type, vertical compaction, inverted index storage, new tokenizers, enhanced materialized view support for Iceberg/Paimon/Hudi, and numerous query‑performance optimizations such as faster partition pruning and dynamic partition clipping, offering smoother handling of thousands of columns and large‑scale semi‑structured data.

Apache DorisLakehouseVARIANT
0 likes · 8 min read
Apache Doris 3.1 Unveiled: Variant, Index, and Lakehouse Boosts
Big Data Technology & Architecture
Big Data Technology & Architecture
Oct 10, 2025 · Artificial Intelligence

Explore Cutting-Edge Open-Source AI Projects: Gemini CLI, AI Engineering Hub, and GPT‑5 Demos

This article introduces several noteworthy open‑source AI projects—including Google’s Gemini CLI, the AI‑Engineering‑Hub learning repository, and OpenAI’s GPT‑5 coding examples—providing URLs, key features, and visual previews to help developers quickly explore and adopt cutting‑edge AI tools.

AI Engineering HubArtificial IntelligenceGPT-5
0 likes · 3 min read
Explore Cutting-Edge Open-Source AI Projects: Gemini CLI, AI Engineering Hub, and GPT‑5 Demos