Tagged articles
29 articles
Page 1 of 1
DataFunTalk
DataFunTalk
May 27, 2026 · Industry Insights

Data Agent Tipping Point in 6‑12 Months? Xiaomi, Alibaba Cloud & Datastrato Discuss

The round‑table examines how Data Agent is moving from proof‑of‑concept to production, outlines its three‑stage evolution from NL2SQL to a general AI‑driven agent, highlights verification and semantic‑gap challenges, and presents expert views that the scaling tipping point could arrive within the next six to twelve months.

AIApache GravitinoData Agent
0 likes · 10 min read
Data Agent Tipping Point in 6‑12 Months? Xiaomi, Alibaba Cloud & Datastrato Discuss
DataFunTalk
DataFunTalk
May 21, 2026 · Databases

How the Agent Paradigm Is Redefining Enterprise Data Infrastructure

The article examines how the rise of AI agents is reshaping enterprise data infrastructure, tracing software evolution from rule‑based systems to lakehouses and arguing that real‑time OLAP engines with sub‑second latency, hybrid search, and semantic schemas will become the core of the new Agent‑centric stack.

AgentData InfrastructureHybrid Search
0 likes · 13 min read
How the Agent Paradigm Is Redefining Enterprise Data Infrastructure
Digital Planet
Digital Planet
Apr 23, 2026 · Industry Insights

Why Wusu Beer’s 35% Growth Fell to -8%: The Missed Opportunity of 50 Million Scan Users

After soaring 35% annual growth from 2016‑2022, Wusu Beer’s sales dropped 8.1% in 2025 despite holding 88 billion yuan revenue and 50 million scan‑code users, because the company treated QR‑based “物码” merely as a promotional gimmick instead of building a full‑chain digital infrastructure, leading to a data‑black‑box channel and stalled growth.

Data InfrastructureFMCGMarketing Analytics
0 likes · 16 min read
Why Wusu Beer’s 35% Growth Fell to -8%: The Missed Opportunity of 50 Million Scan Users
Machine Heart
Machine Heart
Apr 10, 2026 · Artificial Intelligence

Why Generalist’s Success Shifts Embodied AI Competition From Models to Infrastructure

The launch of Generalist AI’s GEN‑1 model demonstrates a breakthrough in success rate, speed and resilience, but the article argues that the true competitive frontier has moved from model performance to the underlying data, simulation and evaluation infrastructure that enables continuous learning and scalable testing for embodied intelligence.

AI modelsData InfrastructureEmbodied AI
0 likes · 12 min read
Why Generalist’s Success Shifts Embodied AI Competition From Models to Infrastructure
DataFunSummit
DataFunSummit
Mar 31, 2026 · Industry Insights

How SelectDB Overcomes the ‘Impossible Triangle’ in Real‑Time Automotive Data

The whitepaper explains how the explosive growth, multimodal nature, and real‑time collaboration demands of intelligent connected‑vehicle data create two “impossible triangles,” and how SelectDB’s three technical innovations—Index+Bitmap primary keys, Variant sparse columns, and hybrid full‑text/vector search—enable cost‑effective, high‑performance real‑time analytics across five automotive scenarios with proven case studies from leading OEMs.

AutomotiveData InfrastructureDatabase Innovation
0 likes · 17 min read
How SelectDB Overcomes the ‘Impossible Triangle’ in Real‑Time Automotive Data
DataFunTalk
DataFunTalk
Feb 2, 2026 · Artificial Intelligence

How Alluxio Boosts GPU Utilization to 99.57% for Embodied AI – Inside the MLPerf Success

This article explains how Alluxio’s distributed caching architecture tackles the massive, multimodal data challenges of embodied AI, delivers near‑zero‑millisecond access, achieves 99.57% GPU utilization in MLPerf Storage v2.0, and validates its value through real‑world enterprise deployments.

AI Data PlatformAlluxioData Infrastructure
0 likes · 21 min read
How Alluxio Boosts GPU Utilization to 99.57% for Embodied AI – Inside the MLPerf Success
Alibaba Cloud Observability
Alibaba Cloud Observability
Sep 29, 2025 · Artificial Intelligence

Building a Cloud‑Native Observability Stack for LLM Apps with Alibaba SLS

This article details the engineering practice of constructing a complete data infrastructure for large‑language‑model (LLM) applications using Alibaba Cloud SLS, covering the observability challenges of the Dify platform, the redesign of the architecture, and the resulting improvements in monitoring, diagnosis, and quality optimization.

Cloud NativeData InfrastructureDify
0 likes · 23 min read
Building a Cloud‑Native Observability Stack for LLM Apps with Alibaba SLS
AntTech
AntTech
Sep 13, 2025 · Artificial Intelligence

Why High‑Quality Data Is the New Breakthrough for Large‑Scale AI Models

At the 2025 Inclusion·Bund Conference forum, leading scholars and industry experts revealed how high‑quality data and AI form a dual‑engine that reshapes model training, improves performance, and drives the next evolution of intelligent systems.

AI training dataData InfrastructureMachine Learning
0 likes · 7 min read
Why High‑Quality Data Is the New Breakthrough for Large‑Scale AI Models
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 8, 2025 · Artificial Intelligence

From Generative to Agentic AI: Building AI‑Native Enterprise Applications

This article examines the rapid evolution of artificial intelligence—from Generative AI to Agentic AI—and explains how enterprises can adopt AI‑Native development models, address full‑stack challenges, upgrade data infrastructure, and leverage Chat BI and LangStudio platforms to create intelligent, data‑driven applications.

AI-nativeChat BIData Infrastructure
0 likes · 18 min read
From Generative to Agentic AI: Building AI‑Native Enterprise Applications
Data Thinking Notes
Data Thinking Notes
Dec 19, 2024 · Information Security

Unveiling 41 Official Data Terms: What They Mean for China’s Data Infrastructure

This article compiles the official definitions released by China’s National Data Bureau and other agencies for 41 data‑related terms, explains the concepts of data infrastructure, privacy‑preserving computing, trusted data spaces, and blockchain, and outlines how these definitions guide the nation’s data‑driven development strategy.

BlockchainData InfrastructurePrivacy Computing
0 likes · 25 min read
Unveiling 41 Official Data Terms: What They Mean for China’s Data Infrastructure
AntTech
AntTech
Dec 17, 2024 · Information Security

Jiangxi Secure Computing Summit Launches Industry Chain Initiative to Accelerate Data Security and Value Release

The Jiangxi "Secure Computing Industry Summit" convened on December 17, announcing a collaborative "Secure Computing Industry Chain Co‑construction Action" that unites leading institutions to promote privacy‑preserving computation technologies across finance, healthcare, and public data, aiming to safely unlock the value of data assets.

Data InfrastructureData SecurityIndustry collaboration
0 likes · 7 min read
Jiangxi Secure Computing Summit Launches Industry Chain Initiative to Accelerate Data Security and Value Release
21CTO
21CTO
Dec 4, 2024 · Artificial Intelligence

Why AI Skills Will Dominate Europe’s Tech Job Market by 2025

A new salary survey reveals a persistent talent shortage in Europe, highlighting explosive growth in demand for AI, automation, machine learning, and data‑infrastructure skills, while warning that a widening skills gap could hinder companies’ ability to meet hiring needs by 2025.

2025AIData Infrastructure
0 likes · 6 min read
Why AI Skills Will Dominate Europe’s Tech Job Market by 2025
StarRocks
StarRocks
Jul 24, 2024 · Big Data

Why Lakehouse Architecture Is Redefining Big Data Infrastructure in the AI Era

The article examines the rapid rise of lakehouse architecture, its market momentum, core components—including storage, metadata, table formats, and compute layers—compares Iceberg, Hudi, and Delta Lake, discusses the shift from HDFS to object storage, and outlines the strategic importance of lakehouses for AI-driven data management and future data infrastructure trends.

AIApache IcebergBig Data
0 likes · 28 min read
Why Lakehouse Architecture Is Redefining Big Data Infrastructure in the AI Era
DataFunSummit
DataFunSummit
Apr 5, 2024 · Big Data

HuoLala Big Data Infrastructure: Challenges, Practices, and Future Outlook

Senior big data engineer Zhu Yaogai from HuoLala shares the team’s three‑year journey, detailing background challenges, the construction of a multi‑layer big‑data infrastructure, solutions for cost efficiency, operational automation, heterogeneous computing, and future plans, illustrating how high cost‑effectiveness, operational efficiency, and analytical performance drive their evolution.

Cloud NativeData Infrastructureautomation
0 likes · 11 min read
HuoLala Big Data Infrastructure: Challenges, Practices, and Future Outlook
DataFunSummit
DataFunSummit
Jul 2, 2023 · Big Data

Building a One‑Stop AB Testing Platform at NetEase Cloud Music: Architecture, Metric Infrastructure, Scientific Evaluation, and Efficiency

The article describes how NetEase Cloud Music designed and deployed a comprehensive AB testing platform, covering system infrastructure, metric modeling, scientific experiment validation (including SRM mitigation and statistical power), and operational efficiency improvements to support rapid product iteration across multiple devices.

AB testingBig DataData Infrastructure
0 likes · 13 min read
Building a One‑Stop AB Testing Platform at NetEase Cloud Music: Architecture, Metric Infrastructure, Scientific Evaluation, and Efficiency
DataFunSummit
DataFunSummit
Jan 1, 2023 · Big Data

Shopee Data Infra Presentation: Storage Status, Acceleration, Serviceization, and Future Plans

The Shopee Data Infra talk details the current storage architecture, Presto‑based acceleration with Alluxio caching, service‑oriented storage solutions using Alluxio Fuse and S3 APIs, and outlines future enhancements for Spark/Hive integration and CSI/Fuse optimizations, providing a comprehensive view of large‑scale big data storage engineering.

AlluxioCache ManagerData Infrastructure
0 likes · 16 min read
Shopee Data Infra Presentation: Storage Status, Acceleration, Serviceization, and Future Plans
DataFunSummit
DataFunSummit
Jun 2, 2022 · Databases

An In‑Depth Overview of Apache BookKeeper: Architecture, Features, and Use Cases

This article provides a comprehensive technical overview of Apache BookKeeper, covering its role as a distributed append‑only log service, core concepts, high‑availability mechanisms, storage‑media evolution, comparisons with Raft, and community resources, while illustrating its use in Pulsar and large‑scale data platforms.

Apache BookKeeperData InfrastructureDistributed Log
0 likes · 12 min read
An In‑Depth Overview of Apache BookKeeper: Architecture, Features, and Use Cases
Architects Research Society
Architects Research Society
May 16, 2022 · Big Data

The Four Phases of Netflix’s Trillion‑Scale Real‑Time Data Infrastructure

This article chronicles Netflix’s evolution from a failing batch pipeline to a cloud‑native, multi‑tenant streaming platform across four phases, detailing the motivations, challenges, strategic bets, and patterns that enabled the company to scale real‑time data processing to trillions of events per day.

Data InfrastructureNetflixcloud-native
0 likes · 31 min read
The Four Phases of Netflix’s Trillion‑Scale Real‑Time Data Infrastructure
DataFunTalk
DataFunTalk
Jun 26, 2021 · Big Data

Building a Scalable Big Data Service System at Didi: Practices and Lessons

Zhang Liang shares Didi's four-stage journey of constructing and governing large‑scale open‑source big‑data engine services—including engine selection, hardware sizing, PaaS platform building, proxy architecture, and governance—highlighting practical challenges, solutions, and ROI‑driven best practices for Kafka, Elasticsearch, Flink, and related technologies.

Big DataData InfrastructureElasticsearch
0 likes · 16 min read
Building a Scalable Big Data Service System at Didi: Practices and Lessons
Tencent Cloud Developer
Tencent Cloud Developer
May 19, 2021 · Industry Insights

How Cloud‑Native Principles Transform Big Data Infrastructure

The article analyzes how cloud‑native concepts such as DevOps, micro‑services, continuous delivery, and containerization can be applied to big‑data foundations, outlining four guiding principles—industrialized delivery, cost quantification, load‑adaptive scaling, and data‑centric design—and describing concrete Hadoop‑based architectures and Tencent Cloud solutions that lower cost while boosting performance.

Big DataData InfrastructureHadoop
0 likes · 22 min read
How Cloud‑Native Principles Transform Big Data Infrastructure
DevOps
DevOps
Mar 10, 2021 · Artificial Intelligence

Ant Financial's Intelligent Middle Platform: AI Applications, Data Infrastructure, and Security Practices

This article presents Ant Financial's intelligent middle platform, detailing AI use cases such as risk control, wealth management, lending, marketing, insurance, and customer service, alongside the AI capability map, data foundation architecture, annotation workflows, security measures, and the overall impact on fintech innovation.

Data InfrastructureData SecurityFinTech
0 likes · 8 min read
Ant Financial's Intelligent Middle Platform: AI Applications, Data Infrastructure, and Security Practices
Big Data Technology Architecture
Big Data Technology Architecture
Feb 3, 2020 · Big Data

NetEase Data Foundation Platform Construction – Technical Sharing

This article, originally shared by NetEase’s data expert Jiang Hongxiang on DataFun, outlines the construction of NetEase’s data foundation platform, covering database kernel insights and the implementation of the ad‑hoc query engine Impala with the distributed storage system Kudu, offering valuable big‑data engineering practices.

Data InfrastructureData PlatformImpala
0 likes · 4 min read
NetEase Data Foundation Platform Construction – Technical Sharing
dbaplus Community
dbaplus Community
Nov 20, 2019 · Databases

Key Takeaways: Database Ecosystem & GaussDB Compatibility at Global Data Forum

The Global Data Infrastructure Forum in Shenzhen gathered industry leaders and academics to discuss database ecosystem building, GaussDB's new naming and compatibility certification, the ZnSQL and ZnAiops platforms, and Huawei's comprehensive digital transformation methodology encompassing strategic goals and a five‑step implementation framework.

Data InfrastructureEcosystemGaussDB
0 likes · 13 min read
Key Takeaways: Database Ecosystem & GaussDB Compatibility at Global Data Forum
21CTO
21CTO
Apr 4, 2016 · Big Data

How Asana Scaled Its Data Infrastructure: From MySQL to Redshift & Hadoop

This article details Asana's evolution from a simple Python‑MySQL setup to a robust, scalable data platform using Redshift, Hadoop, Luigi, and modern BI tools, highlighting challenges, solutions, and lessons learned for building reliable data pipelines in fast‑growing startups.

Big DataData InfrastructureETL
0 likes · 15 min read
How Asana Scaled Its Data Infrastructure: From MySQL to Redshift & Hadoop
dbaplus Community
dbaplus Community
Apr 3, 2016 · Big Data

How Asana Scaled Its Data Infrastructure: From MySQL to Redshift & Beyond

Facing rapid growth, Asana overhauled its data infrastructure—from a single‑machine MySQL setup to a Redshift‑backed warehouse, Hadoop‑based log processing, Luigi orchestration, and self‑service BI tools—highlighting the challenges, solutions, and future plans for scalable, reliable analytics.

Big DataData InfrastructureETL
0 likes · 16 min read
How Asana Scaled Its Data Infrastructure: From MySQL to Redshift & Beyond