DataFunSummit
Author

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

1.7k
Articles
0
Likes
5.4k
Views
0
Comments
Recent Articles

Latest from DataFunSummit

100 recent articles max
DataFunSummit
DataFunSummit
May 30, 2026 · Industry Insights

Where Is the Real Moat in the AI Era as Large Models Become Commoditized?

The article analyzes how the rapid commoditization of large‑model capabilities, illustrated by Palantir’s 85% Q1 2026 revenue growth, reshapes AI competition into three layers—model, wrapper, and infrastructure—highlighting ontology as the hard‑to‑copy moat for enterprise AI in high‑risk scenarios.

AI commoditizationAI infrastructurePalantir
0 likes · 11 min read
Where Is the Real Moat in the AI Era as Large Models Become Commoditized?
DataFunSummit
DataFunSummit
May 29, 2026 · Artificial Intelligence

Why the Overlooked Agent Harness Is the Real Reason AI Projects Fail

The article explains that the hidden infrastructure layer called Agent Harness—its OS‑like architecture, three‑layer abstraction, context‑rot problem, compounding error, and verification loops—determines whether impressive agent demos can survive in production, with concrete benchmarks showing harness improvements far outweigh model upgrades.

AI infrastructureAgent HarnessCompounding Error
0 likes · 14 min read
Why the Overlooked Agent Harness Is the Real Reason AI Projects Fail
DataFunSummit
DataFunSummit
May 28, 2026 · Artificial Intelligence

How DataWorks Data Agent Advances from Augmented Assistance to Full Autonomy

The article analyzes DataWorks Data Agent’s evolution from a helper‑style tool to an autonomous data‑centric AI agent, detailing its five‑stage roadmap, dual‑engine CLI/Claw architecture, unified runtime kernel, open skill ecosystem, and CPU‑GPU joint optimization for enterprise‑grade data automation.

AIAgent ArchitectureBig Data
0 likes · 12 min read
How DataWorks Data Agent Advances from Augmented Assistance to Full Autonomy
DataFunSummit
DataFunSummit
May 27, 2026 · Artificial Intelligence

From Text to Images: Building Multi‑Modal Product Search with Elasticsearch Serverless

This article walks through a complete multi‑modal product search solution that transforms textual and visual product data into embeddings, leverages dense, sparse and hybrid models, applies vector similarity and quantization techniques such as SQ and BBQ, and demonstrates how Elasticsearch Serverless provides a serverless, cost‑effective, auto‑scaling backbone for end‑to‑end retrieval.

AI Search Open PlatformElasticsearch ServerlessEmbedding
0 likes · 22 min read
From Text to Images: Building Multi‑Modal Product Search with Elasticsearch Serverless
DataFunSummit
DataFunSummit
May 27, 2026 · Industry Insights

Where Is the Real Moat in the AI Era as Large Models Become Commoditized?

The article analyzes Palantir's 2026 Q1 surge and argues that as large‑model capabilities become cheap commodities, true competitive advantage now lies in deep ontology‑based infrastructure that makes AI outputs trustworthy in high‑risk enterprise scenarios.

AIAI commoditizationPalantir
0 likes · 11 min read
Where Is the Real Moat in the AI Era as Large Models Become Commoditized?
DataFunSummit
DataFunSummit
May 27, 2026 · Artificial Intelligence

How Baidu’s “Sheng Suan” Turns Agents from Outsiders into Business‑Savvy Assistants

The article explains that most AI agents achieve only 80‑90% accuracy in read‑only tasks and cannot handle core production decisions, then details Baidu’s “Sheng Suan” platform which uses a three‑layer business ontology and system‑engineered sandbox, audit, and simulation features to enable agents to execute write operations, citing three real‑world cases where decision latency dropped from months to minutes and accuracy exceeded 95%.

AI agentsContext Engineeringbusiness ontology
0 likes · 8 min read
How Baidu’s “Sheng Suan” Turns Agents from Outsiders into Business‑Savvy Assistants
DataFunSummit
DataFunSummit
May 26, 2026 · Artificial Intelligence

Why Ontology Is the New Semantic Operating System for Large‑Model AI

The article argues that in the era of ever‑larger language models, enterprises lack a unified, computable, and evolvable semantic structure, and that ontology—recast as a semantic operating system—provides the necessary skeleton, guardrails, and actionable knowledge to make AI systems truly understand and execute business processes.

Open Sourceenterprise AIknowledge graph
0 likes · 17 min read
Why Ontology Is the New Semantic Operating System for Large‑Model AI
DataFunSummit
DataFunSummit
May 26, 2026 · Artificial Intelligence

Building an Evolvable Context Layer for Agents with ContextSearch

The article explains how ContextSearch transforms enterprise search from simple document retrieval into an Agentic, multi‑source, runtime‑driven context layer that can understand constraints, gather evidence, verify results, and continuously evolve through trace‑backed optimization.

ContextSearchDiskANNOpenSearch
0 likes · 14 min read
Building an Evolvable Context Layer for Agents with ContextSearch
DataFunSummit
DataFunSummit
May 25, 2026 · Big Data

How Hisense Built an AI‑Ready Multimodal Data Platform: Storage, Governance, and Development

This article details Hisense's journey to create an AI‑ready multimodal data platform, covering the challenges of integrating diverse business systems, the shift from a Hadoop‑based architecture to a cloud‑native data lake, the JuData governance and development platform, and six practical scenarios that demonstrate unified ingestion, metadata management, rule‑based quality control, intelligent asset retrieval, and future AI‑driven DataOps capabilities.

AI PlatformCloud NativeDataOps
0 likes · 23 min read
How Hisense Built an AI‑Ready Multimodal Data Platform: Storage, Governance, and Development
DataFunSummit
DataFunSummit
May 24, 2026 · Industry Insights

Why AI Agents Are Redefining Data Infrastructure Governance

The rise of AI agents as data consumers forces a fundamental shift in data infrastructure design, requiring unified metadata control, a robust semantic layer, and a governed agent access framework to replace traditional human‑centric RBAC models and ensure secure, auditable operations.

AI agentsAgentic Data ProtocolApache Gravitino
0 likes · 18 min read
Why AI Agents Are Redefining Data Infrastructure Governance