DataFunTalk
Author

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

2.5k
Articles
2
Likes
7.3k
Views
1
Comments
Recent Articles

Latest from DataFunTalk

100 recent articles max
DataFunTalk
DataFunTalk
May 6, 2026 · Big Data

How Xiaohongshu Evolved Its Data Architecture for the Big AI Data Era

The article details Xiaohongshu's four‑stage data‑platform evolution—from a simple ClickHouse ad‑hoc setup to a Lambda‑based 2.0 design and finally a lakehouse‑driven 3.0 architecture—highlighting the adoption of general incremental compute, cost‑reduction to one‑third, performance gains of up to ten‑fold, and the SPOT standards that guide the new system.

Big DataClickHouseData Architecture
0 likes · 21 min read
How Xiaohongshu Evolved Its Data Architecture for the Big AI Data Era
DataFunTalk
DataFunTalk
May 6, 2026 · Artificial Intelligence

Why Palantir’s Ontology, Not Just Large Models, Drives Its Valuation Surge

In a 90‑minute round‑table, experts from banking risk control and cloud observability explain how Palantir’s ontology—viewed as the skeleton and memory that structures massive, heterogeneous data—bridges three data gaps, enables large‑model reasoning, and offers concrete steps for building practical knowledge graphs in enterprises.

Digital TwinPalantirdata modeling
0 likes · 16 min read
Why Palantir’s Ontology, Not Just Large Models, Drives Its Valuation Surge
DataFunTalk
DataFunTalk
May 5, 2026 · Artificial Intelligence

How Knora’s Ontology‑Enhanced AI Tackles Hallucinations and Execution Gaps in Enterprise Deployments

The article analyzes Knora 4.0, an ontology‑enhanced AI platform that combines large‑model capabilities with a structured knowledge graph to overcome hallucinations and execution gaps in enterprise deployments, detailing its architecture, autonomous agent Knora Claw, real‑world case studies, and a three‑year roadmap.

AI ArchitectureBusiness Automationautonomous agents
0 likes · 18 min read
How Knora’s Ontology‑Enhanced AI Tackles Hallucinations and Execution Gaps in Enterprise Deployments
DataFunTalk
DataFunTalk
May 5, 2026 · Artificial Intelligence

Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems

This article reviews cutting‑edge AI search and recommendation techniques—including Alibaba Cloud's Agentic RAG, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's generative ranking model GRAB—detailing their architectural evolution, multimodal retrieval strategies, GPU acceleration, and measured performance gains.

AI SearchAgentic RAGGPU Acceleration
0 likes · 6 min read
Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems
DataFunTalk
DataFunTalk
May 4, 2026 · Artificial Intelligence

Engineering and Algorithm Innovations for RAG Engines in Office Applications

This article analyzes the challenges and practical solutions of building a Retrieval‑Augmented Generation (RAG) system for office scenarios, covering background issues, modular architecture, offline and online pipelines, hybrid retrieval, ranking models, knowledge filtering, prompt design, and two‑stage generation techniques.

AIDocument ParsingHybrid Retrieval
0 likes · 22 min read
Engineering and Algorithm Innovations for RAG Engines in Office Applications
DataFunTalk
DataFunTalk
May 4, 2026 · Artificial Intelligence

Building a Semantic Foundation for Harness Engineering: Ontology‑Driven Controllable Agents

The article analyzes why current AI agents lack reliable control, defines a multi‑dimensional safety framework, and proposes an ontology‑driven architecture—implemented in the Knora platform—that embeds business rules directly into agents, enabling deterministic validation, auditability, and large‑scale efficiency gains.

AIAgentBusiness Control
0 likes · 17 min read
Building a Semantic Foundation for Harness Engineering: Ontology‑Driven Controllable Agents
DataFunTalk
DataFunTalk
May 3, 2026 · Industry Insights

What Does Palantir Really Represent? (Part 5)

The article dissects why China cannot replicate Palantir by analysing the four‑layer ecosystem—political, institutional, capability and cultural—that enabled Palantir’s subscription‑based, long‑term AI platform, and proposes alternative paths suited to China’s market and regulatory soil.

AI ontologyChinese enterprise softwareF‑share structure
0 likes · 15 min read
What Does Palantir Really Represent? (Part 5)
DataFunTalk
DataFunTalk
May 2, 2026 · Industry Insights

Why Palantir’s Ontology Fuels Its Valuation: The Skeleton and Memory Behind AI

In a 90‑minute round‑table, experts from banking risk control and cloud observability explain how Palantir’s ontology bridges three data gaps, turns raw logs into a graph of entities and relationships, and works with large models as a skeleton and memory to make AI trustworthy and scalable.

AI trustworthinessDigital TwinLarge Language Model
0 likes · 16 min read
Why Palantir’s Ontology Fuels Its Valuation: The Skeleton and Memory Behind AI
DataFunTalk
DataFunTalk
May 2, 2026 · Big Data

Building a One-Person Data Team: Core Skills of a Full‑Stack Data Engineer

The article examines why a single data engineer can run an end‑to‑end data team, outlines the essential abilities—semantic ownership, building an agentic data stack, and leveraging historical context—while discussing ChatBI’s limits, validation loops, and the open‑source Datus 0.3 harness for practical implementation.

ChatBIDatusFull-Stack Data Engineer
0 likes · 14 min read
Building a One-Person Data Team: Core Skills of a Full‑Stack Data Engineer