AntData
Author

AntData

Ant Data leverages Ant Group's leading technological innovation in big data, databases, and multimedia, with years of industry practice. Through long-term technology planning and continuous innovation, we strive to build world-class data technology and products.

26
Articles
0
Likes
86
Views
0
Comments
Recent Articles

Latest from AntData

26 recent articles
AntData
AntData
May 14, 2026 · Artificial Intelligence

How RAG‑Powered DB‑GPT Enables Intelligent Marine‑Environment Queries with Text2SQL

The article presents a private‑deployed DB‑GPT solution that combines Retrieval‑Augmented Generation (RAG) and Text2SQL to address low utilization of unstructured marine‑environment knowledge, cross‑source data querying difficulties, and security concerns, detailing technical selection, implementation steps, and performance gains that reduce query time from 30 minutes to 1‑3 minutes.

AIDB-GPTKnowledge retrieval
0 likes · 13 min read
How RAG‑Powered DB‑GPT Enables Intelligent Marine‑Environment Queries with Text2SQL
AntData
AntData
Apr 28, 2026 · Artificial Intelligence

Iterative Agent Evaluation Skill: Automating Bad‑Case Diagnosis with AI Pre‑Annotation

The article presents an end‑to‑end, eight‑phase automated evaluation pipeline for large‑model agents that replaces manual bad‑case inspection with AI‑assisted pre‑annotation, cutting analysis time from a full‑day to about 30 minutes and achieving over 90 % efficiency gain while enabling iterative knowledge‑base refinement.

AI Pre‑annotationAgent EvaluationAutomated Pipeline
0 likes · 20 min read
Iterative Agent Evaluation Skill: Automating Bad‑Case Diagnosis with AI Pre‑Annotation
AntData
AntData
Apr 17, 2026 · Industry Insights

5 Silver Rules That Made Dataphin‑MCP’s AI Platform Scale to 1M Calls in 9 Days

This article shares the practical lessons learned from building Dataphin‑MCP, an AI‑enabled data‑development platform, by outlining five concrete "silver" rules, illustrating each with real‑world cases, and discussing deeper considerations for building robust AI‑first tools and harnesses.

AI PlatformAgent DesignConcept modeling
0 likes · 13 min read
5 Silver Rules That Made Dataphin‑MCP’s AI Platform Scale to 1M Calls in 9 Days
AntData
AntData
Dec 3, 2025 · Artificial Intelligence

How to Build and Refine Your Personal AI Agent Assistant

This article walks through turning a generic AI model into a personal assistant by explaining user‑centric workflows, crafting effective natural‑language prompts, adding clarification steps, validating AI‑generated results through multiple methods, and handling errors with product interactions to create a reliable, evolving assistant.

ChatBILLMresult validation
0 likes · 10 min read
How to Build and Refine Your Personal AI Agent Assistant
AntData
AntData
Jul 8, 2025 · Artificial Intelligence

How RaBitQ Achieves 32× Vector Compression Without Sacrificing Accuracy

This article explains the challenges of high‑dimensional vector retrieval, introduces quantization techniques—especially the binary RaBitQ method and its MRQ extension—detailing their compression ratios, speed gains, compatibility with search indexes, and real‑world performance results in the VSAG system.

AI embeddingsMRQMemory Optimization
0 likes · 15 min read
How RaBitQ Achieves 32× Vector Compression Without Sacrificing Accuracy
AntData
AntData
May 30, 2025 · Artificial Intelligence

DeepInsight Copilot: AI‑Powered Data Analysis Platform Overview and Technical Evolution

The article presents an in‑depth overview of DeepInsight Copilot, an AI‑driven business intelligence product that streamlines data, information, insight, and decision‑recommendation stages, detailing its functional modules, intelligent agents, multi‑generation technical evolution, architecture, model fine‑tuning, and future challenges and solutions in data analysis.

AIData AnalysisMachine Learning
0 likes · 21 min read
DeepInsight Copilot: AI‑Powered Data Analysis Platform Overview and Technical Evolution
AntData
AntData
May 20, 2025 · Artificial Intelligence

How Vector Retrieval Powers AI: Challenges, Solutions, and VSAG’s Open‑Source Breakthrough

The article examines the rapid growth of unstructured data, explains the fundamentals and resource‑intensive nature of vector retrieval, presents Ant Group’s engineering practices—including hybrid HNSW‑DiskANN indexing, performance tricks like BSA pruning and memory prefetching, sparse‑vector and feedback‑driven recall improvements—and outlines the open‑source VSAG roadmap and ecosystem integrations.

AI infrastructureOpen SourcePerformance Optimization
0 likes · 18 min read
How Vector Retrieval Powers AI: Challenges, Solutions, and VSAG’s Open‑Source Breakthrough
AntData
AntData
Apr 3, 2025 · Artificial Intelligence

Ray Flow Insight: Visualizing and Debugging Distributed AI Applications

Ray Flow Insight is an Ant Group open‑source tool that visualizes Ray's distributed programming primitives—Actors, Tasks, and Objects—to turn complex reinforcement‑learning systems from opaque "black boxes" into transparent, debuggable workflows, providing logical, physical, distributed stack, and flame‑graph views for performance analysis and optimization.

AIDebuggingDistributed Systems
0 likes · 32 min read
Ray Flow Insight: Visualizing and Debugging Distributed AI Applications
AntData
AntData
Mar 20, 2025 · Big Data

Design and Optimization of Real‑time Data Lake Tables with Paimon and Flink for Advertising Diagnostics

This article presents a comprehensive exploration of using Apache Paimon and Flink to design lake tables that support minute‑level latency, low cost, and unified batch‑stream processing for advertising data, covering schema design, partitioning strategies, performance trade‑offs, cost analysis, and operational best practices.

Big DataFlinkPaimon
0 likes · 34 min read
Design and Optimization of Real‑time Data Lake Tables with Paimon and Flink for Advertising Diagnostics
AntData
AntData
Mar 14, 2025 · Fundamentals

Analysis of DeepSeek 3FS Storage Service Architecture and Design

This article provides an in‑depth technical analysis of DeepSeek's open‑source 3FS distributed file system, focusing on the StorageService architecture, space pooling, allocation mechanisms, reference counting, fragmentation handling, and the RDMA‑based read/write data path.

RDMAallocationchunk management
0 likes · 15 min read
Analysis of DeepSeek 3FS Storage Service Architecture and Design