Tagged articles

240 articles

Page 1 of 3

May 25, 2026 · Industry Insights

Are You One of the 90% of Companies Trapped in the Digital “Death Spiral”?

The article reveals why most enterprises stall in the deep‑water stage of digital transformation—not because of lacking technology or budget, but due to four organizational bottlenecks: data integration, process optimization, BI adoption, and custom development, and it proposes a governance‑first approach to break the deadlock.

Organizational Changebusiness intelligencecustom development

0 likes · 9 min read

Are You One of the 90% of Companies Trapped in the Digital “Death Spiral”?

Old Zhao – Management Systems Only

May 20, 2026 · Operations

The 8 Essential Tables That Simplify Procurement Management

The article breaks down procurement into eight core tables—demand plan, purchase request, supplier management, price inquiry, purchase order, receipt inspection, inventory entry, and payment ledger—showing how structuring these forms a data‑driven closed loop that reduces risk, stabilizes inventory, and improves cash‑flow visibility.

ERPSupply Chaindata integration

0 likes · 10 min read

The 8 Essential Tables That Simplify Procurement Management

Old Zhao – Management Systems Only

May 18, 2026 · Operations

How I Built a Complete Supply‑Chain Visualization Dashboard in 2 Hours

The article walks through a step‑by‑step process for turning fragmented sales, procurement, production, inventory and shipping data into a single, real‑time supply‑chain dashboard using the 简道云 platform, highlighting data integration, three‑layer visual design and automated alerts that cut down firefighting and improve decision‑making.

AutomationOperationsSupply Chain

0 likes · 9 min read

How I Built a Complete Supply‑Chain Visualization Dashboard in 2 Hours

DataFunSummit

May 16, 2026 · Industry Insights

What Powers Palantir’s 137% Revenue Surge? Inside Its Ontology‑Based Enterprise AI Platform

Palantir’s Q4 2025 revenue jumped 70% to $14.07 billion, with U.S. commercial revenue soaring 137%, driven not merely by AI hype but by its Ontology‑centric approach that tightly integrates data, business logic, actions, and security, locking large enterprises into a deeply embedded decision‑making stack.

AI OpsEnterprise AIOntology

0 likes · 9 min read

What Powers Palantir’s 137% Revenue Surge? Inside Its Ontology‑Based Enterprise AI Platform

Digital Planet

May 12, 2026 · Industry Insights

Why Central SOEs Are Rushing into DRP – It’s More Complex Than It Looks

The Digitalized Resource‑management Platform (DRP) is being adopted en masse by central state‑owned enterprises as a strategic response to tighter regulatory oversight, the need for precise governance, and untapped data value, but its implementation faces legacy system overload, data‑standard fragmentation, and deep organizational resistance that demand strong leadership, cross‑departmental coordination, and phased, value‑driven execution.

DRPDigital GovernanceEnterprise Resource Management

0 likes · 14 min read

Why Central SOEs Are Rushing into DRP – It’s More Complex Than It Looks

DataFunSummit

May 2, 2026 · Artificial Intelligence

How Palantir’s 4‑Layer Ontology Architecture Enables Buildings, Tenants, and Data to ‘Talk’

Healthpeak transformed its commercial‑real‑estate operations by replacing fragmented spreadsheets with Palantir’s AI Platform (AIP), using a four‑layer architecture and ontology‑driven modeling to automate billing, detect anomalies, and orchestrate workflows, dramatically cutting manual effort, errors, and scaling costs.

AI Workflow AutomationCommercial Real EstateEnterprise AI

0 likes · 18 min read

How Palantir’s 4‑Layer Ontology Architecture Enables Buildings, Tenants, and Data to ‘Talk’

DataFunSummit

Apr 29, 2026 · Industry Insights

Beyond the Data Rear‑view Mirror: Palantir’s Strategic Value and Real‑World Cases

Palantir leverages its Ontology‑driven data integration and AI platforms—Gotham, Foundry, and AIP—to transform fragmented data into actionable intelligence, delivering decision‑making advantages in government, aerospace, food, and energy sectors, while shifting from custom‑heavy services to an open, platform‑based ecosystem.

AI PlatformAI agentsEnterprise AI

0 likes · 11 min read

Beyond the Data Rear‑view Mirror: Palantir’s Strategic Value and Real‑World Cases

SuanNi

Apr 27, 2026 · Artificial Intelligence

How MIT’s RUBICON Cuts AI Agent Costs by 90% While Achieving 100% Accuracy

The paper shows that conventional LLM agents fail on real‑world enterprise data because of chaotic data sources, while the RUBICON architecture uses a minimal Agentic Query Language to let users direct data retrieval, achieving 100% accuracy with a much cheaper model and dramatically lower token and monetary costs.

Agentic Query LanguageEnterprise AILLM Agents

0 likes · 11 min read

How MIT’s RUBICON Cuts AI Agent Costs by 90% While Achieving 100% Accuracy

DataFunSummit

Apr 26, 2026 · Industry Insights

Why Palantir AIP Is More Than a Data Platform – The Secret ‘Implementation Orchestration Machine’

The article analyzes how Palantir’s ontology‑driven platforms—Gotham, Foundry, and the 2023 AI Platform (AIP)—break data silos, enable real‑time decision making, and shift the company from custom‑heavy solutions to a low‑code, AI‑agent‑centric ecosystem, illustrated with military, aerospace, and retail case studies.

AI PlatformAIPEnterprise AI

0 likes · 10 min read

Why Palantir AIP Is More Than a Data Platform – The Secret ‘Implementation Orchestration Machine’

Old Zhao – Management Systems Only

Apr 23, 2026 · Operations

Supply Chain vs Logistics vs Procurement: Clear Differences Explained

The article clarifies why many companies confuse procurement, logistics, and supply chain, outlines each function’s specific tasks, shows how fragmented data and unclear boundaries cause order delays, and proposes a linear, data‑driven workflow that links demand, purchasing, inbound, outbound, and delivery for smoother operations.

LogisticsOperations ManagementProcess Flow

0 likes · 10 min read

Supply Chain vs Logistics vs Procurement: Clear Differences Explained

AI Large-Model Wave and Transformation Guide

Apr 22, 2026 · Industry Insights

How to Build a Scalable Ontology‑Driven Investigation Platform: A Full‑Stack Architecture Blueprint

This article dissects the design of an end‑to‑end investigation platform by breaking down its core capabilities, mapping a layered architecture, justifying open‑source component choices, detailing deployment topology, comparing gaps with the commercial Gotham solution, and outlining a phased implementation roadmap.

AIDevOpsGraph Database

0 likes · 12 min read

How to Build a Scalable Ontology‑Driven Investigation Platform: A Full‑Stack Architecture Blueprint

AI Large-Model Wave and Transformation Guide

Apr 22, 2026 · Fundamentals

Why Ontology Is the Hidden Grammar Behind Knowledge Graphs

The article explains that ontology is not merely a list of terms but a formal model defining concepts, relationships, and constraints, outlines three quality standards, shows how it enables data integration and reasoning, compares it with simple taxonomies, and warns of common misconceptions.

AI fundamentalsKnowledge GraphOntology

0 likes · 9 min read

Why Ontology Is the Hidden Grammar Behind Knowledge Graphs

DataFunTalk

Apr 21, 2026 · Industry Insights

How a Chinese Bank Used AI Large Models to Revolutionize Data Development

Facing siloed, tool‑fragmented, and low‑quality data pipelines, China Everbright Bank built an AI‑driven, end‑to‑end data integration platform that unifies heterogeneous databases, automates workflow checkpoints, and adds intelligent code quality checks, delivering faster, higher‑quality data services for the financial sector.

AIData DevelopmentFinancial Industry

0 likes · 8 min read

How a Chinese Bank Used AI Large Models to Revolutionize Data Development

Digital Planet

Apr 14, 2026 · Industry Insights

Why Most FMCG Channel Digitalization Projects Fail and How to Turn Data into Real Incentives

The article analyzes three fundamental pitfalls that cause FMCG channel digitalization projects to produce fake or delayed data, explains why binding sales incentives to real product flow is essential, and outlines a formula and four capability pillars to achieve true online sales expense management.

Channel DigitalizationFMCGdata integration

0 likes · 16 min read

Why Most FMCG Channel Digitalization Projects Fail and How to Turn Data into Real Incentives

Digital Planet

Mar 30, 2026 · Industry Insights

Can Master Kong’s New “One More Bottle” Campaign Reverse Its Decline? A Deep Dive into FMCG Digital Transformation

Facing its first annual revenue decline in a decade, Master Kong revives the classic “One More Bottle” promotion using a five‑code integration that links factories, distributors, stores, and consumers, offering a case study on how digital‑first, full‑chain strategies can rejuvenate legacy FMCG growth models in a saturated market.

FMCGMarketingSupply Chain

0 likes · 15 min read

Can Master Kong’s New “One More Bottle” Campaign Reverse Its Decline? A Deep Dive into FMCG Digital Transformation

Wukong Talks Architecture

Mar 5, 2026 · Databases

Unifying Card and Coin Payments: KaiwuDB’s Dual‑Mode Solution for Amusement Parks

This article presents a detailed technical case study of using KaiwuDB’s multi‑model database to unify card‑based and coin‑based payment processing in amusement parks, covering architecture, schema design, SQL implementations, offline handling, cross‑model analytics, hot‑cold data tiering, visualization, monitoring, security, and high‑availability strategies.

Amusement ParkDual-Mode PaymentsKaiwuDB

0 likes · 42 min read

Unifying Card and Coin Payments: KaiwuDB’s Dual‑Mode Solution for Amusement Parks

AI Large Model Application Practice

Feb 19, 2026 · Artificial Intelligence

When Should You Add a Knowledge Graph? 6 Practical Decision Criteria

This article outlines six concrete criteria—relationship‑centric data, reproducible reasoning, evolving schemas, multi‑hop queries, explainable decisions, and cross‑system data integration—to help engineers decide whether a knowledge graph is the right solution or if a relational database will suffice.

AI EngineeringKnowledge GraphReasoning

0 likes · 15 min read

When Should You Add a Knowledge Graph? 6 Practical Decision Criteria

Fighter's World

Feb 7, 2026 · Artificial Intelligence

Who Will Capture the Trillion‑Dollar Value of Context Graphs?

The article analyzes why Context Graphs can unlock trillion‑dollar value by unifying heterogeneous enterprise systems, how platform‑level compounding effects outpace vertical AI agents, the strategic advantage of data companies in cross‑system integration, and why open standards and unified Context layers will decide the market winners.

AI agentsContext GraphEnterprise AI

0 likes · 25 min read

Who Will Capture the Trillion‑Dollar Value of Context Graphs?

Big Data Tech Team

Jan 19, 2026 · Big Data

What Is Data Fabric and How It Can Eliminate Data Silos Today

This article explains the concept of Data Fabric, debunks common misconceptions, outlines the three key drivers behind its rise, and provides a practical four‑step roadmap—including metadata, semantic layers, policy engines, and AI—to help teams of any size adopt the technology.

AIData FabricMetadata Management

0 likes · 7 min read

What Is Data Fabric and How It Can Eliminate Data Silos Today

Old Zhao – Management Systems Only

Jan 13, 2026 · Operations

How to Build a Lightweight Supply‑Chain Visualization System in Under Two Hours

This article walks through a practical, step‑by‑step case study of creating a lightweight supply‑chain visualization system for small manufacturers, covering problem definition, data unification, dashboard design, automated collaboration rules, pilot testing, and actionable rollout recommendations.

OperationsSMEdata integration

0 likes · 8 min read

How to Build a Lightweight Supply‑Chain Visualization System in Under Two Hours

JavaEdge

Jan 3, 2026 · Blockchain

Why Oracles Are Essential for Real‑Time On‑Chain Data: Methods & Alternatives

Oracles serve as the crucial bridge that enables smart contracts to access off‑chain data, and while they are the dominant solution for real‑time on‑chain updates, the article also explores alternative approaches such as centralized data entry, state channels, sidechains, and cross‑chain oracles, outlining their pros, cons, and challenges.

DecentralizedOracledata integration

0 likes · 6 min read

Why Oracles Are Essential for Real‑Time On‑Chain Data: Methods & Alternatives

Alibaba Cloud Big Data AI Platform

Dec 30, 2025 · Big Data

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

StarRocks and Apache Paimon have been progressively integrated across multiple releases, enabling a unified lakehouse architecture that supports multi-source federated analysis, time-travel queries, native readers/writers, distributed planning, and advanced profiling, while delivering performance gains that bring Paimon query speed on par with native StarRocks tables.

Apache PaimonLakehousePerformance Optimization

0 likes · 9 min read

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

Old Meng AI Explorer

Dec 25, 2025 · Industry Insights

How Open-Source OpenBB Terminal Gives You Bloomberg‑Level Analysis for Free

OpenBB Terminal is a free, open‑source financial analysis platform that consolidates over 500 data sources, offers AI‑driven report generation, one‑click industry comparisons, and local Docker deployment, enabling individual investors and small institutions to perform Bloomberg‑level research, quantitative backtesting, and secure data handling without costly subscriptions.

AIDockerFinancial Analysis

0 likes · 10 min read

How Open-Source OpenBB Terminal Gives You Bloomberg‑Level Analysis for Free

DataFunSummit

Nov 9, 2025 · Artificial Intelligence

How Zilliz Cut an 8‑Minute Sales Lead Process to Seconds with AI‑Powered Dify

This article recounts how Zilliz leveraged the low‑code platform Dify to integrate large‑model AI, private data, and business logic, transforming an eight‑minute, manual sales‑lead workflow into a seconds‑level automated pipeline and illustrating a new human‑AI collaboration paradigm.

AILarge ModelsLow‑code

0 likes · 14 min read

How Zilliz Cut an 8‑Minute Sales Lead Process to Seconds with AI‑Powered Dify

BirdNest Tech Talk

Oct 11, 2025 · Artificial Intelligence

How to Load Documents into LangChain: From Files to APIs

Learn how to use LangChain's Document Loaders to import data from files, web pages, databases, and APIs, understand the Document object structure, compare load() versus lazy_load(), and follow a step‑by‑step Python example that demonstrates loading, inspecting, and optionally processing documents with an LLM.

Document LoaderLLMLangChain

0 likes · 12 min read

How to Load Documents into LangChain: From Files to APIs

Selected Java Interview Questions

Oct 7, 2025 · Backend Development

How to Build a High‑Performance Doris Stream Load Service in Java

This guide walks through the complete architecture, Maven dependencies, configuration classes, annotation‑driven field mapping, utility mappers, a parallel Stream Load core, response handling, and performance tuning for integrating Apache Doris with a Spring Boot backend.

JavaSpring BootStream Load

0 likes · 23 min read

How to Build a High‑Performance Doris Stream Load Service in Java

DataFunTalk

Aug 26, 2025 · Artificial Intelligence

Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide

This resource guide presents a curated list of cutting‑edge topics—including multimodal GraphRAG, knowledge‑graph‑driven large‑model applications in finance, traditional Chinese medicine, automotive manufacturing, and knowledge‑management trends—offering insights into AI‑powered knowledge services, and invites readers to scan the QR code to download the full e‑book.

AIKnowledge GraphLarge Language Model

0 likes · 2 min read

Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide

360 Tech Engineering

Aug 12, 2025 · Artificial Intelligence

How Knowledge Graphs Are Reinventing AI Security: Insights from ISC.AI 2025

At the 13th ISC.AI 2025 Knowledge Graphs Reshaping Intelligent Security Summit in Beijing, leading experts from academia and industry highlighted how knowledge graphs enhance AI model accuracy, explainability, and trust, offering comprehensive data integration and risk monitoring to fortify intelligent systems across sectors.

Knowledge Graphdata integrationrisk monitoring

0 likes · 6 min read

How Knowledge Graphs Are Reinventing AI Security: Insights from ISC.AI 2025

360 Zhihui Cloud Developer

Jul 22, 2025 · Big Data

How Apache SeaTunnel Revolutionizes Heterogeneous Data Integration with Decoupled Connectors

This article explores how Apache SeaTunnel addresses modern data integration challenges by providing a high‑performance, distributed, plugin‑based platform that decouples connectors from execution engines, enabling seamless batch and streaming synchronization across heterogeneous sources such as databases, message queues, and data lakes.

Apache SeaTunnelBatch ProcessingConnector Architecture

0 likes · 24 min read

How Apache SeaTunnel Revolutionizes Heterogeneous Data Integration with Decoupled Connectors

Code Ape Tech Column

Jul 8, 2025 · Backend Development

Mastering Spring Batch: Real-World Use Cases and Hands‑On Guide

This comprehensive guide explains why batch processing is essential, walks through typical banking, e‑commerce, logging and medical data scenarios, details Spring Batch's core architecture and components, provides step‑by‑step setup and code examples, and presents a production‑grade bank reconciliation case with monitoring and troubleshooting tips.

Batch ProcessingJob SchedulingPerformance Optimization

0 likes · 27 min read

Mastering Spring Batch: Real-World Use Cases and Hands‑On Guide

Alibaba Cloud Developer

Jun 13, 2025 · Artificial Intelligence

Accelerate Enterprise Data Insights with Alibaba Cloud Hologres and AI Agents

Learn how to rapidly build an intelligent data analysis agent by integrating multi‑source data through Alibaba Cloud Hologres, leveraging Bailei’s AI model service and the serverless Function AI platform, covering architecture, step‑by‑step deployment, verification, and resource cleanup for cost‑effective, real‑time business insights.

AIAlibaba CloudHologres

0 likes · 8 min read

Accelerate Enterprise Data Insights with Alibaba Cloud Hologres and AI Agents

Alibaba Cloud Big Data AI Platform

Jun 11, 2025 · Big Data

Sync MaxCompute Tables to Milvus with DataWorks: Step‑by‑Step Guide

This guide explains how to use Alibaba Cloud DataWorks to create the necessary resources, configure Milvus and MaxCompute data sources, set up an offline single‑table synchronization task, and verify the imported vectors, enabling efficient AI‑driven vector search on large structured datasets.

Big DataDataWorksMaxCompute

0 likes · 8 min read

Sync MaxCompute Tables to Milvus with DataWorks: Step‑by‑Step Guide

Java Captain

Jun 10, 2025 · Backend Development

Why Spring Batch? Real‑World Scenarios, Core Architecture and Hands‑On Guide

This article explains the necessity of batch processing, presents typical use cases such as daily interest calculation, e‑commerce order archiving, log analysis and medical data migration, then dives deep into Spring Batch's core components, provides step‑by‑step code examples, performance‑tuning tips, production‑grade fault‑tolerance, monitoring solutions and a comprehensive FAQ.

Batch ProcessingJavaMonitoring

0 likes · 20 min read

Why Spring Batch? Real‑World Scenarios, Core Architecture and Hands‑On Guide

DataFunSummit

Jun 2, 2025 · Artificial Intelligence

Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs

This article explains how the rapid development of large language models and knowledge graph technologies creates new opportunities for enterprise knowledge management, outlines the challenges of massive unstructured data, describes the architecture and core data flow of a corporate knowledge brain, and showcases key technologies and real‑world applications.

AI ArchitectureEnterprise AIKnowledge Graph

0 likes · 13 min read

Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs

Data Thinking Notes

May 5, 2025 · Artificial Intelligence

How MCP’s Text2SQL Service Turns Natural Language into Powerful Database Queries

This article explores the MCP platform’s data service capabilities, detailing its core components—Resources, Prompts, and Tools—and demonstrates how its Text2SQL feature enables natural‑language queries to retrieve table schemas, perform data sampling, and execute complex relational analyses across multiple database tables.

AIDatabaseLLM

0 likes · 7 min read

How MCP’s Text2SQL Service Turns Natural Language into Powerful Database Queries

Big Data Tech Team

Apr 21, 2025 · Industry Insights

8 Practical Ways DeepSeek Boosts Data Quality for Better Governance

This guide outlines eight concrete methods DeepSeek uses to improve data quality—including automated cleaning, validation, classification, monitoring, governance standards, anomaly detection, integration, and intelligent analysis—providing actionable steps for organizations to enhance data accuracy, completeness, consistency, and usability.

Data cleaningDeepSeekdata integration

0 likes · 5 min read

8 Practical Ways DeepSeek Boosts Data Quality for Better Governance

Big Data Technology & Architecture

Apr 15, 2025 · Big Data

Designing a Lakehouse with Doris and Paimon: Query Acceleration and Unified Modeling

This article summarizes how the Doris‑Paimon lakehouse architecture leverages Doris' high‑performance OLAP engine to accelerate lake queries, provides a unified data analysis gateway, supports unified data integration, and enables open, layered data modeling for modern big‑data workloads.

Big DataPaimonQuery Acceleration

0 likes · 9 min read

Designing a Lakehouse with Doris and Paimon: Query Acceleration and Unified Modeling

DataFunSummit

Apr 1, 2025 · Big Data

Understanding Flink CDC 3.3: Features, Improvements, and Future Plans

This article provides a comprehensive overview of Flink CDC 3.3, detailing its CDC fundamentals, new connectors, Transform module enhancements, asynchronous snapshot splitting, community adoption, and upcoming roadmap for broader ecosystem support and batch‑mode execution.

Big DataCDCChange Data Capture

0 likes · 15 min read

Understanding Flink CDC 3.3: Features, Improvements, and Future Plans

Alibaba Cloud Big Data AI Platform

Mar 25, 2025 · Big Data

How to Connect EMR Serverless Spark with Apache Doris for Seamless Data Processing

This guide explains how to integrate EMR Serverless Spark with the high‑performance Apache Doris analytical database, covering prerequisites, connector download, OSS upload, network configuration, table creation, and both SQL‑session and Notebook examples for reading and writing Doris tables.

Apache DorisBig DataEMR Serverless Spark

0 likes · 11 min read

How to Connect EMR Serverless Spark with Apache Doris for Seamless Data Processing

Alibaba Cloud Big Data AI Platform

Mar 20, 2025 · Big Data

How to Read and Write StarRocks Data with EMR Serverless Spark

This step‑by‑step guide explains how to use EMR Serverless Spark together with the StarRocks Spark Connector to create a workspace, upload the connector JAR, configure network connections, create databases and tables in StarRocks, and perform read/write operations via SQL sessions, Notebook sessions, or batch Spark jobs, complete with code examples and UI screenshots.

Big DataEMR ServerlessSpark

0 likes · 14 min read

How to Read and Write StarRocks Data with EMR Serverless Spark

AI Product Manager Community

Feb 17, 2025 · Product Management

How AI Can Transform Your Product Roadmap into a Real‑Time Strategic Tool

In today’s fast‑changing market, traditional product planning falls short, so this article explains how AI‑powered data integration, predictive analytics, and dynamic feedback loops can create a real‑time, data‑driven product roadmap, detailing three implementation phases—data unification, intelligent analysis, and continuous adjustment—with practical steps for product managers.

AIRoadmapdata integration

0 likes · 8 min read

How AI Can Transform Your Product Roadmap into a Real‑Time Strategic Tool

DataFunSummit

Feb 9, 2025 · Big Data

Modern Data Stack on Alibaba Cloud Using Flink CDC: Architecture, Features, and Use Cases

This article presents a comprehensive overview of Alibaba Cloud's modern data stack built on Flink CDC, detailing its core concepts, extended capabilities, typical application scenarios, performance optimizations, a live demo, and future development plans for large‑scale streaming data integration.

Alibaba CloudBig DataFlink CDC

0 likes · 13 min read

Modern Data Stack on Alibaba Cloud Using Flink CDC: Architecture, Features, and Use Cases

dbaplus Community

Feb 7, 2025 · Artificial Intelligence

How Anthropic’s Model Context Protocol Enables Natural‑Language Database Access

The article explains Anthropic’s Model Context Protocol (MCP), a universal interface that lets AI assistants like Claude query and manipulate data sources—including SQLite databases—using plain natural language, and compares it with existing tools such as Chat2DB.

Artificial IntelligenceClaudeMCP

0 likes · 8 min read

How Anthropic’s Model Context Protocol Enables Natural‑Language Database Access

Alibaba Cloud Big Data AI Platform

Jan 27, 2025 · Big Data

Unlock Real-Time Data Sync with Flink CDC: YAML Integration, Transform & Route Explained

This article summarizes an advanced Flink CDC presentation, covering Flink CDC fundamentals, real‑time Flink integration, CDC‑YAML core capabilities, supported sync links, Transform and Route modules, monitoring metrics, schema‑change strategies, typical use cases, performance optimizations, demo implementations, and future development plans.

CDCFlinkYAML

0 likes · 20 min read

Unlock Real-Time Data Sync with Flink CDC: YAML Integration, Transform & Route Explained

Alibaba Cloud Big Data AI Platform

Jan 23, 2025 · Big Data

How Alibaba Cloud DataWorks Leverages Flink CDC for Scalable Data Lake Integration

Alibaba Cloud DataWorks’ Data Integration platform, built on Flink CDC, offers a comprehensive, serverless solution for real‑time and batch data lake ingestion, detailing its architecture, elastic scaling, productized use cases, and future roadmap, including AI‑driven diagnostics and expanded source support.

Big DataFlink CDCServerless

0 likes · 12 min read

How Alibaba Cloud DataWorks Leverages Flink CDC for Scalable Data Lake Integration

Alibaba Cloud Big Data AI Platform

Jan 21, 2025 · Big Data

Master Flink CDC YAML: Real‑Time Data Integration Best Practices in 10 Minutes

This article introduces Flink CDC YAML, outlines its core capabilities and application scenarios, compares it with SQL and DataStream jobs, showcases enterprise‑grade features of Alibaba Cloud Flink CDC, and provides a step‑by‑step tutorial to build a complete CDC YAML job in just ten minutes.

CDCFlinkYAML

0 likes · 20 min read

Master Flink CDC YAML: Real‑Time Data Integration Best Practices in 10 Minutes

Bilibili Tech

Nov 26, 2024 · Big Data

Bilibili’s Iceberg‑Based Streaming‑Batch Integration: Architecture, Optimizations, and Practices

Bilibili migrated its massive user‑behavior, commercial AI training, and database synchronization pipelines from Hive and Kafka to an Iceberg‑based streaming‑batch architecture, using Flink and the Magnus optimizer to achieve minute‑level freshness, reduce CPU and memory usage by about 20‑22 %, save roughly 3.55 M CNY annually, and dramatically improve query latency and join performance.

BatchFlinkIceberg

0 likes · 20 min read

Bilibili’s Iceberg‑Based Streaming‑Batch Integration: Architecture, Optimizations, and Practices

Data Thinking Notes

Nov 5, 2024 · Big Data

How a Next‑Gen Data Management Platform Boosts Efficiency and Innovation

This article outlines the motivations, objectives, and architectural design of a next‑generation data management platform, detailing its four‑layer “four‑ization” approach, core services such as data integration, modeling, API provisioning, componentization, as well as governance, security, and operational best practices.

Big DataData GovernanceData Platform

0 likes · 20 min read

How a Next‑Gen Data Management Platform Boosts Efficiency and Innovation

AsiaInfo Technology: New Tech Exploration

Nov 4, 2024 · Big Data

How Apache SeaTunnel Redefines Data Integration for Modern Data Platforms

This article reviews the evolution of data‑integration architectures toward EtLT, explains the core capabilities of Apache SeaTunnel, and details how a Chinese data‑platform vendor applied and extended SeaTunnel to simplify batch and streaming ingestion, unify multi‑engine processing, and reduce development and operational costs.

Apache SeaTunnelBig DataConnector Development

0 likes · 17 min read

How Apache SeaTunnel Redefines Data Integration for Modern Data Platforms

DataFunSummit

Nov 1, 2024 · Big Data

DataFun Summit Session Overview and E‑book Access Instructions

The article outlines how to obtain the DataFun Summit e‑book by following the public account instructions and provides concise English summaries of twelve technical sessions covering data lineage, integration, AI language models, multimodal content, game AI agents, lake‑warehouse governance, big‑data architecture, and cluster management.

AIBig DataDataOps

0 likes · 5 min read

DataFunSummit

Oct 27, 2024 · Artificial Intelligence

How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”

This article describes Siemens' journey in applying generative AI and Retrieval‑Augmented Generation to create an internal knowledge chatbot, detailing the business challenges, technical architecture, data integration, multi‑modal capabilities, deployment outcomes, and strategic lessons for enterprise AI adoption.

AI chatbotEnterprise Knowledge ManagementLarge Language Model

0 likes · 21 min read

How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”

macrozheng

Sep 27, 2024 · Big Data

Master DataX: Efficient Offline Data Sync for Heterogeneous Sources

This guide walks through the challenges of synchronizing massive datasets across heterogeneous databases, introduces Alibaba's open‑source DataX tool, explains its framework‑plugin architecture, and provides step‑by‑step instructions—including environment setup, installation, job configuration, and both full and incremental MySQL synchronization—complete with code examples and performance metrics.

Big DataDataXETL

0 likes · 15 min read

Master DataX: Efficient Offline Data Sync for Heterogeneous Sources

Data Thinking Notes

Sep 9, 2024 · Fundamentals

Master the 6‑Step Blueprint for Building an Enterprise Data Middle Platform

This guide outlines a practical six‑step methodology—covering overall planning, data integration, model construction, data development, asset management, and data services—to help enterprises build a robust data middle platform that unlocks business value and supports agile digital transformation.

Data GovernanceData PlatformEnterprise Architecture

0 likes · 10 min read

Master the 6‑Step Blueprint for Building an Enterprise Data Middle Platform

Volcano Engine Developer Services

Aug 8, 2024 · Databases

How ByConity Powers Real‑Time Telecom Data Analytics: A Deep Dive

This article details Haijing Technology's challenges with real‑time telecom data analysis, explains why traditional Hadoop and ClickHouse solutions fell short, and shows how ByConity's unified engine, multi‑table joins, and elastic scaling enable efficient, low‑latency analytics across complex B‑O‑M domains.

Big DataByConityMPP database

0 likes · 10 min read

How ByConity Powers Real‑Time Telecom Data Analytics: A Deep Dive

Ops Development & AI Practice

Aug 7, 2024 · Artificial Intelligence

How ChatGPT’s New JSON Output Transforms AI Integration

This article examines OpenAI's recent ChatGPT API update that adds JSON‑formatted responses, detailing the technical background, implementation steps, example requests and responses, and the broader impact on developers, enterprises, and future AI applications.

APIArtificial IntelligenceChatGPT

0 likes · 10 min read

How ChatGPT’s New JSON Output Transforms AI Integration

Data Thinking Notes

Jul 29, 2024 · Big Data

What Is a Data Middle Platform and How Does It Transform Enterprise Data Management?

This article explains the concept, design principles, and core components of a data middle platform, detailing its overall, functional, layered, logical, and data architectures, as well as the specific platforms for data collection, processing, organization, governance, quality, sharing, and visualization, illustrated with diagrams.

Big DataData ArchitectureData Governance

0 likes · 27 min read

What Is a Data Middle Platform and How Does It Transform Enterprise Data Management?

DaTaobao Tech

Jul 8, 2024 · Big Data

ODPS (MaxCompute) SQL Basics, Data Integration and Hologres Import Guide

This guide provides a comprehensive, beginner‑to‑advanced reference for ODPS (MaxCompute) SQL, covering table creation, DDL/DML commands, query syntax, join hints, MySQL‑to‑ODPS synchronization, one‑click and custom imports into Hologres, and scheduling variables for automated data pipelines.

ETLHologresODPS

0 likes · 37 min read

ODPS (MaxCompute) SQL Basics, Data Integration and Hologres Import Guide

DataFunSummit

Jun 14, 2024 · Big Data

JD Logistics One‑Stop Agile BI Solution: Architecture, Challenges, and Product Evolution

This article presents JD Logistics' one‑stop agile BI platform, detailing the complex data sources, rapid business demands, the UData solution architecture, performance and usability improvements, and future upgrade plans that together enable faster data integration, self‑service reporting, and enhanced decision‑making across the organization.

Agile AnalyticsBIBig Data

0 likes · 25 min read

JD Logistics One‑Stop Agile BI Solution: Architecture, Challenges, and Product Evolution

DataFunTalk

May 13, 2024 · Big Data

Data Integration Maturity Model: From ETL to EtLT

The article examines the evolution of data integration architectures—from traditional ETL through ELT to the emerging EtLT model—highlighting their advantages, disadvantages, industry trends, maturity stages, and practical guidance for enterprises and professionals navigating modern big‑data pipelines.

Big DataDataOpsELT

0 likes · 31 min read

Data Integration Maturity Model: From ETL to EtLT

DataFunTalk

May 8, 2024 · Big Data

Risk Control and Data Application in the Bulk Commodity Industry: Challenges, Solutions, and Core Capabilities

The article presents Ant Group's exploration of applying its data‑driven risk control and credit assessment capabilities to the traditional bulk commodity sector, detailing industry background, data pain points, core technical solutions, and the construction of a secure, explainable data‑model platform for digital transformation.

AIBig DataBulk Industry

0 likes · 13 min read

Risk Control and Data Application in the Bulk Commodity Industry: Challenges, Solutions, and Core Capabilities

21CTO

Apr 28, 2024 · Artificial Intelligence

5 Transformative Business Use Cases for Conversational AI

This article explores how conversational AI, powered by large language models, is reshaping enterprise operations across five key scenarios—from customer support assistants and AI‑driven data interfaces to HR bots, unstructured data processing, and multi‑agent digital assistants—highlighting benefits, implementation considerations, and privacy challenges.

Conversational AILarge Language Modelsbusiness applications

0 likes · 13 min read

5 Transformative Business Use Cases for Conversational AI

Data Thinking Notes

Apr 9, 2024 · Big Data

What Is a Data Middle Platform and Why It’s Essential for Modern Enterprises

Data middle platforms transform raw enterprise data into reusable assets by integrating collection, storage, processing, governance, and service layers, enabling faster deployment, consistent metrics, improved data quality, and business value across digital transformation, while addressing challenges like siloed data, low efficiency, and inconsistent standards.

Big DataData GovernanceData Platform

0 likes · 23 min read

What Is a Data Middle Platform and Why It’s Essential for Modern Enterprises

DataFunSummit

Apr 7, 2024 · Big Data

Li Auto’s Flink on Kubernetes Data Integration Practice

This article presents Li Auto’s end‑to‑end data integration journey, detailing the evolution of its data platform, the challenges of heterogeneous sources, and how a unified Flink‑on‑K8s solution with cloud‑native architecture, operator management, monitoring, and checkpointing addresses batch‑stream convergence and future scalability.

Batch ProcessingBig DataFlink

0 likes · 12 min read

Li Auto’s Flink on Kubernetes Data Integration Practice

DataFunTalk

Mar 1, 2024 · Big Data

Understanding Data Fabric and Data Virtualization: Concepts, Practices, and Real‑World Case Study

This article explains the fundamentals of Data Fabric and data virtualization, highlights the limitations of traditional centralized data warehouses, describes the three‑layer virtualization architecture, and presents a detailed securities‑industry case study that demonstrates cost, efficiency, and compliance benefits.

Big DataData FabricETL

0 likes · 17 min read

Understanding Data Fabric and Data Virtualization: Concepts, Practices, and Real‑World Case Study

DataFunTalk

Feb 23, 2024 · Artificial Intelligence

Challenges and Opportunities in Applying Large‑Model AI to Healthcare

The article analyzes how large‑model medical AI is rapidly adopted yet struggles with implementation due to doctor shortages, behavioral resistance, data silos, safety regulations, and the need for strategic alignment, while contrasting the more supportive innovation ecosystem in the United States.

AI adoptionHealthcare Innovationdata integration

0 likes · 6 min read

Challenges and Opportunities in Applying Large‑Model AI to Healthcare

DataFunSummit

Feb 20, 2024 · Big Data

BitSail Open‑Source Data Integration Engine: Architecture, New Features, CDC Solutions and Future Outlook

This article introduces ByteDance's open‑source data integration engine BitSail, covering its background, layered architecture, recent feature enhancements, automated testing framework, CDC‑based full‑library synchronization solutions, and future development plans for connectors and real‑time data consistency.

Big DataCDCDistributed Systems

0 likes · 12 min read

BitSail Open‑Source Data Integration Engine: Architecture, New Features, CDC Solutions and Future Outlook

DataFunTalk

Feb 17, 2024 · Big Data

JD Logistics One‑Stop Agile BI Solution: Architecture, Challenges, and Optimization

This article presents JD Logistics' one‑stop agile BI platform, detailing the complex data sources, rapid requirement changes, and Chinese‑style reporting challenges it addresses, while outlining the UData solution, product methodology, performance enhancements, and real‑world case studies that demonstrate significant efficiency gains.

Agile AnalyticsBIBig Data

0 likes · 26 min read

JD Logistics One‑Stop Agile BI Solution: Architecture, Challenges, and Optimization

DataFunSummit

Feb 5, 2024 · Artificial Intelligence

Ant Group's Knowledge Graph: Overview, Construction, Applications, and Integration with Large Models

Ant Group shares its comprehensive knowledge graph initiatives, detailing the fundamentals, construction pipeline, fusion techniques, cognitive representations, diverse business applications, and the emerging synergy between knowledge graphs and large language models, illustrating how graph-based AI enhances accuracy, interpretability, and downstream services.

Artificial IntelligenceGraph FusionKnowledge Graph

0 likes · 14 min read

Ant Group's Knowledge Graph: Overview, Construction, Applications, and Integration with Large Models

DataFunTalk

Jan 29, 2024 · Big Data

Case Study: Deploying RisingWave for Real-Time Stream Processing in a Large-Scale Quantitative Hedge Fund

An ultra‑large hedge fund with over $10 billion AUM replaced ksqlDB and Flink with RisingWave, leveraging its PostgreSQL‑compatible streaming SQL to achieve sub‑10 ms latency, lower learning and operational costs, rich connectors, advanced operators, and comprehensive observability for real‑time trade data processing.

Low latencyQuantitative TradingRisingWave

0 likes · 10 min read

Case Study: Deploying RisingWave for Real-Time Stream Processing in a Large-Scale Quantitative Hedge Fund

NetEase LeiHuo UX Big Data Technology

Jan 9, 2024 · Artificial Intelligence

Accelerating Recommendation System Development with MindsDB

The article explains how the data team adopted the open‑source machine‑learning platform MindsDB to simplify data integration, enable SQL‑based model training and inference, manage model versions, and dramatically shorten recommendation system development cycles, achieving up to 30% efficiency gains.

Machine LearningMindsDBModel Management

0 likes · 5 min read

Accelerating Recommendation System Development with MindsDB

Alibaba Cloud Native

Dec 28, 2023 · Cloud Computing

How to Set Up No‑Code Data Dump from Alibaba Cloud Kafka to OSS

This guide explains how to use Alibaba Cloud Message Queue Kafka's no‑code, fully managed, serverless dump feature to transfer data to OSS, covering its benefits, typical scenarios, required prerequisites, step‑by‑step configuration, testing, and verification of the resulting objects.

Alibaba CloudKafkaOSS

0 likes · 9 min read

How to Set Up No‑Code Data Dump from Alibaba Cloud Kafka to OSS

Sohu Tech Products

Dec 27, 2023 · Big Data

Practical Implementation of Data Integration with Flink on Kubernetes at Li Auto

Li Auto built a cloud‑native data‑integration platform by deploying Flink on Kubernetes, unifying batch and streaming workloads with a storage layer (JuiceFS + BOS) and Flink Operator, enabling simple source‑sink pipelines, elastic scaling, automated checkpointing, and centralized monitoring while addressing earlier fragmentation and resource inefficiencies.

Big DataCloud NativeFlink

0 likes · 11 min read

Practical Implementation of Data Integration with Flink on Kubernetes at Li Auto

DataFunTalk

Dec 22, 2023 · Big Data

Practical Implementation of Flink on Kubernetes for Data Integration at Li Auto

This article details Li Auto's end‑to‑end data integration practice using Flink on Kubernetes, covering the evolution of their integration platform, architectural design, cloud‑native deployment, operational challenges, and future roadmap, while highlighting unified batch‑stream processing and resource elasticity.

Batch ProcessingBig DataCloud Native

0 likes · 12 min read

Practical Implementation of Flink on Kubernetes for Data Integration at Li Auto

Big Data Technology & Architecture

Dec 20, 2023 · Big Data

Using Flink CDC 3.0 to Enhance Project Summaries, Resumes, and Interview Discussions

The article explains how Flink CDC 3.0 transforms traditional CDC pipelines into an end‑to‑end streaming ELT framework, offers practical guidance for describing such projects on resumes and in interviews, and outlines future challenges and development directions for large‑scale data integration.

Big DataFlink CDCdata integration

0 likes · 6 min read

Using Flink CDC 3.0 to Enhance Project Summaries, Resumes, and Interview Discussions

Baidu Intelligent Cloud Tech Hub

Dec 12, 2023 · Databases

Master Database Migration to Cloud: Challenges & Solutions with Baidu DTS

This article examines the rapid growth of China's database market, the technical hurdles of moving databases to public cloud—including engine selection, lengthy migration processes, efficiency, disaster recovery, and data consistency—and explains how Baidu Intelligent Cloud's DTS platform offers a smooth, reliable, high‑availability, and high‑performance one‑stop solution with real‑world use cases.

Baidu CloudCloud DatabasesDTS

0 likes · 25 min read

Master Database Migration to Cloud: Challenges & Solutions with Baidu DTS

Data Thinking Notes

Dec 5, 2023 · Big Data

How to Overcome Data Governance Challenges and Unlock Business Value

Enterprises face significant hurdles in data governance and integration, from siloed systems and unclear responsibilities to poor data quality, but by establishing clear rules, fostering user department engagement, and aligning governance with business-driven data applications, they can create a cohesive data asset management framework that drives value.

Big DataData AssetsData Governance

0 likes · 10 min read

How to Overcome Data Governance Challenges and Unlock Business Value

Alibaba Cloud Native

Nov 23, 2023 · Cloud Native

How CDC + Serverless Functions Enable Real‑Time ETL in Cloud Native Architectures

This article explains how Alibaba Cloud's Serverless Function Compute combined with Database Change Data Capture (CDC) creates a complete, real‑time ETL pipeline, detailing the ETL model, DTS integration, architecture components, event‑driven processing, and practical use cases such as OLTP‑to‑OLAP data flow.

Alibaba CloudCDCETL

0 likes · 10 min read

How CDC + Serverless Functions Enable Real‑Time ETL in Cloud Native Architectures

DataFunSummit

Oct 24, 2023 · Big Data

Practices of Data Fabric in Data Integration Scenarios

The presentation by Aloudata Vice President Yu Jun introduces his extensive background in large‑scale internet and big‑data platforms and outlines how Data Fabric and data virtualization can be applied to data integration, highlighting the differences from traditional solutions and the business value of logical data warehouses.

Big DataData FabricLogical Data Warehouse

0 likes · 2 min read

Practices of Data Fabric in Data Integration Scenarios

DataFunTalk

Oct 12, 2023 · Big Data

FastData Real‑Time Intelligent Lakehouse Platform: Data Fabric Technology Practice

This article introduces the concept of Data Fabric, explains how Dipu Technology built the FastData real‑time intelligent lakehouse platform on top of it, describes its architecture, core advantages, practical use cases in energy and retail, and outlines the platform’s future roadmap.

AnalyticsBig DataData Fabric

0 likes · 19 min read

FastData Real‑Time Intelligent Lakehouse Platform: Data Fabric Technology Practice

DataFunTalk

Sep 30, 2023 · Big Data

Building a Marketing‑Oriented Data Middle Platform: Concepts and Practices

This article outlines how a marketing‑focused data middle platform can be constructed by integrating online and offline behavior data, business data, and third‑party sources, then applying data integration, modeling, processing, and application capabilities to enable data‑driven user journeys and personalized marketing strategies.

Big DataMarketing Analyticsdata integration

0 likes · 13 min read

Building a Marketing‑Oriented Data Middle Platform: Concepts and Practices

Java High-Performance Architecture

Sep 28, 2023 · Databases

How to Use Debezium for MySQL CDC in Spring Boot Without Adding Extra Middleware

Learn how to capture MySQL data changes using Debezium's CDC capabilities within a Spring Boot application, avoiding heavyweight message brokers by leveraging binlog monitoring, configuring connectors, handling snapshots, and processing change events for use cases like cache invalidation, data integration, and simplifying monolithic architectures.

CDCDebeziumKafka

0 likes · 24 min read

How to Use Debezium for MySQL CDC in Spring Boot Without Adding Extra Middleware

Architects Research Society

Sep 27, 2023 · Fundamentals

What Is the Common Data Model and Why Use It?

The Common Data Model provides a shared, standardized data language and metadata system that simplifies cross‑application data integration, reduces custom development effort, and enables consistent, extensible data structures for business and analytics scenarios across Microsoft Power Platform and Azure services.

Common Data ModelEnterprise DataMicrosoft Power Platform

0 likes · 8 min read

What Is the Common Data Model and Why Use It?

DataFunSummit

Sep 8, 2023 · Big Data

Tianqiong OLAP Real‑time Lakehouse Fusion Platform Architecture Practice

This article explains why lake‑warehouse fusion is needed, describes the challenges of integrating real‑time data warehouses with data lakes, introduces a new StarRocks‑based architecture that supports real‑time ingestion, cooling, offline loading, and adaptive hot‑cold query rewriting, and outlines future plans and Q&A.

Big DataData WarehouseLakehouse

0 likes · 21 min read

Tianqiong OLAP Real‑time Lakehouse Fusion Platform Architecture Practice

Architecture Digest

Aug 21, 2023 · Databases

Redis 7.2 Unified Release: New AI, Vector Search, and Programmable Engine Features

Redis 7.2, the first Unified Redis Release, introduces extensive AI support, vector database capabilities, scalable search, server‑side Triggers and Functions, enhanced geospatial queries, performance‑boosted sorted sets, and the Redis Data Integration tool, while expanding client and protocol compatibility.

AIDatabaseRedis

0 likes · 7 min read

Redis 7.2 Unified Release: New AI, Vector Search, and Programmable Engine Features

Java Backend Technology

Aug 19, 2023 · Big Data

Top ETL Tools Compared: Kettle, DataX, DataPipeline, Talend, DataStage, Sqoop, FineDataLink, Canal

This guide reviews the most popular ETL and data integration tools—including Kettle, DataX, DataPipeline, Talend, DataStage, Sqoop, FineDataLink, and Canal—detailing their core features, architectures, and typical use cases to help you choose the right solution for data migration and synchronization.

Big DataCDCData Migration

0 likes · 13 min read

Top ETL Tools Compared: Kettle, DataX, DataPipeline, Talend, DataStage, Sqoop, FineDataLink, Canal

DataFunSummit

Aug 13, 2023 · Big Data

KwaiBI: Evolution of Kuaishou’s One‑Stop Business Intelligence Platform from 1.0 to 2.0

The article details Kuaishou’s KwaiBI business intelligence platform evolution, covering its 1.0 tool‑based implementation, the 2.0 standardized architecture built on an indicator middle‑platform, core processes, data integration, self‑service features, and future directions for self‑service and intelligent analytics.

BIBig DataData Platform

0 likes · 22 min read

KwaiBI: Evolution of Kuaishou’s One‑Stop Business Intelligence Platform from 1.0 to 2.0

DataFunSummit

Aug 10, 2023 · Databases

ClickHouse Deployment in Lenovo Manufacturing: Architecture, Data Integration, and Performance Optimization

This article details Lenovo's implementation of ClickHouse in a manufacturing environment, covering the current data landscape, cluster architecture, integration challenges, performance enhancements, and solutions such as Seatunnel and query pre‑aggregation, illustrating how OLAP engines can address real‑time analytics and concurrency issues in production data pipelines.

ClickHouseManufacturingOLAP

0 likes · 11 min read

ClickHouse Deployment in Lenovo Manufacturing: Architecture, Data Integration, and Performance Optimization

Data Thinking Notes

Aug 2, 2023 · Fundamentals

Mastering Enterprise Data: A Practical Guide to Master Data Management

This article explains why fragmented data hampers business insight in large enterprises and provides a comprehensive overview of master data concepts, governance structures, standards, processes, and step‑by‑step implementation practices to achieve consistent, high‑quality enterprise data.

Data GovernanceEnterprise DataMDM

0 likes · 18 min read

Mastering Enterprise Data: A Practical Guide to Master Data Management

Architects Research Society

Aug 2, 2023 · Fundamentals

Data Fabric Architecture: Three Patterns, Core Technical Components, and Inherent Limitations

The article explains data fabric architecture as a promising approach for enabling data exchange across distributed systems, outlines its three design patterns, describes key technical components such as data virtualization, data catalog, and knowledge graphs, and discusses the trade‑offs, costs, and limitations that organizations must consider.

Data CatalogData FabricKnowledge Graph

0 likes · 17 min read

Data Fabric Architecture: Three Patterns, Core Technical Components, and Inherent Limitations

Didi Tech

Jul 31, 2023 · Big Data

Data Serviceization at Didi: Architecture, Phases, and Standard Metric Service

Didi’s data serviceization converts raw business data into consumable services through a four‑stage pipeline—integration, development, production, and back‑flow—while the Data Dream Factory and Shu‑Chain platform automate synchronization, provide a unified access gateway for thousands of APIs, and introduce a standard metric service that abstracts storage complexities and ensures high‑performance, secure data delivery.

Data PlatformMetadata Managementdata integration

0 likes · 16 min read

Data Serviceization at Didi: Architecture, Phases, and Standard Metric Service

Inke Technology

Jun 28, 2023 · Big Data

Extending Apache Seatunnel for Trino and Kyuubi Integration: A Practical Guide

This article outlines the challenges of scaling data integration platforms, proposes a comprehensive solution using Apache Seatunnel and Dinky, details the implementation of Trino and Kyuubi JDBC support, and describes the platform's architecture, task publishing workflow, logging, monitoring, resource management, and future enhancements.

Apache SeaTunnelKyuubiTrino

0 likes · 16 min read

Extending Apache Seatunnel for Trino and Kyuubi Integration: A Practical Guide

Architects Research Society

Jun 21, 2023 · Fundamentals

The Strategic Role of Enterprise Architects: Five Strategic and One Tactical Focus Areas

Enterprise architects align IT strategy with business goals by overseeing application portfolio management, technology and risk, IT operations, security and privacy, integration and data, and finance, defining roadmaps for 1‑3‑5 year plans while balancing strategic and tactical responsibilities in a rapidly changing environment.

Application PortfolioIT OperationsIT Strategy

0 likes · 6 min read

The Strategic Role of Enterprise Architects: Five Strategic and One Tactical Focus Areas

21CTO

Jun 20, 2023 · Fundamentals

ETL vs ELT: Which Data Integration Method Wins for Your Business?

ETL extracts, transforms, then loads data, while ELT extracts, loads, and transforms later, each offering distinct advantages; the article compares their processes, key differences, and factors such as data volume, complexity, latency, and cost to help businesses choose the optimal integration approach.

Data WarehousingELTdata integration

0 likes · 12 min read

ETL vs ELT: Which Data Integration Method Wins for Your Business?

Huawei Cloud Developer Alliance

Jun 19, 2023 · Cloud Computing

Build a Real-Time IoT Dashboard with Huawei Cloud: From Device to Edge to Cloud

This article explains how Huawei Cloud’s IoT platform connects devices, leverages edge computing, stores data in RDS, and visualizes it on an Astro real-time dashboard, offering a step-by-step guide for developers to build end-to-end IoT solutions.

Cloud ServicesHuawei CloudIoT

0 likes · 6 min read

Build a Real-Time IoT Dashboard with Huawei Cloud: From Device to Edge to Cloud

DataFunSummit

Jun 12, 2023 · Big Data

From Data Integration to the Modern Data Stack: Concepts, Tools, and Practices

This article explains data integration fundamentals, compares data integration tools such as Stitch, Fivetran, and Airbyte, describes the concepts of data warehouses and data lakes, outlines ETL vs ELT processes, and explores building modern data stacks with Flink CDC and cloud services.

Big DataELTETL

0 likes · 17 min read

From Data Integration to the Modern Data Stack: Concepts, Tools, and Practices

360 Tech Engineering

Jun 2, 2023 · Big Data

Overcoming Challenges in User Profiling: A Big Data‑Driven Framework for Precise Marketing

The article outlines how a unified, big‑data‑based user profiling platform addresses traditional data silos, high costs, and limited functionality by standardizing tags, integrating Spark and RHadoop processing, and enabling a closed‑loop marketing workflow that improves accuracy and operational efficiency.

Big DataMarketing AutomationRHadoop

0 likes · 7 min read

Overcoming Challenges in User Profiling: A Big Data‑Driven Framework for Precise Marketing

StarRocks

May 26, 2023 · Big Data

How SeaTunnel’s StarRocks Connector Enables High‑Performance Data Sync

This article explains SeaTunnel’s architecture and its StarRocks connector, detailing source and sink features such as field projection, predicate push‑down, parallel reading, state recovery, data type mapping, Stream Load writes, CDC support, configuration examples, and future roadmap for exactly‑once semantics.

Big DataConnectorSeaTunnel

0 likes · 16 min read

How SeaTunnel’s StarRocks Connector Enables High‑Performance Data Sync

Top Architect

May 4, 2023 · Big Data

Data Middle Platform: General Architecture and Core Components

The article explains the concept, benefits, and detailed modular architecture of a data middle platform, covering data storage, acquisition, processing, governance, security, and operation frameworks, and illustrates how enterprises can build and evolve such platforms to turn data into valuable services.

Big DataData ArchitectureData Governance

0 likes · 19 min read

Data Middle Platform: General Architecture and Core Components

ITPUB

Apr 26, 2023 · Databases

Mastering Change Data Capture: Open‑Source Tools and How to Choose the Right One

This article explains the concept of Change Data Capture (CDC), outlines its common use cases, compares the main technical approaches—including timestamps, data diff, triggers, and log‑based methods—and reviews popular open‑source CDC solutions and their database‑specific configuration requirements.

CDCChange Data Capturedata integration

0 likes · 15 min read

Mastering Change Data Capture: Open‑Source Tools and How to Choose the Right One

ITPUB

Apr 25, 2023 · Big Data

Top 8 Open‑Source ETL Tools for Data Migration and Integration

This article reviews eight widely used ETL and data‑migration tools—including Kettle, DataX, DataPipeline, Talend, DataStage, Sqoop, FineDataLink, and Canal—detailing their core features, architectures, supported data sources, and typical usage scenarios to help practitioners choose the right solution.

Big DataData MigrationETL

0 likes · 13 min read

Top 8 Open‑Source ETL Tools for Data Migration and Integration