Tag

SQL parsing

1 views collected around this technical thread.

Aikesheng Open Source Community
Aikesheng Open Source Community
Feb 27, 2025 · Information Security

Improving Data Export Workflows and Security: From 1.0 to 2.0 with Classification and Dynamic Approval

This article examines the security challenges of data export work orders in MySQL environments, outlines the shortcomings of the original 1.0 workflow, and presents a comprehensive 2.0 redesign that introduces dynamic approvals, data classification, execution‑plan analysis, and code‑level solutions to mitigate data leakage risks.

Approval WorkflowMySQLSQL parsing
0 likes · 15 min read
Improving Data Export Workflows and Security: From 1.0 to 2.0 with Classification and Dynamic Approval
Beijing SF i-TECH City Technology Team
Beijing SF i-TECH City Technology Team
May 30, 2024 · Big Data

Data Lineage System Design and Implementation for Big Data Platforms

This article presents a comprehensive data lineage system (Data-Lineage) for big data platforms, addressing challenges in heterogeneous data sources, multiple execution engines, and complex dependencies through hook-based architecture and modular design.

SQL parsingbig data architecturedata lineage
0 likes · 12 min read
Data Lineage System Design and Implementation for Big Data Platforms
DataFunTalk
DataFunTalk
Mar 9, 2024 · Big Data

Construction and Application of Tencent Oula Data Lineage Platform

This article presents a comprehensive overview of Tencent Oula's data lineage system, detailing its background, goals, architecture, modular construction, key technologies such as graph databases and SQL parsing, and various internal application scenarios including data governance, cost insight, and baseline monitoring.

Big DataSQL parsingcost analysis
0 likes · 20 min read
Construction and Application of Tencent Oula Data Lineage Platform
Top Architecture Tech Stack
Top Architecture Tech Stack
Nov 5, 2023 · Databases

Understanding MySQL Communication Protocols, Parsing, Optimizer, Storage Engines, and Execution Engine

This article explains how MySQL establishes connections, the supported communication protocols and message formats, the lexical and syntactic parsing process, query optimization and execution plan generation, the role of different storage engines, and how the execution engine uses the plan to operate on the storage layer.

Execution EngineMySQLQuery Optimizer
0 likes · 15 min read
Understanding MySQL Communication Protocols, Parsing, Optimizer, Storage Engines, and Execution Engine
DaTaobao Tech
DaTaobao Tech
Jun 21, 2023 · Databases

Data Space Architecture and Metadata Models

The article outlines a data‑space architecture that employs a wide‑table design with dynamic columns and dedicated metadata tables, a metadata execution engine for business‑logic mapping, upgraded SQL parsing via Druid, MySQL‑proxy protocol handling, and distributed flow control using Redis and Zookeeper to enable scalable, multi‑tenant, low‑code and cloud‑native data management.

Data SpaceDatabase DesignOpen Platform
0 likes · 16 min read
Data Space Architecture and Metadata Models
DataFunSummit
DataFunSummit
May 10, 2023 · Big Data

Field-Level Data Lineage Extraction for FlinkSQL Using Apache Calcite

This article explains how to derive field‑level data lineage for FlinkSQL by leveraging Apache Calcite, covering the Calcite framework, FlinkSQL execution stages, the three‑step parsing approach, core source code details, practical Insert/Join examples, and extensions for lookup joins and UDTFs.

Apache CalciteBig DataFlinkSQL
0 likes · 12 min read
Field-Level Data Lineage Extraction for FlinkSQL Using Apache Calcite
Architect's Guide
Architect's Guide
Jan 7, 2023 · Databases

MySQL Execution Process Overview

This article explains the complete MySQL execution flow, covering the connector, permission verification, caching, parser, optimizer, executor, process states, SQL execution order, and the impact of WHERE‑clause condition ordering on query performance.

Execution ProcessMySQLPermissions
0 likes · 13 min read
MySQL Execution Process Overview
DeWu Technology
DeWu Technology
Nov 30, 2022 · Big Data

Fundamentals and Implementation of Data Lineage in Big Data Environments

Data lineage in big‑data environments tracks how data moves and transforms—from source tables through SQL processing to final storage—enabling management tasks such as domain segmentation, performance tuning, anomaly detection, and dependency verification, with implementations ranging from simple regex extraction to robust AST parsing and optimization, as used by tools like Alibaba DataWorks and Apache Atlas.

ASTBig DataHive
0 likes · 7 min read
Fundamentals and Implementation of Data Lineage in Big Data Environments
vivo Internet Technology
vivo Internet Technology
May 31, 2022 · Databases

Exploring Presto SQL Engine (3) - Implementing WHERE Condition Filtering with Antlr and Dynamic Code Generation

The third article in the Presto SQL Engine series demonstrates how to implement WHERE‑clause filtering with Antlr, contrasting a direct AST‑traversal visitor approach—hampered by branch prediction and JVM inlining issues—with runtime bytecode generation using airlift.bytecode, which yields roughly three‑fold speed gains but adds complexity.

ANTLRAirlift BytecodeBytecode Generation
0 likes · 24 min read
Exploring Presto SQL Engine (3) - Implementing WHERE Condition Filtering with Antlr and Dynamic Code Generation
Bilibili Tech
Bilibili Tech
May 24, 2022 · Big Data

Metadata Infrastructure and Governance in Bilibili Data Platform

Bilibili’s data platform consolidates scattered metadata into a unified URN‑based model stored across TiDB, Elasticsearch, and HugeGraph, offering batch‑pull and embedded collection, flexible SQL‑like queries, comprehensive lineage mapping, and powering data‑map, lineage‑map, and impact‑analysis tools while planning expanded quality assurance and self‑service dictionaries.

Big DataSQL parsingdata governance
0 likes · 21 min read
Metadata Infrastructure and Governance in Bilibili Data Platform
Aikesheng Open Source Community
Aikesheng Open Source Community
Jan 18, 2022 · Databases

Enhancing my2sql for Data Rollback and Query Capabilities with DBeaver Integration

This article describes practical improvements to the my2sql binlog‑parsing tool—adding query support and automatic binlog range detection—and demonstrates how to integrate it into DBeaver to provide developers with easy data‑rollback and original‑SQL tracing without requiring DBA intervention.

DBeaverMySQLSQL parsing
0 likes · 6 min read
Enhancing my2sql for Data Rollback and Query Capabilities with DBeaver Integration
vivo Internet Technology
vivo Internet Technology
Aug 4, 2021 · Big Data

Applying ANTLR4 for Arithmetic Calculator and SQL Parsing over CSV Data

The article demonstrates how ANTLR4 can replace manual parsing by building a four‑operation calculator and a trimmed SQL parser for Presto, showing the workflow from grammar definition to generated lexer/parser and visitor code, then applying the SQL parser to query CSV data efficiently.

ANTLRBig DataCSV
0 likes · 20 min read
Applying ANTLR4 for Arithmetic Calculator and SQL Parsing over CSV Data
Fangduoduo Tech
Fangduoduo Tech
Feb 8, 2021 · Big Data

Why Build Your Own Data Lineage Engine? Lessons from Apache Atlas to Duo-Lineage

This article explains what data lineage is, why it is essential for data governance in large‑scale big‑data platforms, compares Apache Atlas with a custom solution, and details the technical choices, architecture, and performance optimizations behind the self‑built duo‑lineage system.

Apache AtlasBig DataSQL parsing
0 likes · 14 min read
Why Build Your Own Data Lineage Engine? Lessons from Apache Atlas to Duo-Lineage
DataFunTalk
DataFunTalk
Feb 2, 2021 · Big Data

Metadata Management: Concepts, Architecture, and Applications in Data Warehousing

This article explains the fundamentals and value of metadata, describes a comprehensive metadata management system and its layered architecture, outlines key technologies such as automatic SQL metadata extraction, and showcases practical applications like metadata query, impact analysis, data lineage, and business‑driven data needs within modern data warehouses.

Big DataData WarehouseSQL parsing
0 likes · 17 min read
Metadata Management: Concepts, Architecture, and Applications in Data Warehousing
Liulishuo Tech Team
Liulishuo Tech Team
Sep 17, 2020 · Big Data

Automated Task Orchestration and Governance in Data Warehouse Workflows

The article describes how Liulishuo automates data‑warehouse task scheduling by parsing SQL to build DAGs, introduces systematic governance to improve reliability, reduce execution time, and handle unreliable data sources, ultimately enhancing data‑engineering efficiency.

DAGData EngineeringSQL parsing
0 likes · 12 min read
Automated Task Orchestration and Governance in Data Warehouse Workflows
Xueersi Online School Tech Team
Xueersi Online School Tech Team
Aug 21, 2020 · Databases

Understanding MySQL SQL Parsing: Yacc Grammar, Parse Trees, and Call Stacks

This article explains how MySQL parses SQL statements using Yacc, detailing the grammar structure, definition and rule sections, relevant source files, parse‑tree node declarations, common error messages, and the call stacks involved in both parsing and pre‑parsing phases.

Database InternalsMySQLParse Tree
0 likes · 7 min read
Understanding MySQL SQL Parsing: Yacc Grammar, Parse Trees, and Call Stacks
360 Tech Engineering
360 Tech Engineering
Jun 25, 2019 · Fundamentals

Building an LL(1) SQL Parser in Go

This tutorial explains how to implement a simple LL(1) parser in Go for SQL queries, covering lexical analysis, syntax analysis, finite‑state‑machine strategy, and testing, providing complete code snippets and practical guidance for developers interested in parser construction.

CompilerGoLL(1) parser
0 likes · 9 min read
Building an LL(1) SQL Parser in Go
Architects' Tech Alliance
Architects' Tech Alliance
Apr 1, 2018 · Databases

Understanding Oracle AWR Reports and Key Performance Metrics

The article explains how to generate and interpret Oracle Automatic Workload Repository (AWR) reports, detailing key sections such as DB Time, Cache Sizes, Load Profile, parsing behavior, instance efficiency percentages, shared pool statistics, and top wait events to diagnose database performance issues.

AWRInstance EfficiencyOracle
0 likes · 16 min read
Understanding Oracle AWR Reports and Key Performance Metrics
Baidu Waimai Technology Team
Baidu Waimai Technology Team
Mar 23, 2017 · Databases

Design and Implementation of the "Little Boy" Greenplum Optimization and Operations Platform

This article introduces the architecture, key modules, and implementation details of the Little Boy platform, a Greenplum optimization and operations system that parses SQL, applies index and distribution‑key tuning, manages resources, and outlines future enhancements for large‑scale data warehouses.

Big DataDatabase OptimizationGreenplum
0 likes · 15 min read
Design and Implementation of the "Little Boy" Greenplum Optimization and Operations Platform