Databases 24 min read

Apache Doris 1.2.0 Release: New Features, Performance Boosts, and Upgrade Guide

Apache Doris 1.2.0, released on December 7, 2022, introduces full vectorization, Merge‑On‑Write updates, Multi‑Catalog, new data types, Java and Remote UDFs, extensive performance improvements, and detailed deployment and upgrade instructions for users and contributors.

DataFunTalk
DataFunTalk
DataFunTalk
Apache Doris 1.2.0 Release: New Features, Performance Boosts, and Upgrade Guide

Apache Doris 1.2.0 was officially released on December 7, 2022, with 118 contributors delivering over 2400 optimizations and fixes.

Key innovations include full vectorization across all modules, a Merge‑On‑Write update mode for Unique Key tables, Multi‑Catalog for seamless external data source integration, Light Schema Change for millisecond‑level DDL, JDBC external tables, Java and Remote UDF support, new Array and JSONB data types, and DateV2/DatetimeV2 types.

Performance benchmarks show up to 4× improvement on SSB‑Flat and up to 11× on TPC‑H compared with previous versions, and the new version leads the ClickBench ranking.

The release also adds Table Valued Functions, new built‑in functions (cbrt, sequence_match, mask, uuid, etc.), enhanced memory tracking, row‑level policies, improved security settings, and numerous configuration and API changes.

Download packages (FE, BE, Java‑UDF jar) are provided for ARM64, x86_64, and x86_64‑no‑avx2 architectures, with detailed upgrade instructions, deployment notes, and compatibility considerations.

Example configuration to enable Merge‑On‑Write:

"enable_unique_key_merge_on_write" = "true"

Example configuration to enable Light Schema Change:

"light_schema_change" = "true"

Example INSERT using TVF:

insert into tbl select * from s3("s3://bucket/file.*", "ak" = "xx", "sk" = "xxx") where c1 > 2;
PerformanceDatabasereleaseUpgradeApache DorisFeature Update
DataFunTalk
Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.