Author

AntTech

Technology is the core driver of Ant's future creation.

710

Articles

Likes

2.3k

Views

Comments

Latest from AntTech

100 recent articles max

AntTech

Jan 14, 2026 · Artificial Intelligence

Boosting Secure AI: HAWK Accelerator and FHEFusion Compiler Break New Ground

This article highlights two cutting‑edge works from Ant Group’s research team—HAWK, a fixed‑word key decomposition switching accelerator that overcomes hardware challenges for FHE, and FHEFusion, a compiler framework that introduces operator fusion to dramatically speed CKKS‑based DNN inference—showcasing their designs, optimizations, and experimental gains.

Compiler OptimizationDNN inferenceFully Homomorphic Encryption

0 likes · 7 min read

Boosting Secure AI: HAWK Accelerator and FHEFusion Compiler Break New Ground

AntTech

Dec 18, 2025 · Artificial Intelligence

How AEnvironment Powers Scalable Agentic RL with a Unified MCP Protocol

AEnvironment is an open‑source, unified environment platform for Agentic Reinforcement Learning that abstracts all resources as services via the MCP protocol, enabling trillion‑scale model training, rapid app generation, benchmark integration, and seamless deployment through a high‑performance ASandbox runtime.

AEnvironmentEnvironment PlatformMCP protocol

0 likes · 11 min read

How AEnvironment Powers Scalable Agentic RL with a Unified MCP Protocol

AntTech

Dec 11, 2025 · Artificial Intelligence

Unlock Scalable RL: AReaL’s Decoupled Agentic Framework & Single‑Controller Design

This article explains how the open‑source AReaL framework boosts large‑scale reinforcement learning by separating agent execution from training logic, introducing a decoupled Agentic RL service and a Single‑Controller architecture that improves data flow, fault tolerance, and GPU utilization.

Open-sourceScalable RLagentic AI

0 likes · 14 min read

Unlock Scalable RL: AReaL’s Decoupled Agentic Framework & Single‑Controller Design

AntTech

Dec 6, 2025 · Artificial Intelligence

FinEval‑KR: Diagnosing Knowledge vs. Reasoning Gaps in Financial Large Language Models

FinEval‑KR, a new EMNLP2025 evaluation framework co‑authored by Shanghai University of Finance and Economics and Ant Group, separates knowledge coverage from logical reasoning to reveal why financial LLMs often hallucinate on calculation tasks, introduces KS, RS, and CS metrics, and ranks 18 state‑of‑the‑art models on a rigorously curated finance dataset.

Knowledge vs reasoningLLM evaluationfinance AI

0 likes · 14 min read

FinEval‑KR: Diagnosing Knowledge vs. Reasoning Gaps in Financial Large Language Models

AntTech

Dec 4, 2025 · Artificial Intelligence

How AState Reduces Trillion‑Parameter RL Weight Sync to 6 Seconds

AState is a general‑purpose state data management system for reinforcement‑learning tasks that tackles low IO efficiency, slow weight synchronization, and state‑recovery challenges, achieving sub‑10‑second weight sync for trillion‑parameter models through a three‑layer architecture, zero‑redundancy transfers, and hardware‑aware co‑design, with the code openly available on GitHub.

AStateHigh Performance ComputingLarge Models

0 likes · 23 min read

How AState Reduces Trillion‑Parameter RL Weight Sync to 6 Seconds

AntTech

Nov 27, 2025 · Artificial Intelligence

How AMem NCCL‑Plugin Cuts GPU Memory Overhead for Trillion‑Parameter RL Models

The article explains the design, implementation, and performance of the AMem NCCL‑Plugin, a lightweight extension to NVIDIA's NCCL that enables transparent offloading and rapid recovery of GPU memory during reinforcement‑learning training of trillion‑parameter models, detailing its architecture, APIs, benchmarks, installation steps, and integration guidelines.

ASystemGPUNCCL

0 likes · 18 min read

How AMem NCCL‑Plugin Cuts GPU Memory Overhead for Trillion‑Parameter RL Models

AntTech

Nov 21, 2025 · Artificial Intelligence

How Awex Enables Sub‑Second TB‑Scale Weight Sync for Trillion‑Parameter RL Models

Awex is a high‑performance Python framework that synchronizes training and inference weights for trillion‑parameter reinforcement‑learning models in seconds, using unified conversion, metadata management, and NCCL/RDMA transfer plans, dramatically reducing RL training latency and supporting diverse parallel strategies.

High Performance ComputingLarge ModelsPython

0 likes · 17 min read

How Awex Enables Sub‑Second TB‑Scale Weight Sync for Trillion‑Parameter RL Models

AntTech

Nov 11, 2025 · Artificial Intelligence

Breaking the Efficiency Wall: Ant Group’s Bailing Model Paves the Way to AGI

At CNCC 2025, Ant Group’s Vice President Zhou Jun outlined the Bailing large‑model’s five‑layer architecture, hybrid linear attention, Ling Scaling Law, and novel training algorithms that dramatically cut costs and latency, achieving state‑of‑the‑art performance on math and code benchmarks while promoting open‑source collaboration toward AGI.

AGIMixture of Expertslarge language models

0 likes · 8 min read

Breaking the Efficiency Wall: Ant Group’s Bailing Model Paves the Way to AGI

AntTech

Nov 8, 2025 · Artificial Intelligence

Ant Group’s AntBaiLing Model: Pushing AI Scaling Limits with Trillion‑Parameter Efficiency

Ant Group’s President Luo Ji outlined how the AntBaiLing suite, featuring trillion‑parameter open‑source models, three efficiency breakthroughs, and a domestic compute cluster, is advancing AGI research and inclusive applications, especially in healthcare, while emphasizing ethical, trustworthy AI.

AGIOpen Sourcelarge language models

0 likes · 5 min read

Ant Group’s AntBaiLing Model: Pushing AI Scaling Limits with Trillion‑Parameter Efficiency

AntTech

Nov 4, 2025 · Artificial Intelligence

Unlock Native TPU Inference with SGLang-Jax: A Jax‑Powered Open‑Source Engine

SGLang-Jax is a cutting‑edge, fully Jax‑based open‑source inference engine that delivers native TPU performance, integrates advanced features like continuous batching, tensor and expert parallelism, and speculative decoding, while providing detailed installation and usage guidance for developers.

JAXSGLang-JaxTPU inference

0 likes · 10 min read

Unlock Native TPU Inference with SGLang-Jax: A Jax‑Powered Open‑Source Engine