Kuaishou Tech
Author

Kuaishou Tech

Official Kuaishou tech account, providing real-time updates on the latest Kuaishou technology practices.

229
Articles
0
Likes
680
Views
0
Comments
Recent Articles

Latest from Kuaishou Tech

100 recent articles max
Kuaishou Tech
Kuaishou Tech
Jul 31, 2025 · Big Data

How Kuaishou Overcame the ‘Impossible Triangle’ of Performance, Flexibility, and Cost in Real‑Time Big Data Analytics

This article details how Kuaishou’s content middle platform tackled the massive challenges of real‑time, flexible, and cost‑effective data analysis at trillion‑scale by redesigning its architecture, adopting ClickHouse, splitting wide tables, and implementing a scatter‑gather execution model with pre‑shuffle and bitmap optimizations.

Big DataClickHousePerformance Optimization
0 likes · 17 min read
How Kuaishou Overcame the ‘Impossible Triangle’ of Performance, Flexibility, and Cost in Real‑Time Big Data Analytics
Kuaishou Tech
Kuaishou Tech
Jul 29, 2025 · Artificial Intelligence

How Kuaishou’s 8 Groundbreaking Papers Are Shaping AI at KDD 2025

Eight Kuashou research papers covering recommendation systems, multi‑task learning, multimodal large models, large language models, and combinatorial optimization have been accepted to the premier AI data‑mining conference KDD 2025, highlighting the company’s cutting‑edge innovations and their potential impact on the field.

AIMultimodal LearningRecommendation Systems
0 likes · 18 min read
How Kuaishou’s 8 Groundbreaking Papers Are Shaping AI at KDD 2025
Kuaishou Tech
Kuaishou Tech
Jul 23, 2025 · Artificial Intelligence

Revolutionizing Cascade Ranking with LCRON: End-to-End Training for Ads

This article introduces LCRON, a novel end-to-end training framework for cascade ranking systems that aligns training objectives with overall recall, addresses stage interaction challenges, and demonstrates significant performance gains on public benchmarks and in Kuaishou’s commercial advertising platform.

AdvertisingMachine LearningRecommendation Systems
0 likes · 14 min read
Revolutionizing Cascade Ranking with LCRON: End-to-End Training for Ads
Kuaishou Tech
Kuaishou Tech
Jul 22, 2025 · Artificial Intelligence

How Orthus Achieves Lossless Multimodal Generation with a Unified Autoregressive Transformer

Orthus, a new unified multimodal model presented at ICML 2025, leverages an autoregressive Transformer backbone with separate language and diffusion heads to enable lossless image‑text interleaved generation, outperforming existing models on both understanding and generation benchmarks while remaining computationally efficient.

AI researchautoregressive transformerdiffusion models
0 likes · 11 min read
How Orthus Achieves Lossless Multimodal Generation with a Unified Autoregressive Transformer
Kuaishou Tech
Kuaishou Tech
Jul 21, 2025 · Artificial Intelligence

Can AI Models Think on Demand? Inside KAT‑V1 AutoThink’s Dynamic Reasoning

The article introduces KAT‑V1 AutoThink, a dual‑mode large language model that automatically switches between thinking and non‑thinking modes based on problem difficulty, details its novel training paradigm, reinforcement‑learning enhancements, performance benchmarks against leading open‑source models, and provides open‑source resources for further research.

Knowledge DistillationLarge Language Modelauto-think
0 likes · 14 min read
Can AI Models Think on Demand? Inside KAT‑V1 AutoThink’s Dynamic Reasoning
Kuaishou Tech
Kuaishou Tech
Jul 17, 2025 · Artificial Intelligence

How DHPS Boosted Online Inference Throughput by 270% with RDMA

This article details the design and evolution of DHPS, Kuaishou's load‑balanced, RDMA‑based high‑performance service architecture, explaining its network, storage, and traffic‑scheduling innovations that deliver over 270% query‑throughput improvement, lower latency, reduced CPU usage, and near‑five‑nine availability for large‑scale AI inference workloads.

Distributed SystemsRDMAStorage Engine
0 likes · 17 min read
How DHPS Boosted Online Inference Throughput by 270% with RDMA
Kuaishou Tech
Kuaishou Tech
Jul 16, 2025 · Artificial Intelligence

How KuaiMM Conversation Revolutionizes Multimodal Dialogue on Short‑Video Platforms

The KuaiMM Conversation project introduces a multimodal large‑model‑driven dialogue system for Kuaishou, featuring the world‑first short‑video mixed‑dialogue dataset, a Chain‑of‑Thought interaction framework, and large‑scale industrial deployments that dramatically improve live‑stream comments and intelligent customer service.

Kuaishouchain-of-thoughtconversation AI
0 likes · 11 min read
How KuaiMM Conversation Revolutionizes Multimodal Dialogue on Short‑Video Platforms
Kuaishou Tech
Kuaishou Tech
Jul 11, 2025 · Artificial Intelligence

How VARSR Redefines Image Super‑Resolution with Autoregressive Modeling

The VARSR algorithm introduces autoregressive modeling to image super‑resolution, leveraging prefix tokens, scale‑aligned rotary positional encodings, quantization error correction, and image‑quality‑guided diffusion to achieve faster inference and superior visual fidelity, as demonstrated by extensive ICML‑2025 experiments.

ICML 2025VARSRautoregressive modeling
0 likes · 11 min read
How VARSR Redefines Image Super‑Resolution with Autoregressive Modeling
Kuaishou Tech
Kuaishou Tech
Jul 10, 2025 · Artificial Intelligence

How MODA’s Modular Duplex Attention Solves Multimodal Attention Imbalance and Boosts Emotion Understanding

The paper introduces MODA, a modular duplex attention multimodal model that addresses severe cross‑modal attention imbalance in existing large multimodal models, proposes a novel attention paradigm and masking scheme, and demonstrates significant performance gains across 21 benchmarks in perception, cognition, and emotion tasks, earning a Spotlight paper at ICML 2025.

Emotion RecognitionMoDAattention mechanisms
0 likes · 13 min read
How MODA’s Modular Duplex Attention Solves Multimodal Attention Imbalance and Boosts Emotion Understanding
Kuaishou Tech
Kuaishou Tech
Jul 9, 2025 · Artificial Intelligence

How ResULIC Achieves Ultra‑Low‑Rate Image Compression with Semantic Residual Coding and Diffusion

The paper introduces ResULIC, a residual‑guided ultra‑low‑bitrate image compression framework that combines semantic residual coding, a compression‑aware diffusion model, and perceptual fidelity optimization to dramatically improve visual quality and outperform prior diffusion‑based methods on standard benchmarks.

Machine LearningResULICdiffusion model
0 likes · 12 min read
How ResULIC Achieves Ultra‑Low‑Rate Image Compression with Semantic Residual Coding and Diffusion