Zhihu Tech Column
Author

Zhihu Tech Column

Sharing Zhihu tech posts and exploring community technology innovations.

18
Articles
0
Likes
62
Views
0
Comments
Recent Articles

Latest from Zhihu Tech Column

18 recent articles
Zhihu Tech Column
Zhihu Tech Column
Jun 11, 2025 · Artificial Intelligence

How Minute‑Level Time Decay Boosts User Retention Modeling in Recommendation Systems

This article presents a novel minute‑level future‑reward framework with dual‑delay incentives, activity‑based attribution, multi‑task delayed modeling, and sequential streaming training that dramatically improves user retention prediction accuracy and real‑time performance in large‑scale recommendation platforms.

User Retentiondeep learningmulti‑task modeling
0 likes · 17 min read
How Minute‑Level Time Decay Boosts User Retention Modeling in Recommendation Systems
Zhihu Tech Column
Zhihu Tech Column
Mar 14, 2025 · Artificial Intelligence

Insights from Zhihu’s ZhiLight Large Model Inference Framework: Architecture, Parallelism, and Performance Optimizations

The article summarizes Zhihu’s technical talk on the ZhiLight large‑model inference framework, detailing model execution mechanisms, GPU load analysis, multi‑GPU parallel strategies, open‑source engine comparisons, compute‑communication overlap, quantization techniques, benchmark results, and future directions for scalable LLM deployment.

GPU parallelismLarge Language ModelsSGLang
0 likes · 11 min read
Insights from Zhihu’s ZhiLight Large Model Inference Framework: Architecture, Parallelism, and Performance Optimizations
Zhihu Tech Column
Zhihu Tech Column
Jan 17, 2025 · Artificial Intelligence

Zhihu Direct Answer: Product Overview and Technical Practices

This article summarizes the key technical insights from Zhihu Direct Answer, an AI-powered search product, covering its product overview, RAG framework, query understanding, retrieval strategies, chunking, reranking, generation techniques, evaluation methods, and engineering optimizations for cost and performance.

AI SearchChunkingEngineering Optimization
0 likes · 13 min read
Zhihu Direct Answer: Product Overview and Technical Practices
Zhihu Tech Column
Zhihu Tech Column
Dec 31, 2024 · Cloud Native

Cloud Native Innovation Forum: AutoMQ Table Topic, OceanBase Integrated Database, and Observability Practices

The article recaps Zhihu's Cloud Native Innovation Forum where experts from AutoMQ, OceanBase, and Flashcat shared practical solutions on streaming data ingestion, unified database architectures, and AI‑driven observability, highlighting real‑world deployments, performance optimizations, and cost‑saving strategies.

AIAutoMQCloud Native
0 likes · 10 min read
Cloud Native Innovation Forum: AutoMQ Table Topic, OceanBase Integrated Database, and Observability Practices
Zhihu Tech Column
Zhihu Tech Column
Dec 25, 2024 · Databases

Technical Challenges and Solutions for Migrating Zhihu's Self‑Managed MongoDB Cluster to Alibaba Cloud

The article analyzes the storage, sharding, backup, and operational pain points of Zhihu's self‑operated MongoDB cluster, proposes cloud‑based architectural and procedural solutions, and details a step‑by‑step migration plan that ensures zero‑downtime, improved stability, and cost efficiency.

Alibaba CloudCloud ComputingMongoDB
0 likes · 11 min read
Technical Challenges and Solutions for Migrating Zhihu's Self‑Managed MongoDB Cluster to Alibaba Cloud
Zhihu Tech Column
Zhihu Tech Column
Oct 28, 2024 · Databases

TiDB Architecture, Deployment Choices, and Zhihu’s Database Platform Practices

This article explains TiDB’s distributed relational database architecture, compares Kubernetes‑based and physical‑machine deployments, and describes Zhihu’s platform‑driven approach—including TiDB Operator, TiUP, self‑service tooling, monitoring, backup, security, and FinOps—to enable efficient database management for both developers and DBAs.

Data MigrationDatabase operationsFinOps
0 likes · 15 min read
TiDB Architecture, Deployment Choices, and Zhihu’s Database Platform Practices
Zhihu Tech Column
Zhihu Tech Column
Oct 10, 2024 · Artificial Intelligence

Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model

This article presents a method for massive multi-label text classification on Zhihu content by combining a semantic retrieval model with a proprietary large AI model, detailing the challenges of large label spaces, model architecture, loss optimization, and experimental results showing significant accuracy gains.

BGELarge Language Modelmulti-label classification
0 likes · 16 min read
Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model