Xiaohongshu Tech REDtech
Author

Xiaohongshu Tech REDtech

Official account of the Xiaohongshu tech team, sharing tech innovations and problem insights, advancing together.

119
Articles
0
Likes
409
Views
0
Comments
Recent Articles

Latest from Xiaohongshu Tech REDtech

100 recent articles max
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 18, 2025 · Artificial Intelligence

DynamicFace: Composable 3D Facial Priors for High‑Quality, Consistent Face Swaps

DynamicFace introduces a controllable face‑swapping framework that leverages composable 3D facial priors, dual‑stream identity injection, and a FusionTVO module to achieve superior image and video quality, identity preservation, and temporal consistency, outperforming existing state‑of‑the‑art methods on benchmark datasets.

3D facial priorsAIcontrollable generation
0 likes · 13 min read
DynamicFace: Composable 3D Facial Priors for High‑Quality, Consistent Face Swaps
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 13, 2025 · Backend Development

Boosting TLS Performance with Intel QAT and a Custom Keyless Architecture

This article details how XiaoHongShu's infrastructure team built a keyless architecture that offloads CPU‑intensive TLS private‑key signing to Intel QAT hardware, achieving massive HTTPS throughput gains, lower server costs, and valuable insights for similar high‑traffic TLS offload scenarios.

Intel QATKeyless ArchitecturePerformance
0 likes · 10 min read
Boosting TLS Performance with Intel QAT and a Custom Keyless Architecture
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 7, 2025 · Databases

Achieving RPO=0: How XiaoHongShu’s Binlog Server Boosts MySQL Replication Speed and Data Consistency

This article explains how XiaoHongShu’s database team built a lightweight Binlog Server to accelerate semi‑synchronous MySQL replication beyond 300 MB/s, achieve RPO=0 data‑loss‑free failover, and improve high‑availability without manual intervention, backed by performance tests and detailed architecture diagrams.

Binlog ServerRPO=0Semi‑synchronous Replication
0 likes · 15 min read
Achieving RPO=0: How XiaoHongShu’s Binlog Server Boosts MySQL Replication Speed and Data Consistency
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 6, 2025 · Artificial Intelligence

dots.vlm1: Open‑Source Multimodal Vision‑Language Model Near SOTA Performance

dots.vlm1, the first open‑source multimodal large model from Xiaohongshu hi‑lab, combines a 1.2‑billion‑parameter NaViT visual encoder with DeepSeek V3 LLM, achieving near‑state‑of‑the‑art visual understanding and reasoning while remaining competitive on text tasks, and is available on GitHub and HuggingFace.

AIdeep-learninglarge-model
0 likes · 11 min read
dots.vlm1: Open‑Source Multimodal Vision‑Language Model Near SOTA Performance
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jul 31, 2025 · Artificial Intelligence

How dots.ocr Achieves SOTA Multilingual Document Parsing with a 1.7B VLM

dots.ocr is a 1.7 billion-parameter multilingual document-parsing model that unifies layout detection and content recognition within a single visual-language model, delivering state-of-the-art performance across text, tables, formulas and reading order while remaining efficient and extensible for future multimodal AI research.

AIDocument ParsingOCR
0 likes · 10 min read
How dots.ocr Achieves SOTA Multilingual Document Parsing with a 1.7B VLM
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jul 24, 2025 · Backend Development

How Xiaohongshu Boosted Java Performance by 10% with a RedJDK Upgrade

Xiaohongshu’s middleware team migrated thousands of Java services from JDK 8 to RedJDK 11/17, achieving over 10% performance gains, 50% GC pause reduction, and eliminating OOM crashes through systematic JDK upgrades, GC tuning, native‑memory improvements, and standardized deployment pipelines.

GC optimizationNative Memorybackend services
0 likes · 22 min read
How Xiaohongshu Boosted Java Performance by 10% with a RedJDK Upgrade
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 19, 2025 · Artificial Intelligence

Can Adaptive Chain‑of‑Thought Learning Halve LLM Thinking Time?

The article introduces the Think When You Need (TWYN) method, a reinforcement‑learning approach that dynamically adapts chain‑of‑thought length, dramatically cuts redundant token generation in large language models, and maintains or improves accuracy across diverse reasoning benchmarks.

Efficiencyadaptive inferencechain-of-thought
0 likes · 9 min read
Can Adaptive Chain‑of‑Thought Learning Halve LLM Thinking Time?
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 6, 2025 · Artificial Intelligence

How dots.llm1 Sets New Benchmarks for Open‑Source MoE Language Models

dots.llm1, an open‑source 142‑billion‑parameter Mixture‑of‑Experts language model from hi lab, achieves Qwen2.5‑72B‑level performance after training on 11.2 T high‑quality tokens, and the release includes full models, intermediate checkpoints, and detailed training pipelines for the research community.

AI researchLarge Language ModelMixture of Experts
0 likes · 10 min read
How dots.llm1 Sets New Benchmarks for Open‑Source MoE Language Models
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 4, 2025 · Artificial Intelligence

From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN

MarkerGen introduces a novel, plug‑and‑play framework that decomposes length‑controllable text generation into four sub‑abilities—identifying, counting, planning, and aligning—integrates external tokenizers and dynamic markers, and achieves significantly lower length errors and higher quality across diverse models, tasks, and languages.

LLMLength-Controlled GenerationMarkerGen
0 likes · 14 min read
From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 3, 2025 · Artificial Intelligence

Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation

The TailoredBench framework dramatically reduces large‑language‑model evaluation cost and error by using a global probe set, model‑specific source selection, extensible K‑Medoids clustering, and calibration, achieving up to 300× speedup and a 31.4% MAE reduction across diverse benchmarks.

AI researchK-MedoidsLLM evaluation
0 likes · 10 min read
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation