Xiaohongshu Tech REDtech
Author

Xiaohongshu Tech REDtech

Official account of the Xiaohongshu tech team, sharing tech innovations and problem insights, advancing together.

119
Articles
0
Likes
409
Views
0
Comments
Recent Articles

Latest from Xiaohongshu Tech REDtech

100 recent articles max
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
May 22, 2025 · Artificial Intelligence

Scalable Overload-Aware Graph-Based Index Construction for 10‑Billion‑Scale Vector Similarity Search (SOGAIC)

The paper introduces SOGAIC, a scalable overload‑aware graph‑based index construction system for billion‑scale vector similarity search that uses adaptive overlapping partitioning and load‑balanced distributed scheduling to cut construction time by 47.3% while maintaining high recall.

ANNDistributed Schedulinggraph index
0 likes · 13 min read
Scalable Overload-Aware Graph-Based Index Construction for 10‑Billion‑Scale Vector Similarity Search (SOGAIC)
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
May 19, 2025 · Industry Insights

How Xiaohongshu Built a Minute‑Level Near‑Real‑Time Data Warehouse with Incremental Computing

Facing billions of daily logs and the need for minute‑level experiment metrics, Xiaohongshu partnered with Yunqi Tech to design a generic incremental‑compute solution that delivers near‑real‑time data warehousing with lower cost, higher accuracy, simplified pipelines, and improved query performance.

Big DataFlinkIceberg
0 likes · 24 min read
How Xiaohongshu Built a Minute‑Level Near‑Real‑Time Data Warehouse with Incremental Computing
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Mar 13, 2025 · Artificial Intelligence

UniCBE: A Unified Multi‑Objective Optimization Framework for Contrastive Based Evaluation

UniCBE introduces a unified multi‑objective optimization framework for contrastive‑based evaluation that mitigates sampling bias, unbalanced uncertainty reduction, and inefficient resource allocation by combining three decoupled probability matrices through a greedy and Hadamard‑product strategy, achieving Pearson correlations above 0.995 with only 83 % of the annotation budget and cutting evaluation costs by more than 50 % across diverse LLM evaluators.

Contrastive EvaluationEfficiencySampling Bias
0 likes · 10 min read
UniCBE: A Unified Multi‑Objective Optimization Framework for Contrastive Based Evaluation
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Mar 6, 2025 · Backend Development

ROFF: A High‑Performance Seven‑Layer Rust‑Based Gateway with TLS Offload, QUIC/HTTP3, and Dynamic Module System

ROFF is a Rust‑implemented, seven‑layer gateway that delivers high‑throughput load balancing with memory‑safe performance, TLS hardware offload, native QUIC/HTTP3 support, a hot‑reload/upgrade mechanism, and an extensible module system allowing over thirty built‑in filters and custom Rust macros.

HTTP/3QUICRust
0 likes · 28 min read
ROFF: A High‑Performance Seven‑Layer Rust‑Based Gateway with TLS Offload, QUIC/HTTP3, and Dynamic Module System
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 27, 2025 · Artificial Intelligence

SAFE: A Lightweight General AI Image Detection Method Achieving 96.7% Accuracy Across 33 Test Subsets

SAFE is a lightweight AI‑image detection framework using only 1.44 M parameters and 2.30 B FLOPs that preserves fine‑grained artifacts through crop‑based preprocessing, invariant augmentations, and high‑frequency wavelet features, achieving an average 96.7 % accuracy across 33 test subsets and strong generalization to unseen GAN and diffusion generators.

AI image detectionGenerative Modelscomputer vision
0 likes · 11 min read
SAFE: A Lightweight General AI Image Detection Method Achieving 96.7% Accuracy Across 33 Test Subsets
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 24, 2025 · Artificial Intelligence

AIDE: Hybrid Feature Detector for AI‑Generated Image Detection and the Chameleon Benchmark

The paper introduces AIDE, a hybrid AI‑generated image detector that fuses low‑level pixel statistics with high‑level semantic embeddings, and the manually curated Chameleon benchmark of ~26 000 diverse, high‑realism images, showing AIDE surpasses nine state‑of‑the‑art methods by up to 4.6 % while highlighting remaining challenges on this tougher dataset.

AI-generated image detectionbenchmark datasetcomputer vision
0 likes · 14 min read
AIDE: Hybrid Feature Detector for AI‑Generated Image Detection and the Chameleon Benchmark
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 20, 2025 · Big Data

How Xiaohongshu Accelerated Data Warehouse Queries with Logical Datasets & Materialized Views

Xiaohongshu tackled low reuse of APP tables, limited scalability of single-table BI datasets, and poor dashboard query performance by introducing logical datasets and materialized views, which enable query pruning, reduce data redundancy, and accelerate BI queries, achieving up to 80% latency reduction and higher hit rates.

BIBig DataData Warehouse
0 likes · 25 min read
How Xiaohongshu Accelerated Data Warehouse Queries with Logical Datasets & Materialized Views
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Feb 17, 2025 · Artificial Intelligence

WorldSense: A New Benchmark for Evaluating Multimodal Large Models in Real‑World Scenarios

WorldSense, a new benchmark of 1,662 real‑world video‑audio clips and 3,172 QA pairs across 26 cognitive tasks, reveals that current multimodal large models achieve only 25%–48% accuracy, highlighting the crucial role of combined visual‑audio input and the difficulty of audio‑ and emotion‑related reasoning.

Large Modelsbenchmark datasetmodel analysis
0 likes · 12 min read
WorldSense: A New Benchmark for Evaluating Multimodal Large Models in Real‑World Scenarios
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jan 16, 2025 · Cloud Native

Xiaohongshu Large-Scale Cloud-Native Mixed Deployment and Elasticity Practices

Xiaohongshu’s cloud‑native team transformed its over‑90% containerized services by introducing resource‑pooled mixed deployment, fine‑grained unified scheduling, and an elastic container pool with global HPA and cluster autoscaling—driving 35% of resources to mixed use, tens of millions of daily core‑hours, and roughly 30% cost savings while preparing for hybrid‑cloud expansion and FinOps.

Containerizationcloud-nativeelasticity
0 likes · 7 min read
Xiaohongshu Large-Scale Cloud-Native Mixed Deployment and Elasticity Practices
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jan 16, 2025 · Artificial Intelligence

Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection

The authors present a semantic‑graph‑enhanced uncertainty modeling framework that captures token, sentence, and paragraph dependencies, propagates uncertainty through entity relations and contradiction probabilities, and achieves roughly a 20 % gain in paragraph‑level hallucination detection on WikiBio and NoteSum compared with existing uncertainty‑based baselines.

Semantic GraphSentence-level ModelingToken-level Modeling
0 likes · 13 min read
Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection