Tagged articles
2 articles
Page 1 of 1
Wuming AI
Wuming AI
May 10, 2026 · Artificial Intelligence

Can Large Models Really Understand 1 M Tokens? Lessons from the RULER Benchmark

The article examines why a model’s advertised context window (e.g., 128 K or 1 M tokens) does not guarantee effective long‑context reasoning, summarizing the RULER framework that breaks long‑context ability into retrieval, interference resistance, multi‑hop tracking, aggregation, and multi‑answer recall, and offering practical guidance for evaluating and using such models.

LLMRULERaggregation
0 likes · 16 min read
Can Large Models Really Understand 1 M Tokens? Lessons from the RULER Benchmark
dbaplus Community
dbaplus Community
Aug 20, 2024 · Databases

How REDgraph Cut Multi‑Hop Query Latency by 50% with Distributed Parallel Execution

Xiaohongshu's REDgraph graph database faced high latency for multi‑hop queries, so the storage team redesigned the query framework using MPP‑inspired distributed parallel execution, edge‑partitioning, operator forwarding, and caching, achieving over 50% latency reduction and making three‑hop queries viable for online services.

Distributed QueryOptimizationREDgraph
0 likes · 30 min read
How REDgraph Cut Multi‑Hop Query Latency by 50% with Distributed Parallel Execution