Tagged articles
9 articles
Page 1 of 1
Data STUDIO
Data STUDIO
May 6, 2026 · Artificial Intelligence

DeepSeek V4 (Flash & Pro) Unveils Million‑Token Context and Trillion‑Parameter Inference

The April 24, 2026 release of DeepSeek V4 introduces Hybrid Attention (CSA/HCA), Manifold‑Constrained Hyper‑Connections, and the Muon optimizer, delivering 1 M‑token context windows, up to 1.6 T parameters, competitive benchmark scores against Claude and GPT, dramatically lower inference costs, and detailed deployment guidelines that expose both performance gains and practical challenges.

AI benchmarkingDeepSeek V4Hybrid Attention
0 likes · 17 min read
DeepSeek V4 (Flash & Pro) Unveils Million‑Token Context and Trillion‑Parameter Inference
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 30, 2026 · Artificial Intelligence

How DeepSeek V4’s CSA + HCA Break the Million‑Token Barrier

Traditional full‑attention cannot handle million‑token contexts due to exponential compute and memory growth, but DeepSeek V4’s Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) compress, sparsely index, and precisely compute tokens, cutting KV cache to 10% and FLOPs to 27% while enabling a 1‑M token window on a single GPU.

CSAHCAKV cache compression
0 likes · 12 min read
How DeepSeek V4’s CSA + HCA Break the Million‑Token Barrier
DeepHub IMBA
DeepHub IMBA
Apr 27, 2026 · Artificial Intelligence

DeepSeek‑V4 Deep Dive: Engineering Million‑Token Context Efficiency

The article provides a thorough technical analysis of DeepSeek‑V4, detailing how mixed sparse attention (CSA + HCA), manifold‑constrained hyper‑connections, the Muon optimizer, FP4 quantization, and a suite of infrastructure tricks enable stable training and inference with up to one‑million token contexts while achieving state‑of‑the‑art benchmark results.

CSADeepSeek V4FP4 Quantization
0 likes · 22 min read
DeepSeek‑V4 Deep Dive: Engineering Million‑Token Context Efficiency
Architecture & Thinking
Architecture & Thinking
Apr 26, 2026 · Artificial Intelligence

DeepSeek V4: How Million‑Token Context and Open‑Source Design Redefine AI Ecosystems

DeepSeek V4, released on April 24, 2026, introduces a 1‑million‑token context via DSA sparse attention, offers Pro and Flash variants, adapts to domestic AI chips, cuts compute costs dramatically, and leverages open‑source weights to challenge the dominance of closed‑source LLMs, reshaping the global AI landscape.

AI hardware adaptationDeepSeek V4agentic AI
0 likes · 9 min read
DeepSeek V4: How Million‑Token Context and Open‑Source Design Redefine AI Ecosystems
ZhiKe AI
ZhiKe AI
Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Launch: Open‑Source Model Beats Closed‑Source Leaders in Coding & Math, 1.6 T Params, 1 M Context

DeepSeek V4, released today, offers two open‑source models (Pro and Flash) with up to 1.6 T parameters and a 1‑million‑token context, achieving top‑tier programming and mathematics benchmark scores that surpass the three major closed‑source competitors, while cutting API costs to a fraction of the price.

APIDeepSeekV4
0 likes · 7 min read
DeepSeek V4 Launch: Open‑Source Model Beats Closed‑Source Leaders in Coding & Math, 1.6 T Params, 1 M Context
SuanNi
SuanNi
Apr 24, 2026 · Artificial Intelligence

DeepSeek-V4 Launches: Million-Token Context Becomes Affordable for All

DeepSeek-V4 introduces a hybrid attention architecture, manifold‑constrained hyper‑connections, and the Muon optimizer to cut inference FLOPs and KV cache dramatically, enabling open‑source models to handle million‑token contexts at a fraction of the cost of leading closed‑source services while matching their performance.

DeepSeek V4Hybrid AttentionLarge Language Model
0 likes · 7 min read
DeepSeek-V4 Launches: Million-Token Context Becomes Affordable for All
AI Engineering
AI Engineering
Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Unveiled: How Its Million-Token Context Redefines Open-Source LLMs

DeepSeek released the V4 preview, introducing V4‑Pro (1.6 T parameters, 49 B activation neurons, 33 T tokens) and V4‑Flash (284 B parameters, 13 B activation neurons, 32 T tokens) with 1 M token context, a novel DSA sparse attention that reduces compute and memory, and performance that rivals top closed‑source models in agentic coding, world‑knowledge and reasoning benchmarks, while offering an API compatible with OpenAI and Anthropic.

DeepSeekLarge Language ModelOpenAI API Compatibility
0 likes · 5 min read
DeepSeek V4 Unveiled: How Its Million-Token Context Redefines Open-Source LLMs
AI Insight Log
AI Insight Log
Mar 14, 2026 · Artificial Intelligence

Opus 4.6 Unlocks Full 1M‑Token Context—GPT‑5.4 Slumps to 36% Accuracy

Anthropic opened its million‑token context window for Claude Opus 4.6, showing a 78.3% MRCR v2 accuracy while competing models like GPT‑5.4 and Gemini 3.1 Pro fall below 40%, and the release also removes pricing premiums, expands media limits six‑fold, and requires no code changes, dramatically improving Claude Code workflows.

AI PerformanceAnthropicClaude Opus
0 likes · 8 min read
Opus 4.6 Unlocks Full 1M‑Token Context—GPT‑5.4 Slumps to 36% Accuracy