Tagged articles

9 articles

Page 1 of 1

May 6, 2026 · Artificial Intelligence

DeepSeek V4 (Flash & Pro) Unveils Million‑Token Context and Trillion‑Parameter Inference

The April 24, 2026 release of DeepSeek V4 introduces Hybrid Attention (CSA/HCA), Manifold‑Constrained Hyper‑Connections, and the Muon optimizer, delivering 1 M‑token context windows, up to 1.6 T parameters, competitive benchmark scores against Claude and GPT, dramatically lower inference costs, and detailed deployment guidelines that expose both performance gains and practical challenges.

AI benchmarkingDeepSeek V4Hybrid Attention

0 likes · 17 min read

DeepSeek V4 (Flash & Pro) Unveils Million‑Token Context and Trillion‑Parameter Inference

Lao Guo's Learning Space

Apr 30, 2026 · Artificial Intelligence

How DeepSeek V4’s CSA + HCA Break the Million‑Token Barrier

Traditional full‑attention cannot handle million‑token contexts due to exponential compute and memory growth, but DeepSeek V4’s Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) compress, sparsely index, and precisely compute tokens, cutting KV cache to 10% and FLOPs to 27% while enabling a 1‑M token window on a single GPU.

CSAHCAKV cache compression

0 likes · 12 min read

How DeepSeek V4’s CSA + HCA Break the Million‑Token Barrier

DeepHub IMBA

Apr 27, 2026 · Artificial Intelligence

DeepSeek‑V4 Deep Dive: Engineering Million‑Token Context Efficiency

The article provides a thorough technical analysis of DeepSeek‑V4, detailing how mixed sparse attention (CSA + HCA), manifold‑constrained hyper‑connections, the Muon optimizer, FP4 quantization, and a suite of infrastructure tricks enable stable training and inference with up to one‑million token contexts while achieving state‑of‑the‑art benchmark results.

CSADeepSeek V4FP4 Quantization

0 likes · 22 min read

DeepSeek‑V4 Deep Dive: Engineering Million‑Token Context Efficiency

Architecture & Thinking

Apr 26, 2026 · Artificial Intelligence

DeepSeek V4: How Million‑Token Context and Open‑Source Design Redefine AI Ecosystems

DeepSeek V4, released on April 24, 2026, introduces a 1‑million‑token context via DSA sparse attention, offers Pro and Flash variants, adapts to domestic AI chips, cuts compute costs dramatically, and leverages open‑source weights to challenge the dominance of closed‑source LLMs, reshaping the global AI landscape.

AI hardware adaptationDeepSeek V4agentic AI

0 likes · 9 min read

DeepSeek V4: How Million‑Token Context and Open‑Source Design Redefine AI Ecosystems

ZhiKe AI

Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Launch: Open‑Source Model Beats Closed‑Source Leaders in Coding & Math, 1.6 T Params, 1 M Context

DeepSeek V4, released today, offers two open‑source models (Pro and Flash) with up to 1.6 T parameters and a 1‑million‑token context, achieving top‑tier programming and mathematics benchmark scores that surpass the three major closed‑source competitors, while cutting API costs to a fraction of the price.

APIDeepSeekV4

0 likes · 7 min read

DeepSeek V4 Launch: Open‑Source Model Beats Closed‑Source Leaders in Coding & Math, 1.6 T Params, 1 M Context

SuanNi

Apr 24, 2026 · Artificial Intelligence

DeepSeek-V4 Launches: Million-Token Context Becomes Affordable for All

DeepSeek-V4 introduces a hybrid attention architecture, manifold‑constrained hyper‑connections, and the Muon optimizer to cut inference FLOPs and KV cache dramatically, enabling open‑source models to handle million‑token contexts at a fraction of the cost of leading closed‑source services while matching their performance.

DeepSeek V4Hybrid AttentionLarge Language Model

0 likes · 7 min read

DeepSeek-V4 Launches: Million-Token Context Becomes Affordable for All

AI Insight Log

Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Unveiled: 1.6 T Parameters, Million‑Token Context, Fully Open‑Source

DeepSeek V4 introduces two open‑source MoE models—Pro and Flash—with up to 1.6 T parameters, 1 M token context, a new DSA sparse‑attention mechanism, extensive benchmark results, and a tiered pricing scheme, while remaining compatible with OpenAI and Anthropic APIs.

DeepSeekLarge Language ModelOpen Source

0 likes · 9 min read

DeepSeek V4 Unveiled: 1.6 T Parameters, Million‑Token Context, Fully Open‑Source

AI Engineering

Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Unveiled: How Its Million-Token Context Redefines Open-Source LLMs

DeepSeek released the V4 preview, introducing V4‑Pro (1.6 T parameters, 49 B activation neurons, 33 T tokens) and V4‑Flash (284 B parameters, 13 B activation neurons, 32 T tokens) with 1 M token context, a novel DSA sparse attention that reduces compute and memory, and performance that rivals top closed‑source models in agentic coding, world‑knowledge and reasoning benchmarks, while offering an API compatible with OpenAI and Anthropic.

DeepSeekLarge Language ModelOpenAI API Compatibility

0 likes · 5 min read

DeepSeek V4 Unveiled: How Its Million-Token Context Redefines Open-Source LLMs

AI Insight Log

Mar 14, 2026 · Artificial Intelligence

Opus 4.6 Unlocks Full 1M‑Token Context—GPT‑5.4 Slumps to 36% Accuracy

Anthropic opened its million‑token context window for Claude Opus 4.6, showing a 78.3% MRCR v2 accuracy while competing models like GPT‑5.4 and Gemini 3.1 Pro fall below 40%, and the release also removes pricing premiums, expands media limits six‑fold, and requires no code changes, dramatically improving Claude Code workflows.

AI PerformanceAnthropicClaude Opus

0 likes · 8 min read

Opus 4.6 Unlocks Full 1M‑Token Context—GPT‑5.4 Slumps to 36% Accuracy