Tagged articles
4 articles
Page 1 of 1
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 20, 2026 · Artificial Intelligence

Claude Opus 4.7: Programming Power Peaks but Faces ‘Dumbing‑Down’ Criticism

Anthropic’s Claude Opus 4.7 launches with record‑breaking programming benchmarks, a new xhigh effort mode and a free 1 M‑token context window, yet an AMD audit reveals a steep drop in real‑world engineering accuracy, reduced cache TTL and a shift to usage‑based pricing that has sparked community backlash.

1M token contextAI benchmarksClaude Opus 4.7
0 likes · 10 min read
Claude Opus 4.7: Programming Power Peaks but Faces ‘Dumbing‑Down’ Criticism
Java Tech Enthusiast
Java Tech Enthusiast
Feb 4, 2026 · Artificial Intelligence

Claude Sonnet 5 (Fennec) – The Next‑Gen Coding LLM Set to Outperform All Rivals

Claude Sonnet 5, codenamed Fennec, is about to launch on Google’s infrastructure with a 1‑million‑token context window, pricing half of Opus 4.5, and benchmark scores surpassing 80.9% on SWE‑Bench, while introducing an autonomous “Dev Team” swarm that can generate, test, and deliver full software modules without human intervention.

Benchmarkingmodel releasemulti-agent systems
0 likes · 9 min read
Claude Sonnet 5 (Fennec) – The Next‑Gen Coding LLM Set to Outperform All Rivals
Data Party THU
Data Party THU
Aug 11, 2025 · Artificial Intelligence

What Makes GPT‑5 the Most Powerful AI Model Yet? A Deep Dive into Its Architecture and Benchmarks

The article analyzes GPT‑5’s unified system, advanced reasoning models, and impressive benchmark gains across programming, creative writing, and health domains, highlighting its new router, Verbosity API, and record‑setting performance on tasks such as Aider polyglot, AIME 2025, and HealthBench.

AI benchmarksAI reasoningGPT-5
0 likes · 7 min read
What Makes GPT‑5 the Most Powerful AI Model Yet? A Deep Dive into Its Architecture and Benchmarks
Infra Learning Club
Infra Learning Club
May 10, 2025 · Artificial Intelligence

Testing Gemini 2.5 Pro’s Programming Skills with Cursor

The author evaluates Gemini 2.5 Pro’s coding capabilities inside the Cursor IDE, detailing setup steps, regional API‑key limitations, hands‑on attempts to generate a front‑end project, a comparison with Augment Code’s Sonnet 3.5 model, and overall impressions of AI‑driven code generation.

AI Code GenerationAugment CodeCursor IDE
0 likes · 5 min read
Testing Gemini 2.5 Pro’s Programming Skills with Cursor