Tagged articles

6 articles

Page 1 of 1

May 28, 2026 · Artificial Intelligence

Anthropic Unveils Claude Opus 4.8: Same Price, Agent Power Beats GPT‑5.5

Anthropic released Claude Opus 4.8 with unchanged pricing, new inference‑strength controls, Dynamic Workflows for massive tasks, a fast mode 2.5× quicker and three‑times cheaper, and benchmark results showing its agent capabilities surpass GPT‑5.5 while improving honesty and alignment.

AI AgentsAnthropicClaude Opus 4.8

0 likes · 12 min read

Anthropic Unveils Claude Opus 4.8: Same Price, Agent Power Beats GPT‑5.5

AI Large Model Application Practice

Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Preview: Key Technical Highlights, Benchmarks, and Pricing

The DeepSeek‑V4 preview details two model variants—Pro and Flash—with trillion‑scale parameters, outlines benchmark scores that surpass or match leading overseas models across code generation, real‑world fixes, engineering tasks, and world knowledge, and explains core innovations, pricing, API endpoints, and open‑source licensing.

APIDeepSeekHybrid Attention

0 likes · 7 min read

DeepSeek V4 Preview: Key Technical Highlights, Benchmarks, and Pricing

Machine Learning Algorithms & Natural Language Processing

Feb 20, 2026 · Artificial Intelligence

Google Reclaims AI Throne with Gemini 3.1 Pro, Achieving 77.1% ARC‑AGI‑2 Score

Google’s Gemini 3.1 Pro, the latest upgrade to the Gemini 3 series, achieves a verified 77.1 % score on the ARC‑AGI‑2 reasoning benchmark—more than double the performance of Gemini 3 Pro—while leading in GPQA, LiveCodeBench Pro, SWE‑Bench Verified, and MMMLU tests, and is now being rolled out to developers, enterprises and consumers with detailed pricing and integration options.

AI benchmarkingARC-AGI-2Gemini 3.1 Pro

0 likes · 9 min read

Google Reclaims AI Throne with Gemini 3.1 Pro, Achieving 77.1% ARC‑AGI‑2 Score

Wuming AI

Feb 20, 2026 · Artificial Intelligence

Gemini 3.1 Pro: How Google Boosted Reasoning Scores and What It Means for Developers

Google's Gemini 3.1 Pro preview raises reasoning benchmark scores dramatically, offers new pricing tiers, and is already integrated into Gemini API, CLI, Vertex AI, and consumer apps, while community demos showcase SVG animation, real‑time dashboards, 3D simulations, and heat‑transfer analysis.

AI benchmarksGemini 3.1 ProGoogle AI

0 likes · 5 min read

Gemini 3.1 Pro: How Google Boosted Reasoning Scores and What It Means for Developers

AI Insight Log

Dec 11, 2025 · Artificial Intelligence

GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro

OpenAI’s GPT‑5.2 launch introduces three specialized modes, achieves a record 55.6% score on SWE‑Bench Pro, demonstrates strong front‑end generation, adds a /compact API for long‑context efficiency, offers tiered pricing with cache discounts, and improves safety for younger users.

AI benchmarkingAI safetyGPT-5.2

0 likes · 6 min read

GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro

AI Algorithm Path

Jun 11, 2025 · Artificial Intelligence

OpenAI's O3‑Pro Model: Deep Reasoning, Pricing, Benchmarks, and Access Guide

OpenAI introduced the O3‑Pro multimodal deep‑reasoning model with an 80% price cut for O3, detailed its training via large‑scale reinforcement learning, compared its capabilities and costs against GPT‑4o, GPT‑4.1 and O3‑Pro, listed its core specs, limitations, access methods, and presented benchmark tests that highlight both strengths and weaknesses.

AIMultimodalO3-Pro

0 likes · 10 min read

OpenAI's O3‑Pro Model: Deep Reasoning, Pricing, Benchmarks, and Access Guide