Tagged articles
6 articles
Page 1 of 1
AI Engineering
AI Engineering
May 28, 2026 · Artificial Intelligence

Anthropic Unveils Claude Opus 4.8: Same Price, Agent Power Beats GPT‑5.5

Anthropic released Claude Opus 4.8 with unchanged pricing, new inference‑strength controls, Dynamic Workflows for massive tasks, a fast mode 2.5× quicker and three‑times cheaper, and benchmark results showing its agent capabilities surpass GPT‑5.5 while improving honesty and alignment.

AI AgentsAnthropicClaude Opus 4.8
0 likes · 12 min read
Anthropic Unveils Claude Opus 4.8: Same Price, Agent Power Beats GPT‑5.5
AI Large Model Application Practice
AI Large Model Application Practice
Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Preview: Key Technical Highlights, Benchmarks, and Pricing

The DeepSeek‑V4 preview details two model variants—Pro and Flash—with trillion‑scale parameters, outlines benchmark scores that surpass or match leading overseas models across code generation, real‑world fixes, engineering tasks, and world knowledge, and explains core innovations, pricing, API endpoints, and open‑source licensing.

APIDeepSeekHybrid Attention
0 likes · 7 min read
DeepSeek V4 Preview: Key Technical Highlights, Benchmarks, and Pricing
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Feb 20, 2026 · Artificial Intelligence

Google Reclaims AI Throne with Gemini 3.1 Pro, Achieving 77.1% ARC‑AGI‑2 Score

Google’s Gemini 3.1 Pro, the latest upgrade to the Gemini 3 series, achieves a verified 77.1 % score on the ARC‑AGI‑2 reasoning benchmark—more than double the performance of Gemini 3 Pro—while leading in GPQA, LiveCodeBench Pro, SWE‑Bench Verified, and MMMLU tests, and is now being rolled out to developers, enterprises and consumers with detailed pricing and integration options.

AI benchmarkingARC-AGI-2Gemini 3.1 Pro
0 likes · 9 min read
Google Reclaims AI Throne with Gemini 3.1 Pro, Achieving 77.1% ARC‑AGI‑2 Score
Wuming AI
Wuming AI
Feb 20, 2026 · Artificial Intelligence

Gemini 3.1 Pro: How Google Boosted Reasoning Scores and What It Means for Developers

Google's Gemini 3.1 Pro preview raises reasoning benchmark scores dramatically, offers new pricing tiers, and is already integrated into Gemini API, CLI, Vertex AI, and consumer apps, while community demos showcase SVG animation, real‑time dashboards, 3D simulations, and heat‑transfer analysis.

AI benchmarksGemini 3.1 ProGoogle AI
0 likes · 5 min read
Gemini 3.1 Pro: How Google Boosted Reasoning Scores and What It Means for Developers
AI Insight Log
AI Insight Log
Dec 11, 2025 · Artificial Intelligence

GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro

OpenAI’s GPT‑5.2 launch introduces three specialized modes, achieves a record 55.6% score on SWE‑Bench Pro, demonstrates strong front‑end generation, adds a /compact API for long‑context efficiency, offers tiered pricing with cache discounts, and improves safety for younger users.

AI benchmarkingAI safetyGPT-5.2
0 likes · 6 min read
GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro
AI Algorithm Path
AI Algorithm Path
Jun 11, 2025 · Artificial Intelligence

OpenAI's O3‑Pro Model: Deep Reasoning, Pricing, Benchmarks, and Access Guide

OpenAI introduced the O3‑Pro multimodal deep‑reasoning model with an 80% price cut for O3, detailed its training via large‑scale reinforcement learning, compared its capabilities and costs against GPT‑4o, GPT‑4.1 and O3‑Pro, listed its core specs, limitations, access methods, and presented benchmark tests that highlight both strengths and weaknesses.

AIMultimodalO3-Pro
0 likes · 10 min read
OpenAI's O3‑Pro Model: Deep Reasoning, Pricing, Benchmarks, and Access Guide