Tagged articles
9 articles
Page 1 of 1
Old Zhang's AI Learning
Old Zhang's AI Learning
Mar 12, 2026 · Artificial Intelligence

Distilling Claude Opus 4.6 into Qwen3.5‑27B: High‑Quality Reasoning on a Single RTX 3090

The article details how Claude Opus 4.6's chain‑of‑thought data were used to distill the 27‑billion‑parameter Qwen3.5‑27B model with Unsloth and LoRA, achieving full‑context inference on a single RTX 3090/4090, while outlining performance numbers, hyper‑parameter tips, benchmark gains and the trade‑offs of losing multimodal abilities.

Claude Opus 4.6GPU inferenceLoRA
0 likes · 7 min read
Distilling Claude Opus 4.6 into Qwen3.5‑27B: High‑Quality Reasoning on a Single RTX 3090
AI Engineering
AI Engineering
Feb 20, 2026 · Artificial Intelligence

Gemini 3.1 Pro Doubles Reasoning Power and Outperforms Claude Opus 4.6

Google's Gemini 3.1 Pro achieves a 77.1% ARC‑AGI‑2 score—more than double its predecessor—leads in multiple benchmark categories, cuts inference cost by half compared to top rivals, and demonstrates advanced multimodal and programming capabilities through real‑world demos.

AI benchmarksARC-AGI-2Claude Opus 4.6
0 likes · 9 min read
Gemini 3.1 Pro Doubles Reasoning Power and Outperforms Claude Opus 4.6
Black & White Path
Black & White Path
Feb 10, 2026 · Artificial Intelligence

Claude Opus 4.6 Finds 500 Zero‑Day Bugs Out‑of‑the‑Box, Redefining Code Audits

Anthropic’s Claude Opus 4.6 not only shattered AI benchmarks in coding, reasoning and search, but also, when sandboxed with standard fuzzers and debuggers, autonomously uncovered over 500 high‑severity zero‑day vulnerabilities—including a GhostScript crash and buffer‑overflow bugs—prompting a market sell‑off and raising both excitement and misuse concerns.

AI code auditAnthropicClaude Opus 4.6
0 likes · 5 min read
Claude Opus 4.6 Finds 500 Zero‑Day Bugs Out‑of‑the‑Box, Redefining Code Audits
Fun with Large Models
Fun with Large Models
Feb 8, 2026 · Artificial Intelligence

How the US‑China LLM ‘War’ Plays Out: Deep Dive into Claude Opus 4.6 vs GPT‑5.3 CodeX

The article provides a detailed technical comparison of Anthropic's Claude Opus 4.6 and OpenAI's GPT‑5.3 CodeX, covering performance gains, context window size, agent teamwork, programming benchmarks, new features such as adaptive thinking and interactive development, and offers guidance on choosing the right model for specific workflows.

AI model comparisonClaude Opus 4.6GPT-5.3-Codex
0 likes · 15 min read
How the US‑China LLM ‘War’ Plays Out: Deep Dive into Claude Opus 4.6 vs GPT‑5.3 CodeX
AI Insight Log
AI Insight Log
Feb 7, 2026 · Artificial Intelligence

Claude Opus 4.6 Unveils ‘Swarm’ Agent Teams: One Prompt, 16 Parallel AIs in Action

Claude Opus 4.6 and GPT‑5.3‑Codex both introduce Agent Teams that let a single user orchestrate up to 16 parallel AI agents, cutting latency by 78%, boosting accuracy to 78.4%, and enabling feats like building a C compiler for the Linux kernel, with Kimi K2.5 offering a more user‑friendly, zero‑code alternative.

AI CollaborationAgent TeamsC compiler generation
0 likes · 13 min read
Claude Opus 4.6 Unveils ‘Swarm’ Agent Teams: One Prompt, 16 Parallel AIs in Action
Shuge Unlimited
Shuge Unlimited
Feb 6, 2026 · Artificial Intelligence

Claude 4.6 vs GPT‑5.3: How Simultaneous Model Releases Are Redefining SaaS

On February 5, 2026 Anthropic and OpenAI launched Claude Opus 4.6 and GPT‑5.3‑Codex within an hour, sparking a fierce AI model rivalry that brings 1‑million‑token context windows, adaptive reasoning, self‑training, and a shift from AI tools to AI colleagues, reshaping SaaS, developer workflows, and security considerations.

AI agentsClaude Opus 4.6GPT-5.3
0 likes · 13 min read
Claude 4.6 vs GPT‑5.3: How Simultaneous Model Releases Are Redefining SaaS
AI Insight Log
AI Insight Log
Feb 5, 2026 · Artificial Intelligence

GPT-5.3-Codex vs Claude Opus 4.6: Is the 15% Terminal Coding Boost the Real Game‑Changer for Developers?

The article objectively compares OpenAI's GPT‑5.3‑Codex and Anthropic's Claude Opus 4.6 across Terminal‑Bench 2.0 and SWE‑Bench, revealing a 15% terminal‑coding edge for Codex, modest gains in pure code generation, and a strategic split between specialist and generalist AI approaches.

AI model comparisonClaude Opus 4.6GPT-5.3-Codex
0 likes · 9 min read
GPT-5.3-Codex vs Claude Opus 4.6: Is the 15% Terminal Coding Boost the Real Game‑Changer for Developers?