Old Zhang's AI Learning
Author

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

210
Articles
0
Likes
266
Views
0
Comments
Recent Articles

Latest from Old Zhang's AI Learning

100 recent articles max
Old Zhang's AI Learning
Old Zhang's AI Learning
May 21, 2026 · Artificial Intelligence

SkillOS: Enabling Agents to Self‑Manage Their Skills

SkillOS reframes skill management for LLM agents as a long‑horizon reinforcement‑learning problem, letting a trainable Skill Curator automatically insert, update, or delete markdown‑based skills, which the frozen Agent Executor then consumes, improving memory‑free performance and cross‑task transfer.

LLM agentsMarkdownSkillOS
0 likes · 6 min read
SkillOS: Enabling Agents to Self‑Manage Their Skills
Old Zhang's AI Learning
Old Zhang's AI Learning
May 20, 2026 · Artificial Intelligence

Qwen 3.7‑Max vs Claude 4.7: 7 In‑Depth Tests Reveal a Smooth, Powerful Model

The author evaluates Alibaba’s newly released Qwen 3.7‑Max across seven rigorous tasks—including reading comprehension, HTML fireworks generation, 3D particle visualizations, PDF‑to‑PPT conversion, Excel data analysis, GitHub trending scraping, and complex video generation—showing it often surpasses GPT‑5.5‑level models and rivals Claude 4.7, especially in long‑duration agent tasks.

AI BenchmarkAgentClaude 4.7
0 likes · 9 min read
Qwen 3.7‑Max vs Claude 4.7: 7 In‑Depth Tests Reveal a Smooth, Powerful Model
Old Zhang's AI Learning
Old Zhang's AI Learning
May 19, 2026 · Artificial Intelligence

ByteDance’s Agent Plan Enhances Hermes Agent and Claude Code with Models, Seedance Skills, and Web Search

The article examines Volcano Engine’s new Agent Plan, detailing how its bundled flagship models, Seedance image and video generation skills, web‑search and memory capabilities streamline tasks such as browser‑plugin replication, data‑analysis report creation, full‑stack web dashboards, PDF translation, PPT generation, and Three.js visualizations within Claude Code and Hermes Agent, while comparing it to the earlier Coding Plan model.

AI agentsAgent PlanByteDance
0 likes · 8 min read
ByteDance’s Agent Plan Enhances Hermes Agent and Claude Code with Models, Seedance Skills, and Web Search
Old Zhang's AI Learning
Old Zhang's AI Learning
May 17, 2026 · Mobile Development

How Gemini Intelligence Turns Android Phones into Personal Assistants

Google's Gemini Intelligence upgrades Android from an operating system to an AI-driven platform, enabling cross‑app automation, Chrome‑based browsing tasks, intelligent autofill, spoken‑to‑text messaging, and natural‑language widget creation, while reshaping hardware strategy and developer interfaces.

AIAndroidCross-app automation
0 likes · 6 min read
How Gemini Intelligence Turns Android Phones into Personal Assistants
Old Zhang's AI Learning
Old Zhang's AI Learning
May 17, 2026 · Artificial Intelligence

Why DeepSeek V4 Flash’s Quantized Model Is Gaining Traction

The DeepSeek V4 Flash quantized GGUF model and the dedicated ds4 inference engine, both released by antirez, offer dramatically reduced activation parameters, massive 1‑million‑token context windows, aggressive KV‑cache compression and hardware‑specific quantizations that enable smooth local inference on high‑memory Macs and CUDA machines, while sacrificing generality for performance.

DeepSeek V4 FlashGGUFLLM inference
0 likes · 11 min read
Why DeepSeek V4 Flash’s Quantized Model Is Gaining Traction
Old Zhang's AI Learning
Old Zhang's AI Learning
May 16, 2026 · Artificial Intelligence

Inside X’s New For‑You Recommendation Pipeline: What Creators Must Know

The May 15 open‑source release of X’s For‑You recommendation system reveals a full pipeline—from query hydration and candidate sourcing to multi‑stage scoring—showing that the platform predicts a range of user actions, emphasizes content‑level signals, and offers creators concrete guidance to improve visibility.

GroxMachine LearningPhoenix
0 likes · 17 min read
Inside X’s New For‑You Recommendation Pipeline: What Creators Must Know
Old Zhang's AI Learning
Old Zhang's AI Learning
May 16, 2026 · Artificial Intelligence

vLLM 0.21.0 Arrives: Speculative Decoding Now Supports Reasoning Models

The vLLM 0.21.0 release brings five major updates—including Transformers v4 deprecation, a C++20 build requirement, KV offload with hybrid memory, speculative decoding that respects thinking budgets, and a Blackwell token‑speed backend—while offering detailed upgrade guidance for different user groups.

C++20KV CacheSpeculative Decoding
0 likes · 12 min read
vLLM 0.21.0 Arrives: Speculative Decoding Now Supports Reasoning Models
Old Zhang's AI Learning
Old Zhang's AI Learning
May 16, 2026 · Artificial Intelligence

Can Your PC Run Large Language Models? Meet BenchLoop, the Local Benchmarking Tool

BenchLoop is a CLI‑plus‑Web application that lets you reproducibly benchmark locally‑run LLMs across seven suites—including speed, tool‑calling, coding and agent tasks—while recording hardware details, scoring results with a weighted formula, and optionally publishing them to a public leaderboard.

AI evaluationBenchLoopLLM benchmarking
0 likes · 14 min read
Can Your PC Run Large Language Models? Meet BenchLoop, the Local Benchmarking Tool
Old Zhang's AI Learning
Old Zhang's AI Learning
May 15, 2026 · Artificial Intelligence

Alibaba’s Qoder 1.0 Transforms Desktop AI Coding – Hands‑On Review

Qoder 1.0 upgrades from a 0.x prototype to a full‑featured AI IDE with a new independent Quest view, multi‑agent parallelism, end‑to‑end delivery, long‑term memory, extensible expert teams, and full‑stack quality checks, demonstrated by recreating a browser extension in minutes.

AI IDEAgentic CodingBrowser Agent
0 likes · 14 min read
Alibaba’s Qoder 1.0 Transforms Desktop AI Coding – Hands‑On Review