Author

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

210

Articles

Likes

266

Views

Comments

Latest from Old Zhang's AI Learning

100 recent articles max

Old Zhang's AI Learning

May 21, 2026 · Artificial Intelligence

SkillOS: Enabling Agents to Self‑Manage Their Skills

SkillOS reframes skill management for LLM agents as a long‑horizon reinforcement‑learning problem, letting a trainable Skill Curator automatically insert, update, or delete markdown‑based skills, which the frozen Agent Executor then consumes, improving memory‑free performance and cross‑task transfer.

LLM agentsMarkdownSkillOS

0 likes · 6 min read

SkillOS: Enabling Agents to Self‑Manage Their Skills

Old Zhang's AI Learning

May 20, 2026 · Artificial Intelligence

Qwen 3.7‑Max vs Claude 4.7: 7 In‑Depth Tests Reveal a Smooth, Powerful Model

The author evaluates Alibaba’s newly released Qwen 3.7‑Max across seven rigorous tasks—including reading comprehension, HTML fireworks generation, 3D particle visualizations, PDF‑to‑PPT conversion, Excel data analysis, GitHub trending scraping, and complex video generation—showing it often surpasses GPT‑5.5‑level models and rivals Claude 4.7, especially in long‑duration agent tasks.

AI BenchmarkAgentClaude 4.7

0 likes · 9 min read

Qwen 3.7‑Max vs Claude 4.7: 7 In‑Depth Tests Reveal a Smooth, Powerful Model

Old Zhang's AI Learning

May 19, 2026 · Artificial Intelligence

ByteDance’s Agent Plan Enhances Hermes Agent and Claude Code with Models, Seedance Skills, and Web Search

The article examines Volcano Engine’s new Agent Plan, detailing how its bundled flagship models, Seedance image and video generation skills, web‑search and memory capabilities streamline tasks such as browser‑plugin replication, data‑analysis report creation, full‑stack web dashboards, PDF translation, PPT generation, and Three.js visualizations within Claude Code and Hermes Agent, while comparing it to the earlier Coding Plan model.

AI agentsAgent PlanByteDance

0 likes · 8 min read

ByteDance’s Agent Plan Enhances Hermes Agent and Claude Code with Models, Seedance Skills, and Web Search

Old Zhang's AI Learning

May 18, 2026 · Artificial Intelligence

Testing a Cloud AI Agent: From Data Analysis to PPT to Video with a Single Input

The author walks through a hands‑on test of the Skywork cloud AI Agent, showing how it can ingest exported Excel data, generate a data‑analysis report, automatically create a PPT, produce narrated video and images, all via a single input without any local deployment.

AI agentData AnalysisPPT generation

0 likes · 8 min read

Testing a Cloud AI Agent: From Data Analysis to PPT to Video with a Single Input

Old Zhang's AI Learning

May 17, 2026 · Mobile Development

How Gemini Intelligence Turns Android Phones into Personal Assistants

Google's Gemini Intelligence upgrades Android from an operating system to an AI-driven platform, enabling cross‑app automation, Chrome‑based browsing tasks, intelligent autofill, spoken‑to‑text messaging, and natural‑language widget creation, while reshaping hardware strategy and developer interfaces.

AIAndroidCross-app automation

0 likes · 6 min read

How Gemini Intelligence Turns Android Phones into Personal Assistants

Old Zhang's AI Learning

May 17, 2026 · Artificial Intelligence

Why DeepSeek V4 Flash’s Quantized Model Is Gaining Traction

The DeepSeek V4 Flash quantized GGUF model and the dedicated ds4 inference engine, both released by antirez, offer dramatically reduced activation parameters, massive 1‑million‑token context windows, aggressive KV‑cache compression and hardware‑specific quantizations that enable smooth local inference on high‑memory Macs and CUDA machines, while sacrificing generality for performance.

DeepSeek V4 FlashGGUFLLM inference

0 likes · 11 min read

Why DeepSeek V4 Flash’s Quantized Model Is Gaining Traction

Old Zhang's AI Learning

May 16, 2026 · Artificial Intelligence

Inside X’s New For‑You Recommendation Pipeline: What Creators Must Know

The May 15 open‑source release of X’s For‑You recommendation system reveals a full pipeline—from query hydration and candidate sourcing to multi‑stage scoring—showing that the platform predicts a range of user actions, emphasizes content‑level signals, and offers creators concrete guidance to improve visibility.

GroxMachine LearningPhoenix

0 likes · 17 min read

Inside X’s New For‑You Recommendation Pipeline: What Creators Must Know

Old Zhang's AI Learning

May 16, 2026 · Artificial Intelligence

vLLM 0.21.0 Arrives: Speculative Decoding Now Supports Reasoning Models

The vLLM 0.21.0 release brings five major updates—including Transformers v4 deprecation, a C++20 build requirement, KV offload with hybrid memory, speculative decoding that respects thinking budgets, and a Blackwell token‑speed backend—while offering detailed upgrade guidance for different user groups.

C++20KV CacheSpeculative Decoding

0 likes · 12 min read

vLLM 0.21.0 Arrives: Speculative Decoding Now Supports Reasoning Models

Old Zhang's AI Learning

May 16, 2026 · Artificial Intelligence

Can Your PC Run Large Language Models? Meet BenchLoop, the Local Benchmarking Tool

BenchLoop is a CLI‑plus‑Web application that lets you reproducibly benchmark locally‑run LLMs across seven suites—including speed, tool‑calling, coding and agent tasks—while recording hardware details, scoring results with a weighted formula, and optionally publishing them to a public leaderboard.

AI evaluationBenchLoopLLM benchmarking

0 likes · 14 min read

Can Your PC Run Large Language Models? Meet BenchLoop, the Local Benchmarking Tool

Old Zhang's AI Learning

May 15, 2026 · Artificial Intelligence

Alibaba’s Qoder 1.0 Transforms Desktop AI Coding – Hands‑On Review

Qoder 1.0 upgrades from a 0.x prototype to a full‑featured AI IDE with a new independent Quest view, multi‑agent parallelism, end‑to‑end delivery, long‑term memory, extensible expert teams, and full‑stack quality checks, demonstrated by recreating a browser extension in minutes.

AI IDEAgentic CodingBrowser Agent

0 likes · 14 min read

Alibaba’s Qoder 1.0 Transforms Desktop AI Coding – Hands‑On Review