Old Zhang's AI Learning
Author

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

210
Articles
0
Likes
266
Views
0
Comments
Recent Articles

Latest from Old Zhang's AI Learning

100 recent articles max
Old Zhang's AI Learning
Old Zhang's AI Learning
May 26, 2026 · Artificial Intelligence

How Anthropic’s Finance Skills Power Real‑World CFO Workflows

This article breaks down Anthropic’s blog on using Claude for finance, extracts reusable Skills from the open‑source repository, explains the three core CFO workflows, shows how to mount the Skills in Claude Code, Cursor and GitHub Copilot, and provides a detailed guide for adapting the Skills to Chinese data sources.

AIAnthropicClaude
0 likes · 18 min read
How Anthropic’s Finance Skills Power Real‑World CFO Workflows
Old Zhang's AI Learning
Old Zhang's AI Learning
May 25, 2026 · Product Management

Boost Feishu CLI with Claude Code: Create Diagrams and PPTs in One Command

The article shows how the open‑source Feishu CLI, now integrated with Claude Code, lets users generate editable whiteboard diagrams, mind maps, cheat‑sheets, UI mockups and PPT slides directly inside Feishu documents using simple commands, streamlining documentation and collaboration for product managers, architects and developers.

AI agentClaude CodeFeishu CLI
0 likes · 9 min read
Boost Feishu CLI with Claude Code: Create Diagrams and PPTs in One Command
Old Zhang's AI Learning
Old Zhang's AI Learning
May 24, 2026 · Artificial Intelligence

LM Studio Adds MTP Support, Boosting Qwen3.6‑35B to ~130 Tokens/s

LM Studio 0.4.14+ now implements Multi‑Token Prediction (MTP) speculative decoding, eliminating the need for a separate draft model and delivering roughly double the token throughput—e.g., Qwen3.6‑35B reaches about 130 tokens/s on RTX 3090—while providing a six‑step activation guide and a list of known pitfalls.

LM StudioMTPQwen3.6
0 likes · 6 min read
LM Studio Adds MTP Support, Boosting Qwen3.6‑35B to ~130 Tokens/s
Old Zhang's AI Learning
Old Zhang's AI Learning
May 24, 2026 · Industry Insights

How a Fake vLLM PR Exposed the Risks of AI‑Generated Resume Padding

The article dissects a fabricated vLLM pull request that pretended to fix a non‑existent NVIDIA Eagle3 checkpoint bug, explains its bogus test plan, shows how AI‑assisted PR generation can flood open‑source projects, and warns of the trust damage such resume‑padding schemes cause.

AI coding agentsEagle3NVIDIA
0 likes · 7 min read
How a Fake vLLM PR Exposed the Risks of AI‑Generated Resume Padding
Old Zhang's AI Learning
Old Zhang's AI Learning
May 24, 2026 · Artificial Intelligence

Taming Vibe Coding: How OpenSpec Enforces Spec‑Driven AI Coding

The article examines the pitfalls of "vibe coding" with AI assistants like Claude Code, introduces Spec‑Driven Development, compares major tools, and provides a detailed walkthrough of the open‑source OpenSpec workflow that aligns AI output with explicit specifications.

AI codingClaude CodeOpenSpec
0 likes · 10 min read
Taming Vibe Coding: How OpenSpec Enforces Spec‑Driven AI Coding
Old Zhang's AI Learning
Old Zhang's AI Learning
May 23, 2026 · Artificial Intelligence

The Underrated Lifesaving Template for Qwen Local Deployment

This article analyzes the hidden pitfalls of Qwen's official Jinja chat template, explains how the community‑maintained Qwen‑Fixed‑Chat‑Templates v19 fixes rendering errors, KV‑Cache loss, token waste and agent dead‑locks, and provides step‑by‑step installation instructions for LM Studio, llama.cpp, vLLM and MLX.

Agent LoopChat TemplateKV Cache
0 likes · 10 min read
The Underrated Lifesaving Template for Qwen Local Deployment
Old Zhang's AI Learning
Old Zhang's AI Learning
May 23, 2026 · Artificial Intelligence

Qwopus 3.6‑27B‑v2: Trace‑Inversion Distillation Cuts Token Use by 36% and Boosts Accuracy

The Qwopus 3.6‑27B‑v2 model reconstructs full step‑by‑step reasoning from compressed Claude outputs using a Trace‑Inverter, creates two high‑quality SFT datasets, and achieves 35.9% token savings, a 2.57‑point accuracy gain on MMLU‑Pro, 75.25% success on SWE‑bench, while running on a single consumer‑grade RTX 5090.

GGUFMMLUQwen
0 likes · 11 min read
Qwopus 3.6‑27B‑v2: Trace‑Inversion Distillation Cuts Token Use by 36% and Boosts Accuracy
Old Zhang's AI Learning
Old Zhang's AI Learning
May 21, 2026 · Artificial Intelligence

Matt Pocock Open‑Sources His Complete .claude Skills Repository

The article reviews Matt Pocock’s newly released mattpocock/skills GitHub repository, explaining its purpose, installation steps, folder structure, core engineering skills, four common failure modes, and how its concise, composable prompts differ from Anthropic’s official skills, while offering practical recommendations for Claude Code and Codex users.

AI agentsClaudeDevOps
0 likes · 12 min read
Matt Pocock Open‑Sources His Complete .claude Skills Repository