Author

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

210

Articles

Likes

266

Views

Comments

Latest from Old Zhang's AI Learning

100 recent articles max

Old Zhang's AI Learning

May 3, 2026 · Artificial Intelligence

One‑Command Setup of Reusable Claude Code Configurations (Full Toolkit)

The article reviews the GitHub project claude-code-templates, which aggregates over 100 reusable Claude Code assets—including agents, commands, MCPs, settings, hooks, and skills—into an npm‑like repository and a web dashboard, showing how a single npx command can install a complete development stack, detailing usage examples, pros, cons, and target audiences.

AI codingCLIClaude Code

0 likes · 9 min read

One‑Command Setup of Reusable Claude Code Configurations (Full Toolkit)

Old Zhang's AI Learning

May 3, 2026 · Artificial Intelligence

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

Qwen‑Scope adds a sparse autoencoder (SAE) to the Qwen‑3.5‑27B model, exposing a top‑K 50‑feature, residual‑stream hook across all 64 layers for interpretability, controllable generation, data analysis, and training diagnostics, while detailing installation, usage, and practical trade‑offs.

InterpretabilityLarge Language ModelQwen

0 likes · 11 min read

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

Old Zhang's AI Learning

May 2, 2026 · Artificial Intelligence

Andrew Ng’s AI Prompting Course: Deep Research, Writing, Data Analysis

The review dissects Andrew Ng’s 2026 “AI Prompting for Everyone” course, outlining its three modules—information retrieval tiers, AI‑assisted writing with progressive outlining, and building apps or analyzing data via code execution—while highlighting the Deep Research agentic loop, AI‑slop detection, and a decision‑tree for choosing the right AI tool.

AI promptingAI writingCode Execution

0 likes · 12 min read

Andrew Ng’s AI Prompting Course: Deep Research, Writing, Data Analysis

Old Zhang's AI Learning

May 1, 2026 · Artificial Intelligence

Claude Code Hackathon Top 3: How a Turkish Doctor Won Gold with AI‑Powered MedKit

The Anthropic "Built with Opus 4.7" hackathon showcased three standout projects—MedKit, Wrench Board, and Maieutic—each built by creators from medicine, electronics repair, and education, demonstrating how deep domain expertise combined with Claude Code agents can deliver real‑world AI solutions.

AI agentsClaude CodeElectronics repair

0 likes · 10 min read

Claude Code Hackathon Top 3: How a Turkish Doctor Won Gold with AI‑Powered MedKit

Old Zhang's AI Learning

May 1, 2026 · Artificial Intelligence

NVIDIA’s Open‑Source Multimodal Nemotron 3 Nano Omni: Run Locally on Consumer GPUs (English‑Only)

NVIDIA’s Nemotron 3 Nano Omni 30B‑A3B‑Reasoning model, an open‑source multimodal LLM with 30 B parameters, 256K context and video‑audio‑image‑text capabilities, outperforms comparable models by up to 9.2× in video throughput, runs on consumer GPUs via 4‑bit GGUF quantization, but currently supports only English input.

GGUFGPUMultimodal

0 likes · 17 min read

NVIDIA’s Open‑Source Multimodal Nemotron 3 Nano Omni: Run Locally on Consumer GPUs (English‑Only)

Old Zhang's AI Learning

May 1, 2026 · Artificial Intelligence

DeepSeek‑V4 Local Deployment: How SGLang Overcomes the Architecture Challenges

The article analyzes DeepSeek‑V4's architectural innovations—including mixed sparse attention, mHC, and native FP4 weights—explains SGLang's ShadowRadix, HiSparse, and in‑graph speculative decoding solutions, presents benchmark gains, provides Docker deployment steps, and warns of key pitfalls for long‑context inference.

DeepSeek V4HiSparseSGLang

0 likes · 15 min read

DeepSeek‑V4 Local Deployment: How SGLang Overcomes the Architecture Challenges

Old Zhang's AI Learning

Apr 30, 2026 · Artificial Intelligence

How Codex, Claude, and Gemini Are Battling for the Desktop AI Market

The article reviews the emerging desktop clients for AI assistants—Claude, Gemini, and Codex—detailing their features, integration advantages, new capabilities like voice input and screen‑based memory, as well as limitations such as rate‑limit consumption and security risks.

AI assistantsAI desktop appsClaude

0 likes · 5 min read

How Codex, Claude, and Gemini Are Battling for the Desktop AI Market

Old Zhang's AI Learning

Apr 29, 2026 · Artificial Intelligence

Top 10 Open‑Source LLM Benchmarks: Scores, Rankings, and What They Test

This article walks through ten mainstream open‑source large‑model benchmarks—SWE‑bench Verified and Pro, MMLU‑Pro, GPQA Diamond, HLE, AIME, HMMT, olmOCR‑bench, Terminal‑Bench 2.0, and EvasionBench—explaining their data, evaluation metrics, current leading models, and the capability dimensions they reveal.

AI evaluationLLM benchmarksMMLU-Pro

0 likes · 20 min read

Top 10 Open‑Source LLM Benchmarks: Scores, Rankings, and What They Test

Old Zhang's AI Learning

Apr 28, 2026 · Frontend Development

Recreating a Social Media Anti‑Addiction Chrome Extension with AI (Open‑Source)

The author adapts a Japanese developer’s forced‑break Chrome extension using GitHub Copilot (Codex) to create an open‑source add‑on that blocks prolonged browsing on any specified sites—by default Zhihu and Weibo—displaying a full‑screen cat overlay after a configurable time.

AI Code GenerationAnti-AddictionChrome Extension

0 likes · 3 min read

Recreating a Social Media Anti‑Addiction Chrome Extension with AI (Open‑Source)

Old Zhang's AI Learning

Apr 28, 2026 · Artificial Intelligence

vLLM 0.20 Arrives with DeepSeek V4 Support – What’s New?

The vLLM 0.20.0 release dramatically upgrades the inference engine with DeepSeek V4 support, default CUDA 13, PyTorch 2.11, Transformers v5 compatibility, FlashAttention 4 MLA prefill, TurboQuant 2‑bit KV cache, an online quantization front‑end, IR enhancements, Model Runner V2 features, and a slew of new models, while providing detailed installation and upgrade guidance.

CUDA 13DeepSeek V4FlashAttention

0 likes · 10 min read

vLLM 0.20 Arrives with DeepSeek V4 Support – What’s New?