Old Zhang's AI Learning
Author

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

210
Articles
0
Likes
266
Views
0
Comments
Recent Articles

Latest from Old Zhang's AI Learning

100 recent articles max
Old Zhang's AI Learning
Old Zhang's AI Learning
May 3, 2026 · Artificial Intelligence

One‑Command Setup of Reusable Claude Code Configurations (Full Toolkit)

The article reviews the GitHub project claude-code-templates, which aggregates over 100 reusable Claude Code assets—including agents, commands, MCPs, settings, hooks, and skills—into an npm‑like repository and a web dashboard, showing how a single npx command can install a complete development stack, detailing usage examples, pros, cons, and target audiences.

AI codingCLIClaude Code
0 likes · 9 min read
One‑Command Setup of Reusable Claude Code Configurations (Full Toolkit)
Old Zhang's AI Learning
Old Zhang's AI Learning
May 3, 2026 · Artificial Intelligence

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

Qwen‑Scope adds a sparse autoencoder (SAE) to the Qwen‑3.5‑27B model, exposing a top‑K 50‑feature, residual‑stream hook across all 64 layers for interpretability, controllable generation, data analysis, and training diagnostics, while detailing installation, usage, and practical trade‑offs.

InterpretabilityLarge Language ModelQwen
0 likes · 11 min read
Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B
Old Zhang's AI Learning
Old Zhang's AI Learning
May 2, 2026 · Artificial Intelligence

Andrew Ng’s AI Prompting Course: Deep Research, Writing, Data Analysis

The review dissects Andrew Ng’s 2026 “AI Prompting for Everyone” course, outlining its three modules—information retrieval tiers, AI‑assisted writing with progressive outlining, and building apps or analyzing data via code execution—while highlighting the Deep Research agentic loop, AI‑slop detection, and a decision‑tree for choosing the right AI tool.

AI promptingAI writingCode Execution
0 likes · 12 min read
Andrew Ng’s AI Prompting Course: Deep Research, Writing, Data Analysis
Old Zhang's AI Learning
Old Zhang's AI Learning
May 1, 2026 · Artificial Intelligence

Claude Code Hackathon Top 3: How a Turkish Doctor Won Gold with AI‑Powered MedKit

The Anthropic "Built with Opus 4.7" hackathon showcased three standout projects—MedKit, Wrench Board, and Maieutic—each built by creators from medicine, electronics repair, and education, demonstrating how deep domain expertise combined with Claude Code agents can deliver real‑world AI solutions.

AI agentsClaude CodeElectronics repair
0 likes · 10 min read
Claude Code Hackathon Top 3: How a Turkish Doctor Won Gold with AI‑Powered MedKit
Old Zhang's AI Learning
Old Zhang's AI Learning
May 1, 2026 · Artificial Intelligence

NVIDIA’s Open‑Source Multimodal Nemotron 3 Nano Omni: Run Locally on Consumer GPUs (English‑Only)

NVIDIA’s Nemotron 3 Nano Omni 30B‑A3B‑Reasoning model, an open‑source multimodal LLM with 30 B parameters, 256K context and video‑audio‑image‑text capabilities, outperforms comparable models by up to 9.2× in video throughput, runs on consumer GPUs via 4‑bit GGUF quantization, but currently supports only English input.

GGUFGPUMultimodal
0 likes · 17 min read
NVIDIA’s Open‑Source Multimodal Nemotron 3 Nano Omni: Run Locally on Consumer GPUs (English‑Only)
Old Zhang's AI Learning
Old Zhang's AI Learning
May 1, 2026 · Artificial Intelligence

DeepSeek‑V4 Local Deployment: How SGLang Overcomes the Architecture Challenges

The article analyzes DeepSeek‑V4's architectural innovations—including mixed sparse attention, mHC, and native FP4 weights—explains SGLang's ShadowRadix, HiSparse, and in‑graph speculative decoding solutions, presents benchmark gains, provides Docker deployment steps, and warns of key pitfalls for long‑context inference.

DeepSeek V4HiSparseSGLang
0 likes · 15 min read
DeepSeek‑V4 Local Deployment: How SGLang Overcomes the Architecture Challenges
Old Zhang's AI Learning
Old Zhang's AI Learning
Apr 30, 2026 · Artificial Intelligence

How Codex, Claude, and Gemini Are Battling for the Desktop AI Market

The article reviews the emerging desktop clients for AI assistants—Claude, Gemini, and Codex—detailing their features, integration advantages, new capabilities like voice input and screen‑based memory, as well as limitations such as rate‑limit consumption and security risks.

AI assistantsAI desktop appsClaude
0 likes · 5 min read
How Codex, Claude, and Gemini Are Battling for the Desktop AI Market
Old Zhang's AI Learning
Old Zhang's AI Learning
Apr 29, 2026 · Artificial Intelligence

Top 10 Open‑Source LLM Benchmarks: Scores, Rankings, and What They Test

This article walks through ten mainstream open‑source large‑model benchmarks—SWE‑bench Verified and Pro, MMLU‑Pro, GPQA Diamond, HLE, AIME, HMMT, olmOCR‑bench, Terminal‑Bench 2.0, and EvasionBench—explaining their data, evaluation metrics, current leading models, and the capability dimensions they reveal.

AI evaluationLLM benchmarksMMLU-Pro
0 likes · 20 min read
Top 10 Open‑Source LLM Benchmarks: Scores, Rankings, and What They Test
Old Zhang's AI Learning
Old Zhang's AI Learning
Apr 28, 2026 · Frontend Development

Recreating a Social Media Anti‑Addiction Chrome Extension with AI (Open‑Source)

The author adapts a Japanese developer’s forced‑break Chrome extension using GitHub Copilot (Codex) to create an open‑source add‑on that blocks prolonged browsing on any specified sites—by default Zhihu and Weibo—displaying a full‑screen cat overlay after a configurable time.

AI Code GenerationAnti-AddictionChrome Extension
0 likes · 3 min read
Recreating a Social Media Anti‑Addiction Chrome Extension with AI (Open‑Source)
Old Zhang's AI Learning
Old Zhang's AI Learning
Apr 28, 2026 · Artificial Intelligence

vLLM 0.20 Arrives with DeepSeek V4 Support – What’s New?

The vLLM 0.20.0 release dramatically upgrades the inference engine with DeepSeek V4 support, default CUDA 13, PyTorch 2.11, Transformers v5 compatibility, FlashAttention 4 MLA prefill, TurboQuant 2‑bit KV cache, an online quantization front‑end, IR enhancements, Model Runner V2 features, and a slew of new models, while providing detailed installation and upgrade guidance.

CUDA 13DeepSeek V4FlashAttention
0 likes · 10 min read
vLLM 0.20 Arrives with DeepSeek V4 Support – What’s New?