Wu Shixiong's Large Model Academy
Author

Wu Shixiong's Large Model Academy

We continuously share large‑model know‑how, helping you master core skills—LLM, RAG, fine‑tuning, deployment—from zero to job offer, tailored for career‑switchers, autumn recruiters, and those seeking stable large‑model positions.

109
Articles
0
Likes
109
Views
0
Comments
Recent Articles

Latest from Wu Shixiong's Large Model Academy

100 recent articles max
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Sep 18, 2025 · Artificial Intelligence

How to Diagnose and Optimize RAG Systems When 30% Answers Miss the Mark

This guide explains why RAG systems often produce off‑topic answers, outlines how to measure hit‑rate, retrieval, reranking and generation metrics, provides step‑by‑step evaluation pipelines, code examples, real‑world case studies, and interview‑ready templates for diagnosing and optimizing each stage of the pipeline.

AIRAGgeneration
0 likes · 18 min read
How to Diagnose and Optimize RAG Systems When 30% Answers Miss the Mark
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Aug 23, 2025 · Artificial Intelligence

Why LoRA, QLoRA, Prompt & Prefix Tuning Are Changing Large‑Model Fine‑Tuning

This article explains the mathematical basis of LoRA, compares it with QLoRA, Prompt Tuning, Prefix Tuning and P‑tuning, shows practical PyTorch implementations, and provides mixed‑precision training tips so readers can choose the most memory‑efficient fine‑tuning method for their large language models.

LoRAPrompt TuningQLoRA
0 likes · 17 min read
Why LoRA, QLoRA, Prompt & Prefix Tuning Are Changing Large‑Model Fine‑Tuning
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Aug 20, 2025 · Artificial Intelligence

Mastering Large‑Model Interview Questions: MHA, KV‑Cache, Scaled Dot‑Product, and Speculative Decoding

This guide walks through common large‑model interview challenges, including a hands‑on implementation of multi‑head attention with KV‑cache, the mathematical reason for scaling by sqrt(dₖ), a concise speculative decoding algorithm, and systematic debugging steps for NaN loss during training.

KV CacheLarge Model InterviewMulti‑Head Attention
0 likes · 14 min read
Mastering Large‑Model Interview Questions: MHA, KV‑Cache, Scaled Dot‑Product, and Speculative Decoding
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 12, 2023 · Fundamentals

How to Compute the Shortest Distance on a Circular Road Efficiently

Given a circular road with n stations and the distances between each consecutive pair, this article explains how to determine the minimal travel distance between any two stations by evaluating both clockwise and counter‑clockwise routes, providing problem details, examples, solution logic, reference implementations in Python, Java, and C++, and complexity analysis.

ArrayJavaSimulation
0 likes · 8 min read
How to Compute the Shortest Distance on a Circular Road Efficiently