Author

Wu Shixiong's Large Model Academy

We continuously share large‑model know‑how, helping you master core skills—LLM, RAG, fine‑tuning, deployment—from zero to job offer, tailored for career‑switchers, autumn recruiters, and those seeking stable large‑model positions.

109

Articles

Likes

109

Views

Comments

Latest from Wu Shixiong's Large Model Academy

100 recent articles max

Wu Shixiong's Large Model Academy

Mar 10, 2026 · Artificial Intelligence

RRF vs Weighted Sum in RAG: Boost Retrieval, Solve Timeliness & Interview Challenges

This article explains why Reciprocal Rank Fusion often outperforms weighted‑sum fusion in Retrieval‑Augmented Generation, presents a three‑layer approach to keep knowledge bases timely, discusses HyDE’s cost‑benefit trade‑offs, and offers concrete interview‑ready answers for common RAG follow‑up questions.

HyDEHybrid RetrievalInterview Tips

0 likes · 13 min read

RRF vs Weighted Sum in RAG: Boost Retrieval, Solve Timeliness & Interview Challenges

Wu Shixiong's Large Model Academy

Mar 7, 2026 · Artificial Intelligence

Mastering Offline Document Parsing for RAG: From PDFs to Multimodal Knowledge Bases

This article provides a comprehensive guide to offline document parsing for Retrieval‑Augmented Generation, covering multi‑format extraction, layout analysis, OCR pitfalls, chunking strategies, hierarchical metadata tagging, and how these steps directly affect retrieval accuracy and overall RAG performance.

Document ParsingRAGmetadata

0 likes · 14 min read

Mastering Offline Document Parsing for RAG: From PDFs to Multimodal Knowledge Bases

Wu Shixiong's Large Model Academy

Feb 4, 2026 · Artificial Intelligence

Why LLM Agents Rush to Call Tools and How to Stop Them

The article explains that premature tool calls in LLM agents stem from a data‑distribution bias in fine‑tuning, and it presents practical fixes such as adding non‑tool samples, enforcing a Thought chain, and using negative sampling to teach the model when to think before acting.

AgentLLMThought Chain

0 likes · 10 min read

Why LLM Agents Rush to Call Tools and How to Stop Them

Wu Shixiong's Large Model Academy

Feb 3, 2026 · Artificial Intelligence

Why Loss Masking Is the Hidden Key to Effective LLM Fine‑Tuning

The article explains how loss masking in supervised fine‑tuning of large language models prevents the model from learning irrelevant tokens such as user inputs, system prompts, tool outputs, and padding, thereby focusing training on the assistant’s responses and improving performance and generalization.

AI trainingLLMfine-tuning

0 likes · 10 min read

Why Loss Masking Is the Hidden Key to Effective LLM Fine‑Tuning

Wu Shixiong's Large Model Academy

Dec 18, 2025 · Artificial Intelligence

Why Text2SQL Must Be Integrated into AI Agents – An Interviewer's Guide

The article explains how Text2SQL should be treated as a read‑only tool within an AI Agent, covering its role in function calls, dynamic schema pruning, ambiguity handling, SQL safety checks, result validation, semantic caching, and logging to build a production‑grade system.

AI agentLLMSQL Safety

0 likes · 11 min read

Why Text2SQL Must Be Integrated into AI Agents – An Interviewer's Guide

Wu Shixiong's Large Model Academy

Dec 17, 2025 · Artificial Intelligence

How Should Text2SQL Fit Inside an Agent System? Practical Guide for Interviews

This article explains the proper role of Text2SQL within an Agent architecture, detailing its placement as a tool, function‑call implementation, decision logic for invocation, multi‑turn handling, failure management, and how to clearly present these concepts in technical interviews.

AIAgentLLM

0 likes · 9 min read

How Should Text2SQL Fit Inside an Agent System? Practical Guide for Interviews

Wu Shixiong's Large Model Academy

Dec 15, 2025 · Artificial Intelligence

Mastering Text2SQL: From Schema Design to Secure Multi‑Step LLM Pipelines

This article explains how Text2SQL works by teaching LLMs to understand a closed‑world database schema, constructing tightly constrained prompts, validating generated SQL, handling execution errors, and using a second LLM call to translate results into natural language, while highlighting common pitfalls and engineering best practices.

LLMSQL ValidationText2SQL

0 likes · 9 min read

Mastering Text2SQL: From Schema Design to Secure Multi‑Step LLM Pipelines

Wu Shixiong's Large Model Academy

Dec 12, 2025 · Artificial Intelligence

Why Fixing Bad Cases Beats Adding More Data in RLHF

In industrial RLHF, repairing bad cases—structural error samples—provides explicit alignment signals that improve model capability far more efficiently than simply increasing data volume, because it teaches the model how to correct mistakes rather than just exposing it to more examples.

Capability ImprovementRLHFbad case

0 likes · 9 min read

Why Fixing Bad Cases Beats Adding More Data in RLHF

Wu Shixiong's Large Model Academy

Dec 11, 2025 · Artificial Intelligence

Why Reward Models Need Reasoning: From Scalar Scores to RM‑R1

Interviewers increasingly ask why modern reward models must go beyond scalar scores to incorporate reasoning, and this article explains the limitations of traditional scalar reward models, the benefits of the RM‑R1 framework, and how reasoning‑based rewards improve alignment, stability, and task performance in large language model training.

AI alignmentLLMRLHF

0 likes · 11 min read

Why Reward Models Need Reasoning: From Scalar Scores to RM‑R1

Wu Shixiong's Large Model Academy

Dec 10, 2025 · Artificial Intelligence

Why RLHF Success Relies on Data Engineering, Not Just Model Tricks

The article explains that the real difficulty of RLHF lies in designing and curating high‑quality preference data, building robust reward models through bad‑case rewriting, human‑in‑the‑loop labeling, and inference‑based reward modeling, while algorithmic details like PPO are secondary concerns.

GRPORLHFRM-R1

0 likes · 9 min read

Why RLHF Success Relies on Data Engineering, Not Just Model Tricks