Author

Wu Shixiong's Large Model Academy

We continuously share large‑model know‑how, helping you master core skills—LLM, RAG, fine‑tuning, deployment—from zero to job offer, tailored for career‑switchers, autumn recruiters, and those seeking stable large‑model positions.

111

Articles

Likes

110

Views

Comments

Latest from Wu Shixiong's Large Model Academy

100 recent articles max

Wu Shixiong's Large Model Academy

Mar 23, 2026 · Artificial Intelligence

From RAG to Deep Research: Building Autonomous AI Agents for Industry Reports

This article explains how Deep Research extends traditional Retrieval‑Augmented Generation by adding autonomous planning, multi‑step search, self‑correction, and long‑context synthesis to enable AI agents that can generate comprehensive industry analysis reports.

AI AgentAutonomous RetrievalLLM

0 likes · 18 min read

From RAG to Deep Research: Building Autonomous AI Agents for Industry Reports

Wu Shixiong's Large Model Academy

Mar 22, 2026 · Artificial Intelligence

How to Overcome MinerU’s Top 9 Limitations for Reliable Document Parsing

This article examines MinerU’s strengths and nine critical shortcomings—such as reading order errors, split tables, merged cells, OCR misrecognition, formula handling, heading hierarchy loss, output inconsistency, hardware limits, and licensing issues—and provides concrete improvement strategies and interview‑ready talking points for engineers.

Document ParsingInterview TipsMinerU

0 likes · 12 min read

How to Overcome MinerU’s Top 9 Limitations for Reliable Document Parsing

Wu Shixiong's Large Model Academy

Mar 21, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Implementing a Hybrid Retrieval Function with RRF Fusion

This article breaks down the end‑to‑end retrieval function used in a RAG system, detailing each of the five stages—from request construction, hybrid vector + BM25 search, RRF fusion, cross‑encoder reranking, to threshold filtering—and provides concrete Python code, parameter choices, and performance insights.

Cross-EncoderElasticsearchHybrid Retrieval

0 likes · 13 min read

Step‑by‑Step Guide to Implementing a Hybrid Retrieval Function with RRF Fusion

Wu Shixiong's Large Model Academy

Mar 20, 2026 · Artificial Intelligence

Mastering MinerU: Overcoming Its Top 9 Limitations for Reliable Document Parsing

This article examines MinerU's strengths and nine critical shortcomings—such as layout order errors, cross‑page table splits, merged‑cell failures, OCR misrecognition, and licensing issues—and provides concrete improvement strategies, interview‑ready resume bullets, and practical response frameworks for engineers.

LLMLayout AnalysisMinerU

0 likes · 13 min read

Mastering MinerU: Overcoming Its Top 9 Limitations for Reliable Document Parsing

Wu Shixiong's Large Model Academy

Mar 19, 2026 · Artificial Intelligence

Making LLM Answers Trustworthy: Citation Attribution and Hallucination Detection

This article explains why simple prompt‑based citation is insufficient for Retrieval‑Augmented Generation, introduces a sentence‑level attribution pipeline, combines semantic similarity with NLI verification, and presents practical hallucination detection and structured JSON output to ensure answer reliability.

LLM ReliabilityNLIPrompt Engineering

0 likes · 10 min read

Making LLM Answers Trustworthy: Citation Attribution and Hallucination Detection

Wu Shixiong's Large Model Academy

Mar 17, 2026 · Artificial Intelligence

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Chunk splitting, a critical yet often overlooked step in RAG pipelines, dramatically impacts retrieval recall and LLM output quality; this guide walks through three evolution stages—from naive fixed‑length splits to sentence‑aware overlaps and finally semantic, structure‑driven segmentation—complete with code, experiments, and practical pitfalls.

ChunkingLLMRAG

0 likes · 15 min read

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Wu Shixiong's Large Model Academy

Mar 16, 2026 · Artificial Intelligence

Designing a Complete RAG System from Zero: A Step‑by‑Step Interview Guide

This article outlines a full‑stack RAG architecture—offline parsing, query understanding, online retrieval, and context generation—explains six critical module interactions, and provides a concise interview framework for presenting the design from start to finish.

Interview PreparationLLMRAG

0 likes · 14 min read

Designing a Complete RAG System from Zero: A Step‑by‑Step Interview Guide

Wu Shixiong's Large Model Academy

Mar 15, 2026 · Artificial Intelligence

Choosing the Right Embedding and Rerank Models for RAG (Interview‑Ready Guide)

This article explains the role of embedding models in Retrieval‑Augmented Generation, compares the most popular 2024‑2025 open‑source embeddings and rerankers, offers concrete selection rules, shows how to read the MTEB leaderboard, and provides a structured answer framework for interviewers.

AIEmbeddingMTEB

0 likes · 13 min read

Choosing the Right Embedding and Rerank Models for RAG (Interview‑Ready Guide)

Wu Shixiong's Large Model Academy

Mar 14, 2026 · Interview Experience

How to Turn Your RAG Project into an Interview‑Winning Resume Bullet

This guide shows how to translate concrete RAG project work—mixing retrieval, embedding fine‑tuning, and reranking—into concise, quantified resume bullet points that instantly signal depth to interviewers and prepare you for the detailed follow‑up questions they will ask.

AIInterviewRAG

0 likes · 11 min read

How to Turn Your RAG Project into an Interview‑Winning Resume Bullet

Wu Shixiong's Large Model Academy

Mar 13, 2026 · Artificial Intelligence

Why Every RAG System Needs Smart Query Understanding and Routing

The article explains how diverse user queries in a RAG‑based insurance system require intent classification, entity extraction, and multi‑path routing to choose between vector search, calculation, database lookup, or chit‑chat, and outlines practical rule‑ML‑LLM hybrid solutions with safety safeguards.

LLMRAGRouting

0 likes · 11 min read

Why Every RAG System Needs Smart Query Understanding and Routing