Author

Lao Guo's Learning Space

AI learning, discussion, and hands‑on practice with self‑reflection

Articles

Likes

Views

Comments

Latest from Lao Guo's Learning Space

62 recent articles

Lao Guo's Learning Space

May 3, 2026 · Artificial Intelligence

2026 Enterprise Guide to Large Model Fine‑Tuning: Choosing, Training, and Deploying

This comprehensive guide explains why enterprises should fine‑tune large language models instead of using raw APIs or RAG, compares six fine‑tuning techniques (Full, LoRA, QLoRA, AdaLoRA, DoRA, Prompt‑Tuning), evaluates popular toolchains, outlines a step‑by‑step workflow, presents cost analyses, real‑world case studies, and practical best‑practice recommendations for 2026.

Enterprise AILoRAModel Deployment

0 likes · 18 min read

2026 Enterprise Guide to Large Model Fine‑Tuning: Choosing, Training, and Deploying

Lao Guo's Learning Space

May 2, 2026 · Industry Insights

AI News Flash: DeepSeek Multimodal Breakthrough, Codex Major Update, Grok 4.3 Launch (May 1‑2)

The AI roundup covers OpenAI's Codex upgrade with Workspace Agents and 40% token efficiency, xAI's Grok 4.3 API offering 128K context and 60% lower pricing, Ant Group's open‑source Ling 2.6‑1T model, DeepSeek's multimodal Visual Primitives framework and its sudden removal, plus the ongoing GPT‑Plus account bans and their mitigation.

AI model benchmarksCodexDeepSeek

0 likes · 11 min read

AI News Flash: DeepSeek Multimodal Breakthrough, Codex Major Update, Grok 4.3 Launch (May 1‑2)

Lao Guo's Learning Space

Apr 30, 2026 · Artificial Intelligence

How DeepSeek V4’s CSA + HCA Break the Million‑Token Barrier

Traditional full‑attention cannot handle million‑token contexts due to exponential compute and memory growth, but DeepSeek V4’s Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) compress, sparsely index, and precisely compute tokens, cutting KV cache to 10% and FLOPs to 27% while enabling a 1‑M token window on a single GPU.

CSAHCAKV cache compression

0 likes · 12 min read

How DeepSeek V4’s CSA + HCA Break the Million‑Token Barrier

Lao Guo's Learning Space

Apr 30, 2026 · Artificial Intelligence

Xiaomi Opens MiMo‑V2.5 and Gives 100 Trillion Free Tokens – A Must‑Grab

Xiaomi has open‑sourced its MiMo‑V2.5 series, including a 1.02 T‑parameter Pro model, and is giving developers up to 100 trillion free tokens for 30 days; the article details the models' token‑efficiency benchmarks, a macOS‑like demo, MIT‑license benefits, and step‑by‑step usage instructions.

AI benchmarkingLarge Language ModelMIT license

0 likes · 12 min read

Xiaomi Opens MiMo‑V2.5 and Gives 100 Trillion Free Tokens – A Must‑Grab

Lao Guo's Learning Space

Apr 29, 2026 · Artificial Intelligence

What’s Inside GPT‑6’s ‘Spud’ Release? 5‑6 Trillion Parameters and 2 M Token Context

OpenAI’s GPT‑6 ‘Spud’ launch packs 5‑6 trillion parameters with MoE sparsity, a unified Symphony multimodal architecture, dual System‑1/2 reasoning, a 2‑million‑token window, and competitive benchmark results, while keeping pricing flat and introducing autonomous agent capabilities that reshape AI workflows.

GPT-6Large Language ModelSparse MoE

0 likes · 15 min read

What’s Inside GPT‑6’s ‘Spud’ Release? 5‑6 Trillion Parameters and 2 M Token Context

Lao Guo's Learning Space

Apr 29, 2026 · Big Data

Designing a Full-Stack Credit Data System: From Ingestion to Real-Time Decision

The article dissects a credit data system architecture, detailing six logical layers—from multi-source data collection and feature engineering (including graph features and feature stores) to model training, real‑time stream processing, decision engine integration, and privacy‑preserving computation—while explaining the trade‑offs, tools, and performance targets needed for accurate, low‑latency risk assessment.

Credit ScoringFeature StoreFlink

0 likes · 16 min read

Designing a Full-Stack Credit Data System: From Ingestion to Real-Time Decision

Lao Guo's Learning Space

Apr 27, 2026 · Artificial Intelligence

Build a Private Knowledge Base from Scratch with DeepSeek V4 and AnythingLLM

This guide walks you through creating a fully local, zero‑cloud RAG knowledge base using DeepSeek V4, AnythingLLM, and the BGE‑M3 embedding model, covering component choices, step‑by‑step installation, advanced tuning, troubleshooting, use‑case scenarios, and cost estimation.

AnythingLLMBGE‑M3DeepSeek V4

0 likes · 18 min read

Build a Private Knowledge Base from Scratch with DeepSeek V4 and AnythingLLM

Lao Guo's Learning Space

Apr 27, 2026 · Artificial Intelligence

DeepSeek V4 & Huawei Ascend 950PR: Is Domestic Compute Ready for Enterprise AI?

DeepSeek V4, paired with Huawei’s Ascend 950PR chip, delivers inference speed up to 2.87× that of Nvidia H20 and introduces a CSA+HCA attention compression that cuts KV cache usage to under 10%, but its 94‑96% hallucination rate and high token consumption raise concerns for production use.

AI inferenceCSA+HCADeepSeek V4

0 likes · 13 min read

DeepSeek V4 & Huawei Ascend 950PR: Is Domestic Compute Ready for Enterprise AI?

Lao Guo's Learning Space

Apr 26, 2026 · Industry Insights

April 2026 AI Explosion: Sealed Model, Dual Model Showdown, and a 24‑Hour Shift

In April 2026 the AI landscape accelerated dramatically as Anthropic sealed its most powerful model, OpenAI and DeepSeek released competing flagship systems on the same day, Chinese firms unveiled groundbreaking world‑model and full‑duplex voice technologies, and token usage surged to 140 trillion calls per day, signaling a shift toward AI as essential infrastructure.

AnthropicClaude MythosDeepSeek V4

0 likes · 16 min read

April 2026 AI Explosion: Sealed Model, Dual Model Showdown, and a 24‑Hour Shift

Lao Guo's Learning Space

Apr 25, 2026 · Artificial Intelligence

30 Proven Prompt Templates to Unlock Tongyi Lingma’s Full Potential

This guide compiles the 30 most effective prompt templates for Alibaba's Tongyi Lingma code‑assistant, explains its three interaction modes, and offers concrete examples—from code generation and unit‑test creation to multi‑file refactoring—plus five universal tips to double output quality.

AI coding assistantDebuggingPrompt Engineering

0 likes · 13 min read

30 Proven Prompt Templates to Unlock Tongyi Lingma’s Full Potential