Lao Guo's Learning Space
Author

Lao Guo's Learning Space

AI learning, discussion, and hands‑on practice with self‑reflection

62
Articles
0
Likes
69
Views
0
Comments
Recent Articles

Latest from Lao Guo's Learning Space

62 recent articles
Lao Guo's Learning Space
Lao Guo's Learning Space
May 3, 2026 · Artificial Intelligence

2026 Enterprise Guide to Large Model Fine‑Tuning: Choosing, Training, and Deploying

This comprehensive guide explains why enterprises should fine‑tune large language models instead of using raw APIs or RAG, compares six fine‑tuning techniques (Full, LoRA, QLoRA, AdaLoRA, DoRA, Prompt‑Tuning), evaluates popular toolchains, outlines a step‑by‑step workflow, presents cost analyses, real‑world case studies, and practical best‑practice recommendations for 2026.

Enterprise AILoRAModel Deployment
0 likes · 18 min read
2026 Enterprise Guide to Large Model Fine‑Tuning: Choosing, Training, and Deploying
Lao Guo's Learning Space
Lao Guo's Learning Space
May 2, 2026 · Industry Insights

AI News Flash: DeepSeek Multimodal Breakthrough, Codex Major Update, Grok 4.3 Launch (May 1‑2)

The AI roundup covers OpenAI's Codex upgrade with Workspace Agents and 40% token efficiency, xAI's Grok 4.3 API offering 128K context and 60% lower pricing, Ant Group's open‑source Ling 2.6‑1T model, DeepSeek's multimodal Visual Primitives framework and its sudden removal, plus the ongoing GPT‑Plus account bans and their mitigation.

AI model benchmarksCodexDeepSeek
0 likes · 11 min read
AI News Flash: DeepSeek Multimodal Breakthrough, Codex Major Update, Grok 4.3 Launch (May 1‑2)
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 30, 2026 · Artificial Intelligence

How DeepSeek V4’s CSA + HCA Break the Million‑Token Barrier

Traditional full‑attention cannot handle million‑token contexts due to exponential compute and memory growth, but DeepSeek V4’s Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) compress, sparsely index, and precisely compute tokens, cutting KV cache to 10% and FLOPs to 27% while enabling a 1‑M token window on a single GPU.

CSAHCAKV cache compression
0 likes · 12 min read
How DeepSeek V4’s CSA + HCA Break the Million‑Token Barrier
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 30, 2026 · Artificial Intelligence

Xiaomi Opens MiMo‑V2.5 and Gives 100 Trillion Free Tokens – A Must‑Grab

Xiaomi has open‑sourced its MiMo‑V2.5 series, including a 1.02 T‑parameter Pro model, and is giving developers up to 100 trillion free tokens for 30 days; the article details the models' token‑efficiency benchmarks, a macOS‑like demo, MIT‑license benefits, and step‑by‑step usage instructions.

AI benchmarkingLarge Language ModelMIT license
0 likes · 12 min read
Xiaomi Opens MiMo‑V2.5 and Gives 100 Trillion Free Tokens – A Must‑Grab
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 29, 2026 · Artificial Intelligence

What’s Inside GPT‑6’s ‘Spud’ Release? 5‑6 Trillion Parameters and 2 M Token Context

OpenAI’s GPT‑6 ‘Spud’ launch packs 5‑6 trillion parameters with MoE sparsity, a unified Symphony multimodal architecture, dual System‑1/2 reasoning, a 2‑million‑token window, and competitive benchmark results, while keeping pricing flat and introducing autonomous agent capabilities that reshape AI workflows.

GPT-6Large Language ModelSparse MoE
0 likes · 15 min read
What’s Inside GPT‑6’s ‘Spud’ Release? 5‑6 Trillion Parameters and 2 M Token Context
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 29, 2026 · Big Data

Designing a Full-Stack Credit Data System: From Ingestion to Real-Time Decision

The article dissects a credit data system architecture, detailing six logical layers—from multi-source data collection and feature engineering (including graph features and feature stores) to model training, real‑time stream processing, decision engine integration, and privacy‑preserving computation—while explaining the trade‑offs, tools, and performance targets needed for accurate, low‑latency risk assessment.

Credit ScoringFeature StoreFlink
0 likes · 16 min read
Designing a Full-Stack Credit Data System: From Ingestion to Real-Time Decision
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 26, 2026 · Industry Insights

April 2026 AI Explosion: Sealed Model, Dual Model Showdown, and a 24‑Hour Shift

In April 2026 the AI landscape accelerated dramatically as Anthropic sealed its most powerful model, OpenAI and DeepSeek released competing flagship systems on the same day, Chinese firms unveiled groundbreaking world‑model and full‑duplex voice technologies, and token usage surged to 140 trillion calls per day, signaling a shift toward AI as essential infrastructure.

AnthropicClaude MythosDeepSeek V4
0 likes · 16 min read
April 2026 AI Explosion: Sealed Model, Dual Model Showdown, and a 24‑Hour Shift
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 25, 2026 · Artificial Intelligence

30 Proven Prompt Templates to Unlock Tongyi Lingma’s Full Potential

This guide compiles the 30 most effective prompt templates for Alibaba's Tongyi Lingma code‑assistant, explains its three interaction modes, and offers concrete examples—from code generation and unit‑test creation to multi‑file refactoring—plus five universal tips to double output quality.

AI coding assistantDebuggingPrompt Engineering
0 likes · 13 min read
30 Proven Prompt Templates to Unlock Tongyi Lingma’s Full Potential