Tagged articles

1070 articles

Page 5 of 11

Oct 25, 2025 · Artificial Intelligence

How InfLLM‑V2 Delivers Fast, Low‑Cost Sparse Attention for Long‑Context LLMs

InfLLM‑V2 introduces a zero‑parameter, train‑efficient sparse‑attention framework that dramatically speeds up long‑sequence processing while requiring only 5 B tokens for training, and the open‑source MiniCPM4.1 model demonstrates comparable performance to dense attention on both long‑text understanding and deep‑thinking benchmarks.

EfficiencyInfLLM-V2Large Language Models

0 likes · 10 min read

How InfLLM‑V2 Delivers Fast, Low‑Cost Sparse Attention for Long‑Context LLMs

Baidu Tech Salon

Oct 24, 2025 · Artificial Intelligence

How Wenxin X1.1 Tops China’s LLMs on the New SuperCLUE-CPIF Benchmark

Recent release of the SuperCLUE-CPIF benchmark shows Baidu’s Wenxin X1.1 achieving the highest score among Chinese large language models, surpassing competitors like DeepSeek‑V3.2‑Exp‑Thinking and Hunyuan‑T1, with notable advantages in precise instruction following and complex task handling.

AI evaluationBenchmarkLarge Language Models

0 likes · 4 min read

How Wenxin X1.1 Tops China’s LLMs on the New SuperCLUE-CPIF Benchmark

DataFunTalk

Oct 24, 2025 · Artificial Intelligence

Why OpenAI’s Adult Content Plans Could Reshape AI Performance and Markets

The article examines how opening AI models to adult content—tracing its historical role as a technology testbed, analyzing market incentives, data‑bias risks, alignment tax, and regulatory hurdles—suggests that such a move could boost model capabilities while raising ethical and legal challenges.

AILarge Language Modelsadult content

0 likes · 12 min read

Why OpenAI’s Adult Content Plans Could Reshape AI Performance and Markets

DataFunSummit

Oct 22, 2025 · Artificial Intelligence

When Claude Leaves China: How Domestic AI Models Are Rising to Fill the Gap

Anthropic's ban on Claude for Chinese‑owned firms forces developers to seek home‑grown alternatives, prompting a deep dive into Claude's strengths, the rapid growth of Chinese AI models, and the gaps that still separate them from the international benchmark.

AI modelsChinese AIClaude

0 likes · 10 min read

When Claude Leaves China: How Domestic AI Models Are Rising to Fill the Gap

DataFunTalk

Oct 22, 2025 · Artificial Intelligence

How Large Language Models Power Xiaomi’s Xiao AI Assistant

This article explains how Xiaomi’s Xiao AI assistant leverages large language models for intent routing, domain‑specific intent understanding, and response generation, detailing the system architecture, challenges such as knowledge requirements and latency constraints, and the shift from prompt engineering to model fine‑tuning.

AI AssistantIntent RoutingLarge Language Models

0 likes · 5 min read

How Large Language Models Power Xiaomi’s Xiao AI Assistant

Wuming AI

Oct 20, 2025 · Artificial Intelligence

How to Let AI Instantly Draw Professional UML Diagrams with Mermaid

This article walks through using large language models such as Claude, Gemini, DeepSeek, and Kimi to generate accurate, colorful UML diagrams via Mermaid syntax, covering model selection, prompt engineering, step‑by‑step demonstrations, and practical tips for reliable AI‑driven diagram creation.

AI‑generated diagramsAutomationLarge Language Models

0 likes · 5 min read

How to Let AI Instantly Draw Professional UML Diagrams with Mermaid

Alibaba Cloud Infrastructure

Oct 20, 2025 · Artificial Intelligence

How ACK Inference Gateway Tripled Large‑Model Performance for an Insurance Giant

This article details how Guotai Insurance tackled the high latency and cost of large‑model inference by deploying Alibaba Cloud's ACK Inference Gateway, which uses load‑aware, prefix‑aware routing, intelligent queuing, and comprehensive observability to boost efficiency threefold while reducing expenses.

ACK GatewayAI inferenceCloud Native

0 likes · 18 min read

How ACK Inference Gateway Tripled Large‑Model Performance for an Insurance Giant

AntTech

Oct 20, 2025 · Artificial Intelligence

How a Constraint-Aware Multi-Agent System Won the IJCAI Travel Planning Challenge

Leveraging a proprietary “large model + optimization” approach, Alibaba’s Ant Group and East China Normal University built a constraint-aware multi-agent framework that secured first place in the Original OS track and second in the DSL track of the IJCAI-2025 Autonomous Travel Planning Competition.

IJCAILarge Language ModelsTravel Planning

0 likes · 7 min read

How a Constraint-Aware Multi-Agent System Won the IJCAI Travel Planning Challenge

Data Thinking Notes

Oct 19, 2025 · Artificial Intelligence

How GSPO Improves Stability in Large Language Model Training

GSPO (Group Sequence Policy Optimization) is a reinforcement‑learning algorithm for LLMs that replaces token‑level GRPO with sequence‑level optimization, addressing instability in ultra‑large model training, especially for long‑sequence and MoE architectures, by aligning reward granularity and reducing variance.

GRPOGSPOLarge Language Models

0 likes · 11 min read

How GSPO Improves Stability in Large Language Model Training

IT Services Circle

Oct 18, 2025 · Artificial Intelligence

Unlock Multi‑Model AI Collaboration with Zen MCP – A Deep Dive

The Zen MCP open‑source server, now with over 8.6K stars, acts as a bridge that lets Claude Code, Codex CLI, Gemini CLI and other AI tools invoke dozens of large models simultaneously, offering seamless multi‑model cooperation, automatic model selection, conversation continuity, and local execution for privacy‑preserving AI workflows.

AI orchestrationAI toolingLarge Language Models

0 likes · 5 min read

Unlock Multi‑Model AI Collaboration with Zen MCP – A Deep Dive

Amap Tech

Oct 17, 2025 · Artificial Intelligence

How Ranking Improves In-Context Example Retrieval: Insights from NeurIPS ’25

This article explains the limitations of current pointwise in‑context learning methods, introduces a novel ranking‑based approach called SeDPO that learns preference orders among examples, and demonstrates its superior performance across multiple NLP tasks through extensive experiments and ablation studies.

In-Context LearningLarge Language ModelsNeurIPS

0 likes · 10 min read

How Ranking Improves In-Context Example Retrieval: Insights from NeurIPS ’25

Wuming AI

Oct 16, 2025 · Industry Insights

Top AI Model Releases This Week: NanoChat, Ring‑1T, Qwen3‑VL, Veo 3.1, Claude Haiku 4.5

This week’s AI landscape saw Karpathy’s NanoChat open‑sourcing a 8‑K‑line ChatGPT replica, Ant Group unveiling a trillion‑parameter Ring‑1T model, Alibaba releasing the 4B/8B Qwen3‑VL visual language models that outperform Gemini 2.5 Flash Lite and GPT‑5 Nano, Google launching Veo 3.1 for high‑fidelity video generation, and Anthropic announcing Claude Haiku 4.5, a faster and cheaper LLM that excels on SWE‑bench benchmarks.

AI modelsLarge Language ModelsMultimodal

0 likes · 7 min read

Top AI Model Releases This Week: NanoChat, Ring‑1T, Qwen3‑VL, Veo 3.1, Claude Haiku 4.5

Meituan Technology Team

Oct 15, 2025 · Artificial Intelligence

What’s New in Large Model Research? Top Meituan AI Papers Up to Oct 2025

This curated list showcases Meituan’s latest large‑model breakthroughs and academic papers up to October 2025, spanning LLM system optimizations, multimodal generation, evaluation benchmarks, quantization techniques, and reinforcement‑learning‑driven improvements, offering researchers valuable insights and resources across the AI landscape.

AI researchBenchmarkingLarge Language Models

0 likes · 10 min read

What’s New in Large Model Research? Top Meituan AI Papers Up to Oct 2025

Shopee Tech Team

Oct 14, 2025 · Artificial Intelligence

How SPEC‑RL Boosts On‑Policy Reinforcement Learning Speed by Up to 3×

SPEC‑RL introduces speculative rollouts that reuse verified historical rollouts as prefixes, cutting rollout time by 2–3× while maintaining or improving performance across various math and reasoning benchmarks, and works seamlessly with PPO, GRPO, DAPO and other on‑policy algorithms.

AI efficiencyLarge Language ModelsTraining Acceleration

0 likes · 8 min read

How SPEC‑RL Boosts On‑Policy Reinforcement Learning Speed by Up to 3×

HyperAI Super Neural

Oct 14, 2025 · Artificial Intelligence

NeurIPS 2025: OCRBench v2 Shows Gemini Leads Chinese OCR Ranking Yet Scores Only Pass

OCRBench v2, introduced at NeurIPS 2025, evaluates 58 multimodal models on 23 OCR‑related tasks in Chinese and English, revealing that even top models like Gemini‑2.5‑Pro barely exceed the passing threshold and that most models struggle with fine‑grained text localization and multilingual performance.

GeminiLarge Language ModelsNeurIPS 2025

0 likes · 8 min read

NeurIPS 2025: OCRBench v2 Shows Gemini Leads Chinese OCR Ranking Yet Scores Only Pass

Practical DevOps Architecture

Oct 14, 2025 · Artificial Intelligence

Master AI Agents: From Basics to Advanced Multi-Model Development

This comprehensive AI agent development course covers 18 chapters, ranging from fundamental concepts and architecture to large‑model integration, tool and browser control, memory, RAG self‑learning, sandboxing, database manipulation, multi‑agent architectures, code assistance, and a real‑world frontend automation project, complete with source code and documentation.

AI agentsLangChainLarge Language Models

0 likes · 3 min read

Master AI Agents: From Basics to Advanced Multi-Model Development

DataFunSummit

Oct 13, 2025 · Artificial Intelligence

How Large Language Models Supercharge Douyin’s User Experience

This article explains how Douyin leverages large language models to build an end‑to‑end user‑experience pipeline that detects signals, understands feedback, attributes issues, and automates governance, turning reactive fixes into proactive, data‑driven product improvements.

AILarge Language ModelsSignal Processing

0 likes · 20 min read

How Large Language Models Supercharge Douyin’s User Experience

Alibaba Cloud Developer

Oct 13, 2025 · Artificial Intelligence

Can AI Cut Taobao Recommendation Development from a Week to Two Days?

This article explains how Alibaba's WaterFlow, an AI‑driven end‑to‑end development platform, tackles the high demand volume, diverse tech stacks, and slow collaboration of Taobao's recommendation feed, enabling many features to be delivered in just two days instead of a week.

AILarge Language Modelscontinuous integration

0 likes · 16 min read

Can AI Cut Taobao Recommendation Development from a Week to Two Days?

Data Party THU

Oct 11, 2025 · Artificial Intelligence

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

This article surveys the most influential large language models released since 2017, detailing the core innovations of Transformer, BERT, GPT series, T5, Retrieval‑Augmented Generation, and the latest LLaMA and Meta models, while highlighting their architectures, training paradigms, and impact on NLP research.

LLMLarge Language ModelsModel Scaling

0 likes · 21 min read

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

Kuaishou Large Model

Oct 11, 2025 · Artificial Intelligence

How Large-Scale Reinforcement Learning Boosted KAT-Dev-72B-Exp to 74.6% on SWE‑Bench

The KwaiPilot team introduced KAT-Dev-72B-Exp, an open‑source LLM trained with large‑scale reinforcement learning that achieved a record‑breaking 74.6% score on SWE‑Bench Verified, thanks to innovations like Trie Packing, entropy‑aware advantage scaling, and a decoupled data‑environment architecture.

KAT-Dev-72B-ExpLarge Language ModelsTrie Packing

0 likes · 6 min read

How Large-Scale Reinforcement Learning Boosted KAT-Dev-72B-Exp to 74.6% on SWE‑Bench

Bilibili Tech

Oct 11, 2025 · Artificial Intelligence

Can Dual-Agent AI Transform Web Video Editing? Inside VibeCut’s Architecture

VibeCut introduces a novel Orchestrator‑Executor dual‑agent framework for WebCut, leveraging large language models, shared structured context, and modular tool integration to automate complex video editing tasks, demonstrating improved efficiency, transparency, and adaptability across diverse scenarios while addressing challenges of multi‑agent coordination.

AI video editingLarge Language ModelsWebCut

0 likes · 35 min read

Can Dual-Agent AI Transform Web Video Editing? Inside VibeCut’s Architecture

Bighead's Algorithm Notes

Oct 10, 2025 · Artificial Intelligence

Quantitative Finance Paper Digest (Sep 27 – Oct 10 2025)

This digest summarizes recent arXiv papers that introduce new AI‑driven methods for portfolio similarity, Bayesian portfolio optimization, end‑to‑end deep‑learning portfolio construction, large‑language‑model‑based financial prediction, and multi‑agent crypto‑trading systems, highlighting their datasets, architectures, and empirical gains.

Bayesian OptimizationLarge Language Modelsasset allocation

0 likes · 18 min read

Quantitative Finance Paper Digest (Sep 27 – Oct 10 2025)

Baidu Tech Salon

Oct 10, 2025 · Artificial Intelligence

Navigating the 2025 AI Model Boom: Practical Evaluation Strategies

This article examines the rapid surge of large AI models in 2024‑2025, critiques the reliability of public leaderboards, and presents a business‑focused evaluation framework—including dataset construction, metric selection, automation, and LLM‑as‑judge techniques—to help developers choose the right model for real‑world applications.

AI PerformanceAI benchmarksDataset Construction

0 likes · 17 min read

Navigating the 2025 AI Model Boom: Practical Evaluation Strategies

Data Party THU

Oct 10, 2025 · Artificial Intelligence

Can Language Models Self‑Train Without Data? Inside the Language Self‑Play Framework

This article examines the Language Self‑Play (LSP) approach for data‑free training of large language models, detailing its challenger‑solver game formulation, advantage calculations, loss functions, self‑reward extension, experimental setup on AlpacaEval, and results that show LSP can match or surpass data‑driven baselines.

LLMLarge Language Modelsdata-free training

0 likes · 14 min read

Can Language Models Self‑Train Without Data? Inside the Language Self‑Play Framework

DataFunTalk

Oct 10, 2025 · Artificial Intelligence

How Large Language Models Power Xiaomi’s Xiao AI Assistant

This article explains how large language models are integrated into Xiaomi’s Xiao AI assistant, covering intent distribution, domain‑specific intent understanding, response generation, architectural design, challenges such as knowledge requirements and latency, and the shift from prompt engineering to model fine‑tuning.

AI AssistantIntent RoutingLarge Language Models

0 likes · 5 min read

Data Party THU

Oct 9, 2025 · Artificial Intelligence

How Reinforcement Learning Is Transforming the Full Lifecycle of Large Language Models

This survey systematically reviews recent advances in applying reinforcement learning across the entire lifecycle of large language models, detailing methods, datasets, benchmarks, open‑source tools, and future challenges such as scalability, reward design, and evaluation standards.

AI SurveyLLM lifecycleLarge Language Models

0 likes · 9 min read

How Reinforcement Learning Is Transforming the Full Lifecycle of Large Language Models

Data Party THU

Oct 9, 2025 · Artificial Intelligence

Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach

This article presents Crab, a unified audio‑visual scene understanding model that leverages a novel display‑cooperation learning paradigm, introduces the AV‑UIE dataset with explicit reasoning steps, and demonstrates superior performance across temporal, spatial, pixel‑level, and spatio‑temporal tasks through extensive experiments and ablations.

BenchmarkLarge Language ModelsLoRA

0 likes · 12 min read

Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach

DataFunTalk

Oct 9, 2025 · Artificial Intelligence

From Physics to DeepMind: How a Tsinghua Star Is Shaping AI Research

Google DeepMind hired Shunyu Yao, a Tsinghua physics prodigy and former Anthropic researcher, whose rapid transition from theoretical physics to AI highlights the intense workload, values clash, and the accelerating pace of large‑model research.

AI researchCareer TransitionDeepMind

0 likes · 9 min read

From Physics to DeepMind: How a Tsinghua Star Is Shaping AI Research

HyperAI Super Neural

Oct 5, 2025 · Artificial Intelligence

Which Sci‑Fi AI Are Already Real? Voice Assistants, Companion Bots, Digital Immortality

The article reviews iconic AI portrayals from movies such as Iron Man, Her, The Wandering Earth 2, Terminator and The Matrix, then compares each vision with today’s voice assistants, large‑language‑model chatbots, companion robots, brain‑computer interfaces and autonomous weapon systems, highlighting what has materialized and what remains speculative.

AILarge Language Modelsautonomous weapons

0 likes · 15 min read

Which Sci‑Fi AI Are Already Real? Voice Assistants, Companion Bots, Digital Immortality

AI2ML AI to Machine Learning

Oct 1, 2025 · Artificial Intelligence

2025 Large Model Engineering Breakthroughs: Cutting Costs, Boosting Performance, and Extending Context

The 2025 open‑source reports reveal major advances in large‑model engineering, including drastic cost cuts such as DeepSeek‑V3 training for $5.57 M, performance gains where Gemma 3 4B matches Gemma 2 27B, memory efficiencies like 85 % KV‑cache reduction, and a suite of new techniques—from loss‑free MoE balancing to multi‑token prediction—that together push context lengths to one million tokens and enable multimodal, aligned, and industry‑specific models.

Cost ReductionLarge Language ModelsModel Compression

0 likes · 13 min read

2025 Large Model Engineering Breakthroughs: Cutting Costs, Boosting Performance, and Extending Context

DataFunSummit

Sep 29, 2025 · Artificial Intelligence

How Large Language Models Power XiaoAI: From Intent Routing to Response Generation

This article explores how large language models are integrated into Xiaomi’s XiaoAI assistant, detailing the system’s architecture, intent distribution, domain-specific understanding, and response generation, while sharing practical challenges, prompt engineering solutions, and fine‑tuning strategies that boosted user retention and query satisfaction.

AI assistantsIntent RoutingLarge Language Models

0 likes · 4 min read

How Large Language Models Power XiaoAI: From Intent Routing to Response Generation

21CTO

Sep 29, 2025 · Artificial Intelligence

Why Open‑Source Is the Key to China’s AI Future, According to Li Kaifu

Li Kaifu argues that open‑source large‑model ecosystems are essential for China to close the AI gap with the United States, highlighting DeepSeek’s impact, shifting scaling laws, and the emerging role of AI‑to‑AI teaching as the next development frontier.

Artificial IntelligenceChina AILarge Language Models

0 likes · 4 min read

Why Open‑Source Is the Key to China’s AI Future, According to Li Kaifu

Volcano Engine Developer Services

Sep 28, 2025 · Artificial Intelligence

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

This guide breaks down the complex terminology of large language models—explaining tokens, transformers, self‑attention, RAG, scaling laws, dense vs. sparse architectures, and training stages—using clear analogies and step‑by‑step explanations so readers can confidently understand and work with modern AI systems.

AI fundamentalsLarge Language ModelsRAG

0 likes · 35 min read

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

DataFunSummit

Sep 26, 2025 · Artificial Intelligence

How Large Language Models are Transforming Recommendation Systems: Insights from Huawei

This article reviews Huawei Noah's Ark Lab's exploration of large language models in recommendation systems, covering background challenges, the KAR and Uni-CTR projects, experimental results, and future research directions for open, knowledge‑driven recommendation pipelines.

AI researchHuaweiLarge Language Models

0 likes · 13 min read

How Large Language Models are Transforming Recommendation Systems: Insights from Huawei

HyperAI Super Neural

Sep 26, 2025 · Artificial Intelligence

Nvidia’s ReaSyn Uses Chain‑of‑Reaction Reasoning to Boost Molecule Reconstruction and Path Diversity

ReaSyn, a new framework from Nvidia’s research team, treats synthesis pathways as chain‑of‑thought reasoning using a novel Chain‑of‑Reaction representation, achieving the highest reconstruction rates and path diversity in molecule synthesis tasks, and outperforming prior methods across multiple benchmark optimizations.

AI drug discoveryLarge Language ModelsReaSyn

0 likes · 14 min read

Nvidia’s ReaSyn Uses Chain‑of‑Reaction Reasoning to Boost Molecule Reconstruction and Path Diversity

Instant Consumer Technology Team

Sep 25, 2025 · Artificial Intelligence

Inside Qwen’s Midnight Release: New Guard, Travel Agent, LiveTranslate, Code & Vision Models Unveiled

Late at night on the 23rd, Lin Junyang of Tongyi Lab announced six AI model releases—including a safety‑audit guard, a personal travel planner, a real‑time multilingual translator, upgraded coding models, a powerful vision‑language model, and the flagship Qwen3‑Max—each detailed with capabilities, highlights, and direct download links.

Artificial IntelligenceLarge Language ModelsSafety

0 likes · 11 min read

Inside Qwen’s Midnight Release: New Guard, Travel Agent, LiveTranslate, Code & Vision Models Unveiled

Data Thinking Notes

Sep 24, 2025 · Artificial Intelligence

How AI Agents Are Transforming Smart Logistics at SF Express

This article explains how SF Express leverages AI agents and large language models to create a full‑process intelligent management framework that optimizes order forecasting, dynamic scheduling, resource allocation, and operational decision‑making across the entire logistics chain.

AIIntelligent agentsLarge Language Models

0 likes · 21 min read

How AI Agents Are Transforming Smart Logistics at SF Express

Fun with Large Models

Sep 24, 2025 · Artificial Intelligence

Interview Guide: Core Differences Between PPO and GRPO Algorithms for Large Model Fine‑Tuning

The article explains the fundamental principles of PPO and GRPO reinforcement‑learning algorithms, compares their architectures and training workflows, highlights why GRPO is gaining traction in large‑model fine‑tuning, discusses associated risks, and offers practical guidance on group size selection for engineers preparing for interviews.

GRPOLarge Language ModelsPPO

0 likes · 9 min read

Interview Guide: Core Differences Between PPO and GRPO Algorithms for Large Model Fine‑Tuning

Data Thinking Notes

Sep 21, 2025 · Artificial Intelligence

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

Amid the rapid rise of generative AI, this article examines the limitations of large language models and explains how Retrieval‑Augmented Generation (RAG), followed by the advanced paradigms DeepSearch and DeepResearch, progressively enhance knowledge handling through dynamic retrieval, multi‑agent reasoning, and autonomous research capabilities.

AI Knowledge ManagementDeepResearchDeepSearch

0 likes · 16 min read

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

Bighead's Algorithm Notes

Sep 20, 2025 · Artificial Intelligence

Weekly Quantitative Finance Paper Digest (Sep 13‑19, 2025)

This digest summarizes seven recent arXiv papers that apply reinforcement learning, multi‑agent frameworks, dynamic factor models, high‑frequency trading LLMs, quantum GANs, multi‑LLM sentiment analysis, and context‑aware language models to advance quantitative finance and AI‑driven market prediction.

Large Language ModelsQuantitative FinanceQuantum Machine Learning

0 likes · 12 min read

Weekly Quantitative Finance Paper Digest (Sep 13‑19, 2025)

DataFunTalk

Sep 19, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, GraphRAG, and autonomous agents boost model intelligence, improve user experience, and enable advanced content generation, understanding, and multi‑step reasoning.

Artificial IntelligenceGraphRAGLarge Language Models

0 likes · 4 min read

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

Data Party THU

Sep 19, 2025 · Artificial Intelligence

How RepoMaster Enables AI Agents to Master GitHub Repositories for Complex Tasks

RepoMaster is an AI‑driven framework that automatically discovers, analyzes, and executes code from massive GitHub repositories, turning them into reusable tools and achieving state‑of‑the‑art performance on challenging benchmarks while drastically reducing token consumption and engineering effort.

AI agentsLarge Language ModelsRepoMaster

0 likes · 9 min read

How RepoMaster Enables AI Agents to Master GitHub Repositories for Complex Tasks

Data Party THU

Sep 19, 2025 · Artificial Intelligence

How DeepSeek R1 Redefines AI Reasoning with Pure Reinforcement Learning

DeepSeek R1 replaces traditional supervised fine‑tuning with a pure reinforcement‑learning pipeline, introducing the GRPO algorithm and a four‑stage training regime that dramatically lowers cost, boosts reasoning and code‑generation performance, and raises important ethical, privacy, and societal considerations for large language models.

AI reasoningDeepSeekGRPO

0 likes · 14 min read

How DeepSeek R1 Redefines AI Reasoning with Pure Reinforcement Learning

HyperAI Super Neural

Sep 19, 2025 · Artificial Intelligence

Weekly AI Paper Roundup: RL Advances, Tree‑Structured QA, and GraphRAG Breakthroughs

This article surveys five recent AI papers, covering reinforcement learning for large reasoning models, a tree‑structured table QA framework (ST‑Raptor), visual representation alignment for multimodal LLMs, GraphRAG‑based generation, and an LLM‑driven cryptographic vulnerability detector, each with key insights and links.

Large Language Modelscryptographic vulnerability detectiongraph retrieval

0 likes · 5 min read

Weekly AI Paper Roundup: RL Advances, Tree‑Structured QA, and GraphRAG Breakthroughs

AsiaInfo Technology: New Tech Exploration

Sep 16, 2025 · Industry Insights

Can Ontology Bridge the Gap Between Large Language Models and Executable Code?

This article analyzes how combining ontology with large language models can create a new intelligent application development paradigm that unites semantic understanding and executable behavior, proposing a three‑layer architecture, a Model Control Protocol, and real‑world case studies to illustrate its potential and challenges.

AI integrationLarge Language ModelsSoftware Architecture

0 likes · 22 min read

Can Ontology Bridge the Gap Between Large Language Models and Executable Code?

DataFunTalk

Sep 15, 2025 · Artificial Intelligence

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

In an interview, Di Xingxing of Autohome details their AI+Data framework—unified lake‑warehouse, intelligent engine, and agent services—that breaks data silos, blends traditional models with LLMs, leverages causal inference and RAG knowledge bases, and uses continuous feedback to build explainable, evolving data agents for accurate sales forecasting, competitive analysis, and end‑to‑end business automation in the automotive industry.

AIAutomotiveData Engineering

0 likes · 10 min read

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

DataFunSummit

Sep 14, 2025 · Artificial Intelligence

How AI is Revolutionizing Chemistry and Drug Discovery: From Data to Breakthroughs

This article explores how AI-driven models and data pipelines are transforming the chemistry and pharmaceutical sectors by accelerating drug design, improving protein‑antibody predictions, automating patent data extraction, and outlining future goals for end‑to‑end AI‑enabled scientific discovery.

AI for ScienceChemistry AILarge Language Models

0 likes · 13 min read

How AI is Revolutionizing Chemistry and Drug Discovery: From Data to Breakthroughs

Alibaba Cloud Developer

Sep 12, 2025 · Operations

How to Build End‑to‑End Observability for Large‑Model Applications on Alibaba Cloud

This guide explains how to design and implement a complete observability solution for large‑model AI services on Alibaba Cloud, covering architecture, core metrics, logging standards, demo code, log collection, dashboard design, alerting, monitoring tools, troubleshooting SOPs, and recovery procedures.

AI OperationsAlibaba CloudLarge Language Models

0 likes · 21 min read

How to Build End‑to‑End Observability for Large‑Model Applications on Alibaba Cloud

Fun with Large Models

Sep 12, 2025 · Artificial Intelligence

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

The article explains the technical background and suitable scenarios for Retrieval‑Augmented Generation (RAG) and model fine‑tuning, compares their strengths, discusses how they can be combined, and provides interview‑style Q&A on their capabilities, risks, and differences from model distillation.

AI InterviewFine‑TuningLarge Language Models

0 likes · 7 min read

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

AI2ML AI to Machine Learning

Sep 11, 2025 · Industry Insights

Key Takeaways from Asset Management Leaders on Large‑Model AI at the Bund Conference

The article compiles senior asset‑management executives' perspectives on applying large‑model AI—covering vertical versus generic models, integration strategies, talent and cost considerations, innovative C2C development, AI‑native platforms, and the practical challenges of using LLMs in investment research.

AI applicationsC2C developmentLarge Language Models

0 likes · 5 min read

Key Takeaways from Asset Management Leaders on Large‑Model AI at the Bund Conference

Baidu Geek Talk

Sep 10, 2025 · Artificial Intelligence

How to Cut Through the LLM SOTA Hype: Practical Evaluation Strategies for 2025

Amid the 2025 surge of large language models, this article demystifies misleading SOTA claims, critiques benchmark reliability, and presents a comprehensive, business‑focused evaluation framework—including dataset construction, metric selection, automated scoring, and practical guidelines—to help developers and product teams choose the right model for real‑world applications.

AI benchmarkingLLM-as-judgeLarge Language Models

0 likes · 18 min read

How to Cut Through the LLM SOTA Hype: Practical Evaluation Strategies for 2025

Baobao Algorithm Notes

Sep 10, 2025 · Artificial Intelligence

Qwen3-Next Unveiled: Sparse MoE, Hybrid Attention & Multi‑Token Prediction

A recent Hugging Face pull request reveals Alibaba’s upcoming Qwen3‑Next series, highlighting its extreme‑context, parameter‑efficient design that combines a 1:50 high‑sparsity MoE, a hybrid attention architecture mixing gated attention with Gated DeltaNet, and a Multi‑Token Prediction technique, promising ten‑fold throughput gains for 32K‑plus token contexts.

AI ArchitectureHybrid AttentionLarge Language Models

0 likes · 8 min read

Qwen3-Next Unveiled: Sparse MoE, Hybrid Attention & Multi‑Token Prediction

DataFunSummit

Sep 9, 2025 · Artificial Intelligence

How Baidu’s GRAB Model Uses Scaling Laws to Transform Ad Ranking

This article explains Baidu's generative ranking model GRAB, detailing how scaling laws from large language models inspire a new recommendation paradigm, the model's architecture, custom attention mechanisms, training strategies, deployment optimizations, and the resulting business gains in CTR and revenue.

BaiduCTR predictionGenerative AI

0 likes · 22 min read

How Baidu’s GRAB Model Uses Scaling Laws to Transform Ad Ranking

JD Cloud Developers

Sep 9, 2025 · Artificial Intelligence

How JD’s PODM‑MI Framework Revolutionized E‑commerce Search Ranking

This article recounts a JD engineer’s journey from theory to practice, detailing the development of the PODM‑MI re‑ranking framework, its three‑layer distribution‑based design, the discovery of a novel SID bottleneck, and the resulting multi‑million‑order impact validated at SIGIR 2024.

E-commerce AILarge Language ModelsRe‑ranking

0 likes · 8 min read

How JD’s PODM‑MI Framework Revolutionized E‑commerce Search Ranking

DataFunSummit

Sep 8, 2025 · Artificial Intelligence

How High‑Quality Inference Data Is Powering the Next AI Revolution

This article explores how high‑quality inference data has become a new paradigm driving AI breakthroughs, detailing Ant Group's research on inference data paradigms, financial‑sector applications, intelligent labeling and quality inspection, and the AIGD AI data synthesis platform, followed by a technical Q&A.

AI dataAIGDFinancial AI

0 likes · 11 min read

How High‑Quality Inference Data Is Powering the Next AI Revolution

DaTaobao Tech

Sep 8, 2025 · Artificial Intelligence

How to Make Large Language Models Understand Third‑Party Java Packages: From Failure to Success

This article explains why AI coding assistants like Cursor and Claude fail to read external Java libraries, explores naive "feed‑the‑code" tricks, evaluates built‑in IDE tools, and ultimately presents a robust solution using a local decompilation pipeline (MCP) that lets LLMs query class definitions and generate correct backend code.

AI Code GenerationJava decompilationLarge Language Models

0 likes · 19 min read

How to Make Large Language Models Understand Third‑Party Java Packages: From Failure to Success

DataFunTalk

Sep 8, 2025 · Artificial Intelligence

When Claude Leaves China: How Domestic AI Models Are Rising to Fill the Gap

Anthropic's new ban on Claude for Chinese‑controlled firms forces developers to seek home‑grown alternatives, prompting a deep dive into Claude's strengths, the rapid rise of Chinese large‑language models, and the gaps that still separate them from the world‑leading offering.

AI modelsAI safetyChinese AI

0 likes · 11 min read

Bighead's Algorithm Notes

Sep 5, 2025 · Artificial Intelligence

Weekly Quantitative Finance Paper Digest (Aug 30 – Sep 5, 2025)

This digest reviews four recent AI‑driven finance papers: a robust MCVaR portfolio optimizer with ellipsoidal support and RKHS uncertainty, a PPO‑based adaptive weighting system for LLM‑generated alphas, an empirical comparison of price‑based, GICS‑based, and LLM‑embedding stock clustering, and a diffusion‑model approach that generates future financial chart images from current charts and text prompts.

Large Language ModelsQuantitative Financediffusion models

0 likes · 9 min read

Weekly Quantitative Finance Paper Digest (Aug 30 – Sep 5, 2025)

ShiZhen AI

Sep 5, 2025 · Artificial Intelligence

Andrew Ng Highlights Core AI Engineer Skills Amidst Major AI Industry Updates

The article reports that ChatGPT now supports branch conversations, Anthropic restricts service use in certain regions, Andrew Ng outlines essential AI engineer capabilities such as AI‑assisted software building, prompting and agentic workflows, and highlights the market demand, while also covering the Kimi K2 model upgrade, Hugging Face’s FineVision dataset release, and Google’s AI‑driven Deep Loop Shaping method published in *Science*.

AI EngineeringAI for astronomyAI safety

0 likes · 8 min read

Andrew Ng Highlights Core AI Engineer Skills Amidst Major AI Industry Updates

Instant Consumer Technology Team

Sep 5, 2025 · Artificial Intelligence

Why Context Engineering Is the Next Frontier for Large Language Models

This article surveys over 1,400 papers to define context engineering as a systematic discipline that structures retrieval, memory, tools, and multi‑agent coordination for LLMs, highlighting the critical asymmetry between understanding long contexts and generating equally complex outputs.

Context EngineeringLLM evaluationLarge Language Models

0 likes · 8 min read

Why Context Engineering Is the Next Frontier for Large Language Models

DataFunSummit

Sep 4, 2025 · Artificial Intelligence

Unlocking Multi‑Agent AI: How Ant Group’s agentUniverse Transforms Financial Services

The article explores Ant Group’s agentUniverse team’s experience applying multi‑agent technology in finance, covering background on large language models, the agentUniverse framework, real‑world implementations, and the advantages of coordinated multi‑agent collaboration for complex analytical and decision‑making tasks.

AI CollaborationFinancial AILarge Language Models

0 likes · 4 min read

Unlocking Multi‑Agent AI: How Ant Group’s agentUniverse Transforms Financial Services

Amap Tech

Sep 4, 2025 · Artificial Intelligence

How Hierarchical Sampling Boosts Self‑Taught Reasoning in LLMs

HS‑STAR introduces a three‑stage hierarchical sampling framework that identifies high‑utility boundary problems, reallocates computation budget to them, and fine‑tunes large language models, achieving significant accuracy gains on math reasoning benchmarks without extra sampling cost.

HS-STARHierarchical SamplingLarge Language Models

0 likes · 10 min read

How Hierarchical Sampling Boosts Self‑Taught Reasoning in LLMs

Data Party THU

Sep 3, 2025 · Artificial Intelligence

Exploring Multimodal Generative AI: A Tsinghua Tutorial at IJCAI 2025

This article introduces a 1.5‑hour tutorial presented by Tsinghua researchers at IJCAI 2025, covering the latest advances in multimodal generative AI, including multimodal large language models, diffusion models, post‑training generalization techniques, and unified understanding‑generation frameworks.

Generative ModelsIJCAI 2025Large Language Models

0 likes · 5 min read

Exploring Multimodal Generative AI: A Tsinghua Tutorial at IJCAI 2025

AI2ML AI to Machine Learning

Sep 2, 2025 · Artificial Intelligence

Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities

The article analyzes why enterprise‑wide large‑model AI projects face steep hurdles, outlining required human capabilities, historical labor shifts, current hot technologies such as RAG, Agent, CoT and multimodal, their limits, a three‑stage implementation roadmap, typical case pitfalls, and the key success factors for sustainable digital transformation.

AgentCoTEnterprise AI

0 likes · 15 min read

Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities

Amap Tech

Sep 2, 2025 · Artificial Intelligence

How Pos2Distill Eliminates Positional Bias in Large Language Models

This article introduces Pos2Distill, a novel knowledge‑distillation framework that transfers capabilities from advantageous to disadvantaged positions in large language models, effectively mitigating positional bias and improving performance on long‑text retrieval and in‑context reasoning tasks.

Knowledge DistillationLarge Language Modelsin-context reasoning

0 likes · 10 min read

How Pos2Distill Eliminates Positional Bias in Large Language Models

Alibaba Cloud Developer

Sep 2, 2025 · Artificial Intelligence

Turning Large Language Models into Business Results: Alibaba Cloud’s Playbook

In this talk, Alibaba Cloud CIO Jiang Linquan shares how his team systematically tackled organizational, technical, and operational challenges to deploy large‑language‑model applications across dozens of enterprise scenarios, presenting real‑world case studies, a RIDE methodology, and practical metrics for success.

AIEnterprise AILarge Language Models

0 likes · 36 min read

Turning Large Language Models into Business Results: Alibaba Cloud’s Playbook

DataFunSummit

Sep 1, 2025 · Artificial Intelligence

Turning Large AI Models into Real Business Value: A Logistics Ops Expert’s Playbook

In this interview, senior AI product operations manager Lu Xinting shares how to identify high‑value AI scenarios, apply three practical metrics, build a closed‑loop AIGC operation framework, and design user incentives to achieve product‑market fit for large language models in logistics.

AI OperationsAIGCLarge Language Models

0 likes · 8 min read

Turning Large AI Models into Real Business Value: A Logistics Ops Expert’s Playbook

DataFunSummit

Aug 28, 2025 · Artificial Intelligence

Why Finance Needs Its Own Large Language Model: Insights from Du Xiaoman

This article explains how the unique data‑driven, knowledge‑intensive, and complex nature of the financial industry makes large language models especially valuable, outlines the limitations of generic models, and shows how domain‑specific, cost‑effective models can deliver superior performance for finance.

AILarge Language Modelscost efficiency

0 likes · 5 min read

Why Finance Needs Its Own Large Language Model: Insights from Du Xiaoman

Architects' Tech Alliance

Aug 26, 2025 · Artificial Intelligence

How DeepSeek‑V3.1’s New FP8 Precision Supercharges Domestic Chip Performance

DeepSeek‑V3.1 introduces the UE8M0 FP8 Scale precision, cutting memory usage by up to 75% and enabling next‑generation Chinese chips such as Ascend 910B to run 128K context models efficiently, while the ecosystem rapidly adopts FP8, yet challenges in IP autonomy and software maturity remain before global competitiveness is achieved.

AI hardwareDeepSeekDomestic Chips

0 likes · 10 min read

How DeepSeek‑V3.1’s New FP8 Precision Supercharges Domestic Chip Performance

JD Tech

Aug 25, 2025 · Artificial Intelligence

How JD’s Large‑Model Tools are Shaping AI in Enterprise: Insights & Roadmap

JD’s recent technical salon reveals the rapid evolution of large‑model tools, detailing industry trends, JD’s JoyAI ecosystem—including JoyAgent, OxyGent and JoyCode—real‑world applications across office, code review, logistics and local services, and future policy and multi‑agent visions.

AI applicationsAI toolsEnterprise AI

0 likes · 13 min read

How JD’s Large‑Model Tools are Shaping AI in Enterprise: Insights & Roadmap

Architecture and Beyond

Aug 24, 2025 · Artificial Intelligence

Why Master‑Slave Architecture Powers Modern Multi‑Agent AI Systems

The article explains how the master‑slave (or manager‑worker) architecture, inspired by both software micro‑services and biological systems, solves context fragmentation and coordination challenges in large‑model multi‑agent applications, detailing design principles, technical implementations, advantages, limitations, and suitable use cases.

AI coordinationContext ManagementLarge Language Models

0 likes · 15 min read

Why Master‑Slave Architecture Powers Modern Multi‑Agent AI Systems

Wu Shixiong's Large Model Academy

Aug 23, 2025 · Artificial Intelligence

Why LoRA, QLoRA, Prompt & Prefix Tuning Are Changing Large‑Model Fine‑Tuning

This article explains the mathematical basis of LoRA, compares it with QLoRA, Prompt Tuning, Prefix Tuning and P‑tuning, shows practical PyTorch implementations, and provides mixed‑precision training tips so readers can choose the most memory‑efficient fine‑tuning method for their large language models.

Large Language ModelsLoRAPrompt Tuning

0 likes · 17 min read

Why LoRA, QLoRA, Prompt & Prefix Tuning Are Changing Large‑Model Fine‑Tuning

DataFunSummit

Aug 23, 2025 · Artificial Intelligence

Mastering Role‑Playing AI Agents: Challenges, Techniques, and Future Directions

This article surveys the latest research on role‑playing AI agents, covering their definition, core components, application scenarios, three main challenges—role fidelity, long‑term memory, and evaluation—and presents four technical approaches for each challenge along with future research directions and references.

AI agentsLarge Language ModelsMemory

0 likes · 22 min read

Mastering Role‑Playing AI Agents: Challenges, Techniques, and Future Directions

JD Retail Technology

Aug 22, 2025 · Artificial Intelligence

How JD’s Open‑Source Large‑Model Tools Are Shaping the Future of Enterprise AI

This article explores the rapid evolution of large‑model AI tools, outlines JD’s open‑source solutions such as JoyAI, JoyAgent, OxyGent and JoyCode, and examines real‑world applications, design principles, policy considerations, and future directions for AI agents and embodied intelligence.

AI PolicyAI applicationsEnterprise AI

0 likes · 12 min read

How JD’s Open‑Source Large‑Model Tools Are Shaping the Future of Enterprise AI

JD Tech Talk

Aug 20, 2025 · Artificial Intelligence

How Large AI Models Are Transforming Software Testing

This article explains what large AI models are, how they enhance capabilities across domains, and details their practical use in software testing—covering code review, automated test case generation, security and performance checks—while envisioning future impacts on manual testing efficiency.

AI in QALarge Language Modelsmodel‑driven testing

0 likes · 4 min read

How Large AI Models Are Transforming Software Testing

Data Party THU

Aug 20, 2025 · Artificial Intelligence

How Dual‑Granularity Prompting Boosts Graph‑Enhanced LLMs for Fraud Detection

The article analyzes the Dual Granularity Prompting (DGP) framework, which mitigates information overload in graph‑enhanced large language models for fraud detection by applying fine‑grained processing to target nodes and coarse‑grained summarization to neighbors, achieving superior accuracy and token efficiency across multiple public and industrial datasets.

Large Language Modelsdual granularity promptingfraud detection

0 likes · 6 min read

How Dual‑Granularity Prompting Boosts Graph‑Enhanced LLMs for Fraud Detection

Kuaishou Large Model

Aug 19, 2025 · Artificial Intelligence

How Klear-Reasoner Achieves SOTA Math & Code Reasoning with GPPO

Klear-Reasoner, built on Qwen3‑8B‑Base, introduces the Gradient‑Preserving Clipping Policy Optimization (GPPO) algorithm to overcome traditional clip limitations, achieving state‑of‑the‑art performance on AIME2024/2025 and LiveCodeBench while providing detailed experimental analysis and data‑quality insights.

GPPOLarge Language Modelscode reasoning

0 likes · 11 min read

How Klear-Reasoner Achieves SOTA Math & Code Reasoning with GPPO

Alibaba Cloud Developer

Aug 18, 2025 · Artificial Intelligence

Mastering Claude Prompt Engineering: 9 Proven Strategies to Boost LLM Performance

This guide systematically breaks down Anthropic's official prompt‑engineering recommendations—clear instructions, multishot examples, chain‑of‑thought prompting, XML structuring, response pre‑filling, prompt chaining, long‑context handling, extended thinking, and practical code snippets—showing how to unlock Claude's full potential across complex tasks.

AIClaudeLarge Language Models

0 likes · 15 min read

Mastering Claude Prompt Engineering: 9 Proven Strategies to Boost LLM Performance

Fighter's World

Aug 15, 2025 · Artificial Intelligence

Why GPT‑5 Is Still Far From AGI Yet Near Scalable Profitability

The article analyzes GPT‑5’s release, its unified multi‑model architecture with a real‑time router, improved reasoning, coding and tool‑use capabilities, reduced hallucinations, and how these technical shifts reshape AI commercialization, investment logic, competition and enterprise adoption.

AI commercializationGPT-5Large Language Models

0 likes · 20 min read

Why GPT‑5 Is Still Far From AGI Yet Near Scalable Profitability

Data Party THU

Aug 14, 2025 · Artificial Intelligence

How FilterLLM Turns One LLM Pass into Billion‑User Cold‑Start Recommendations

The article analyzes the FilterLLM approach, which augments a frozen LLM with billions of learnable user tokens to predict a full‑user interaction probability distribution in a single forward pass, dramatically speeding up cold‑start recommendation while preserving recommendation quality across multiple benchmarks.

AIFilterLLMLLM

0 likes · 8 min read

How FilterLLM Turns One LLM Pass into Billion‑User Cold‑Start Recommendations

Data Party THU

Aug 13, 2025 · Artificial Intelligence

How Large Language Models Are Revolutionizing Automated Scholarly Paper Review

This survey examines the rapid rise of large language models in automated scholarly paper review (ASPR), analyzing model types, technical breakthroughs such as long‑text, multimodal, and multi‑turn capabilities, new generation methods, datasets, open‑source tools, current challenges, publisher policies, and future research directions.

ASPRLarge Language Modelsautomated paper review

0 likes · 19 min read

How Large Language Models Are Revolutionizing Automated Scholarly Paper Review

AI Info Trend

Aug 13, 2025 · Industry Insights

How China’s AI Labs Are Closing the Gap with the US in Q2 2025

The Q2 2025 State of AI report analyzes Chinese AI labs’ rapid progress across language models, open‑source weights, and multimodal generation, showing a shrinking performance gap with US leaders, detailed benchmark scores, ecosystem classifications, and emerging competitive dynamics.

AIBenchmarkChina

0 likes · 10 min read

How China’s AI Labs Are Closing the Gap with the US in Q2 2025

AI Info Trend

Aug 11, 2025 · Industry Insights

What Q2 2025 Reveals About the AI Landscape: Key Trends and Model Rankings

The Q2 2025 State of AI Highlights Report analyzes benchmark data, model performance, and market dynamics, revealing five major industry trends, the rise of AI agents, rapid advances in language, vision, and speech models, and shifting hardware acceleration strategies that shape the future of artificial intelligence.

AIAI agentsBenchmark

0 likes · 11 min read

What Q2 2025 Reveals About the AI Landscape: Key Trends and Model Rankings

Data Party THU

Aug 11, 2025 · Artificial Intelligence

Can Hidden Signals Reveal Multimodal Model Jailbreaks? Introducing HiddenDetect

This article presents HiddenDetect, a training‑free method that leverages refusal‑semantic vectors and layer‑wise activation analysis to detect jailbreak attempts in multimodal large language models, revealing distinct safety signals across text and image modalities and demonstrating strong performance on several LVLM benchmarks.

LVLMLarge Language ModelsMultimodal

0 likes · 7 min read

Can Hidden Signals Reveal Multimodal Model Jailbreaks? Introducing HiddenDetect

Alibaba Cloud Big Data AI Platform

Aug 8, 2025 · Artificial Intelligence

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

This article summarizes a Cloud Summit talk where Alibaba Cloud’s AI expert Zhang Yingying explains how large language models can enhance big‑data intelligent operations, covering opportunities, challenges, RAG‑based Q&A, multi‑agent diagnostics, and the engineering architecture needed for reliable, scalable deployment.

AI EngineeringBig Data OperationsLarge Language Models

0 likes · 20 min read

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

Alibaba Cloud Big Data AI Platform

Aug 8, 2025 · Artificial Intelligence

What Von Neumann’s Brain Theory Reveals About Prompt Engineering for LLMs

The article explores how Von Neumann’s insights on the brain‑computer analogy illuminate modern large‑language‑model prompt engineering, comparing logical reasoning chains, memory mechanisms, and DSL‑driven computation to improve accuracy, reduce hallucinations, and balance reasoning depth with precise calculation.

DSLLarge Language ModelsPrompt Engineering

0 likes · 14 min read

What Von Neumann’s Brain Theory Reveals About Prompt Engineering for LLMs

Data Thinking Notes

Aug 6, 2025 · Artificial Intelligence

OpenAI Unveils gpt-oss 120B & 20B: Open‑Source MoE Models with 4‑Bit Quantization

OpenAI's gpt-oss series introduces two open‑source large language models—gpt‑oss‑120b and gpt‑oss‑20b—featuring Mixture‑of‑Experts architecture, 4‑bit MXFP4 quantization, extensive benchmark results, and broad deployment options across cloud and consumer hardware.

4-bit quantizationAI inferenceGPT-OSS

0 likes · 11 min read

OpenAI Unveils gpt-oss 120B & 20B: Open‑Source MoE Models with 4‑Bit Quantization

AI Frontier Lectures

Jul 31, 2025 · Artificial Intelligence

What’s Driving the Latest LLM Architecture Trends? DeepSeek, OLMo, Gemma, and More Explained

This article examines the evolution of large language model architectures over the past seven years, comparing key design choices such as Multi‑Head Latent Attention, Grouped‑Query Attention, Mixture‑of‑Experts, sliding‑window attention, normalization placement, and optimizer variants across models like DeepSeek V3, OLMo 2, Gemma 3, Llama 4, Qwen 3, SmolLM 3, and Kimi 2.

AI researchLLM comparisonLarge Language Models

0 likes · 30 min read

What’s Driving the Latest LLM Architecture Trends? DeepSeek, OLMo, Gemma, and More Explained

Data Thinking Notes

Jul 30, 2025 · Artificial Intelligence

Tracing the Evolution of Large Language Models: Key Papers and Breakthroughs

This article reviews the most influential papers in large language model research since 2017, covering foundational works such as the Transformer, GPT‑3, BERT, scaling laws, and recent innovations like FlashAttention, Mamba, and QLoRA, highlighting their core contributions and impact on AI development.

AI researchLarge Language ModelsTransformer

0 likes · 28 min read

Tracing the Evolution of Large Language Models: Key Papers and Breakthroughs

JD Tech

Jul 29, 2025 · Artificial Intelligence

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

This article describes a QCon talk that combines causal inference with large language models to build a retrieval‑augmented generation pricing system for e‑commerce, detailing the three‑step algorithm, LLM‑driven modeling challenges, process‑reward tree search, reinforcement‑learning fine‑tuning, and experimental gains in accuracy and speed.

Large Language ModelsRetrieval-Augmented Generationcausal inference

0 likes · 17 min read

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

FunTester

Jul 29, 2025 · Artificial Intelligence

Why AI Hallucinations Happen and How Test Engineers Can Reset Conversations

AI-generated content can produce hallucinations—misleading or illogical answers—especially during lengthy testing dialogues, caused by context overload, limited training data, ambiguous prompts, and the model’s creative tendencies; resetting the conversation with a new session and proper handoff can dramatically improve accuracy and efficiency for software test engineers.

AI HallucinationLarge Language ModelsPrompt Engineering

0 likes · 10 min read

Why AI Hallucinations Happen and How Test Engineers Can Reset Conversations

AI Algorithm Path

Jul 27, 2025 · Artificial Intelligence

Understanding RLHF: How Human Feedback Trains Modern LLMs

This article explains the RLHF (Reinforcement Learning from Human Feedback) pipeline that powers ChatGPT and other large language models, covering the limitations of traditional fine‑tuning, the creation of human‑feedback datasets, reward‑model training, loss design, and the final PPO‑based fine‑tuning step.

ChatGPTHuman FeedbackLarge Language Models

0 likes · 8 min read

Understanding RLHF: How Human Feedback Trains Modern LLMs

AI Info Trend

Jul 24, 2025 · Industry Insights

What’s Driving AI Adoption in 2025? Six Key Trends Uncovered

The AI Adoption Survey H1 2025 reveals that nearly half of organizations have deployed AI in production, engineering and R&D lead usage, Chinese LLMs gain overseas interest, and cost, reliability and intelligence remain the top challenges, while tool preferences and multimodal trends reshape the market.

AI adoptionAI infrastructureAI trends

0 likes · 7 min read

What’s Driving AI Adoption in 2025? Six Key Trends Uncovered

AI2ML AI to Machine Learning

Jul 24, 2025 · Artificial Intelligence

Exploring Recent Large‑Model Agent Papers: Insights and Analyses

This article reviews a series of recent research papers on large‑model agents, covering topics such as reinforcement‑learning‑driven ML agents, premise‑critique ability of LLMs, long‑term tool‑augmented LLM evaluation, agentic RAG, set‑based retrieval for multi‑hop QA, mobile VLM agents, and broader surveys of LLM applications, summarizing each work’s problem statement, prior approaches, novel contributions, experimental results, limitations, and future directions.

BenchmarkLLM evaluationLarge Language Models

0 likes · 46 min read

Exploring Recent Large‑Model Agent Papers: Insights and Analyses

Alibaba Cloud Big Data AI Platform

Jul 23, 2025 · Artificial Intelligence

How to Distill Large Language Models for Efficient Text Generation with EasyDistill

This guide explains how to use the EasyDistill framework and Alibaba Cloud PAI to distill large language models for high‑quality text generation, covering model deployment, SFT and DPO training data construction, code examples, configuration files, and best practices for achieving resource‑efficient, high‑performance student models.

DPOEasyDistillLarge Language Models

0 likes · 14 min read

How to Distill Large Language Models for Efficient Text Generation with EasyDistill

Alibaba Cloud Big Data AI Platform

Jul 23, 2025 · Artificial Intelligence

Unlock Efficient LLMs: How Alibaba’s PAI EasyDistill Powers Model Post‑Training

This article explains how Alibaba Cloud's AI platform PAI leverages the EasyDistill framework for post‑training model optimization, covering knowledge distillation concepts, data synthesis techniques, basic and advanced distillation training, the DistilQwen model family, real‑world customer cases, and step‑by‑step practical demos.

AI PlatformEasyDistillKnowledge Distillation

0 likes · 12 min read

Unlock Efficient LLMs: How Alibaba’s PAI EasyDistill Powers Model Post‑Training

Tencent Cloud Developer

Jul 23, 2025 · Artificial Intelligence

Why Retrieval‑Augmented Generation Is Evolving Into Agentic AI Search

This article explains how the inherent knowledge limits of large language models drive the rise of Retrieval‑Augmented Generation (RAG), outlines its three evolutionary stages, introduces Agentic RAG and DeepSearch, and discusses the knowledge and ability boundaries that shape future AI search systems.

AI SearchDeepSearchKnowledge retrieval

0 likes · 19 min read

Why Retrieval‑Augmented Generation Is Evolving Into Agentic AI Search

Alibaba Cloud Developer

Jul 21, 2025 · Artificial Intelligence

Unlocking LLM Power: How Context Engineering Transforms AI Assistants

Context engineering, the emerging discipline of structuring and managing input information for large language models, goes beyond simple prompt design by addressing issues such as context poisoning, overload, and conflict, offering strategies like intelligent retrieval, isolation, pruning, and compression to build reliable, high‑performing AI agents.

AI productivityAgent DesignContext Engineering

0 likes · 19 min read

Unlocking LLM Power: How Context Engineering Transforms AI Assistants

DataFunTalk

Jul 21, 2025 · Artificial Intelligence

From Prompt Engineering to Context Engineering: Transforming LLM Interactions

This article traces the evolution from prompt engineering to context engineering, detailing technical milestones, core concepts, practical strategies, and future trends that together reshape large language model applications and enable sophisticated AI agents across diverse domains.

Large Language ModelsPrompt EngineeringRetrieval-Augmented Generation

0 likes · 35 min read

From Prompt Engineering to Context Engineering: Transforming LLM Interactions

Data Thinking Notes

Jul 20, 2025 · Artificial Intelligence

Mastering Context Engineering: Boost LLM Performance with Advanced Techniques

Context Engineering, a new discipline for optimizing large language model inputs, expands context windows, compares with prompt engineering, outlines core techniques like information organization, dynamic management, semantic retrieval, and offers practical applications and recommendations to enhance AI performance across domains.

Large Language ModelsPrompt Engineeringai-optimization

0 likes · 11 min read

Mastering Context Engineering: Boost LLM Performance with Advanced Techniques