Tagged articles

675 articles

Page 3 of 7

Jan 22, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Calling Locally Deployed LLMs via OpenAI API Format in Python

This tutorial explains the OpenAI‑style request and response schema, demonstrates low‑level API calls with the requests library, compares them to the high‑level openai package, and walks through building a streaming multi‑turn chatbot that interacts with a locally hosted large language model.

ChatbotLarge Language ModelOpenAI API

0 likes · 17 min read

Step‑by‑Step Guide to Calling Locally Deployed LLMs via OpenAI API Format in Python

AI Engineering

Jan 21, 2026 · Artificial Intelligence

Running Large Language Models on Phones: Liquid AI’s LFM2.5‑1.2B‑Thinking Fits in 900 MB

Liquid AI’s LFM2.5‑1.2B‑Thinking model runs entirely on a smartphone with only 900 MB of memory, scores 88 on MATH‑500, 69 on Multi‑IF, and 57 on BFCLv3 benchmarks, outperforms larger rivals, and achieves real‑time speeds on Snapdragon 8 Elite and AMD Ryzen 9 3950X, signaling a shift toward edge AI.

LFM2.5Large Language ModelRyzen

0 likes · 4 min read

Running Large Language Models on Phones: Liquid AI’s LFM2.5‑1.2B‑Thinking Fits in 900 MB

PaperAgent

Jan 17, 2026 · Artificial Intelligence

How Qwen3‑VL Embedding and Reranker Set New SOTA in Multimodal Retrieval

The article analyzes the Qwen3‑VL‑Embedding and Qwen3‑VL‑Reranker models, detailing their unified vector space, multi‑stage training pipeline, Matryoshka representation learning, quantization techniques, massive synthetic data generation, and benchmark results that push multimodal retrieval performance to a new state‑of‑the‑art.

EmbeddingKnowledge DistillationLarge Language Model

0 likes · 7 min read

How Qwen3‑VL Embedding and Reranker Set New SOTA in Multimodal Retrieval

PaperAgent

Jan 16, 2026 · Artificial Intelligence

How a 4B Model Beats 30B Giants: Inside AgentCPM-Explore’s SOTA Performance

AgentCPM-Explore, a 4‑billion‑parameter open‑source model, achieves state‑of‑the‑art results on long‑range exploration tasks, matching or surpassing larger 8B and even 30B models, thanks to a full‑stack infrastructure, novel training tricks, and extensive benchmark evaluations across eight agent‑centric datasets.

AgentAgentCPM-ExploreLarge Language Model

0 likes · 10 min read

How a 4B Model Beats 30B Giants: Inside AgentCPM-Explore’s SOTA Performance

PaperAgent

Jan 13, 2026 · Artificial Intelligence

How C2LLM Redefines Code Retrieval with Attention‑Based Pooling

Introducing C2LLM, a contrastive code LLM series that replaces mean and EOS pooling with a multi‑head attention pooling module, achieving top scores on the MTEB‑Code benchmark across 12 tasks and demonstrating cost‑effective, high‑precision code retrieval for both production and AI agent applications.

Large Language ModelMTEB-CodeRetrieval-Augmented Generation

0 likes · 8 min read

How C2LLM Redefines Code Retrieval with Attention‑Based Pooling

DeWu Technology

Jan 12, 2026 · Mobile Development

How We Built an AI‑Powered Smart Inspection System for Mobile Apps

This article details the design and implementation of an AI‑driven smart inspection platform for a mobile app, covering background challenges, system architecture, core detection features—including layout, visual, consistency, and AI‑operation checks—platform configuration, result feedback, and the measurable improvements achieved.

AI inspectionLarge Language ModelSmart Inspection

0 likes · 19 min read

How We Built an AI‑Powered Smart Inspection System for Mobile Apps

PaperAgent

Jan 10, 2026 · Artificial Intelligence

DeepSeek V4 Unveiled: Why Its Coding Power Beats Claude and GPT

DeepSeek's newly announced V4 model, the successor to its December 2024 V3 release, demonstrates superior coding abilities over Claude and GPT series, details its data composition, infrastructure, training costs, failed experimental attempts, expanded benchmark comparisons, and includes a comprehensive safety report.

AI model analysisDeepSeekLarge Language Model

0 likes · 4 min read

DeepSeek V4 Unveiled: Why Its Coding Power Beats Claude and GPT

Bighead's Algorithm Notes

Jan 8, 2026 · Artificial Intelligence

Alpha‑R1: Reinforcement‑Learning‑Driven Large‑Model Alpha Factor Selection

Alpha‑R1 integrates reinforcement learning with an 8‑billion‑parameter LLM to jointly process price and news data, creating context‑aware factor embeddings that outperform traditional quantitative and generic LLM baselines on CSI 300 and CSI 1000 portfolios, demonstrating robust alpha‑decay resistance and zero‑sample generalization.

Financial AILarge Language Modelalpha factor selection

0 likes · 16 min read

Alpha‑R1: Reinforcement‑Learning‑Driven Large‑Model Alpha Factor Selection

AI Info Trend

Jan 7, 2026 · Artificial Intelligence

MiroThinker 1.5: 30B Model Beats 1T‑Scale LLMs via Interactive Scaling

Released by the MiroMind team, MiroThinker 1.5 demonstrates that a 30‑billion‑parameter model can match or surpass the performance of 1‑trillion‑parameter LLMs by leveraging Interactive Scaling, achieving top rankings on multiple search benchmarks, dramatically lower inference cost, and open‑source availability for developers.

AI benchmarksLarge Language ModelMiroThinker

0 likes · 6 min read

MiroThinker 1.5: 30B Model Beats 1T‑Scale LLMs via Interactive Scaling

Amap Tech

Dec 29, 2025 · Artificial Intelligence

How G‑Plan Transforms Map Recommendations with AI Agents and Multi‑Demand Planning

This article details how Gaode's G‑Plan combines large‑model AI agents, generative ranking, and spatiotemporal counterfactual DPO to model and prioritize multiple user intents on the home page, presents the system architecture, experimental setup, online gains, and ablation results, and explains how it moves recommendation from passive to proactive planning.

AI recommendationLarge Language Modelintent planning

0 likes · 21 min read

How G‑Plan Transforms Map Recommendations with AI Agents and Multi‑Demand Planning

AI Insight Log

Dec 28, 2025 · Artificial Intelligence

GLM-4.7 Hits Global #6 and Leads Open‑Source LLM Rankings, Outperforming Claude 4.5 Sonnet

GLM-4.7 scores 68 points to rank sixth worldwide and first among open‑source models, surpassing Claude 4.5 Sonnet, with strong reasoning performance, fast generation speed, but higher cost and weaker code‑generation and math abilities compared to rivals.

GLM-4.7Large Language ModelOpen Source

0 likes · 7 min read

GLM-4.7 Hits Global #6 and Leads Open‑Source LLM Rankings, Outperforming Claude 4.5 Sonnet

DataFunTalk

Dec 25, 2025 · Artificial Intelligence

How DeepAgent Redefines General AI Reasoning with Scalable Toolsets

DeepAgent, a new end‑to‑end reasoning agent, integrates autonomous thinking, dynamic tool search, and execution to handle over 16,000 APIs, embodied tasks, and research assistance, achieving state‑of‑the‑art performance on benchmarks like TMDB, ToolBench, ALFWorld, WebShop, and GAIA.

Large Language ModelMemory ManagementReasoning

0 likes · 15 min read

How DeepAgent Redefines General AI Reasoning with Scalable Toolsets

Xiaomi Tech

Dec 24, 2025 · Artificial Intelligence

DeepLight & AgentMat: Xiaomi and SJTU Launch AI Platform for Light Alloy Design

Xiaomi and Shanghai Jiao Tong University introduced DeepLight, an AI‑driven large‑model for lightweight alloys, together with the AgentMat multi‑agent framework that accelerates the full design cycle tenfold, and the LightAlloy‑Bench benchmark where DeepLight outperforms DeepSeek‑V3 and GPT‑4o by about 20 %.

AILarge Language ModelLightweight Alloys

0 likes · 8 min read

DeepLight & AgentMat: Xiaomi and SJTU Launch AI Platform for Light Alloy Design

PaperAgent

Dec 23, 2025 · Artificial Intelligence

CATArena: A Competitive Benchmark That Turns Agent Scoring into Evolutionary Learning

CATArena introduces a tournament‑style evaluation framework where AI agents iteratively code, compete, and improve across classic board games, using three‑dimensional quantitative scores to measure strategy programming, global learning, and generalization, and reveals how different LLM‑based agents learn and adapt over multiple rounds.

AI BenchmarkAgent EvaluationCATArena

0 likes · 8 min read

CATArena: A Competitive Benchmark That Turns Agent Scoring into Evolutionary Learning

DataFunSummit

Dec 20, 2025 · Artificial Intelligence

How AutoHome Built the Cangjie Large Model: From Training Architecture to Real-World AI Applications

This article details AutoHome's end‑to‑end development of the Cangjie large model, covering the training infrastructure with distributed data, pipeline and tensor parallelism, core business use cases such as video script generation and multi‑tool Agent capabilities, inference optimizations through quantization and fast serving frameworks, and future directions for personalized automotive AI services.

Agent AILarge Language ModelVideo Generation

0 likes · 19 min read

How AutoHome Built the Cangjie Large Model: From Training Architecture to Real-World AI Applications

PaperAgent

Dec 19, 2025 · Artificial Intelligence

Inside Xiaomi’s MiMo‑V2‑Flash: How a Hybrid SWA Design Powers Fast, Efficient AI Reasoning

Xiaomi’s newly open‑sourced MiMo‑V2‑Flash model combines a hybrid sliding‑window/attention architecture with a 309B‑parameter MoE design, delivering top‑tier reasoning, coding and agent performance while introducing the efficient MOPD post‑training paradigm that dramatically reduces RL compute costs.

Hybrid SWALarge Language ModelMOPD

0 likes · 5 min read

Inside Xiaomi’s MiMo‑V2‑Flash: How a Hybrid SWA Design Powers Fast, Efficient AI Reasoning

AI Insight Log

Dec 18, 2025 · Artificial Intelligence

Xiaomi’s New MiMo‑V2‑Flash LLM Rivals DeepSeek‑V3.2 and Near‑GPT‑5 High

Xiaomi’s MiMo‑V2‑Flash, a 309B‑parameter MoE LLM with only 15B active weights, uses Hybrid SWA, Multi‑Token Prediction and Multi‑Teacher On‑Policy Distillation to cut KV‑cache by six times, boost inference speed 2.6×, and achieve performance comparable to DeepSeek‑V3.2, Kimi‑K2 and near‑GPT‑5 High, including a 73.4% SWE‑Bench code‑agent score.

Hybrid SWALarge Language ModelMOPD

0 likes · 7 min read

Xiaomi’s New MiMo‑V2‑Flash LLM Rivals DeepSeek‑V3.2 and Near‑GPT‑5 High

AI Insight Log

Dec 17, 2025 · Artificial Intelligence

Google Unveils Gemini 3 Flash: Free, Lightning‑Fast, and Outperforms Its Predecessor

Google released Gemini 3 Flash without warning, offering Pro‑level intelligence at Flash‑speed, costing just $0.5 per million input tokens and $3 per million output tokens, delivering three‑times faster inference than Gemini 2.5 Pro and surpassing it on benchmarks such as GPQA Diamond (90.4%), SWE‑bench (78.0%) and MMMU‑Pro (81.2%), while being freely accessible to all users and developers via the Gemini app, AI Studio, or API.

Gemini 3 FlashGoogle AILarge Language Model

0 likes · 5 min read

Google Unveils Gemini 3 Flash: Free, Lightning‑Fast, and Outperforms Its Predecessor

DataFunTalk

Dec 17, 2025 · Artificial Intelligence

How Large Language Models Unlock Field‑Level Data Lineage at Scale

This talk explains how a data platform tackled massive, heterogeneous enterprise data by using large language models and prompt engineering to automatically extract field‑level lineage from SQL scripts, achieve over 80% coverage, and raise accuracy above 95%, dramatically cutting impact‑analysis time.

AI for data engineeringBig DataData Lineage

0 likes · 6 min read

How Large Language Models Unlock Field‑Level Data Lineage at Scale

Design Hub

Dec 12, 2025 · Artificial Intelligence

GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work

OpenAI's newly released GPT-5.2 claims to outperform human experts on about 70% of real tasks, achieve a perfect score on the AIME 2025 competition, and deliver dramatic efficiency gains—up to 390× cost reduction—while showcasing impressive examples such as one‑shot ocean shader generation, a full 3D engine built in a single file, and visual‑perception scores rivaling top models.

AI benchmarksAgent AIEfficiency

0 likes · 8 min read

GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work

AI Insight Log

Dec 11, 2025 · Artificial Intelligence

GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro

OpenAI’s GPT‑5.2 launch introduces three specialized modes, achieves a record 55.6% score on SWE‑Bench Pro, demonstrates strong front‑end generation, adds a /compact API for long‑context efficiency, offers tiered pricing with cache discounts, and improves safety for younger users.

AI benchmarkingAI safetyGPT-5.2

0 likes · 6 min read

GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro

Data Party THU

Dec 10, 2025 · Artificial Intelligence

How DeepSeek‑V3.2 Cuts Inference Cost and Boosts Agent Skills with Sparse Attention

DeepSeek's V3.2 release introduces a dual‑model lineup, a Sparse Attention architecture that halves long‑context inference cost, a post‑training reinforcement‑learning pipeline that exceeds 10% of pre‑training compute, and a revamped agent framework that dramatically improves tool‑use and reasoning performance across benchmarks.

DeepSeekLarge Language ModelSparse Attention

0 likes · 11 min read

How DeepSeek‑V3.2 Cuts Inference Cost and Boosts Agent Skills with Sparse Attention

Alibaba Cloud Developer

Dec 9, 2025 · Artificial Intelligence

Building Human‑in‑the‑Loop Agent Workflows with MCP on OpenLM

This article explains how to design and implement Human‑in‑the‑Loop (HITL) interactions for large‑model agents on Alibaba's OpenLM platform, covering the challenges of server‑side execution, MCP transport extensions, tool‑calling patterns, timeout handling, and UI rendering strategies across multiple client devices.

AgentHuman-in-the-LoopLarge Language Model

0 likes · 39 min read

Building Human‑in‑the‑Loop Agent Workflows with MCP on OpenLM

Amap Tech

Dec 3, 2025 · Artificial Intelligence

How Gaode’s G‑Action Uses Generative AI to Predict Users’ Next Move

Gaode’s G‑Action framework combines large‑language‑model pre‑training with fine‑tuned generative recommendation to predict a user’s immediate action and destination, transforming static map services into a dynamic, context‑aware experience and delivering measurable gains in click‑through and engagement metrics.

AILarge Language ModelMap Services

0 likes · 15 min read

How Gaode’s G‑Action Uses Generative AI to Predict Users’ Next Move

DataFunTalk

Dec 2, 2025 · Artificial Intelligence

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

This article reviews three cutting‑edge AI search and recommendation techniques—Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's GRAB generative ranking model—detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI SearchAI agentsGenerative Ranking

0 likes · 8 min read

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

Frontend AI Walk

Dec 2, 2025 · Artificial Intelligence

Understanding LLMs: A Frontend Developer’s Primer on Large Language Models

The article demystifies large language models for frontend developers by likening token prediction to autocomplete, explaining tokens, context windows, temperature, the two-stage training process, and the critical role of prompts, using concrete code examples and analogies to familiar frontend concepts.

Frontend AnalogyLLMLarge Language Model

0 likes · 10 min read

Understanding LLMs: A Frontend Developer’s Primer on Large Language Models

Wuming AI

Nov 30, 2025 · Artificial Intelligence

What Exactly Is a Large Language Model? A Simple Guide to AI, Transformers, and How They Work

This article explains the relationship between AI, machine learning, deep learning, and large language models, detailing their evolution, training stages, transformer architecture, attention mechanisms, inference APIs, and practical usage examples, while demystifying common misconceptions about LLM capabilities.

AI fundamentalsLarge Language ModelMachine Learning

0 likes · 10 min read

What Exactly Is a Large Language Model? A Simple Guide to AI, Transformers, and How They Work

Kuaishou Tech

Nov 28, 2025 · Artificial Intelligence

Keye-VL-671B-A37B Leads Vision, Video, and Math Benchmarks

Kwai has open‑sourced its new flagship multimodal model Keye‑VL‑671B‑A37B, which upgrades visual perception, cross‑modal alignment and complex reasoning, achieving top scores on image, video, and mathematical reasoning benchmarks while detailing its architecture, three‑stage pre‑training, post‑training strategies, and future multimodal agent plans.

Large Language ModelMultimodalOpen Source

0 likes · 10 min read

Keye-VL-671B-A37B Leads Vision, Video, and Math Benchmarks

AsiaInfo Technology: New Tech Exploration

Nov 28, 2025 · Artificial Intelligence

Boosting 5G Complaint Intent Detection with Large-Model-Enhanced Few-Shot Learning

This paper presents a collaborative framework where a large language model generates high‑quality synthetic samples to augment a lightweight model, dramatically improving few‑shot user‑complaint intent recognition in 5G networks, achieving a 21% boost for rare categories and a 9% overall accuracy gain.

Knowledge DistillationLarge Language Modelcomplaint intent detection

0 likes · 27 min read

Boosting 5G Complaint Intent Detection with Large-Model-Enhanced Few-Shot Learning

Alibaba Cloud Developer

Nov 27, 2025 · Artificial Intelligence

How AI Powers Ethnic Product Categorization for Global E‑Commerce

This article presents an end‑to‑end AI solution that builds a cultural knowledge base and leverages large language models to automatically identify and match ethnic‑specific product categories on a cross‑border e‑commerce platform, reducing mis‑matches from 8.4% to 1.8% and cutting iteration time from days to under one day.

AIKnowledge BaseLarge Language Model

0 likes · 19 min read

How AI Powers Ethnic Product Categorization for Global E‑Commerce

DataFunSummit

Nov 20, 2025 · Artificial Intelligence

How 1688 Reinvented E‑commerce Search with AI‑Powered Generative Retrieval

This article details Alibaba’s 1688 platform’s shift from traditional e‑commerce search to AI‑driven generative retrieval, covering the AI Deep Search 1.0 and 2.0 cascaded frameworks, multimodal capabilities, an end‑to‑end “model‑as‑search‑engine” approach, experimental results, challenges, and future directions.

AIE-commerce SearchGenerative Retrieval

0 likes · 18 min read

How 1688 Reinvented E‑commerce Search with AI‑Powered Generative Retrieval

HyperAI Super Neural

Nov 20, 2025 · Artificial Intelligence

From 9,874 Papers to 15,000 Structures: MOF‑ChemUnity Rebuilds MOF Knowledge for Explainable AI

MOF‑ChemUnity constructs a scalable, extensible knowledge graph that links millions of MOF names and synonyms to over 15,000 crystal structures using LLM‑driven entity matching, enabling accurate, explainable AI‑assisted material discovery, water‑stability prediction, expert recommendation validation, and graph‑enhanced retrieval across diverse applications.

Graph RAGKnowledge GraphLarge Language Model

0 likes · 17 min read

From 9,874 Papers to 15,000 Structures: MOF‑ChemUnity Rebuilds MOF Knowledge for Explainable AI

Alibaba Cloud Developer

Nov 19, 2025 · Artificial Intelligence

Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation

This article details a practical case study of designing, implementing, and evaluating an AI-driven proofreading agent for a media client, covering background challenges, a three‑layer architecture, prompt engineering techniques, RAG knowledge‑base construction, model selection, fine‑tuning, automated metrics, and lessons learned.

AILarge Language ModelProofreading

0 likes · 26 min read

Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation

Wuming AI

Nov 10, 2025 · Artificial Intelligence

What Exactly Is an AI Agent? A Clear, Practical Guide

This article explains the concept of AI agents, contrasting them with chatbots, detailing their ability and structural layers, summarizing academic surveys and whitepapers, and illustrating how agents plan, perceive, and act to autonomously accomplish user‑defined goals.

AI AgentAgent ArchitectureAutonomous Planning

0 likes · 9 min read

What Exactly Is an AI Agent? A Clear, Practical Guide

JD Tech Talk

Nov 10, 2025 · Artificial Intelligence

Designing an AI-Powered Experiment Analysis Agent: Architecture, Workflow, and Future Enhancements

This article outlines the motivation, design, architecture, engineering implementation, large‑model selection, and future improvement plans for an AI‑driven experiment analysis agent that integrates data aggregation, modular workflow orchestration, and interactive frontend features to streamline AB‑test insights.

AI AgentFrontend DevelopmentLarge Language Model

0 likes · 14 min read

Designing an AI-Powered Experiment Analysis Agent: Architecture, Workflow, and Future Enhancements

Efficient Ops

Nov 9, 2025 · Operations

How Tencent’s PCG Achieves Full‑Link Observability and AI‑Powered SRE

The talk details Tencent PCG’s end‑to‑end observability platform, its data‑standardization pipeline, client‑backend session linking, AI‑enhanced SRE Agent with large language models, and the roadmap toward a SaaS offering, illustrating how modern operations integrate AI for rapid fault localization.

AILarge Language ModelObservability

0 likes · 17 min read

How Tencent’s PCG Achieves Full‑Link Observability and AI‑Powered SRE

Bighead's Algorithm Notes

Nov 7, 2025 · Artificial Intelligence

Weekly AI Finance Paper Digest (Nov 1‑7 2025)

This digest summarizes three recent AI‑driven finance papers—DeltaLag’s dynamic lead‑lag detection, MS‑HGFN’s multi‑scale graph network for stock movement, and LiveTradeBench’s real‑time LLM trading benchmark—highlighting their methods, datasets, and performance gains.

Financial AIGraph Neural NetworkLarge Language Model

0 likes · 8 min read

Weekly AI Finance Paper Digest (Nov 1‑7 2025)

Tencent Advertising Technology

Nov 6, 2025 · Artificial Intelligence

Boosting Web UI Test Efficiency with AIGC: From Manual Scripts to Intelligent Automation

This report examines the challenges of Web UI testing in Tencent's advertising platform, analyzes current inefficiencies, and presents an AIGC-driven solution that leverages large language models, semantic scripts, and automated pipelines to dramatically improve test case generation, execution accuracy, and CI/CD integration.

AIGCAutomationLarge Language Model

0 likes · 27 min read

Boosting Web UI Test Efficiency with AIGC: From Manual Scripts to Intelligent Automation

Amap Tech

Nov 4, 2025 · Artificial Intelligence

Spacetime‑GR: AI‑Powered Spatiotemporal Model Transforming POI Recommendations

This article introduces Spacetime‑GR, a large‑scale generative recommendation model that integrates hierarchical geographic POI indexing and spatiotemporal token encoding to enhance POI prediction for Amap, detailing its pre‑training pipeline, data cleaning, curriculum learning strategy, experimental results, scaling law observations, and the resulting improvements in hit rate and discovery rate.

AmapCurriculum LearningLarge Language Model

0 likes · 14 min read

Spacetime‑GR: AI‑Powered Spatiotemporal Model Transforming POI Recommendations

21CTO

Nov 4, 2025 · Artificial Intelligence

LongCat-Flash-Omni: How an Open-Source 560B Model Achieves Real-Time Multimodal Mastery

LongCat-Flash-Omni, an open‑source 560 billion‑parameter multimodal model, combines efficient Shortcut‑Connected MoE architecture with advanced perception and speech modules to deliver low‑latency real‑time audio‑video interaction and state‑of‑the‑art performance across text, image, video, and audio tasks.

Efficient InferenceLarge Language ModelReal-Time Interaction

0 likes · 10 min read

LongCat-Flash-Omni: How an Open-Source 560B Model Achieves Real-Time Multimodal Mastery

Meituan Technology Team

Nov 3, 2025 · Artificial Intelligence

LongCat-Flash-Omni: 560B Open‑Source Multimodal Model with Real‑Time Interaction

LongCat-Flash-Omni, the latest open‑source model from Meituan, combines a 560 billion‑parameter architecture, efficient multimodal perception and speech reconstruction modules, and a progressive training strategy to deliver real‑time audio‑video interaction and state‑of‑the‑art performance across text, image, audio, and video tasks.

AILarge Language ModelMultimodal

0 likes · 9 min read

LongCat-Flash-Omni: 560B Open‑Source Multimodal Model with Real‑Time Interaction

Huawei Cloud Developer Alliance

Nov 3, 2025 · Artificial Intelligence

How AI Agents Are Revolutionizing Technology: The New Engine of Innovation

This article explores the rise of AI agents—from their definition as intelligent digital assistants powered by large language models to their evolution through planning, memory, and tool use—highlighting real‑world applications, core technical mechanisms, code implementations, and future trends such as autonomy, multimodal fusion, standardization, and safety considerations.

AI AgentLarge Language ModelMultimodal

0 likes · 24 min read

How AI Agents Are Revolutionizing Technology: The New Engine of Innovation

Data Party THU

Nov 2, 2025 · Artificial Intelligence

From RNN to LLM: How Transformers Power Modern Language Models

This article explains the evolution from RNNs through Encoder‑Decoder models to Transformers, detailing self‑attention, multi‑head attention, and masked attention, and then describes what Large Language Models are, their key components, capabilities, limitations, and common applications.

AILLMLarge Language Model

0 likes · 9 min read

From RNN to LLM: How Transformers Power Modern Language Models

DataFunSummit

Oct 31, 2025 · Artificial Intelligence

How OPPO’s AndesVL Is Revolutionizing On‑Device Multimodal AI

OPPO AI Center introduces AndesVL, an open‑source, fully‑adapted multimodal large model ranging from 0.6B to 4B parameters, designed for high‑performance, privacy‑preserving, low‑latency AI on mobile devices, with advanced architecture, training pipelines, on‑device optimizations, and state‑of‑the‑art benchmark results.

Large Language ModelModel Compressionmobile AI

0 likes · 21 min read

How OPPO’s AndesVL Is Revolutionizing On‑Device Multimodal AI

DataFunSummit

Oct 30, 2025 · Artificial Intelligence

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

This article explores how the explosion of unstructured data exposes the limits of traditional OCR and shows how emerging multimodal large language models provide end‑to‑end document understanding, reduce pipeline complexity, cut training costs, enable hybrid retrieval‑augmented generation, and drive real‑world industry deployments.

AIDocument ProcessingLarge Language Model

0 likes · 28 min read

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

DataFunSummit

Oct 30, 2025 · Artificial Intelligence

Bilibili’s AI Assistant: Using Large Language Models to Tackle Big Data Ops

This article explains how Bilibili’s massive video platform built a five‑layer, storage‑compute separated big‑data infrastructure and employed a large language model‑driven intelligent assistant to automatically diagnose and resolve frequent offline task failures and slowdowns, addressing common user queries about task reliability and performance.

Intelligent AssistantLarge Language Modelbig data platform

0 likes · 4 min read

Bilibili’s AI Assistant: Using Large Language Models to Tackle Big Data Ops

Zhuanzhuan Tech

Oct 29, 2025 · Artificial Intelligence

How Reinforcement Learning Boosts Stability and Speed in LLM QA Systems

This article examines how reinforcement‑learning techniques such as PPO, DPO, and GRPO are integrated into the Baixiaosheng QA system to improve answer stability, deepen domain knowledge understanding, and accelerate response generation, and it evaluates the impact of Reinforcement Fine‑Tuning (RFT) on real‑world performance.

AIDPOGRPO

0 likes · 16 min read

How Reinforcement Learning Boosts Stability and Speed in LLM QA Systems

AntTech

Oct 29, 2025 · Artificial Intelligence

Inside Ant’s Baoling: Balancing Efficiency and Reasoning in a 1‑Trillion‑Parameter Model

At the Ant Star Innovation Journey event, the Baoling team unveiled their roadmap for trillion‑parameter models, detailing the development of Ling‑1T, Ring‑1T and multimodal Ming series, the scaling‑law‑guided architecture, training innovations, evaluation methods, and open‑source releases that aim to advance efficient, high‑performance AI.

Efficient InferenceLarge Language ModelScaling Law

0 likes · 24 min read

Inside Ant’s Baoling: Balancing Efficiency and Reasoning in a 1‑Trillion‑Parameter Model

AntTech

Oct 28, 2025 · Artificial Intelligence

Ming-Flash-Omni-Preview: 103B Open-Source Multimodal Model Excelling in Image, Video, and Speech

Introducing Ming‑Flash‑Omni‑Preview, a 103‑billion‑parameter open‑source multimodal model built on a sparse MoE architecture that delivers state‑of‑the‑art performance in controllable image generation, streaming video understanding, and context‑aware speech recognition, surpassing prior models on GenEval and GEdit benchmarks.

Large Language ModelMultimodalSparse MoE

0 likes · 8 min read

Ming-Flash-Omni-Preview: 103B Open-Source Multimodal Model Excelling in Image, Video, and Speech

DataFunTalk

Oct 28, 2025 · Artificial Intelligence

How Bilibili Uses Large Language Models to Solve Big Data Platform Issues

This article explains Bilibili's massive data platform architecture, the common offline‑task failures and slowdowns users encounter, and how the company applies a large‑language‑model‑driven intelligent assistant to diagnose and resolve these engineering problems efficiently.

AI assistanceBilibiliLarge Language Model

0 likes · 4 min read

How Bilibili Uses Large Language Models to Solve Big Data Platform Issues

Amap Tech

Oct 27, 2025 · Artificial Intelligence

Turning Maps into a Living Map: Amap’s G-Where Generative AI Recommendation

Amap upgrades its homepage recommendation by integrating large‑model capabilities—G‑Where, G‑Action, and G‑Plan—through semantic ID generation, item tokenization, and multi‑stage LLM training, achieving significant offline and online performance gains while illustrating a scalable generative recommendation framework.

AIGenerative RecommendationLarge Language Model

0 likes · 21 min read

Turning Maps into a Living Map: Amap’s G-Where Generative AI Recommendation

Advanced AI Application Practice

Oct 23, 2025 · Artificial Intelligence

Using an AI Large Model to Automate Report Comparison Testing

The article demonstrates how Tencent's Hunyuan large model can generate and iteratively refine Python scripts that automatically compare Excel‑based reports, highlight differences, and handle multiple files, thereby streamlining regression testing and reducing manual effort.

AIAutomationLarge Language Model

0 likes · 5 min read

Using an AI Large Model to Automate Report Comparison Testing

DataFunTalk

Oct 23, 2025 · Artificial Intelligence

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, Supervised Fine‑Tuning, and autonomous agents boost model intelligence, reduce hallucinations, and enable sophisticated content creation, understanding, and interactive applications.

AI agentsLarge Language ModelRAG

0 likes · 4 min read

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

Wu Shixiong's Large Model Academy

Oct 23, 2025 · Artificial Intelligence

Why the Transformer Core Structure Is the Key to AI Interview Success

This article explains the fundamental purpose, architecture, and variants of the Transformer model—including Encoder‑Decoder, Encoder‑only, and Decoder‑only designs—while detailing how attention mechanisms work and why modern large‑language models favor the Decoder‑only approach, providing a concise framework for answering interview questions.

AI InterviewEncoder-DecoderLarge Language Model

0 likes · 10 min read

Why the Transformer Core Structure Is the Key to AI Interview Success

Data Party THU

Oct 22, 2025 · Artificial Intelligence

Demystifying Large‑Model Reinforcement Learning: From MDP Basics to Bellman and Advantage Functions

This article provides a comprehensive introduction to reinforcement learning for large language models, covering the Markov Decision Process formulation, the four core elements of RL, state‑value and action‑value functions, Bellman equations, and the advantage function that underpins modern policy‑gradient algorithms.

AI fundamentalsBellman equationLarge Language Model

0 likes · 13 min read

Demystifying Large‑Model Reinforcement Learning: From MDP Basics to Bellman and Advantage Functions

IT Services Circle

Oct 20, 2025 · Artificial Intelligence

How NanoChat Lets Anyone Train a ChatGPT‑Like Model for $100

NanoChat, an open‑source full‑stack AI model solution created by Andrej Karpathy, enables users to train a functional chat model on a modest $100 cloud GPU rental, offering a low‑cost, hands‑on alternative to proprietary large‑language‑model services.

AI trainingLarge Language Modelcost-effective

0 likes · 4 min read

How NanoChat Lets Anyone Train a ChatGPT‑Like Model for $100

AI2ML AI to Machine Learning

Oct 15, 2025 · Artificial Intelligence

NanoChat Source Code Deep Dive: Karpathy’s Full‑Stack LLM Pipeline Explained

This article dissects NanoChat’s end‑to‑end LLM pipeline—from a lightweight 561M‑parameter transformer and custom Rust BPE tokenizer to Chinchilla‑scaled training, multi‑task fine‑tuning, optional RL on GSM8K, KV‑cache inference optimizations, and benchmark results that slightly surpass GPT‑2 Large.

CORE benchmarkChinchilla scalingFastAPI

0 likes · 10 min read

NanoChat Source Code Deep Dive: Karpathy’s Full‑Stack LLM Pipeline Explained

AntTech

Oct 14, 2025 · Artificial Intelligence

How Ring-1T Achieves Trillion-Scale Deep Thinking and Competitive Benchmarks

The Ring-1T model, a trillion-parameter AI system released as open source, leverages advanced reinforcement learning techniques, extensive benchmark evaluations, and custom training frameworks to deliver balanced performance across math, code, reasoning, and creative tasks while highlighting current limitations and future development plans.

AI modelLarge Language Modelbenchmark evaluation

0 likes · 8 min read

How Ring-1T Achieves Trillion-Scale Deep Thinking and Competitive Benchmarks

AI2ML AI to Machine Learning

Oct 13, 2025 · Artificial Intelligence

How Large‑and‑Small Language Model Collaboration Is Shaping the Future

The article argues that combining large, high‑capacity models with lightweight, fine‑tuned small models can cut costs, lower latency, enable specialized vertical tasks, and shift development from chasing ever‑bigger models toward optimal system architectures, outlining key techniques such as state‑space models, knowledge distillation, and staged fine‑tuning.

AI ArchitectureEfficiencyKnowledge Distillation

0 likes · 3 min read

How Large‑and‑Small Language Model Collaboration Is Shaping the Future

DataFunTalk

Oct 13, 2025 · Artificial Intelligence

How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while explaining the underlying technologies of Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems.

AI applicationsAgentLarge Language Model

0 likes · 4 min read

How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications

Bighead's Algorithm Notes

Oct 11, 2025 · Artificial Intelligence

Recent Advances in Multivariate Time Series Forecasting: Paper Summaries (Sep 27 – Oct 10 2025)

This article summarizes eight newly released AI papers on multivariate time‑series forecasting and anomaly detection, detailing each work's motivation, proposed methodology, key innovations such as CRIB, TS‑JEPA, DSAT‑HD, DIMIGNN, ASTGI, IndexNet, TsLLM, Moon, TimeSeriesScientist, MLG‑4TS, and Augur, and reports their experimental validation on real‑world datasets.

Anomaly DetectionLarge Language ModelTransformer

0 likes · 23 min read

Recent Advances in Multivariate Time Series Forecasting: Paper Summaries (Sep 27 – Oct 10 2025)

DataFunSummit

Oct 10, 2025 · Artificial Intelligence

How Ping An Life Built ChatBI: An AI‑Powered Intelligent BI Platform

This article details Ping An Life's self‑developed large‑model reporting product ChatBI, covering its background, goals, solution architecture, technical stack, real‑world use cases, deployment challenges, and future outlook, offering practical insights for enterprises adopting AI‑driven business intelligence.

AIChatbotData Platform

0 likes · 17 min read

How Ping An Life Built ChatBI: An AI‑Powered Intelligent BI Platform

DataFunTalk

Oct 9, 2025 · Artificial Intelligence

How Bilibili Uses Large Language Models to Solve Big Data Task Failures

This article explains Bilibili's massive data platform architecture, the common reasons offline tasks fail or slow down, and how the company is exploring large‑language‑model‑driven assistants to automatically diagnose and resolve these engineering issues.

AI assistanceBilibiliLarge Language Model

0 likes · 4 min read

How Bilibili Uses Large Language Models to Solve Big Data Task Failures

HyperAI Super Neural

Oct 8, 2025 · Artificial Intelligence

From WeChat’s AI Podcast Trial to Google, ByteDance and Xiaohongshu: Can AI Podcasts Capture the Emerging AIGC Blue Ocean?

The article examines how breakthroughs in large language models and high‑fidelity TTS are powering AI‑generated podcasts, analyzes the technical advances behind the "human‑like" sound, surveys major players such as Google, ByteDance, Xiaohongshu and startups, and evaluates the market potential of this rapidly expanding AIGC niche.

AI podcastAIGCByteDance

0 likes · 9 min read

From WeChat’s AI Podcast Trial to Google, ByteDance and Xiaohongshu: Can AI Podcasts Capture the Emerging AIGC Blue Ocean?

DataFunSummit

Oct 7, 2025 · Artificial Intelligence

Bilibili’s AI‑Powered Assistant: Solving Big Data Task Failures with LLMs

This article details Bilibili's implementation of a large‑language‑model‑driven intelligent assistant that helps engineers diagnose and resolve massive offline and real‑time data‑processing failures, describing the platform’s five‑layer architecture, common failure and slowdown causes, and the need for AI‑powered troubleshooting support.

BilibiliIntelligent AssistantLarge Language Model

0 likes · 4 min read

Bilibili’s AI‑Powered Assistant: Solving Big Data Task Failures with LLMs

DataFunSummit

Oct 6, 2025 · Artificial Intelligence

How Bilibili Leverages Large Language Models to Solve Big Data Platform Failures

This article explains Bilibili's massive video platform data architecture, the huge daily workload of offline and real‑time tasks, common user problems like task failures and slowdowns, their root causes, and how a large language model assistant is being used to automate troubleshooting.

AI assistanceBilibiliLarge Language Model

0 likes · 4 min read

How Bilibili Leverages Large Language Models to Solve Big Data Platform Failures

Fun with Large Models

Sep 30, 2025 · Artificial Intelligence

DeepSeek-V3.2 Architecture Breakthrough: A 5‑Minute Guide to Its Core Features

The article introduces DeepSeek-V3.2, highlighting its new DeepSeek Sparse Attention (DSA) that boosts training and inference efficiency by up to 50%, cuts model usage costs dramatically, explains the updated API endpoints, and details the four‑stage post‑training pipeline that underpins the model’s performance improvements.

AI ArchitectureDSADeepSeek-V3.2

0 likes · 8 min read

DeepSeek-V3.2 Architecture Breakthrough: A 5‑Minute Guide to Its Core Features

DataFunTalk

Sep 30, 2025 · Artificial Intelligence

DeepSeek‑V3.2‑Exp Unveiled: Million‑Token Context, Sparse Attention, and Cost‑Effective Inference

DeepSeek‑V3.2‑Exp, the latest experimental large‑language model, is open‑sourced with a paper, featuring a million‑token context window, a new sparse attention mechanism, GRPO‑enhanced reasoning, and detailed cost‑analysis showing up to ten‑fold inference savings.

DeepSeekGRPOInference Optimization

0 likes · 5 min read

DeepSeek‑V3.2‑Exp Unveiled: Million‑Token Context, Sparse Attention, and Cost‑Effective Inference

HyperAI Super Neural

Sep 30, 2025 · Artificial Intelligence

SpikingBrain-1.0 Achieves 100× Faster Inference with Brain‑Inspired Spiking Architecture

SpikingBrain-1.0, the first domestically‑produced brain‑inspired spiking large model, links spiking neuron dynamics to linear attention, delivering over 100× faster first‑token latency on 4‑million‑token sequences, 23.4% FLOP utilization, 69% sparsity, and a one‑click deployment tutorial on HyperAI.

Large Language ModelSpikingBrain-1.0brain-inspired AI

0 likes · 7 min read

SpikingBrain-1.0 Achieves 100× Faster Inference with Brain‑Inspired Spiking Architecture

Alipay Experience Technology

Sep 29, 2025 · Artificial Intelligence

How a Constraint-Aware Multi-Agent AI Won the IJCAI‑2025 Travel Planning Challenge

Alipay’s AI research team, together with Ant Group and East China Normal University, leveraged a self‑developed large‑model‑plus‑optimization framework to create a constraint‑aware multi‑agent system that won both the Original OS Track and DSL Track at the IJCAI‑2025 Autonomous Travel Itinerary Planning Competition.

AILarge Language ModelOptimization

0 likes · 8 min read

How a Constraint-Aware Multi-Agent AI Won the IJCAI‑2025 Travel Planning Challenge

DataFunTalk

Sep 28, 2025 · Artificial Intelligence

How Bilibili Leverages Large Language Models to Automate Big Data Operations

This article explores Bilibili’s implementation of a large‑language‑model‑driven intelligent assistant that helps troubleshoot massive offline and real‑time data processing tasks, detailing the platform’s five‑layer architecture, common failure causes, and how AI can streamline issue resolution.

AI OperationsIntelligent AssistantLarge Language Model

0 likes · 4 min read

How Bilibili Leverages Large Language Models to Automate Big Data Operations

DataFunTalk

Sep 27, 2025 · Artificial Intelligence

Bilibili’s AI Assistant: Using Large Language Models to Tackle Massive Data Tasks

This article explains how Bilibili leverages a large‑language‑model‑based intelligent agent to diagnose and resolve failures and slowdowns in its massive big‑data platform, detailing the platform architecture, workload scale, common user issues, and the need for automated assistance.

AI OperationsBilibiliIntelligent Assistant

0 likes · 5 min read

Bilibili’s AI Assistant: Using Large Language Models to Tackle Massive Data Tasks

Data Party THU

Sep 26, 2025 · Artificial Intelligence

How Keye‑VL‑1.5 Redefines Video Understanding with Slow‑Fast Encoding

Keye‑VL‑1.5, an 8‑billion‑parameter multimodal large language model, introduces a Slow‑Fast video encoding strategy, a four‑stage progressive pre‑training pipeline with 128K context, and a sophisticated post‑training regime that together achieve state‑of‑the‑art performance on video and vision‑language benchmarks while maintaining strong general capabilities.

Large Language ModelMultimodal LLMbenchmark

0 likes · 21 min read

How Keye‑VL‑1.5 Redefines Video Understanding with Slow‑Fast Encoding

Alibaba Cloud Big Data AI Platform

Sep 25, 2025 · Artificial Intelligence

Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide

This article explains the opportunities and challenges of Mixture of Experts (MoE) models, introduces expert parallelism as a solution to scaling and deployment bottlenecks, and provides a step‑by‑step guide for deploying MoE models with Alibaba Cloud PAI‑EAS, including configuration tips and code examples.

AI model deploymentExpert ParallelismLarge Language Model

0 likes · 11 min read

Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide

DataFunSummit

Sep 25, 2025 · Artificial Intelligence

Can NL→MQL→SQL Bridge the Gap to End‑to‑End Intelligent BI?

Aloudata Agent introduces a novel NL→MQL→SQL framework that combines large language models with a custom metric query language, enabling business users to perform end‑to‑end intelligent data analysis, attribution, and reporting without technical expertise, while balancing accuracy, cost, and performance.

Data AnalysisIntelligent BILarge Language Model

0 likes · 18 min read

Can NL→MQL→SQL Bridge the Gap to End‑to‑End Intelligent BI?

Fighter's World

Sep 24, 2025 · Artificial Intelligence

Aivis: Pioneering Autonomous Agents for Alibaba Cloud’s Next‑Gen Intelligent Services

The talk outlines how Alibaba Cloud’s Aivis autonomous service agent tackles the “impossible triangle” of ultra‑high experience, low cost, and complex services by evolving from tool‑based chatbots to teammate‑level agents, detailing a four‑layer architecture, domain‑model training, and actionable steps for enterprise AI service transformation.

AI AgentAgent ArchitectureCloud Service

0 likes · 14 min read

Aivis: Pioneering Autonomous Agents for Alibaba Cloud’s Next‑Gen Intelligent Services

AIWalker

Sep 23, 2025 · Artificial Intelligence

Manzano: A Small 3B Multimodal Model That Unifies Image Understanding and Generation with SOTA Performance

Manzano introduces a hybrid vision tokenizer and a three‑stage training recipe that let a 3‑billion‑parameter multimodal LLM achieve state‑of‑the‑art results on both image‑understanding benchmarks and text‑to‑image generation, while scaling smoothly to larger sizes and minimizing task conflict.

AI researchLarge Language ModelManzano

0 likes · 25 min read

Manzano: A Small 3B Multimodal Model That Unifies Image Understanding and Generation with SOTA Performance

DataFunTalk

Sep 23, 2025 · Artificial Intelligence

DeepSeek‑V3.1‑Terminus Fixes the ‘Extreme’ Bug and Outperforms Gemini 2.5 Pro

DeepSeek released the V3.1‑Terminus model, fixing the notorious “extreme” character bug, improving language consistency and Agent capabilities, and achieving notable benchmark gains that surpass Gemini 2.5 Pro, while providing download links and hinting at upcoming V4/R2 releases.

AgentArtificial IntelligenceDeepSeek

0 likes · 6 min read

DeepSeek‑V3.1‑Terminus Fixes the ‘Extreme’ Bug and Outperforms Gemini 2.5 Pro

Meituan Technology Team

Sep 22, 2025 · Artificial Intelligence

LongCat-Flash-Thinking: The New SOTA Open-Source LLM for Deep Reasoning and Tool Use

Meituan’s LongCat team unveiled LongCat-Flash-Thinking, an open‑source large language model that combines deep logical reasoning with tool‑calling capabilities, achieving state‑of‑the‑art performance across logic, mathematics, code, and agentic tasks, and introducing novel training frameworks such as domain‑parallel RL and DORA.

AILarge Language ModelReasoning

0 likes · 7 min read

LongCat-Flash-Thinking: The New SOTA Open-Source LLM for Deep Reasoning and Tool Use

Data Party THU

Sep 20, 2025 · Artificial Intelligence

How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model

The article reports that DeepSeek’s R1 large language model, detailed in a peer‑reviewed Nature paper, was built with roughly $300 k in total cost—about $29.4 k for training—using Nvidia H800 chips and novel pure reinforcement‑learning techniques, achieving competitive performance while remaining open‑source.

DeepSeekLarge Language ModelNvidia H800

0 likes · 9 min read

How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model

DataFunTalk

Sep 20, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

This article examines Tencent's large language model deployments across content generation, intelligent customer service, and role‑playing, detailing the underlying SFT, Retrieval‑Augmented Generation, GraphRAG, and Agent technologies that enable smarter, more reliable AI solutions.

AgentKnowledge GraphLarge Language Model

0 likes · 4 min read

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

HyperAI Super Neural

Sep 18, 2025 · Artificial Intelligence

DeepSeek‑R1 Costs $294K to Train, Hits Nature Cover as First Peer‑Reviewed Large Model

DeepSeek‑R1, the first mainstream large language model to pass peer review in Nature, was trained for $294,000 using 648 H800 GPUs, and its RL‑enhanced version, DeepSeek‑R1‑Zero, achieved up to 86.7% pass@1 on AIME 2024, outperforming human averages across math, coding, and science tasks.

AI researchDeepSeek-R1Large Language Model

0 likes · 10 min read

DeepSeek‑R1 Costs $294K to Train, Hits Nature Cover as First Peer‑Reviewed Large Model

DataFunTalk

Sep 18, 2025 · Artificial Intelligence

How DeepSeek‑R1’s Reinforcement Learning Earned a Nature Cover

DeepSeek‑R1, the first peer‑reviewed large language model, leveraged a pure reinforcement‑learning framework and the novel GRPO algorithm to achieve breakthrough reasoning performance, low training cost, and widespread acclaim, culminating in a Nature magazine cover story.

AI reasoningDeepSeekGRPO

0 likes · 14 min read

How DeepSeek‑R1’s Reinforcement Learning Earned a Nature Cover

DataFunSummit

Sep 17, 2025 · Artificial Intelligence

How Tencent’s Large Language Model Powers Real-World AI Applications

This article explores Tencent’s large language model across diverse business scenarios—content generation, intelligent customer service, role‑playing, and more—detailing the principles and practical uses of Retrieval‑Augmented Generation (RAG), GraphRAG, and Agent technologies, and how they enhance model intelligence and user experience.

AIAgentKnowledge Graph

0 likes · 4 min read

How Tencent’s Large Language Model Powers Real-World AI Applications

DataFunSummit

Sep 14, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents

This article examines Tencent's large language model deployments across various business scenarios, detailing the use of Retrieval‑Augmented Generation, GraphRAG for role‑playing, and Agent technologies, while also outlining core application areas and the three main technical approaches—SFT, RAG, and Agents.

AI agentsAI applicationsGraphRAG

0 likes · 4 min read

How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents

Data Party THU

Sep 13, 2025 · Artificial Intelligence

How a Multi‑Agent Large Model Transforms Ecological Big‑Data Analysis

This report details a university project that built a flexible, high‑performance multi‑agent large‑model framework for ecological environment big‑data analysis, covering system architecture, individual agents, memory mechanisms, report generation, a FastAPI‑LangGraph backend, a React frontend, testing methodology, and future directions.

AIBig DataFastAPI

0 likes · 7 min read

How a Multi‑Agent Large Model Transforms Ecological Big‑Data Analysis

Instant Consumer Technology Team

Sep 12, 2025 · Cloud Native

Deploy Large Language Models on Kubernetes with Ollama and Open-WebUI

This guide walks through deploying a local LLM on Kubernetes using Ollama for model serving and Open-WebUI for a web interface, covering namespace creation, storage setup, GPU support, service exposure, validation, and model download to ensure privacy, low latency, and high availability.

GPUKubernetesLarge Language Model

0 likes · 9 min read

Deploy Large Language Models on Kubernetes with Ollama and Open-WebUI

Bighead's Algorithm Notes

Sep 11, 2025 · Artificial Intelligence

Fin-PRM: Alibaba’s Dianjin Team Introduces a Domain-Specific Process Reward Model for Financial Reasoning

Fin‑PRM, a domain‑specific process reward model for financial reasoning introduced by Alibaba’s Dianjin team, employs dual‑level step and trajectory rewards to provide fine‑grained supervision, achieving up to 12.9% accuracy gains in supervised fine‑tuning and 5.1% improvements in Best‑of‑N inference on benchmarks such as CFLUE and FinQA.

CFLUEFin-PRMFinQA

0 likes · 11 min read

Fin-PRM: Alibaba’s Dianjin Team Introduces a Domain-Specific Process Reward Model for Financial Reasoning

DataFunSummit

Sep 10, 2025 · Artificial Intelligence

Claude’s Exit from China: How Domestic AI Models Can Fill the Void

Anthropic’s new policy blocks Chinese‑controlled firms from using Claude and Claude Code, prompting a deep dive into the model’s strengths and exploring fast‑growing domestic AI alternatives—such as Qwen3‑Coder, GLM‑4.5, and others—to understand their capabilities, gaps, and future opportunities for Chinese developers.

AIChinese AIClaude

0 likes · 11 min read

Claude’s Exit from China: How Domestic AI Models Can Fill the Void

Eric Tech Circle

Sep 10, 2025 · Artificial Intelligence

Deploy High‑Performance Local LLMs with vLLM: A Step‑by‑Step Guide

This article walks through installing and configuring vLLM for local large language model inference, compares it with Ollama and LM Studio, details environment setup, model download, testing scripts, and shows how to expose an OpenAI‑compatible API for production use.

Inference OptimizationLarge Language ModelModelScope

0 likes · 11 min read

Deploy High‑Performance Local LLMs with vLLM: A Step‑by‑Step Guide

AI Product Manager Community

Sep 10, 2025 · Industry Insights

Avoid These 6 Common Pitfalls When Deploying AI Chatbots in Customer Service

Deploying large‑model AI in customer service can boost efficiency, but without proper boundaries, feedback loops, and emotional handling it often creates costly mistakes, brand damage, and poor user experience, as this article explains the six most frequent traps and how to sidestep them.

AIBest PracticesChatbot

0 likes · 8 min read

Avoid These 6 Common Pitfalls When Deploying AI Chatbots in Customer Service

Wuming AI

Sep 6, 2025 · Artificial Intelligence

Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM

The article reviews Alibaba's 1‑trillion‑parameter Qwen3‑Max‑Preview model, comparing its benchmark scores, hallucination rate, math and coding accuracy, and SVG generation quality against Claude, Kimi K2, and DeepSeek, while providing usage links and real‑world user impressions.

AI BenchmarkLarge Language ModelQwen3

0 likes · 4 min read

Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM

Kuaishou Tech

Sep 5, 2025 · Artificial Intelligence

How Keye‑VL‑1.5‑8B Sets New Benchmarks in Multimodal AI

Fast‑search platform Kwai has open‑sourced the 8‑billion‑parameter multimodal LLM Keye‑VL‑1.5, which introduces a slow‑fast frame encoding, a progressive four‑stage pre‑training pipeline, and an automated data construction workflow, achieving state‑of‑the‑art results on video and vision‑language benchmarks and surpassing many closed‑source models.

Large Language Modelbenchmark performancemultimodal AI

0 likes · 12 min read

How Keye‑VL‑1.5‑8B Sets New Benchmarks in Multimodal AI

Efficient Ops

Sep 2, 2025 · Artificial Intelligence

How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations

At the 27th GOPS Global Operations Conference in Shanghai (Oct 17‑18, 2025), Professor Wang Peng of Fudan University will reveal how large language models can extract and structure heterogeneous operational data into high‑quality knowledge bases, and how RAG‑driven Q&A enhances fault diagnosis, SOP generation, and automated decision‑making.

Artificial IntelligenceIntelligent OperationsKnowledge Base

0 likes · 3 min read

How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations

Efficient Ops

Sep 2, 2025 · Artificial Intelligence

Inside Meituan’s LongCat‑Flash‑Chat: 560B‑Parameter MoE Model with Ultra‑Fast Inference

Meituan has open‑sourced LongCat‑Flash‑Chat, a 5.6‑trillion‑parameter Mixture‑of‑Experts model that activates only a fraction of its weights per token, delivering mainstream‑level performance, high inference speed, and low cost for complex agent applications.

Artificial IntelligenceInference OptimizationLarge Language Model

0 likes · 4 min read

Inside Meituan’s LongCat‑Flash‑Chat: 560B‑Parameter MoE Model with Ultra‑Fast Inference

Baobao Algorithm Notes

Sep 2, 2025 · Artificial Intelligence

How LongCat‑Flash Achieves Record Speed and Efficiency for a 560B MoE Model

LongCat‑Flash is a 560‑billion‑parameter Mixture‑of‑Experts LLM that combines a dynamic zero‑computation expert design, shortcut‑connected MoE communication, variance‑aligned scaling, and a three‑stage agent‑centric pre‑training pipeline, delivering over 100 TPS on H800 GPUs at a cost of $0.70 per million tokens.

Artificial IntelligenceInference OptimizationLarge Language Model

0 likes · 23 min read

How LongCat‑Flash Achieves Record Speed and Efficiency for a 560B MoE Model

Java Tech Enthusiast

Sep 1, 2025 · Artificial Intelligence

How Meituan’s LongCat‑Flash‑Chat Beats Top LLMs with Zero‑Computation Experts

LongCat‑Flash‑Chat, Meituan’s newly open‑sourced 560B MoE model, outperforms leading LLMs on agent tool use and instruction following benchmarks, introduces zero‑computation experts and shortcut‑connected MoE for higher throughput, and demonstrates strong programming and reasoning abilities across diverse evaluation tasks.

Large Language ModelMeituan AIZero Computation Experts

0 likes · 12 min read

How Meituan’s LongCat‑Flash‑Chat Beats Top LLMs with Zero‑Computation Experts

DataFunTalk

Sep 1, 2025 · Artificial Intelligence

Unlocking 560B‑Parameter AI: Inside LongCat‑Flash‑Chat’s Zero‑Computation MoE

LongCat‑Flash‑Chat, a 560‑billion‑parameter Mixture‑of‑Experts model with Zero‑Computation Experts, delivers top‑tier benchmark scores and fast inference while activating only a fraction of its parameters, and is fully open‑sourced with easy deployment scripts.

Artificial IntelligenceBenchmarkingLarge Language Model

0 likes · 6 min read

Unlocking 560B‑Parameter AI: Inside LongCat‑Flash‑Chat’s Zero‑Computation MoE

DataFunTalk

Aug 29, 2025 · Artificial Intelligence

Grok Code Fast 1: xAI’s New Coding Model 3× Faster, 6× Cheaper

Elon Musk’s xAI has launched Grok Code Fast 1, a new code‑generation model that claims to be three times faster and six times cheaper than GPT‑5, offering agentic programming capabilities, broad language support, free‑week trials on major IDE platforms, and competitive pricing with high cache hit rates.

AI code modelLarge Language Modelagentic programming

0 likes · 6 min read

Grok Code Fast 1: xAI’s New Coding Model 3× Faster, 6× Cheaper

DataFunTalk

Aug 26, 2025 · Artificial Intelligence

Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide

This resource guide presents a curated list of cutting‑edge topics—including multimodal GraphRAG, knowledge‑graph‑driven large‑model applications in finance, traditional Chinese medicine, automotive manufacturing, and knowledge‑management trends—offering insights into AI‑powered knowledge services, and invites readers to scan the QR code to download the full e‑book.

AIKnowledge GraphLarge Language Model

0 likes · 2 min read

Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide