Tagged articles
675 articles
Page 3 of 7
Fun with Large Models
Fun with Large Models
Jan 22, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Calling Locally Deployed LLMs via OpenAI API Format in Python

This tutorial explains the OpenAI‑style request and response schema, demonstrates low‑level API calls with the requests library, compares them to the high‑level openai package, and walks through building a streaming multi‑turn chatbot that interacts with a locally hosted large language model.

ChatbotLarge Language ModelOpenAI API
0 likes · 17 min read
Step‑by‑Step Guide to Calling Locally Deployed LLMs via OpenAI API Format in Python
AI Engineering
AI Engineering
Jan 21, 2026 · Artificial Intelligence

Running Large Language Models on Phones: Liquid AI’s LFM2.5‑1.2B‑Thinking Fits in 900 MB

Liquid AI’s LFM2.5‑1.2B‑Thinking model runs entirely on a smartphone with only 900 MB of memory, scores 88 on MATH‑500, 69 on Multi‑IF, and 57 on BFCLv3 benchmarks, outperforms larger rivals, and achieves real‑time speeds on Snapdragon 8 Elite and AMD Ryzen 9 3950X, signaling a shift toward edge AI.

LFM2.5Large Language ModelRyzen
0 likes · 4 min read
Running Large Language Models on Phones: Liquid AI’s LFM2.5‑1.2B‑Thinking Fits in 900 MB
PaperAgent
PaperAgent
Jan 17, 2026 · Artificial Intelligence

How Qwen3‑VL Embedding and Reranker Set New SOTA in Multimodal Retrieval

The article analyzes the Qwen3‑VL‑Embedding and Qwen3‑VL‑Reranker models, detailing their unified vector space, multi‑stage training pipeline, Matryoshka representation learning, quantization techniques, massive synthetic data generation, and benchmark results that push multimodal retrieval performance to a new state‑of‑the‑art.

EmbeddingKnowledge DistillationLarge Language Model
0 likes · 7 min read
How Qwen3‑VL Embedding and Reranker Set New SOTA in Multimodal Retrieval
PaperAgent
PaperAgent
Jan 16, 2026 · Artificial Intelligence

How a 4B Model Beats 30B Giants: Inside AgentCPM-Explore’s SOTA Performance

AgentCPM-Explore, a 4‑billion‑parameter open‑source model, achieves state‑of‑the‑art results on long‑range exploration tasks, matching or surpassing larger 8B and even 30B models, thanks to a full‑stack infrastructure, novel training tricks, and extensive benchmark evaluations across eight agent‑centric datasets.

AgentAgentCPM-ExploreLarge Language Model
0 likes · 10 min read
How a 4B Model Beats 30B Giants: Inside AgentCPM-Explore’s SOTA Performance
PaperAgent
PaperAgent
Jan 13, 2026 · Artificial Intelligence

How C2LLM Redefines Code Retrieval with Attention‑Based Pooling

Introducing C2LLM, a contrastive code LLM series that replaces mean and EOS pooling with a multi‑head attention pooling module, achieving top scores on the MTEB‑Code benchmark across 12 tasks and demonstrating cost‑effective, high‑precision code retrieval for both production and AI agent applications.

Large Language ModelMTEB-CodeRetrieval-Augmented Generation
0 likes · 8 min read
How C2LLM Redefines Code Retrieval with Attention‑Based Pooling
DeWu Technology
DeWu Technology
Jan 12, 2026 · Mobile Development

How We Built an AI‑Powered Smart Inspection System for Mobile Apps

This article details the design and implementation of an AI‑driven smart inspection platform for a mobile app, covering background challenges, system architecture, core detection features—including layout, visual, consistency, and AI‑operation checks—platform configuration, result feedback, and the measurable improvements achieved.

AI inspectionLarge Language ModelSmart Inspection
0 likes · 19 min read
How We Built an AI‑Powered Smart Inspection System for Mobile Apps
PaperAgent
PaperAgent
Jan 10, 2026 · Artificial Intelligence

DeepSeek V4 Unveiled: Why Its Coding Power Beats Claude and GPT

DeepSeek's newly announced V4 model, the successor to its December 2024 V3 release, demonstrates superior coding abilities over Claude and GPT series, details its data composition, infrastructure, training costs, failed experimental attempts, expanded benchmark comparisons, and includes a comprehensive safety report.

AI model analysisDeepSeekLarge Language Model
0 likes · 4 min read
DeepSeek V4 Unveiled: Why Its Coding Power Beats Claude and GPT
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Jan 8, 2026 · Artificial Intelligence

Alpha‑R1: Reinforcement‑Learning‑Driven Large‑Model Alpha Factor Selection

Alpha‑R1 integrates reinforcement learning with an 8‑billion‑parameter LLM to jointly process price and news data, creating context‑aware factor embeddings that outperform traditional quantitative and generic LLM baselines on CSI 300 and CSI 1000 portfolios, demonstrating robust alpha‑decay resistance and zero‑sample generalization.

Financial AILarge Language Modelalpha factor selection
0 likes · 16 min read
Alpha‑R1: Reinforcement‑Learning‑Driven Large‑Model Alpha Factor Selection
AI Info Trend
AI Info Trend
Jan 7, 2026 · Artificial Intelligence

MiroThinker 1.5: 30B Model Beats 1T‑Scale LLMs via Interactive Scaling

Released by the MiroMind team, MiroThinker 1.5 demonstrates that a 30‑billion‑parameter model can match or surpass the performance of 1‑trillion‑parameter LLMs by leveraging Interactive Scaling, achieving top rankings on multiple search benchmarks, dramatically lower inference cost, and open‑source availability for developers.

AI benchmarksLarge Language ModelMiroThinker
0 likes · 6 min read
MiroThinker 1.5: 30B Model Beats 1T‑Scale LLMs via Interactive Scaling
Amap Tech
Amap Tech
Dec 29, 2025 · Artificial Intelligence

How G‑Plan Transforms Map Recommendations with AI Agents and Multi‑Demand Planning

This article details how Gaode's G‑Plan combines large‑model AI agents, generative ranking, and spatiotemporal counterfactual DPO to model and prioritize multiple user intents on the home page, presents the system architecture, experimental setup, online gains, and ablation results, and explains how it moves recommendation from passive to proactive planning.

AI recommendationLarge Language Modelintent planning
0 likes · 21 min read
How G‑Plan Transforms Map Recommendations with AI Agents and Multi‑Demand Planning
DataFunTalk
DataFunTalk
Dec 25, 2025 · Artificial Intelligence

How DeepAgent Redefines General AI Reasoning with Scalable Toolsets

DeepAgent, a new end‑to‑end reasoning agent, integrates autonomous thinking, dynamic tool search, and execution to handle over 16,000 APIs, embodied tasks, and research assistance, achieving state‑of‑the‑art performance on benchmarks like TMDB, ToolBench, ALFWorld, WebShop, and GAIA.

Large Language ModelMemory ManagementReasoning
0 likes · 15 min read
How DeepAgent Redefines General AI Reasoning with Scalable Toolsets
Xiaomi Tech
Xiaomi Tech
Dec 24, 2025 · Artificial Intelligence

DeepLight & AgentMat: Xiaomi and SJTU Launch AI Platform for Light Alloy Design

Xiaomi and Shanghai Jiao Tong University introduced DeepLight, an AI‑driven large‑model for lightweight alloys, together with the AgentMat multi‑agent framework that accelerates the full design cycle tenfold, and the LightAlloy‑Bench benchmark where DeepLight outperforms DeepSeek‑V3 and GPT‑4o by about 20 %.

AILarge Language ModelLightweight Alloys
0 likes · 8 min read
DeepLight & AgentMat: Xiaomi and SJTU Launch AI Platform for Light Alloy Design
PaperAgent
PaperAgent
Dec 23, 2025 · Artificial Intelligence

CATArena: A Competitive Benchmark That Turns Agent Scoring into Evolutionary Learning

CATArena introduces a tournament‑style evaluation framework where AI agents iteratively code, compete, and improve across classic board games, using three‑dimensional quantitative scores to measure strategy programming, global learning, and generalization, and reveals how different LLM‑based agents learn and adapt over multiple rounds.

AI BenchmarkAgent EvaluationCATArena
0 likes · 8 min read
CATArena: A Competitive Benchmark That Turns Agent Scoring into Evolutionary Learning
DataFunSummit
DataFunSummit
Dec 20, 2025 · Artificial Intelligence

How AutoHome Built the Cangjie Large Model: From Training Architecture to Real-World AI Applications

This article details AutoHome's end‑to‑end development of the Cangjie large model, covering the training infrastructure with distributed data, pipeline and tensor parallelism, core business use cases such as video script generation and multi‑tool Agent capabilities, inference optimizations through quantization and fast serving frameworks, and future directions for personalized automotive AI services.

Agent AILarge Language ModelVideo Generation
0 likes · 19 min read
How AutoHome Built the Cangjie Large Model: From Training Architecture to Real-World AI Applications
PaperAgent
PaperAgent
Dec 19, 2025 · Artificial Intelligence

Inside Xiaomi’s MiMo‑V2‑Flash: How a Hybrid SWA Design Powers Fast, Efficient AI Reasoning

Xiaomi’s newly open‑sourced MiMo‑V2‑Flash model combines a hybrid sliding‑window/attention architecture with a 309B‑parameter MoE design, delivering top‑tier reasoning, coding and agent performance while introducing the efficient MOPD post‑training paradigm that dramatically reduces RL compute costs.

Hybrid SWALarge Language ModelMOPD
0 likes · 5 min read
Inside Xiaomi’s MiMo‑V2‑Flash: How a Hybrid SWA Design Powers Fast, Efficient AI Reasoning
AI Insight Log
AI Insight Log
Dec 18, 2025 · Artificial Intelligence

Xiaomi’s New MiMo‑V2‑Flash LLM Rivals DeepSeek‑V3.2 and Near‑GPT‑5 High

Xiaomi’s MiMo‑V2‑Flash, a 309B‑parameter MoE LLM with only 15B active weights, uses Hybrid SWA, Multi‑Token Prediction and Multi‑Teacher On‑Policy Distillation to cut KV‑cache by six times, boost inference speed 2.6×, and achieve performance comparable to DeepSeek‑V3.2, Kimi‑K2 and near‑GPT‑5 High, including a 73.4% SWE‑Bench code‑agent score.

Hybrid SWALarge Language ModelMOPD
0 likes · 7 min read
Xiaomi’s New MiMo‑V2‑Flash LLM Rivals DeepSeek‑V3.2 and Near‑GPT‑5 High
AI Insight Log
AI Insight Log
Dec 17, 2025 · Artificial Intelligence

Google Unveils Gemini 3 Flash: Free, Lightning‑Fast, and Outperforms Its Predecessor

Google released Gemini 3 Flash without warning, offering Pro‑level intelligence at Flash‑speed, costing just $0.5 per million input tokens and $3 per million output tokens, delivering three‑times faster inference than Gemini 2.5 Pro and surpassing it on benchmarks such as GPQA Diamond (90.4%), SWE‑bench (78.0%) and MMMU‑Pro (81.2%), while being freely accessible to all users and developers via the Gemini app, AI Studio, or API.

Gemini 3 FlashGoogle AILarge Language Model
0 likes · 5 min read
Google Unveils Gemini 3 Flash: Free, Lightning‑Fast, and Outperforms Its Predecessor
DataFunTalk
DataFunTalk
Dec 17, 2025 · Artificial Intelligence

How Large Language Models Unlock Field‑Level Data Lineage at Scale

This talk explains how a data platform tackled massive, heterogeneous enterprise data by using large language models and prompt engineering to automatically extract field‑level lineage from SQL scripts, achieve over 80% coverage, and raise accuracy above 95%, dramatically cutting impact‑analysis time.

AI for data engineeringBig DataData Lineage
0 likes · 6 min read
How Large Language Models Unlock Field‑Level Data Lineage at Scale
Design Hub
Design Hub
Dec 12, 2025 · Artificial Intelligence

GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work

OpenAI's newly released GPT-5.2 claims to outperform human experts on about 70% of real tasks, achieve a perfect score on the AIME 2025 competition, and deliver dramatic efficiency gains—up to 390× cost reduction—while showcasing impressive examples such as one‑shot ocean shader generation, a full 3D engine built in a single file, and visual‑perception scores rivaling top models.

AI benchmarksAgent AIEfficiency
0 likes · 8 min read
GPT-5.2 Unveiled: A Cutting-Edge AI Super-Assistant Built for Real-World Work
AI Insight Log
AI Insight Log
Dec 11, 2025 · Artificial Intelligence

GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro

OpenAI’s GPT‑5.2 launch introduces three specialized modes, achieves a record 55.6% score on SWE‑Bench Pro, demonstrates strong front‑end generation, adds a /compact API for long‑context efficiency, offers tiered pricing with cache discounts, and improves safety for younger users.

AI benchmarkingAI safetyGPT-5.2
0 likes · 6 min read
GPT-5.2 Released: How It Outperforms Claude 4.5 and Gemini 3 Pro
Data Party THU
Data Party THU
Dec 10, 2025 · Artificial Intelligence

How DeepSeek‑V3.2 Cuts Inference Cost and Boosts Agent Skills with Sparse Attention

DeepSeek's V3.2 release introduces a dual‑model lineup, a Sparse Attention architecture that halves long‑context inference cost, a post‑training reinforcement‑learning pipeline that exceeds 10% of pre‑training compute, and a revamped agent framework that dramatically improves tool‑use and reasoning performance across benchmarks.

DeepSeekLarge Language ModelSparse Attention
0 likes · 11 min read
How DeepSeek‑V3.2 Cuts Inference Cost and Boosts Agent Skills with Sparse Attention
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 9, 2025 · Artificial Intelligence

Building Human‑in‑the‑Loop Agent Workflows with MCP on OpenLM

This article explains how to design and implement Human‑in‑the‑Loop (HITL) interactions for large‑model agents on Alibaba's OpenLM platform, covering the challenges of server‑side execution, MCP transport extensions, tool‑calling patterns, timeout handling, and UI rendering strategies across multiple client devices.

AgentHuman-in-the-LoopLarge Language Model
0 likes · 39 min read
Building Human‑in‑the‑Loop Agent Workflows with MCP on OpenLM
Amap Tech
Amap Tech
Dec 3, 2025 · Artificial Intelligence

How Gaode’s G‑Action Uses Generative AI to Predict Users’ Next Move

Gaode’s G‑Action framework combines large‑language‑model pre‑training with fine‑tuned generative recommendation to predict a user’s immediate action and destination, transforming static map services into a dynamic, context‑aware experience and delivering measurable gains in click‑through and engagement metrics.

AILarge Language ModelMap Services
0 likes · 15 min read
How Gaode’s G‑Action Uses Generative AI to Predict Users’ Next Move
DataFunTalk
DataFunTalk
Dec 2, 2025 · Artificial Intelligence

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

This article reviews three cutting‑edge AI search and recommendation techniques—Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's GRAB generative ranking model—detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI SearchAI agentsGenerative Ranking
0 likes · 8 min read
How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search
Frontend AI Walk
Frontend AI Walk
Dec 2, 2025 · Artificial Intelligence

Understanding LLMs: A Frontend Developer’s Primer on Large Language Models

The article demystifies large language models for frontend developers by likening token prediction to autocomplete, explaining tokens, context windows, temperature, the two-stage training process, and the critical role of prompts, using concrete code examples and analogies to familiar frontend concepts.

Frontend AnalogyLLMLarge Language Model
0 likes · 10 min read
Understanding LLMs: A Frontend Developer’s Primer on Large Language Models
Wuming AI
Wuming AI
Nov 30, 2025 · Artificial Intelligence

What Exactly Is a Large Language Model? A Simple Guide to AI, Transformers, and How They Work

This article explains the relationship between AI, machine learning, deep learning, and large language models, detailing their evolution, training stages, transformer architecture, attention mechanisms, inference APIs, and practical usage examples, while demystifying common misconceptions about LLM capabilities.

AI fundamentalsLarge Language ModelMachine Learning
0 likes · 10 min read
What Exactly Is a Large Language Model? A Simple Guide to AI, Transformers, and How They Work
Kuaishou Tech
Kuaishou Tech
Nov 28, 2025 · Artificial Intelligence

Keye-VL-671B-A37B Leads Vision, Video, and Math Benchmarks

Kwai has open‑sourced its new flagship multimodal model Keye‑VL‑671B‑A37B, which upgrades visual perception, cross‑modal alignment and complex reasoning, achieving top scores on image, video, and mathematical reasoning benchmarks while detailing its architecture, three‑stage pre‑training, post‑training strategies, and future multimodal agent plans.

Large Language ModelMultimodalOpen Source
0 likes · 10 min read
Keye-VL-671B-A37B Leads Vision, Video, and Math Benchmarks
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Nov 28, 2025 · Artificial Intelligence

Boosting 5G Complaint Intent Detection with Large-Model-Enhanced Few-Shot Learning

This paper presents a collaborative framework where a large language model generates high‑quality synthetic samples to augment a lightweight model, dramatically improving few‑shot user‑complaint intent recognition in 5G networks, achieving a 21% boost for rare categories and a 9% overall accuracy gain.

Knowledge DistillationLarge Language Modelcomplaint intent detection
0 likes · 27 min read
Boosting 5G Complaint Intent Detection with Large-Model-Enhanced Few-Shot Learning
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 27, 2025 · Artificial Intelligence

How AI Powers Ethnic Product Categorization for Global E‑Commerce

This article presents an end‑to‑end AI solution that builds a cultural knowledge base and leverages large language models to automatically identify and match ethnic‑specific product categories on a cross‑border e‑commerce platform, reducing mis‑matches from 8.4% to 1.8% and cutting iteration time from days to under one day.

AIKnowledge BaseLarge Language Model
0 likes · 19 min read
How AI Powers Ethnic Product Categorization for Global E‑Commerce
DataFunSummit
DataFunSummit
Nov 20, 2025 · Artificial Intelligence

How 1688 Reinvented E‑commerce Search with AI‑Powered Generative Retrieval

This article details Alibaba’s 1688 platform’s shift from traditional e‑commerce search to AI‑driven generative retrieval, covering the AI Deep Search 1.0 and 2.0 cascaded frameworks, multimodal capabilities, an end‑to‑end “model‑as‑search‑engine” approach, experimental results, challenges, and future directions.

AIE-commerce SearchGenerative Retrieval
0 likes · 18 min read
How 1688 Reinvented E‑commerce Search with AI‑Powered Generative Retrieval
HyperAI Super Neural
HyperAI Super Neural
Nov 20, 2025 · Artificial Intelligence

From 9,874 Papers to 15,000 Structures: MOF‑ChemUnity Rebuilds MOF Knowledge for Explainable AI

MOF‑ChemUnity constructs a scalable, extensible knowledge graph that links millions of MOF names and synonyms to over 15,000 crystal structures using LLM‑driven entity matching, enabling accurate, explainable AI‑assisted material discovery, water‑stability prediction, expert recommendation validation, and graph‑enhanced retrieval across diverse applications.

Graph RAGKnowledge GraphLarge Language Model
0 likes · 17 min read
From 9,874 Papers to 15,000 Structures: MOF‑ChemUnity Rebuilds MOF Knowledge for Explainable AI
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 19, 2025 · Artificial Intelligence

Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation

This article details a practical case study of designing, implementing, and evaluating an AI-driven proofreading agent for a media client, covering background challenges, a three‑layer architecture, prompt engineering techniques, RAG knowledge‑base construction, model selection, fine‑tuning, automated metrics, and lessons learned.

AILarge Language ModelProofreading
0 likes · 26 min read
Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation
Wuming AI
Wuming AI
Nov 10, 2025 · Artificial Intelligence

What Exactly Is an AI Agent? A Clear, Practical Guide

This article explains the concept of AI agents, contrasting them with chatbots, detailing their ability and structural layers, summarizing academic surveys and whitepapers, and illustrating how agents plan, perceive, and act to autonomously accomplish user‑defined goals.

AI AgentAgent ArchitectureAutonomous Planning
0 likes · 9 min read
What Exactly Is an AI Agent? A Clear, Practical Guide
JD Tech Talk
JD Tech Talk
Nov 10, 2025 · Artificial Intelligence

Designing an AI-Powered Experiment Analysis Agent: Architecture, Workflow, and Future Enhancements

This article outlines the motivation, design, architecture, engineering implementation, large‑model selection, and future improvement plans for an AI‑driven experiment analysis agent that integrates data aggregation, modular workflow orchestration, and interactive frontend features to streamline AB‑test insights.

AI AgentFrontend DevelopmentLarge Language Model
0 likes · 14 min read
Designing an AI-Powered Experiment Analysis Agent: Architecture, Workflow, and Future Enhancements
Efficient Ops
Efficient Ops
Nov 9, 2025 · Operations

How Tencent’s PCG Achieves Full‑Link Observability and AI‑Powered SRE

The talk details Tencent PCG’s end‑to‑end observability platform, its data‑standardization pipeline, client‑backend session linking, AI‑enhanced SRE Agent with large language models, and the roadmap toward a SaaS offering, illustrating how modern operations integrate AI for rapid fault localization.

AILarge Language ModelObservability
0 likes · 17 min read
How Tencent’s PCG Achieves Full‑Link Observability and AI‑Powered SRE
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Nov 7, 2025 · Artificial Intelligence

Weekly AI Finance Paper Digest (Nov 1‑7 2025)

This digest summarizes three recent AI‑driven finance papers—DeltaLag’s dynamic lead‑lag detection, MS‑HGFN’s multi‑scale graph network for stock movement, and LiveTradeBench’s real‑time LLM trading benchmark—highlighting their methods, datasets, and performance gains.

Financial AIGraph Neural NetworkLarge Language Model
0 likes · 8 min read
Weekly AI Finance Paper Digest (Nov 1‑7 2025)
Tencent Advertising Technology
Tencent Advertising Technology
Nov 6, 2025 · Artificial Intelligence

Boosting Web UI Test Efficiency with AIGC: From Manual Scripts to Intelligent Automation

This report examines the challenges of Web UI testing in Tencent's advertising platform, analyzes current inefficiencies, and presents an AIGC-driven solution that leverages large language models, semantic scripts, and automated pipelines to dramatically improve test case generation, execution accuracy, and CI/CD integration.

AIGCAutomationLarge Language Model
0 likes · 27 min read
Boosting Web UI Test Efficiency with AIGC: From Manual Scripts to Intelligent Automation
Amap Tech
Amap Tech
Nov 4, 2025 · Artificial Intelligence

Spacetime‑GR: AI‑Powered Spatiotemporal Model Transforming POI Recommendations

This article introduces Spacetime‑GR, a large‑scale generative recommendation model that integrates hierarchical geographic POI indexing and spatiotemporal token encoding to enhance POI prediction for Amap, detailing its pre‑training pipeline, data cleaning, curriculum learning strategy, experimental results, scaling law observations, and the resulting improvements in hit rate and discovery rate.

AmapCurriculum LearningLarge Language Model
0 likes · 14 min read
Spacetime‑GR: AI‑Powered Spatiotemporal Model Transforming POI Recommendations
21CTO
21CTO
Nov 4, 2025 · Artificial Intelligence

LongCat-Flash-Omni: How an Open-Source 560B Model Achieves Real-Time Multimodal Mastery

LongCat-Flash-Omni, an open‑source 560 billion‑parameter multimodal model, combines efficient Shortcut‑Connected MoE architecture with advanced perception and speech modules to deliver low‑latency real‑time audio‑video interaction and state‑of‑the‑art performance across text, image, video, and audio tasks.

Efficient InferenceLarge Language ModelReal-Time Interaction
0 likes · 10 min read
LongCat-Flash-Omni: How an Open-Source 560B Model Achieves Real-Time Multimodal Mastery
Meituan Technology Team
Meituan Technology Team
Nov 3, 2025 · Artificial Intelligence

LongCat-Flash-Omni: 560B Open‑Source Multimodal Model with Real‑Time Interaction

LongCat-Flash-Omni, the latest open‑source model from Meituan, combines a 560 billion‑parameter architecture, efficient multimodal perception and speech reconstruction modules, and a progressive training strategy to deliver real‑time audio‑video interaction and state‑of‑the‑art performance across text, image, audio, and video tasks.

AILarge Language ModelMultimodal
0 likes · 9 min read
LongCat-Flash-Omni: 560B Open‑Source Multimodal Model with Real‑Time Interaction
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Nov 3, 2025 · Artificial Intelligence

How AI Agents Are Revolutionizing Technology: The New Engine of Innovation

This article explores the rise of AI agents—from their definition as intelligent digital assistants powered by large language models to their evolution through planning, memory, and tool use—highlighting real‑world applications, core technical mechanisms, code implementations, and future trends such as autonomy, multimodal fusion, standardization, and safety considerations.

AI AgentLarge Language ModelMultimodal
0 likes · 24 min read
How AI Agents Are Revolutionizing Technology: The New Engine of Innovation
Data Party THU
Data Party THU
Nov 2, 2025 · Artificial Intelligence

From RNN to LLM: How Transformers Power Modern Language Models

This article explains the evolution from RNNs through Encoder‑Decoder models to Transformers, detailing self‑attention, multi‑head attention, and masked attention, and then describes what Large Language Models are, their key components, capabilities, limitations, and common applications.

AILLMLarge Language Model
0 likes · 9 min read
From RNN to LLM: How Transformers Power Modern Language Models
DataFunSummit
DataFunSummit
Oct 31, 2025 · Artificial Intelligence

How OPPO’s AndesVL Is Revolutionizing On‑Device Multimodal AI

OPPO AI Center introduces AndesVL, an open‑source, fully‑adapted multimodal large model ranging from 0.6B to 4B parameters, designed for high‑performance, privacy‑preserving, low‑latency AI on mobile devices, with advanced architecture, training pipelines, on‑device optimizations, and state‑of‑the‑art benchmark results.

Large Language ModelModel Compressionmobile AI
0 likes · 21 min read
How OPPO’s AndesVL Is Revolutionizing On‑Device Multimodal AI
DataFunSummit
DataFunSummit
Oct 30, 2025 · Artificial Intelligence

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

This article explores how the explosion of unstructured data exposes the limits of traditional OCR and shows how emerging multimodal large language models provide end‑to‑end document understanding, reduce pipeline complexity, cut training costs, enable hybrid retrieval‑augmented generation, and drive real‑world industry deployments.

AIDocument ProcessingLarge Language Model
0 likes · 28 min read
How Multimodal Large Models Are Revolutionizing Document Processing and OCR
DataFunSummit
DataFunSummit
Oct 30, 2025 · Artificial Intelligence

Bilibili’s AI Assistant: Using Large Language Models to Tackle Big Data Ops

This article explains how Bilibili’s massive video platform built a five‑layer, storage‑compute separated big‑data infrastructure and employed a large language model‑driven intelligent assistant to automatically diagnose and resolve frequent offline task failures and slowdowns, addressing common user queries about task reliability and performance.

Intelligent AssistantLarge Language Modelbig data platform
0 likes · 4 min read
Bilibili’s AI Assistant: Using Large Language Models to Tackle Big Data Ops
Zhuanzhuan Tech
Zhuanzhuan Tech
Oct 29, 2025 · Artificial Intelligence

How Reinforcement Learning Boosts Stability and Speed in LLM QA Systems

This article examines how reinforcement‑learning techniques such as PPO, DPO, and GRPO are integrated into the Baixiaosheng QA system to improve answer stability, deepen domain knowledge understanding, and accelerate response generation, and it evaluates the impact of Reinforcement Fine‑Tuning (RFT) on real‑world performance.

AIDPOGRPO
0 likes · 16 min read
How Reinforcement Learning Boosts Stability and Speed in LLM QA Systems
AntTech
AntTech
Oct 29, 2025 · Artificial Intelligence

Inside Ant’s Baoling: Balancing Efficiency and Reasoning in a 1‑Trillion‑Parameter Model

At the Ant Star Innovation Journey event, the Baoling team unveiled their roadmap for trillion‑parameter models, detailing the development of Ling‑1T, Ring‑1T and multimodal Ming series, the scaling‑law‑guided architecture, training innovations, evaluation methods, and open‑source releases that aim to advance efficient, high‑performance AI.

Efficient InferenceLarge Language ModelScaling Law
0 likes · 24 min read
Inside Ant’s Baoling: Balancing Efficiency and Reasoning in a 1‑Trillion‑Parameter Model
AntTech
AntTech
Oct 28, 2025 · Artificial Intelligence

Ming-Flash-Omni-Preview: 103B Open-Source Multimodal Model Excelling in Image, Video, and Speech

Introducing Ming‑Flash‑Omni‑Preview, a 103‑billion‑parameter open‑source multimodal model built on a sparse MoE architecture that delivers state‑of‑the‑art performance in controllable image generation, streaming video understanding, and context‑aware speech recognition, surpassing prior models on GenEval and GEdit benchmarks.

Large Language ModelMultimodalSparse MoE
0 likes · 8 min read
Ming-Flash-Omni-Preview: 103B Open-Source Multimodal Model Excelling in Image, Video, and Speech
DataFunTalk
DataFunTalk
Oct 28, 2025 · Artificial Intelligence

How Bilibili Uses Large Language Models to Solve Big Data Platform Issues

This article explains Bilibili's massive data platform architecture, the common offline‑task failures and slowdowns users encounter, and how the company applies a large‑language‑model‑driven intelligent assistant to diagnose and resolve these engineering problems efficiently.

AI assistanceBilibiliLarge Language Model
0 likes · 4 min read
How Bilibili Uses Large Language Models to Solve Big Data Platform Issues
Amap Tech
Amap Tech
Oct 27, 2025 · Artificial Intelligence

Turning Maps into a Living Map: Amap’s G-Where Generative AI Recommendation

Amap upgrades its homepage recommendation by integrating large‑model capabilities—G‑Where, G‑Action, and G‑Plan—through semantic ID generation, item tokenization, and multi‑stage LLM training, achieving significant offline and online performance gains while illustrating a scalable generative recommendation framework.

AIGenerative RecommendationLarge Language Model
0 likes · 21 min read
Turning Maps into a Living Map: Amap’s G-Where Generative AI Recommendation
DataFunTalk
DataFunTalk
Oct 23, 2025 · Artificial Intelligence

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, Supervised Fine‑Tuning, and autonomous agents boost model intelligence, reduce hallucinations, and enable sophisticated content creation, understanding, and interactive applications.

AI agentsLarge Language ModelRAG
0 likes · 4 min read
How Tencent Leverages RAG and Agents to Supercharge Large Language Models
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Oct 23, 2025 · Artificial Intelligence

Why the Transformer Core Structure Is the Key to AI Interview Success

This article explains the fundamental purpose, architecture, and variants of the Transformer model—including Encoder‑Decoder, Encoder‑only, and Decoder‑only designs—while detailing how attention mechanisms work and why modern large‑language models favor the Decoder‑only approach, providing a concise framework for answering interview questions.

AI InterviewEncoder-DecoderLarge Language Model
0 likes · 10 min read
Why the Transformer Core Structure Is the Key to AI Interview Success
Data Party THU
Data Party THU
Oct 22, 2025 · Artificial Intelligence

Demystifying Large‑Model Reinforcement Learning: From MDP Basics to Bellman and Advantage Functions

This article provides a comprehensive introduction to reinforcement learning for large language models, covering the Markov Decision Process formulation, the four core elements of RL, state‑value and action‑value functions, Bellman equations, and the advantage function that underpins modern policy‑gradient algorithms.

AI fundamentalsBellman equationLarge Language Model
0 likes · 13 min read
Demystifying Large‑Model Reinforcement Learning: From MDP Basics to Bellman and Advantage Functions
IT Services Circle
IT Services Circle
Oct 20, 2025 · Artificial Intelligence

How NanoChat Lets Anyone Train a ChatGPT‑Like Model for $100

NanoChat, an open‑source full‑stack AI model solution created by Andrej Karpathy, enables users to train a functional chat model on a modest $100 cloud GPU rental, offering a low‑cost, hands‑on alternative to proprietary large‑language‑model services.

AI trainingLarge Language Modelcost-effective
0 likes · 4 min read
How NanoChat Lets Anyone Train a ChatGPT‑Like Model for $100
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Oct 15, 2025 · Artificial Intelligence

NanoChat Source Code Deep Dive: Karpathy’s Full‑Stack LLM Pipeline Explained

This article dissects NanoChat’s end‑to‑end LLM pipeline—from a lightweight 561M‑parameter transformer and custom Rust BPE tokenizer to Chinchilla‑scaled training, multi‑task fine‑tuning, optional RL on GSM8K, KV‑cache inference optimizations, and benchmark results that slightly surpass GPT‑2 Large.

CORE benchmarkChinchilla scalingFastAPI
0 likes · 10 min read
NanoChat Source Code Deep Dive: Karpathy’s Full‑Stack LLM Pipeline Explained
AntTech
AntTech
Oct 14, 2025 · Artificial Intelligence

How Ring-1T Achieves Trillion-Scale Deep Thinking and Competitive Benchmarks

The Ring-1T model, a trillion-parameter AI system released as open source, leverages advanced reinforcement learning techniques, extensive benchmark evaluations, and custom training frameworks to deliver balanced performance across math, code, reasoning, and creative tasks while highlighting current limitations and future development plans.

AI modelLarge Language Modelbenchmark evaluation
0 likes · 8 min read
How Ring-1T Achieves Trillion-Scale Deep Thinking and Competitive Benchmarks
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Oct 13, 2025 · Artificial Intelligence

How Large‑and‑Small Language Model Collaboration Is Shaping the Future

The article argues that combining large, high‑capacity models with lightweight, fine‑tuned small models can cut costs, lower latency, enable specialized vertical tasks, and shift development from chasing ever‑bigger models toward optimal system architectures, outlining key techniques such as state‑space models, knowledge distillation, and staged fine‑tuning.

AI ArchitectureEfficiencyKnowledge Distillation
0 likes · 3 min read
How Large‑and‑Small Language Model Collaboration Is Shaping the Future
DataFunTalk
DataFunTalk
Oct 13, 2025 · Artificial Intelligence

How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while explaining the underlying technologies of Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems.

AI applicationsAgentLarge Language Model
0 likes · 4 min read
How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Oct 11, 2025 · Artificial Intelligence

Recent Advances in Multivariate Time Series Forecasting: Paper Summaries (Sep 27 – Oct 10 2025)

This article summarizes eight newly released AI papers on multivariate time‑series forecasting and anomaly detection, detailing each work's motivation, proposed methodology, key innovations such as CRIB, TS‑JEPA, DSAT‑HD, DIMIGNN, ASTGI, IndexNet, TsLLM, Moon, TimeSeriesScientist, MLG‑4TS, and Augur, and reports their experimental validation on real‑world datasets.

Anomaly DetectionLarge Language ModelTransformer
0 likes · 23 min read
Recent Advances in Multivariate Time Series Forecasting: Paper Summaries (Sep 27 – Oct 10 2025)
DataFunSummit
DataFunSummit
Oct 10, 2025 · Artificial Intelligence

How Ping An Life Built ChatBI: An AI‑Powered Intelligent BI Platform

This article details Ping An Life's self‑developed large‑model reporting product ChatBI, covering its background, goals, solution architecture, technical stack, real‑world use cases, deployment challenges, and future outlook, offering practical insights for enterprises adopting AI‑driven business intelligence.

AIChatbotData Platform
0 likes · 17 min read
How Ping An Life Built ChatBI: An AI‑Powered Intelligent BI Platform
DataFunTalk
DataFunTalk
Oct 9, 2025 · Artificial Intelligence

How Bilibili Uses Large Language Models to Solve Big Data Task Failures

This article explains Bilibili's massive data platform architecture, the common reasons offline tasks fail or slow down, and how the company is exploring large‑language‑model‑driven assistants to automatically diagnose and resolve these engineering issues.

AI assistanceBilibiliLarge Language Model
0 likes · 4 min read
How Bilibili Uses Large Language Models to Solve Big Data Task Failures
HyperAI Super Neural
HyperAI Super Neural
Oct 8, 2025 · Artificial Intelligence

From WeChat’s AI Podcast Trial to Google, ByteDance and Xiaohongshu: Can AI Podcasts Capture the Emerging AIGC Blue Ocean?

The article examines how breakthroughs in large language models and high‑fidelity TTS are powering AI‑generated podcasts, analyzes the technical advances behind the "human‑like" sound, surveys major players such as Google, ByteDance, Xiaohongshu and startups, and evaluates the market potential of this rapidly expanding AIGC niche.

AI podcastAIGCByteDance
0 likes · 9 min read
From WeChat’s AI Podcast Trial to Google, ByteDance and Xiaohongshu: Can AI Podcasts Capture the Emerging AIGC Blue Ocean?
DataFunSummit
DataFunSummit
Oct 7, 2025 · Artificial Intelligence

Bilibili’s AI‑Powered Assistant: Solving Big Data Task Failures with LLMs

This article details Bilibili's implementation of a large‑language‑model‑driven intelligent assistant that helps engineers diagnose and resolve massive offline and real‑time data‑processing failures, describing the platform’s five‑layer architecture, common failure and slowdown causes, and the need for AI‑powered troubleshooting support.

BilibiliIntelligent AssistantLarge Language Model
0 likes · 4 min read
Bilibili’s AI‑Powered Assistant: Solving Big Data Task Failures with LLMs
DataFunSummit
DataFunSummit
Oct 6, 2025 · Artificial Intelligence

How Bilibili Leverages Large Language Models to Solve Big Data Platform Failures

This article explains Bilibili's massive video platform data architecture, the huge daily workload of offline and real‑time tasks, common user problems like task failures and slowdowns, their root causes, and how a large language model assistant is being used to automate troubleshooting.

AI assistanceBilibiliLarge Language Model
0 likes · 4 min read
How Bilibili Leverages Large Language Models to Solve Big Data Platform Failures
Fun with Large Models
Fun with Large Models
Sep 30, 2025 · Artificial Intelligence

DeepSeek-V3.2 Architecture Breakthrough: A 5‑Minute Guide to Its Core Features

The article introduces DeepSeek-V3.2, highlighting its new DeepSeek Sparse Attention (DSA) that boosts training and inference efficiency by up to 50%, cuts model usage costs dramatically, explains the updated API endpoints, and details the four‑stage post‑training pipeline that underpins the model’s performance improvements.

AI ArchitectureDSADeepSeek-V3.2
0 likes · 8 min read
DeepSeek-V3.2 Architecture Breakthrough: A 5‑Minute Guide to Its Core Features
HyperAI Super Neural
HyperAI Super Neural
Sep 30, 2025 · Artificial Intelligence

SpikingBrain-1.0 Achieves 100× Faster Inference with Brain‑Inspired Spiking Architecture

SpikingBrain-1.0, the first domestically‑produced brain‑inspired spiking large model, links spiking neuron dynamics to linear attention, delivering over 100× faster first‑token latency on 4‑million‑token sequences, 23.4% FLOP utilization, 69% sparsity, and a one‑click deployment tutorial on HyperAI.

Large Language ModelSpikingBrain-1.0brain-inspired AI
0 likes · 7 min read
SpikingBrain-1.0 Achieves 100× Faster Inference with Brain‑Inspired Spiking Architecture
Alipay Experience Technology
Alipay Experience Technology
Sep 29, 2025 · Artificial Intelligence

How a Constraint-Aware Multi-Agent AI Won the IJCAI‑2025 Travel Planning Challenge

Alipay’s AI research team, together with Ant Group and East China Normal University, leveraged a self‑developed large‑model‑plus‑optimization framework to create a constraint‑aware multi‑agent system that won both the Original OS Track and DSL Track at the IJCAI‑2025 Autonomous Travel Itinerary Planning Competition.

AILarge Language ModelOptimization
0 likes · 8 min read
How a Constraint-Aware Multi-Agent AI Won the IJCAI‑2025 Travel Planning Challenge
DataFunTalk
DataFunTalk
Sep 28, 2025 · Artificial Intelligence

How Bilibili Leverages Large Language Models to Automate Big Data Operations

This article explores Bilibili’s implementation of a large‑language‑model‑driven intelligent assistant that helps troubleshoot massive offline and real‑time data processing tasks, detailing the platform’s five‑layer architecture, common failure causes, and how AI can streamline issue resolution.

AI OperationsIntelligent AssistantLarge Language Model
0 likes · 4 min read
How Bilibili Leverages Large Language Models to Automate Big Data Operations
DataFunTalk
DataFunTalk
Sep 27, 2025 · Artificial Intelligence

Bilibili’s AI Assistant: Using Large Language Models to Tackle Massive Data Tasks

This article explains how Bilibili leverages a large‑language‑model‑based intelligent agent to diagnose and resolve failures and slowdowns in its massive big‑data platform, detailing the platform architecture, workload scale, common user issues, and the need for automated assistance.

AI OperationsBilibiliIntelligent Assistant
0 likes · 5 min read
Bilibili’s AI Assistant: Using Large Language Models to Tackle Massive Data Tasks
Data Party THU
Data Party THU
Sep 26, 2025 · Artificial Intelligence

How Keye‑VL‑1.5 Redefines Video Understanding with Slow‑Fast Encoding

Keye‑VL‑1.5, an 8‑billion‑parameter multimodal large language model, introduces a Slow‑Fast video encoding strategy, a four‑stage progressive pre‑training pipeline with 128K context, and a sophisticated post‑training regime that together achieve state‑of‑the‑art performance on video and vision‑language benchmarks while maintaining strong general capabilities.

Large Language ModelMultimodal LLMbenchmark
0 likes · 21 min read
How Keye‑VL‑1.5 Redefines Video Understanding with Slow‑Fast Encoding
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 25, 2025 · Artificial Intelligence

Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide

This article explains the opportunities and challenges of Mixture of Experts (MoE) models, introduces expert parallelism as a solution to scaling and deployment bottlenecks, and provides a step‑by‑step guide for deploying MoE models with Alibaba Cloud PAI‑EAS, including configuration tips and code examples.

AI model deploymentExpert ParallelismLarge Language Model
0 likes · 11 min read
Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide
DataFunSummit
DataFunSummit
Sep 25, 2025 · Artificial Intelligence

Can NL→MQL→SQL Bridge the Gap to End‑to‑End Intelligent BI?

Aloudata Agent introduces a novel NL→MQL→SQL framework that combines large language models with a custom metric query language, enabling business users to perform end‑to‑end intelligent data analysis, attribution, and reporting without technical expertise, while balancing accuracy, cost, and performance.

Data AnalysisIntelligent BILarge Language Model
0 likes · 18 min read
Can NL→MQL→SQL Bridge the Gap to End‑to‑End Intelligent BI?
Fighter's World
Fighter's World
Sep 24, 2025 · Artificial Intelligence

Aivis: Pioneering Autonomous Agents for Alibaba Cloud’s Next‑Gen Intelligent Services

The talk outlines how Alibaba Cloud’s Aivis autonomous service agent tackles the “impossible triangle” of ultra‑high experience, low cost, and complex services by evolving from tool‑based chatbots to teammate‑level agents, detailing a four‑layer architecture, domain‑model training, and actionable steps for enterprise AI service transformation.

AI AgentAgent ArchitectureCloud Service
0 likes · 14 min read
Aivis: Pioneering Autonomous Agents for Alibaba Cloud’s Next‑Gen Intelligent Services
AIWalker
AIWalker
Sep 23, 2025 · Artificial Intelligence

Manzano: A Small 3B Multimodal Model That Unifies Image Understanding and Generation with SOTA Performance

Manzano introduces a hybrid vision tokenizer and a three‑stage training recipe that let a 3‑billion‑parameter multimodal LLM achieve state‑of‑the‑art results on both image‑understanding benchmarks and text‑to‑image generation, while scaling smoothly to larger sizes and minimizing task conflict.

AI researchLarge Language ModelManzano
0 likes · 25 min read
Manzano: A Small 3B Multimodal Model That Unifies Image Understanding and Generation with SOTA Performance
Meituan Technology Team
Meituan Technology Team
Sep 22, 2025 · Artificial Intelligence

LongCat-Flash-Thinking: The New SOTA Open-Source LLM for Deep Reasoning and Tool Use

Meituan’s LongCat team unveiled LongCat-Flash-Thinking, an open‑source large language model that combines deep logical reasoning with tool‑calling capabilities, achieving state‑of‑the‑art performance across logic, mathematics, code, and agentic tasks, and introducing novel training frameworks such as domain‑parallel RL and DORA.

AILarge Language ModelReasoning
0 likes · 7 min read
LongCat-Flash-Thinking: The New SOTA Open-Source LLM for Deep Reasoning and Tool Use
Data Party THU
Data Party THU
Sep 20, 2025 · Artificial Intelligence

How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model

The article reports that DeepSeek’s R1 large language model, detailed in a peer‑reviewed Nature paper, was built with roughly $300 k in total cost—about $29.4 k for training—using Nvidia H800 chips and novel pure reinforcement‑learning techniques, achieving competitive performance while remaining open‑source.

DeepSeekLarge Language ModelNvidia H800
0 likes · 9 min read
How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model
HyperAI Super Neural
HyperAI Super Neural
Sep 18, 2025 · Artificial Intelligence

DeepSeek‑R1 Costs $294K to Train, Hits Nature Cover as First Peer‑Reviewed Large Model

DeepSeek‑R1, the first mainstream large language model to pass peer review in Nature, was trained for $294,000 using 648 H800 GPUs, and its RL‑enhanced version, DeepSeek‑R1‑Zero, achieved up to 86.7% pass@1 on AIME 2024, outperforming human averages across math, coding, and science tasks.

AI researchDeepSeek-R1Large Language Model
0 likes · 10 min read
DeepSeek‑R1 Costs $294K to Train, Hits Nature Cover as First Peer‑Reviewed Large Model
DataFunTalk
DataFunTalk
Sep 18, 2025 · Artificial Intelligence

How DeepSeek‑R1’s Reinforcement Learning Earned a Nature Cover

DeepSeek‑R1, the first peer‑reviewed large language model, leveraged a pure reinforcement‑learning framework and the novel GRPO algorithm to achieve breakthrough reasoning performance, low training cost, and widespread acclaim, culminating in a Nature magazine cover story.

AI reasoningDeepSeekGRPO
0 likes · 14 min read
How DeepSeek‑R1’s Reinforcement Learning Earned a Nature Cover
DataFunSummit
DataFunSummit
Sep 17, 2025 · Artificial Intelligence

How Tencent’s Large Language Model Powers Real-World AI Applications

This article explores Tencent’s large language model across diverse business scenarios—content generation, intelligent customer service, role‑playing, and more—detailing the principles and practical uses of Retrieval‑Augmented Generation (RAG), GraphRAG, and Agent technologies, and how they enhance model intelligence and user experience.

AIAgentKnowledge Graph
0 likes · 4 min read
How Tencent’s Large Language Model Powers Real-World AI Applications
DataFunSummit
DataFunSummit
Sep 14, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents

This article examines Tencent's large language model deployments across various business scenarios, detailing the use of Retrieval‑Augmented Generation, GraphRAG for role‑playing, and Agent technologies, while also outlining core application areas and the three main technical approaches—SFT, RAG, and Agents.

AI agentsAI applicationsGraphRAG
0 likes · 4 min read
How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents
Data Party THU
Data Party THU
Sep 13, 2025 · Artificial Intelligence

How a Multi‑Agent Large Model Transforms Ecological Big‑Data Analysis

This report details a university project that built a flexible, high‑performance multi‑agent large‑model framework for ecological environment big‑data analysis, covering system architecture, individual agents, memory mechanisms, report generation, a FastAPI‑LangGraph backend, a React frontend, testing methodology, and future directions.

AIBig DataFastAPI
0 likes · 7 min read
How a Multi‑Agent Large Model Transforms Ecological Big‑Data Analysis
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 11, 2025 · Artificial Intelligence

Fin-PRM: Alibaba’s Dianjin Team Introduces a Domain-Specific Process Reward Model for Financial Reasoning

Fin‑PRM, a domain‑specific process reward model for financial reasoning introduced by Alibaba’s Dianjin team, employs dual‑level step and trajectory rewards to provide fine‑grained supervision, achieving up to 12.9% accuracy gains in supervised fine‑tuning and 5.1% improvements in Best‑of‑N inference on benchmarks such as CFLUE and FinQA.

CFLUEFin-PRMFinQA
0 likes · 11 min read
Fin-PRM: Alibaba’s Dianjin Team Introduces a Domain-Specific Process Reward Model for Financial Reasoning
DataFunSummit
DataFunSummit
Sep 10, 2025 · Artificial Intelligence

Claude’s Exit from China: How Domestic AI Models Can Fill the Void

Anthropic’s new policy blocks Chinese‑controlled firms from using Claude and Claude Code, prompting a deep dive into the model’s strengths and exploring fast‑growing domestic AI alternatives—such as Qwen3‑Coder, GLM‑4.5, and others—to understand their capabilities, gaps, and future opportunities for Chinese developers.

AIChinese AIClaude
0 likes · 11 min read
Claude’s Exit from China: How Domestic AI Models Can Fill the Void
Eric Tech Circle
Eric Tech Circle
Sep 10, 2025 · Artificial Intelligence

Deploy High‑Performance Local LLMs with vLLM: A Step‑by‑Step Guide

This article walks through installing and configuring vLLM for local large language model inference, compares it with Ollama and LM Studio, details environment setup, model download, testing scripts, and shows how to expose an OpenAI‑compatible API for production use.

Inference OptimizationLarge Language ModelModelScope
0 likes · 11 min read
Deploy High‑Performance Local LLMs with vLLM: A Step‑by‑Step Guide
Wuming AI
Wuming AI
Sep 6, 2025 · Artificial Intelligence

Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM

The article reviews Alibaba's 1‑trillion‑parameter Qwen3‑Max‑Preview model, comparing its benchmark scores, hallucination rate, math and coding accuracy, and SVG generation quality against Claude, Kimi K2, and DeepSeek, while providing usage links and real‑world user impressions.

AI BenchmarkLarge Language ModelQwen3
0 likes · 4 min read
Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM
Kuaishou Tech
Kuaishou Tech
Sep 5, 2025 · Artificial Intelligence

How Keye‑VL‑1.5‑8B Sets New Benchmarks in Multimodal AI

Fast‑search platform Kwai has open‑sourced the 8‑billion‑parameter multimodal LLM Keye‑VL‑1.5, which introduces a slow‑fast frame encoding, a progressive four‑stage pre‑training pipeline, and an automated data construction workflow, achieving state‑of‑the‑art results on video and vision‑language benchmarks and surpassing many closed‑source models.

Large Language Modelbenchmark performancemultimodal AI
0 likes · 12 min read
How Keye‑VL‑1.5‑8B Sets New Benchmarks in Multimodal AI
Efficient Ops
Efficient Ops
Sep 2, 2025 · Artificial Intelligence

How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations

At the 27th GOPS Global Operations Conference in Shanghai (Oct 17‑18, 2025), Professor Wang Peng of Fudan University will reveal how large language models can extract and structure heterogeneous operational data into high‑quality knowledge bases, and how RAG‑driven Q&A enhances fault diagnosis, SOP generation, and automated decision‑making.

Artificial IntelligenceIntelligent OperationsKnowledge Base
0 likes · 3 min read
How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations
Efficient Ops
Efficient Ops
Sep 2, 2025 · Artificial Intelligence

Inside Meituan’s LongCat‑Flash‑Chat: 560B‑Parameter MoE Model with Ultra‑Fast Inference

Meituan has open‑sourced LongCat‑Flash‑Chat, a 5.6‑trillion‑parameter Mixture‑of‑Experts model that activates only a fraction of its weights per token, delivering mainstream‑level performance, high inference speed, and low cost for complex agent applications.

Artificial IntelligenceInference OptimizationLarge Language Model
0 likes · 4 min read
Inside Meituan’s LongCat‑Flash‑Chat: 560B‑Parameter MoE Model with Ultra‑Fast Inference
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 2, 2025 · Artificial Intelligence

How LongCat‑Flash Achieves Record Speed and Efficiency for a 560B MoE Model

LongCat‑Flash is a 560‑billion‑parameter Mixture‑of‑Experts LLM that combines a dynamic zero‑computation expert design, shortcut‑connected MoE communication, variance‑aligned scaling, and a three‑stage agent‑centric pre‑training pipeline, delivering over 100 TPS on H800 GPUs at a cost of $0.70 per million tokens.

Artificial IntelligenceInference OptimizationLarge Language Model
0 likes · 23 min read
How LongCat‑Flash Achieves Record Speed and Efficiency for a 560B MoE Model
Java Tech Enthusiast
Java Tech Enthusiast
Sep 1, 2025 · Artificial Intelligence

How Meituan’s LongCat‑Flash‑Chat Beats Top LLMs with Zero‑Computation Experts

LongCat‑Flash‑Chat, Meituan’s newly open‑sourced 560B MoE model, outperforms leading LLMs on agent tool use and instruction following benchmarks, introduces zero‑computation experts and shortcut‑connected MoE for higher throughput, and demonstrates strong programming and reasoning abilities across diverse evaluation tasks.

Large Language ModelMeituan AIZero Computation Experts
0 likes · 12 min read
How Meituan’s LongCat‑Flash‑Chat Beats Top LLMs with Zero‑Computation Experts
DataFunTalk
DataFunTalk
Sep 1, 2025 · Artificial Intelligence

Unlocking 560B‑Parameter AI: Inside LongCat‑Flash‑Chat’s Zero‑Computation MoE

LongCat‑Flash‑Chat, a 560‑billion‑parameter Mixture‑of‑Experts model with Zero‑Computation Experts, delivers top‑tier benchmark scores and fast inference while activating only a fraction of its parameters, and is fully open‑sourced with easy deployment scripts.

Artificial IntelligenceBenchmarkingLarge Language Model
0 likes · 6 min read
Unlocking 560B‑Parameter AI: Inside LongCat‑Flash‑Chat’s Zero‑Computation MoE
DataFunTalk
DataFunTalk
Aug 29, 2025 · Artificial Intelligence

Grok Code Fast 1: xAI’s New Coding Model 3× Faster, 6× Cheaper

Elon Musk’s xAI has launched Grok Code Fast 1, a new code‑generation model that claims to be three times faster and six times cheaper than GPT‑5, offering agentic programming capabilities, broad language support, free‑week trials on major IDE platforms, and competitive pricing with high cache hit rates.

AI code modelLarge Language Modelagentic programming
0 likes · 6 min read
Grok Code Fast 1: xAI’s New Coding Model 3× Faster, 6× Cheaper
DataFunTalk
DataFunTalk
Aug 26, 2025 · Artificial Intelligence

Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide

This resource guide presents a curated list of cutting‑edge topics—including multimodal GraphRAG, knowledge‑graph‑driven large‑model applications in finance, traditional Chinese medicine, automotive manufacturing, and knowledge‑management trends—offering insights into AI‑powered knowledge services, and invites readers to scan the QR code to download the full e‑book.

AIKnowledge GraphLarge Language Model
0 likes · 2 min read
Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide