Tagged articles

675 articles

Page 4 of 7

Aug 25, 2025 · Artificial Intelligence

How 1688 AI App Redefines B2B E‑commerce with AI‑Powered Search and Multimodal Interfaces

The article examines the design shift from the traditional 1688 App to the AI‑native 1688 AI App, detailing how AI‑driven interfaces, system prompts, embedding‑based retrieval, multi‑agent routing, and AI gateways transform B2B product discovery, recommendation, and customization.

AI SearchB2B e-commerceLarge Language Model

0 likes · 20 min read

How 1688 AI App Redefines B2B E‑commerce with AI‑Powered Search and Multimodal Interfaces

Baidu Geek Talk

Aug 25, 2025 · Artificial Intelligence

How ERNIE‑4.5‑VL Redefines Multimodal AI with 100+ Language Support

The ERNIE‑4.5‑VL visual‑language model breaks single‑modality limits by delivering breakthrough image, video, and text understanding across more than 100 languages, offering lightweight yet competitive performance against models like Qwen2.5‑VL, supporting 128K context, dual “thinking” modes, and extensive deployment resources.

AI researchErnieLarge Language Model

0 likes · 4 min read

How ERNIE‑4.5‑VL Redefines Multimodal AI with 100+ Language Support

Data Party THU

Aug 24, 2025 · Artificial Intelligence

Can a ‘Centaur’ AI Model Truly Predict Human Decisions? A Deep Dive

This article reviews the Centaur foundation model—fine‑tuned from Llama 3‑70B on the Psych‑101 dataset—to assess its ability to predict human choices, brain activity, and decision rationales across diverse psychological experiments, while discussing generalization, over‑fitting, and future research limits.

CentaurFoundation ModelLarge Language Model

0 likes · 17 min read

Can a ‘Centaur’ AI Model Truly Predict Human Decisions? A Deep Dive

Kuaishou Tech

Aug 23, 2025 · Artificial Intelligence

How Thyme Enables Models to Think Beyond Images with Code‑Driven Multimodal Reasoning

The Kwai Keye team presents Thyme, a novel multimodal reasoning framework that lets large language models generate and safely execute Python code for image manipulation and complex calculations, achieving significant performance gains over existing vision‑language models across perception, reasoning, and hallucination‑reduction benchmarks.

AI researchLarge Language ModelMultimodal

0 likes · 12 min read

How Thyme Enables Models to Think Beyond Images with Code‑Driven Multimodal Reasoning

Open Source Tech Hub

Aug 22, 2025 · Artificial Intelligence

Automate User Feedback Classification with a Large‑Model API in PHP

This guide shows how to use the Tongyi Qianwen large‑model API with PHP to automatically classify user feedback into predefined categories, eliminating manual analysis and complex NLP development while providing clear steps, code, and result interpretation for rapid business insights.

APIAutomationLarge Language Model

0 likes · 7 min read

Automate User Feedback Classification with a Large‑Model API in PHP

AI Algorithm Path

Aug 20, 2025 · Artificial Intelligence

DeepSeek V3.1 Open‑Source: Unlocking a New Era of Long‑Context AI

DeepSeek V3.1, a 685‑billion‑parameter open‑source model, supports up to 128,000 tokens, delivers mixed‑architecture capabilities, matches top‑tier closed systems in benchmarks, and its rapid community adoption signals a shift toward democratized AI development and new industry dynamics.

AI PerformanceDeepSeekLarge Language Model

0 likes · 6 min read

DeepSeek V3.1 Open‑Source: Unlocking a New Era of Long‑Context AI

Fun with Large Models

Aug 20, 2025 · Artificial Intelligence

DeepSeek V3.1 Review: 128K Context, Knowledge, Programming & Agent Skills Near Claude 4

DeepSeek V3.1, released on August 19, expands context length to 128 K tokens and updates its knowledge base to July 2024, and the author’s benchmarks show its programming and agent capabilities now rival Claude 4, with detailed prompt examples, code generation demos, and performance comparisons.

Agent EvaluationClaude 4DeepSeek

0 likes · 9 min read

DeepSeek V3.1 Review: 128K Context, Knowledge, Programming & Agent Skills Near Claude 4

Instant Consumer Technology Team

Aug 15, 2025 · Artificial Intelligence

Master the iFLYTEK Prohibited Words Classification Challenge: Baselines & BERT

This article introduces the iFLYTEK AI Developer Competition on prohibited‑word classification, outlines the task, dataset, evaluation metric, and provides three baseline solutions—including a logistic‑regression model, a BERT fine‑tuning approach, and a large‑model prompt method—along with code snippets and performance notes.

BERTLarge Language ModelNLP

0 likes · 15 min read

Master the iFLYTEK Prohibited Words Classification Challenge: Baselines & BERT

Data Party THU

Aug 11, 2025 · Artificial Intelligence

What Makes GPT‑5 the Most Powerful AI Model Yet? A Deep Dive into Its Architecture and Benchmarks

The article analyzes GPT‑5’s unified system, advanced reasoning models, and impressive benchmark gains across programming, creative writing, and health domains, highlighting its new router, Verbosity API, and record‑setting performance on tasks such as Aider polyglot, AIME 2025, and HealthBench.

AI benchmarksAI reasoningGPT-5

0 likes · 7 min read

What Makes GPT‑5 the Most Powerful AI Model Yet? A Deep Dive into Its Architecture and Benchmarks

AI Algorithm Path

Aug 8, 2025 · Artificial Intelligence

GPT‑5 Is Here: In‑Depth Technical Walkthrough of Architecture, Features, and Benchmarks

OpenAI’s GPT‑5, released on August 7 2025, introduces a unified system with real‑time routing, up to 400 k token context windows, multiple model families, refined safety mechanisms, new API controls, and benchmark results that show it surpasses GPT‑4 across intelligence, coding, instruction following, function calling and multimodal tasks.

AI ArchitectureAPIBenchmark

0 likes · 9 min read

GPT‑5 Is Here: In‑Depth Technical Walkthrough of Architecture, Features, and Benchmarks

AntTech

Aug 6, 2025 · Artificial Intelligence

Ring-lite-2507: Boosted Deep Reasoning and Balanced General Capabilities

The AntBailing team releases Ring-lite-2507, enhancing deep reasoning through a Two‑staged RL pipeline while simultaneously balancing overall model abilities, showcasing notable gains on benchmarks like ARC‑AGI‑v1 and offering the model as an open‑source resource across major platforms.

Large Language ModelRL trainingRing-lite

0 likes · 5 min read

Ring-lite-2507: Boosted Deep Reasoning and Balanced General Capabilities

AI Info Trend

Aug 4, 2025 · Industry Insights

How AI Agents and Small Models Are Redefining Productivity in 2025 H1

The report analyzes first‑half‑2025 AI breakthroughs, covering the rise of general‑purpose agents, rapid inference improvements, small‑model proliferation, reinforcement‑learning compute dominance, evolving transformer architectures, and shifting industry dynamics, offering actionable insights for researchers, product leaders, and decision‑makers.

AIAgentLarge Language Model

0 likes · 9 min read

How AI Agents and Small Models Are Redefining Productivity in 2025 H1

Full-Stack Cultivation Path

Aug 2, 2025 · Artificial Intelligence

Is There a Design Pattern for AI Workflows? Exploring Prompt Chaining

The article explains how breaking complex LLM tasks into sequential steps—known as prompt chaining—improves answer accuracy, debuggability, flexibility, and enables sophisticated AI workflows such as report generation, chatbots, and content creation using tools like n8n and Ollama.

AI workflowAutomationLarge Language Model

0 likes · 6 min read

Is There a Design Pattern for AI Workflows? Exploring Prompt Chaining

Baobao Algorithm Notes

Aug 1, 2025 · Artificial Intelligence

Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide

The article introduces Qwen3‑Coder‑30B‑A3B‑Instruct (aka Qwen3‑Coder‑Flash), detailing its architecture, 256K‑to‑1M token context, agentic coding capabilities, installation steps with Transformers, sample code for tool use, optimal sampling parameters, and deployment tips across various runtimes.

AI coding assistantAgentic CodingLarge Language Model

0 likes · 6 min read

Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide

AI Algorithm Path

Jul 29, 2025 · Artificial Intelligence

Why GLM‑4.5 Sets a New Benchmark for Open‑Source Large Language Models

GLM‑4.5 and its lightweight Air variant, featuring a deep‑layered MoE design, grouped‑query attention, and dual inference modes, achieve third‑place overall on 12 hard‑core benchmarks, excel in web‑browsing and tool‑calling with a 90.6 % success rate, and introduce novel training tricks such as the Muon optimizer and Slime RL framework.

AIBenchmarkGLM-4.5

0 likes · 8 min read

Why GLM‑4.5 Sets a New Benchmark for Open‑Source Large Language Models

AntTech

Jul 29, 2025 · Artificial Intelligence

How Ant Group’s Agentar‑Fin‑R1 Redefines Financial AI with Expert‑Level Reasoning

Ant Group’s Ant Financial Science released Agentar‑Fin‑R1, a finance‑focused large model that claims expert‑level knowledge, efficient training, and continuous self‑evolution, outperforming open‑source rivals on benchmarks like FinEval1.0, FinanceIQ and Finova, while supporting industry standards through a collaborative AI alliance.

Agentar-Fin-R1Ant GroupFinancial AI

0 likes · 5 min read

How Ant Group’s Agentar‑Fin‑R1 Redefines Financial AI with Expert‑Level Reasoning

Model Perspective

Jul 27, 2025 · Artificial Intelligence

Build a Practical AI Agent from Scratch with Coze’s Low‑Code Platform

This guide walks you through creating a functional AI agent using the Coze low‑code platform, covering account setup, goal definition, visual workflow design with large‑model and image‑generation nodes, variable configuration, testing, and publishing the agent to multiple channels.

AI AgentCozeLarge Language Model

0 likes · 10 min read

Build a Practical AI Agent from Scratch with Coze’s Low‑Code Platform

Architecture and Beyond

Jul 27, 2025 · Artificial Intelligence

What Makes an AI Agent Tick? From Expert Systems to Modern Architectures

This article traces the evolution of AI agents from early expert systems to today’s multimodal, memory‑rich agents, explains their perception, reasoning, memory and action modules, discusses model selection, prompt engineering, RAG techniques, and highlights current limitations such as hallucinations, reliability, cost, and security.

AI AgentFunction CallingLarge Language Model

0 likes · 28 min read

What Makes an AI Agent Tick? From Expert Systems to Modern Architectures

AI Algorithm Path

Jul 26, 2025 · Artificial Intelligence

Qwen3-Coder: Alibaba’s 480‑Billion‑Parameter Open‑Source Code Model Takes on Claude 4

Alibaba’s Qwen team has released Qwen3-Coder, a 480‑billion‑parameter open‑source LLM specialized for code, featuring a 1‑million‑token context via YaRN, extensive benchmark superiority over most open models, and performance that rivals Claude 4 Sonnet while remaining fully accessible.

APIBenchmarkLarge Language Model

0 likes · 12 min read

Qwen3-Coder: Alibaba’s 480‑Billion‑Parameter Open‑Source Code Model Takes on Claude 4

Zhihu Tech Column

Jul 25, 2025 · Artificial Intelligence

Boost Creative Writing with Zhi-Create-Qwen3-32B: Training, Eval & Deployment

This article introduces the open‑source Zhi‑Create‑Qwen3‑32B model, detailing its fine‑tuned training on creative‑writing data, the multi‑domain dataset strategy, curriculum‑learning based SFT, evaluation on WritingBench, and practical deployment options across various hardware and inference frameworks.

DeploymentLarge Language Modelcreative writing

0 likes · 11 min read

Boost Creative Writing with Zhi-Create-Qwen3-32B: Training, Eval & Deployment

Fun with Large Models

Jul 24, 2025 · Artificial Intelligence

Qwen3‑Coder vs Claude 4: In‑Depth Performance Review and Usage Guide

This article evaluates the open‑source Qwen3‑Coder‑480B‑A35B model, comparing its programming and agentic capabilities to Claude 4 and other leading models, detailing its architecture, token length, reinforcement‑learning‑after‑training technique, ecosystem tools, and real‑world code‑generation case studies.

AI codingAgent RLBenchmark

0 likes · 14 min read

Qwen3‑Coder vs Claude 4: In‑Depth Performance Review and Usage Guide

DataFunTalk

Jul 23, 2025 · Artificial Intelligence

Qwen3‑Coder: Open‑Source AI Programming Agent That Beats the Competition

Alibaba’s Tongyi team unveiled the open‑source Qwen3‑Coder, a massive 450‑billion‑parameter programming model that outperforms leading closed‑source solutions, supports up to 1 M token context, offers a free CLI tool, and demonstrates impressive code generation capabilities across animations, games, and real‑world tasks.

AI programmingLarge Language ModelOpen Source

0 likes · 5 min read

Qwen3‑Coder: Open‑Source AI Programming Agent That Beats the Competition

Model Perspective

Jul 22, 2025 · Artificial Intelligence

How AI‑Powered “Deep Research” Supercharges Data Retrieval for Modeling

This article explains how large‑language‑model tools like Metaso AI’s “Deep Research” can dramatically speed up reliable data collection for mathematical modeling by providing systematic retrieval workflows, visual summaries, and interactive reports within minutes.

AIData AnalysisData Retrieval

0 likes · 6 min read

How AI‑Powered “Deep Research” Supercharges Data Retrieval for Modeling

Kuaishou Tech

Jul 21, 2025 · Artificial Intelligence

Can AI Models Think on Demand? Inside KAT‑V1 AutoThink’s Dynamic Reasoning

The article introduces KAT‑V1 AutoThink, a dual‑mode large language model that automatically switches between thinking and non‑thinking modes based on problem difficulty, details its novel training paradigm, reinforcement‑learning enhancements, performance benchmarks against leading open‑source models, and provides open‑source resources for further research.

Knowledge DistillationLarge Language Modelauto-think

0 likes · 14 min read

Can AI Models Think on Demand? Inside KAT‑V1 AutoThink’s Dynamic Reasoning

Alibaba Cloud Developer

Jul 21, 2025 · Artificial Intelligence

How Browser‑Use Leverages AI Prompts for Seamless Browser Automation

This article explains how the open‑source browser‑use framework combines carefully designed SystemMessage prompts, structured HumanMessage inputs, and LangChain‑driven tool calls to enable large language models to automate complex web tasks such as shopping, CRM updates, résumé processing, and document generation, while providing concrete code examples and best‑practice tips.

AI automationLangChainLarge Language Model

0 likes · 21 min read

How Browser‑Use Leverages AI Prompts for Seamless Browser Automation

Mingyi World Elasticsearch

Jul 18, 2025 · Artificial Intelligence

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

The video walks through the differences between traditional keyword search and vector search, explains the core concept of Retrieval‑Augmented Generation, and demonstrates how to construct a knowledge‑base Q&A system using a large language model integrated with Elasticsearch.

ElasticsearchKnowledge BaseLarge Language Model

0 likes · 1 min read

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

Architect's Alchemy Furnace

Jul 17, 2025 · Artificial Intelligence

Explore the Ultimate Open-Source LLM Catalog: Models, Tools, and Resources

This article compiles a comprehensive, up‑to‑date inventory of open‑source large language models from Chinese and international organizations, detailing each model’s architecture, parameter count, multilingual capabilities, deployment requirements, and associated tools, offering a valuable reference for AI researchers and developers.

AILLMLarge Language Model

0 likes · 50 min read

Explore the Ultimate Open-Source LLM Catalog: Models, Tools, and Resources

AntTech

Jul 17, 2025 · Artificial Intelligence

How M2-Reasoning-7B Achieves State‑of‑the‑Art Spatial Reasoning in Multimodal AI

M2-Reasoning-7B, an open‑source 7B multimodal model from Ant Group, combines a high‑quality data pipeline with dynamic multi‑task training and a novel reward function to deliver state‑of‑the‑art performance on both general and spatial reasoning benchmarks, surpassing many larger competitors.

BenchmarkLarge Language ModelM2-Reasoning

0 likes · 9 min read

How M2-Reasoning-7B Achieves State‑of‑the‑Art Spatial Reasoning in Multimodal AI

AI Algorithm Path

Jul 14, 2025 · Artificial Intelligence

The Most Powerful Open‑Source Agent Model: Kimi K2

Kimi K2, an open‑source trillion‑parameter AI model released by Moonshot AI, offers Base and Instruct variants, achieves leading scores on benchmarks such as SWE‑bench, LiveCodeBench and AceBench, and introduces a novel post‑training autonomous‑exploration stage with MuonClip optimization to enable robust tool use and reinforcement‑learning‑driven self‑improvement.

Kimi K2Large Language Modelautonomous agents

0 likes · 8 min read

The Most Powerful Open‑Source Agent Model: Kimi K2

Architecture and Beyond

Jul 12, 2025 · Artificial Intelligence

What Exactly Is an AI Agent? History, Architecture, and Future Challenges

This article traces the evolution of AI agents from early expert systems to modern large‑language‑model‑driven assistants, explains their core perception, reasoning, memory, and action modules, compares thinking and execution models, and discusses current limitations such as hallucinations, reliability, cost, and security.

AI AgentLarge Language ModelMemory Architecture

0 likes · 20 min read

What Exactly Is an AI Agent? History, Architecture, and Future Challenges

Data Thinking Notes

Jul 8, 2025 · Artificial Intelligence

How Xiaohongshu Leverages Large Models to Revolutionize Content Recommendation

This article details Xiaohongshu's multi‑stage recommendation pipeline—using massive multi‑modal pre‑training, long‑sequence modeling, real‑time context features, reinforcement learning and online deep learning—to precisely surface valuable content, address cold‑start challenges, and break information bubbles for billions of users.

Large Language ModelMultimodal Learningonline deep learning

0 likes · 16 min read

How Xiaohongshu Leverages Large Models to Revolutionize Content Recommendation

JD Tech Talk

Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Highway for New Developers

New developer Li Ming’s frustrating onboarding experience highlights hidden business rules, undocumented code, and poor knowledge transfer, prompting him to build an AI‑driven knowledge base that links code changes, requirements, and operational docs, ultimately streamlining troubleshooting, accelerating feature development, and improving knowledge retention across teams.

AILarge Language ModelRAG

0 likes · 18 min read

How AI Can Turn a Code Maze into a Knowledge Highway for New Developers

DataFunSummit

Jul 6, 2025 · Artificial Intelligence

AI-Driven Knowledge Graphs: Key Insights from Multimodal GraphRAG Research

This article presents a comprehensive overview of cutting‑edge research on integrating large language models with knowledge graphs, covering multimodal GraphRAG, financial AI solutions, traditional Chinese medicine decision support, and industry‑specific knowledge services, guiding readers through emerging paradigms and practical implementations.

AIEnterprise AIKnowledge Graph

0 likes · 2 min read

AI-Driven Knowledge Graphs: Key Insights from Multimodal GraphRAG Research

DataFunTalk

Jul 5, 2025 · Artificial Intelligence

DeepSeek R1T2 Chimera: Faster, High‑Performance LLM with Assembly of Experts

The DeepSeek R1T2 Chimera model, an open‑source LLM built with Assembly of Experts technology, delivers up to 200% faster inference than R1‑0528, surpasses R1 on GPQA‑Diamond and AIME‑24 benchmarks, and offers a 671‑billion‑parameter MoE architecture, though it lacks function‑calling support and trails the highest‑end R1‑0528 on the toughest tests.

AIAssembly of ExpertsDeepSeek

0 likes · 5 min read

DeepSeek R1T2 Chimera: Faster, High‑Performance LLM with Assembly of Experts

DataFunTalk

Jul 3, 2025 · Artificial Intelligence

Inside xAI’s Grok 4: Massive Funding, Extreme Iteration, and Power Challenges

Elon Musk’s xAI has quietly leaked its upcoming Grok 4 and Grok 4 Code models, skipped Grok 3.5, secured $10 billion in new financing, and is building massive GPU super‑computing facilities, while raising concerns about model bias, data integrity, and unprecedented power‑grid strain.

AI fundingArtificial IntelligenceGPU computing

0 likes · 6 min read

Inside xAI’s Grok 4: Massive Funding, Extreme Iteration, and Power Challenges

DataFunSummit

Jul 2, 2025 · Artificial Intelligence

How End-to-End Reinforcement Learning Powers the Kimi Researcher AI Agent

The article explains how Kimi Researcher, an AI Agent built with end‑to‑end reinforcement learning, achieves state‑of‑the‑art performance on the Humanity’s Last Exam benchmark, scales via data‑driven training, and supports diverse research and analysis scenarios.

AI AgentKimi ResearcherLarge Language Model

0 likes · 9 min read

How End-to-End Reinforcement Learning Powers the Kimi Researcher AI Agent

Baobao Algorithm Notes

Jun 30, 2025 · Artificial Intelligence

How End‑to‑End Reinforcement Learning Powers the Kimi‑Researcher AI Agent

The article examines Kimi‑Researcher, an AI research agent built with end‑to‑end reinforcement learning, detailing its technical motivations, advantages over traditional workflow‑based and SFT methods, performance breakthroughs on benchmark exams, and diverse real‑world use cases ranging from literature reviews to legal analysis.

AI AgentEnd-to-End RLKimi Researcher

0 likes · 10 min read

Network Intelligence Research Center (NIRC)

Jun 29, 2025 · Artificial Intelligence

Multimodal AI Assistant Boosts Network Config: 96.6% Accuracy, 26× Labor Cut

The paper presents NLI2Conf, an intent‑driven network configuration model that fuses configuration files, topology and performance data via a multimodal interface, using large language and graph neural models to align natural‑language intents with forwarding and performance constraints, achieving 96.6% accuracy and a 26‑fold reduction in manual effort.

Graph Neural NetworkLarge Language ModelNLI2Conf

0 likes · 6 min read

Multimodal AI Assistant Boosts Network Config: 96.6% Accuracy, 26× Labor Cut

Alibaba Cloud Big Data AI Platform

Jun 27, 2025 · Artificial Intelligence

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

This guide walks you through using the PAI‑LangStudio platform together with the Qwen3 large language model and Elasticsearch to create a full‑stack AI Search RAG solution, covering prerequisites, step‑by‑step configuration of model services, database connections, runtimes, knowledge bases, workflow creation, testing, and deployment for production use.

AI SearchElasticsearchLarge Language Model

0 likes · 11 min read

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

Alibaba Cloud Developer

Jun 25, 2025 · Cloud Computing

Control Alibaba Cloud Resources with LLMs and MCP Server in Minutes

This article explains how to combine Alibaba Cloud's MCP Server with large language models to enable natural‑language operations on cloud products, covering setup, tool selection, OAuth authentication, code examples, troubleshooting context‑length limits, and future enhancements for more efficient, secure cloud management.

API integrationCloud ComputingLarge Language Model

0 likes · 20 min read

Control Alibaba Cloud Resources with LLMs and MCP Server in Minutes

Instant Consumer Technology Team

Jun 23, 2025 · Artificial Intelligence

What Are AI Agents? Architecture, Applications, and Future Trends

AI Agents, autonomous intelligent programs that perceive, reason, and act, are reshaping industries from healthcare to autonomous driving; this article explains their core components, differences from large language models, planning techniques, memory mechanisms, tool use, real‑world applications, current challenges, and future directions.

AI AgentApplicationsLarge Language Model

0 likes · 35 min read

What Are AI Agents? Architecture, Applications, and Future Trends

Instant Consumer Technology Team

Jun 19, 2025 · Artificial Intelligence

Exploring II-Agent: An Open‑Source AI Agent Framework for Multi‑Domain Automation

II-Agent is an open‑source, multi‑domain AI agent framework that leverages powerful large language models, a rich toolset, planning‑and‑reflection mechanisms, and advanced context management to enable autonomous task execution, real‑time interaction, and seamless integration across development, data analysis, and enterprise workflows.

AI AgentAutomationContext Management

0 likes · 21 min read

Exploring II-Agent: An Open‑Source AI Agent Framework for Multi‑Domain Automation

ByteDance Data Platform

Jun 18, 2025 · Artificial Intelligence

How Imperfect AI Can Unlock the Hidden 80% of Enterprise Data

Enterprises face a sharp paradox: despite exploding data volumes, only about 20% of structured data is used while the remaining 80% of unstructured data stays frozen, and this talk explores how Data Agent‑powered imperfect AI can awaken that hidden value.

AIData AgentData Analysis

0 likes · 16 min read

How Imperfect AI Can Unlock the Hidden 80% of Enterprise Data

JD Tech

Jun 16, 2025 · Artificial Intelligence

How JD Engineers Leverage LLMs and Sparse Models to Boost Search and Ads

This article showcases three JD tech case studies—using large language models for e‑commerce query expansion, applying sparse large models with scaling‑law experiments to improve ad prediction, and building proactive risk‑prevention systems—to illustrate practical AI engineering that drives higher recall, conversion, and system robustness.

AdvertisingLarge Language ModelScaling Law

0 likes · 8 min read

How JD Engineers Leverage LLMs and Sparse Models to Boost Search and Ads

TAL Education Technology

Jun 13, 2025 · Operations

How Large Language Models Are Revolutionizing Fault Localization

This article explores how the rapid rise of large language models and techniques like Retrieval‑Augmented Generation, Chain‑of‑Thought prompting, and multi‑agent architectures can dramatically improve the speed, accuracy, and automation of fault localization in modern operations environments.

Agent ArchitectureCoTFault Localization

0 likes · 14 min read

How Large Language Models Are Revolutionizing Fault Localization

Baidu Tech Salon

Jun 11, 2025 · Artificial Intelligence

Why Baidu’s Wenxin Model Dominates IDC’s 2025 Large Model Evaluation

IDC’s 2025 China foundational large‑model evaluation crowns Baidu’s Wenxin as the top performer, scoring perfect marks in seven of eight criteria and highlighting its superior multimodal, dialogue, and ecosystem capabilities among twelve leading models.

AIBaidu WenxinIDC evaluation

0 likes · 5 min read

Why Baidu’s Wenxin Model Dominates IDC’s 2025 Large Model Evaluation

Nightwalker Tech

Jun 11, 2025 · Artificial Intelligence

Turn Your AI Coding Assistant into a Critical Mentor, Not Just a Tool

This guide explains how to shift AI coding tools like Cursor, Windsurf, and RooCode from simple code generators into proactive mentors that critique, suggest improvements, and adopt multiple specialized modes, while also covering prompt design, multi‑round dialogue, and practical code examples.

AILarge Language ModelPrompt Engineering

0 likes · 15 min read

Turn Your AI Coding Assistant into a Critical Mentor, Not Just a Tool

DataFunSummit

Jun 10, 2025 · Artificial Intelligence

How Quwan’s Kaitian Model Tackles Emotional AI for Social Apps – Architecture, Training Tricks, and Safety

Quwan Technology presents its Kaitian social large model, designed for personalized, emotionally rich, multimodal AI interactions, detailing its scene‑specific goals, CPT+SFT+RLHF training pipeline, data desensitization, LoRA fine‑tuning, evaluation methods, pruning, latency trade‑offs, safety mechanisms, and future feedback loops.

AI safetyLarge Language ModelLoRA

0 likes · 13 min read

How Quwan’s Kaitian Model Tackles Emotional AI for Social Apps – Architecture, Training Tricks, and Safety

Xiaohongshu Tech REDtech

Jun 6, 2025 · Artificial Intelligence

How dots.llm1 Sets New Benchmarks for Open‑Source MoE Language Models

dots.llm1, an open‑source 142‑billion‑parameter Mixture‑of‑Experts language model from hi lab, achieves Qwen2.5‑72B‑level performance after training on 11.2 T high‑quality tokens, and the release includes full models, intermediate checkpoints, and detailed training pipelines for the research community.

AI researchLarge Language ModelMixture of Experts

0 likes · 10 min read

How dots.llm1 Sets New Benchmarks for Open‑Source MoE Language Models

Alibaba Cloud Developer

Jun 5, 2025 · Artificial Intelligence

How Deep (Re)Search Transforms Code Search and AI-Powered Knowledge Retrieval

This article systematically explains the concepts of Deep Search and Deep Research, contrasts them with traditional Retrieval‑Augmented Generation, reviews leading commercial and open‑source solutions, details their architecture for code retrieval, and outlines future plans for specialized code‑search agents.

AI researchCode searchKnowledge retrieval

0 likes · 13 min read

How Deep (Re)Search Transforms Code Search and AI-Powered Knowledge Retrieval

Java Web Project

Jun 4, 2025 · Artificial Intelligence

Why DeepSeek V3 Stands Out: Architecture, Performance, and Open‑Source Edge

The article analyzes DeepSeek's rapid adoption, detailing its seven core models, the third‑generation MoE architecture, FP8 mixed‑precision training, 128K context window, benchmark superiority on MMLU/HumanEval/CMMLU, low training cost, and fully open‑source release, while also introducing a companion guide for developers.

AI ArchitectureDeepSeekFP8 training

0 likes · 9 min read

Why DeepSeek V3 Stands Out: Architecture, Performance, and Open‑Source Edge

Kuaishou Tech

Jun 4, 2025 · Artificial Intelligence

KwaiCoder-AutoThink-preview: An Automatic‑Thinking Large Model Enhanced with Step‑SRPO Reinforcement Learning

The KwaiPilot team released the KwaiCoder‑AutoThink‑preview model, which introduces a novel automatic‑thinking training paradigm and a process‑supervised reinforcement‑learning method called Step‑SRPO, enabling the model to dynamically switch between thinking and non‑thinking modes, reduce inference cost, and achieve up to 20‑point gains on code and math benchmarks while handling large‑scale codebases.

AI researchLarge Language Modelautomatic thinking

0 likes · 12 min read

KwaiCoder-AutoThink-preview: An Automatic‑Thinking Large Model Enhanced with Step‑SRPO Reinforcement Learning

Satori Komeiji's Programming Classroom

Jun 3, 2025 · Artificial Intelligence

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

The article explains Retrieval‑Augmented Generation (RAG) by describing how a programmer, frustrated with oversized prompts for a large language model, discovers that retrieving relevant document fragments, embedding them, and feeding the augmented context to the model yields accurate, fact‑based answers.

AIChunkingEmbedding

0 likes · 6 min read

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

AI Frontier Lectures

May 30, 2025 · Artificial Intelligence

Can a 5% Parameter LLM Rival Full‑Scale Models? Inside FairyR1‑32B

The Beijing University team unveils FairyR1‑32B, a 32‑billion‑parameter LLM built on DeepSeek‑R1‑Distill‑Qwen‑32B that uses self‑merging, multi‑teacher cross‑distillation, and lightweight distillation to achieve competitive math and code benchmark scores with only about 5% of the original model’s parameters.

Large Language ModelModel Compressiondistillation

0 likes · 6 min read

Can a 5% Parameter LLM Rival Full‑Scale Models? Inside FairyR1‑32B

Efficient Ops

May 29, 2025 · Artificial Intelligence

DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3

DeepSeek quietly launched the R1 0528 model, which early testers report matches OpenAI’s o3 in benchmarks and style, while adding deeper chain‑of‑thought reasoning, better writing output, and extended thinking windows, and the announcement is followed by a promotion for the GOPS Global Ops Conference.

AI PerformanceDeepSeekLarge Language Model

0 likes · 3 min read

DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3

Network Intelligence Research Center (NIRC)

May 27, 2025 · Artificial Intelligence

Simplify Large‑Model Fine‑Tuning with LLaMA‑Factory

This article walks through using LLaMA‑Factory—a unified framework that supports over 100 LLMs—to install dependencies, prepare Alpaca‑style datasets, perform LoRA fine‑tuning, run inference, and export the tuned model, all with concrete command‑line examples.

GitHubLLaMA-FactoryLarge Language Model

0 likes · 6 min read

Simplify Large‑Model Fine‑Tuning with LLaMA‑Factory

IT Services Circle

May 25, 2025 · Artificial Intelligence

DeepSeek Core Technologies and Model Innovations: DeepSeek‑V3 and DeepSeek‑R1 Technical Overview

The article provides a detailed technical overview of DeepSeek's flagship large language models, DeepSeek‑V3 and DeepSeek‑R1, describing their MoE architecture, training frameworks, reinforcement‑learning based fine‑tuning, inference optimizations, and the broader impact of these innovations on the AI landscape while also promoting related books and resources.

AIDeepSeekLarge Language Model

0 likes · 10 min read

DeepSeek Core Technologies and Model Innovations: DeepSeek‑V3 and DeepSeek‑R1 Technical Overview

Fun with Large Models

May 25, 2025 · Artificial Intelligence

A Complete Breakdown of Claude 4’s Core Features – How Close Are We to Programmer Unemployment?

Claude 4, released in May 2025 with Opus and Sonnet variants, combines hybrid inference, a 200 K context window, advanced code interpreter, RAG retrieval and MCP integration, delivering industry‑leading programming and AI‑agent performance at relatively low cost, as confirmed by multiple company and user evaluations.

AI agentsAnthropicClaude 4

0 likes · 10 min read

A Complete Breakdown of Claude 4’s Core Features – How Close Are We to Programmer Unemployment?

JD Retail Technology

May 22, 2025 · Industry Insights

Cracking Hidden Ad Fraud: JD’s AI‑Driven Anti‑Cheat System Explained

This article recounts the journey of a JD PhD trainee who transformed academic research on anomaly detection into a production‑grade, LLM‑enhanced anti‑fraud system that identifies concealed address codes in CPS ads, detailing model design, LoRA fine‑tuning, reinforcement learning, distillation, cost‑aware deployment, and lessons learned for scalable ad risk management.

Large Language Modelad fraud detectionindustry AI

0 likes · 12 min read

Cracking Hidden Ad Fraud: JD’s AI‑Driven Anti‑Cheat System Explained

DataFunSummit

May 17, 2025 · Artificial Intelligence

Integrating Knowledge Graphs with DeepSeek AI for Enterprise Knowledge Management

This presentation explores how combining knowledge graphs with DeepSeek large‑model agents can revolutionize enterprise knowledge management, detailing DeepSeek’s technical strengths, the graph‑model complementarity paradigm, various knowledge types, practical frameworks, case studies, and future outlooks for AI‑enhanced intelligent systems.

Artificial IntelligenceDeepSeekEnterprise Knowledge Management

0 likes · 23 min read

Integrating Knowledge Graphs with DeepSeek AI for Enterprise Knowledge Management

Architecture and Beyond

May 16, 2025 · Artificial Intelligence

Understanding AI Hallucinations: The Fictional Reality of Large Language Models

The essay explores why AI systems produce hallucinations by viewing their reality as a vast fictional narrative built from human language data, arguing that their knowledge is bounded by the corpus they ingest, and reflecting on philosophical limits of language and truth.

AILanguageLarge Language Model

0 likes · 11 min read

Understanding AI Hallucinations: The Fictional Reality of Large Language Models

Alimama Tech

May 14, 2025 · Artificial Intelligence

Deep Research‑Driven Risk Root‑Cause Analysis with Domain Graph Constraints for Large‑Scale Advertising Traffic

This article presents a large‑scale advertising risk‑control solution that combines deep‑research paradigms, domain‑graph constraints, and large language models to enable explainable, responsible, and high‑precision fraud detection, detailing system architecture, challenges, demo workflow, and future directions.

AILarge Language Modeladvertising fraud

0 likes · 11 min read

Deep Research‑Driven Risk Root‑Cause Analysis with Domain Graph Constraints for Large‑Scale Advertising Traffic

Alimama Tech

May 12, 2025 · Artificial Intelligence

Universal Recommendation Model (URM): A General Large‑Model Recall System for Advertising

The article presents the Universal Recommendation Model (URM), a large‑language‑model‑based recall framework that integrates world knowledge and e‑commerce expertise through knowledge injection and prompt‑driven alignment, achieving significant offline recall gains and a 3.1% increase in ad consumption while meeting high‑QPS, low‑latency production constraints.

AdvertisingLarge Language ModelMultimodal

0 likes · 17 min read

Universal Recommendation Model (URM): A General Large‑Model Recall System for Advertising

DevOps

May 5, 2025 · Artificial Intelligence

DeepSeek Releases Math‑Specialized Large Model V2 and ProverBench Evaluation Suite

DeepSeek has quietly open‑sourced a new mathematics‑focused large language model, DeepSeek‑Prover‑V2 (available in 671B and 7B variants), achieving 88.9% on MiniF2F and strong results on PutnamBench, alongside the high‑quality ProverBench dataset and a novel recursive theorem‑proving pipeline.

AIDeepSeekLarge Language Model

0 likes · 4 min read

DeepSeek Releases Math‑Specialized Large Model V2 and ProverBench Evaluation Suite

Architects' Tech Alliance

May 2, 2025 · Artificial Intelligence

DeepSeek‑Prover‑V2‑671B: A Massive AI Model for Formal Mathematical Theorem Proving

DeepSeek‑Prover‑V2‑671B, a 671 billion‑parameter AI model released on Hugging Face, dramatically advances formal mathematical theorem proving with MoE architecture, FP8 quantization, 163 k token context, superior performance over GPT‑4 Turbo and other models, and broad implications for research and industry.

AIDeepSeekFP8 quantization

0 likes · 11 min read

DeepSeek‑Prover‑V2‑671B: A Massive AI Model for Formal Mathematical Theorem Proving

JavaEdge

May 2, 2025 · Artificial Intelligence

Exploring Qwen3: Open‑Source LLM Features, Benchmarks, and Deployment Guides

This article introduces the Qwen3 family of open‑source large language models, details their architecture, parameter counts, multilingual support, and benchmark performance, and provides step‑by‑step instructions for deploying them with frameworks like SGLang, vLLM, and local runtimes such as Ollama and LMStudio.

AIAgentLarge Language Model

0 likes · 22 min read

Exploring Qwen3: Open‑Source LLM Features, Benchmarks, and Deployment Guides

AI Algorithm Path

May 2, 2025 · Artificial Intelligence

Qwen3 Launch: Open-Source Models Redefine General AI

The Qwen3 series introduces eight open‑source large language models ranging from 0.6B to 235B parameters, combines dense and Mixture‑of‑Experts architectures, supports multimodal input, offers mixed inference modes, and demonstrates benchmark superiority over leading models such as OpenAI o1 and Gemini 2.5 Pro.

AI agentsBenchmarkLarge Language Model

0 likes · 10 min read

Qwen3 Launch: Open-Source Models Redefine General AI

Mafengwo Technology

Apr 30, 2025 · Artificial Intelligence

How MaFengWo’s mfw-32B Travel LLM Outperforms DeepSeek‑R1 in Speed and Accuracy

The article details the development, training, and evaluation of MaFengWo's 32‑billion‑parameter travel large language model (mfw‑32B), highlighting its superior itinerary planning, personalized demand capture, budget management, and resource efficiency compared to DeepSeek‑R1, and describing the SFT and reinforcement‑learning stages that enabled these gains.

Large Language ModelLoRAai-optimization

0 likes · 14 min read

How MaFengWo’s mfw-32B Travel LLM Outperforms DeepSeek‑R1 in Speed and Accuracy

Alibaba Cloud Big Data AI Platform

Apr 29, 2025 · Artificial Intelligence

Unlock Qwen3: Powerful LLM Features and Zero‑Code Deployment on Alibaba Cloud

This article introduces Qwen3, the latest dense and MOE large language model with dual‑mode reasoning, enhanced inference, multilingual support, and strong agent capabilities, and explains how Alibaba Cloud's PAI‑Model Gallery enables zero‑code, one‑click deployment and enterprise‑grade usage.

Alibaba CloudLarge Language ModelQwen3

0 likes · 6 min read

Unlock Qwen3: Powerful LLM Features and Zero‑Code Deployment on Alibaba Cloud

Programmer DD

Apr 29, 2025 · Artificial Intelligence

Why Qwen3 Is Redefining Open‑Source LLMs: Mixed‑Inference Power and Unmatched Performance

Qwen3, Alibaba’s latest open‑source large language model, introduces a pioneering mixed‑inference architecture that blends top‑tier reasoning and non‑reasoning capabilities, delivering record‑breaking benchmark scores, multilingual support for 119 languages, cost‑effective deployment, and a 128K context window, now accessible via Ollama and OpenRouter.

AI BenchmarkLarge Language ModelQwen3

0 likes · 5 min read

Why Qwen3 Is Redefining Open‑Source LLMs: Mixed‑Inference Power and Unmatched Performance

DataFunTalk

Apr 29, 2025 · Artificial Intelligence

ChatGPT Adds Shopping Feature and Alibaba Unveils Qwen3 Model Series

OpenAI announced new shopping capabilities for ChatGPT, improving product recommendation, visual presentation, and direct purchase links, while Alibaba released the Qwen3 series of large and MoE language models with detailed parameter counts and benchmark performance, highlighting rapid advancements in consumer‑focused AI applications.

AIArtificial IntelligenceChatGPT

0 likes · 4 min read

ChatGPT Adds Shopping Feature and Alibaba Unveils Qwen3 Model Series

Java Architecture Diary

Apr 29, 2025 · Artificial Intelligence

Why Qwen3 Is the New Powerhouse in Open‑Source AI Models

Qwen3 introduces a suite of open‑source models—from a 235B expert model to compact 0.6B versions—offering competitive performance against top proprietary models, multilingual support, flexible thinking modes, and low deployment requirements, with detailed usage instructions via Ollama and OpenRouter.

Large Language ModelOllamaQwen3

0 likes · 8 min read

Why Qwen3 Is the New Powerhouse in Open‑Source AI Models

Baidu Tech Salon

Apr 28, 2025 · Artificial Intelligence

Inside Baidu’s Wenxin 4.5 Turbo & X1 Turbo: Architecture, Training Tricks, and Real-World Impact

At the Create2025 AI Developer Conference, Baidu unveiled the multimodal Wenxin 4.5 Turbo and X1 Turbo models, detailing their innovative architecture, self‑feedback post‑training, composite reasoning chains, data pipelines, and the new Wenxin KuaiMa 3.5 code assistant, while also showcasing ecosystem growth and cultural AI applications.

AI ConferenceBaiduLarge Language Model

0 likes · 9 min read

Inside Baidu’s Wenxin 4.5 Turbo & X1 Turbo: Architecture, Training Tricks, and Real-World Impact

21CTO

Apr 26, 2025 · Artificial Intelligence

Baidu Launches Low-Cost ERNIE 4.5 Turbo & X1 Turbo Multimodal AI Models

Baidu unveiled upgraded ERNIE 4.5 Turbo and ERNIE X1 Turbo models with enhanced multimodal abilities, lower costs and free access, while analysts debated the performance of its new P800 chip cluster and its strategic impact in the global AI race.

AI competitionBaiduErnie

0 likes · 5 min read

Baidu Launches Low-Cost ERNIE 4.5 Turbo & X1 Turbo Multimodal AI Models

Tencent Technical Engineering

Apr 22, 2025 · Artificial Intelligence

Conan-Embedding-V2: A 1.4B LLM‑Based Multilingual Embedding Model Achieving SOTA on MTEB

Conan‑Embedding‑V2, a newly trained 1.4 B‑parameter LLM with a custom tokenizer, 32 k token context, SoftMask, cross‑lingual retrieval data and dynamic hard‑negative mining, delivers state‑of‑the‑art multilingual embeddings that surpass larger models on both English and Chinese MTEB benchmarks while remaining compact and fast.

EmbeddingLarge Language ModelMTEB

0 likes · 14 min read

Conan-Embedding-V2: A 1.4B LLM‑Based Multilingual Embedding Model Achieving SOTA on MTEB

dbaplus Community

Apr 21, 2025 · Operations

Turn Zabbix Alerts into AI‑Powered Insights with DeepSeek

This guide shows how to integrate Zabbix with a locally deployed DeepSeek large language model via Webhook, enabling automatic analysis of alerts, generation of root‑cause explanations and remediation suggestions, and delivering results through WeChat bots, dashboards, or email to reduce MTTR and manual effort.

AI OpsAlert AutomationDeepSeek

0 likes · 4 min read

Turn Zabbix Alerts into AI‑Powered Insights with DeepSeek

AI2ML AI to Machine Learning

Apr 17, 2025 · Artificial Intelligence

Inside Qwen: A Deep Dive into the Large Model’s Source Code

The article provides a comprehensive technical walkthrough of Qwen’s large‑model series, covering data preparation, tokenization, model tweaks, training settings, RLHF pipeline, Code‑Qwen specifics, Qwen2 and Qwen3 architectural changes, scaling‑law experiments, and detailed source‑code analysis with illustrative diagrams.

Large Language ModelMoEQwen

0 likes · 7 min read

Inside Qwen: A Deep Dive into the Large Model’s Source Code

21CTO

Apr 17, 2025 · Artificial Intelligence

What’s New in OpenAI’s GPT‑4.1? Bigger Context, Faster, Cheaper AI

OpenAI has launched GPT‑4.1, a multimodal AI model that expands context windows to one million tokens, improves coding and instruction following, offers cheaper Mini and Nano variants, and signals a shift in its release roadmap, including plans to retire GPT‑4 and delay GPT‑5.

AI researchGPT-4.1Large Language Model

0 likes · 5 min read

What’s New in OpenAI’s GPT‑4.1? Bigger Context, Faster, Cheaper AI

AIWalker

Apr 13, 2025 · Artificial Intelligence

Huawei Pangu Ultra: 135B Ascend‑Native Dense LLM Without Nvidia GPUs

Huawei's Pangu Ultra introduces a 135‑billion‑parameter dense language model trained entirely on Ascend NPUs, detailing novel stability architectures, a domain‑aware tokenizer, multi‑stage pre‑training, extensive system optimizations, and benchmark results that surpass leading models such as Llama 405B and DeepSeek‑R1.

Ascend NPUDense ModelLarge Language Model

0 likes · 15 min read

Huawei Pangu Ultra: 135B Ascend‑Native Dense LLM Without Nvidia GPUs

AntTech

Apr 11, 2025 · Artificial Intelligence

Understanding MCP and Function Call: A Comprehensive Guide to LLM Tool Integration

This article explains the MCP protocol and Function Call mechanism for large language models, detailing how tools are described, invoked, and processed, and provides practical code examples ranging from OpenAI JSON specifications to fast‑MCP Python and Spring MVC implementations.

AI tool integrationLarge Language ModelMCP

0 likes · 14 min read

Understanding MCP and Function Call: A Comprehensive Guide to LLM Tool Integration

JD Tech Talk

Apr 11, 2025 · Artificial Intelligence

A Billion-Scale Pure Time Series Large Model: PCTLM with SFT and TPO for Forecasting

This article presents a pioneering billion‑parameter pure time‑series large model (PCTLM) trained on a 1.5‑billion‑sample dataset, introduces a novel RLHF framework (TPO) for time‑series forecasting, and demonstrates state‑of‑the‑art performance across multiple public benchmarks, surpassing existing models such as GPT4TS.

Large Language ModelPCTLMRLHF

0 likes · 11 min read

A Billion-Scale Pure Time Series Large Model: PCTLM with SFT and TPO for Forecasting

Volcano Engine Developer Services

Apr 8, 2025 · Artificial Intelligence

Which Cloud Platform Delivers the Fastest DeepSeek‑R1 API? A Comprehensive Benchmark

This article aggregates multiple independent evaluations of DeepSeek‑R1 across major cloud providers, comparing accuracy on AIME math problems, token‑per‑second throughput, first‑token latency, stability under high concurrency, and overall service reliability, ultimately highlighting Volcano Engine as the top performer.

AI inferenceAPI performanceBenchmark

0 likes · 12 min read

Which Cloud Platform Delivers the Fastest DeepSeek‑R1 API? A Comprehensive Benchmark

DevOps

Apr 7, 2025 · Artificial Intelligence

Meta Llama 4 Scout, Maverick, and Behemoth: Architecture, NoPE Innovation, and Training Advances

The article introduces Meta's newly open‑sourced Llama 4 series—including Scout with a 1 billion‑token context window, Maverick with 400 billion parameters, and the upcoming Behemoth teacher model—detailing their expert‑mix architecture, the NoPE positional‑encoding removal, training pipelines, performance benchmarks, and infrastructure improvements for large‑scale AI research.

AI researchLarge Language ModelLlama 4

0 likes · 8 min read

Meta Llama 4 Scout, Maverick, and Behemoth: Architecture, NoPE Innovation, and Training Advances

21CTO

Apr 7, 2025 · Artificial Intelligence

Llama 4 Unveiled: Breakthrough Multimodal Models Redefine AI Capabilities

Meta's Llama 4 series introduces the Scout, Maverick, and Behemoth models—featuring Mixture‑of‑Experts architectures, unprecedented 10‑million‑token context windows, and state‑of‑the‑art performance across vision, language, and multimodal benchmarks—while emphasizing efficient training, open‑source availability, and robust safety safeguards.

AI safetyLarge Language ModelLlama 4

0 likes · 14 min read

Llama 4 Unveiled: Breakthrough Multimodal Models Redefine AI Capabilities

AI Algorithm Path

Apr 2, 2025 · Artificial Intelligence

Vision‑Reasoning Model: Enabling LLMs to See and Think

The article analyzes the limitations of current visual language models and large reasoning models, proposes a combined Vision‑Reasoning Model (VRM), details its architecture using LLaVA, describes end‑to‑end fine‑tuning and reinforcement‑learning reward design, and argues that such models will become the next breakthrough in AI.

DeepSeekLLaVALarge Language Model

0 likes · 9 min read

Vision‑Reasoning Model: Enabling LLMs to See and Think

Java Architect Essentials

Apr 2, 2025 · Backend Development

Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application

This guide demonstrates how to create a Spring Boot backend that integrates DeepSeek's large language model via the Spring AI OpenAI starter, covering project setup, dependency configuration, API key management, and a sample controller that provides AI-powered chat responses such as weather forecasts.

AI integrationChatbotDeepSeek

0 likes · 8 min read

Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application

Nightwalker Tech

Apr 1, 2025 · Artificial Intelligence

Evaluation of AutoGLM: Features, Architecture, and Practical Test Results

This article reviews AutoGLM, the first "think‑while‑doing" AI agent released by Zhipu AI, detailing its core capabilities, full‑stack architecture, user experience, identified limitations, and the outcomes of three hands‑on tests using both the client application and a Chrome extension.

AI AgentAutoGLMLarge Language Model

0 likes · 4 min read

Evaluation of AutoGLM: Features, Architecture, and Practical Test Results

DaTaobao Tech

Mar 31, 2025 · Artificial Intelligence

AI Audio Generation and Voice Synthesis Practices at Taobao

The article surveys Taobao’s AI‑generated audio pipeline, detailing eight technical papers on image‑to‑video, OpenAI o1, multimodal video, and large‑model voice synthesis, while highlighting advances like VALL‑E, CosyVoice, F5‑TTS, data‑cleaning methods, and e‑commerce applications such as voice‑cloned live streams, multilingual TTS, AI video‑audio integration, and audiobook production.

AI audioData cleaningLarge Language Model

0 likes · 11 min read

AI Audio Generation and Voice Synthesis Practices at Taobao

AI Frontier Lectures

Mar 31, 2025 · Artificial Intelligence

How Anthropic’s Path Tracing Reveals the Inner Workings of Claude 3.5 Haiku

Anthropic’s recent paper introduces a path‑tracing technique that uses cross‑layer transcoders and attribution graphs to sparsely visualize and analyze the decision‑making process of the Claude 3.5 Haiku large language model, demonstrating Pareto‑optimal improvements and a four‑stage reverse‑engineering framework while acknowledging current limitations.

AnthropicAttribution GraphClaude 3.5

0 likes · 14 min read

How Anthropic’s Path Tracing Reveals the Inner Workings of Claude 3.5 Haiku

Alibaba Cloud Big Data AI Platform

Mar 31, 2025 · Artificial Intelligence

Unlock AI-Powered Data Processing with MaxFrame’s AI Function

This article introduces MaxFrame’s AI Function, a new feature built on MaxCompute that integrates large language models like Qwen 2.5 and DeepSeek‑R1‑Distill‑Qwen to simplify model deployment and enable scalable text classification, information extraction, summarization, translation, and other AI-driven data processing tasks on massive datasets.

AI FunctionLarge Language ModelMaxCompute

0 likes · 19 min read

Unlock AI-Powered Data Processing with MaxFrame’s AI Function

Architects' Tech Alliance

Mar 28, 2025 · Artificial Intelligence

How DeepSeek Leverages Huawei Ascend to Boost AI Inference Efficiency

The report analyzes DeepSeek's latest V3 and R1 models, highlights their scaling‑law‑driven cost reductions, explains how Huawei Ascend optimizes inference by cutting KV‑Cache storage and improving compute efficiency, and surveys the model’s deployments across finance, government, manufacturing, and healthcare sectors.

AI efficiencyAI inferenceDeepSeek

0 likes · 4 min read

How DeepSeek Leverages Huawei Ascend to Boost AI Inference Efficiency

21CTO

Mar 27, 2025 · Artificial Intelligence

Google Unveils Gemini 2.5: The Most Advanced Reasoning AI Yet

Google's Gemini 2.5, billed as its most intelligent AI model, introduces advanced reasoning capabilities that outperform rivals on benchmarks like LMArena and Humanity's Last Exam, excels at web and agent code generation, and is now available to premium users via AI Studio with a 1‑million token context window.

AI reasoningGoogle GeminiLarge Language Model

0 likes · 4 min read

Google Unveils Gemini 2.5: The Most Advanced Reasoning AI Yet

Sohu Tech Products

Mar 26, 2025 · Artificial Intelligence

How SpatialLM Turns 3D Point Clouds into Structured Scene Understanding

SpatialLM is a large language model designed for 3D spatial understanding that converts point‑cloud data from videos, RGB‑D images or LiDAR into structured scene descriptions, and this guide explains its architecture, model versions, repository links, and step‑by‑step deployment on Ubuntu with PyTorch.

3D point cloudLarge Language ModelPyTorch

0 likes · 7 min read

How SpatialLM Turns 3D Point Clouds into Structured Scene Understanding

MaGe Linux Operations

Mar 26, 2025 · Artificial Intelligence

Why Qwen2.5‑VL‑32B Is the New AI Breakthrough for Vision and Math

Alibaba's newly released Qwen2.5‑VL‑32B multimodal model delivers state‑of‑the‑art visual and textual performance, offering human‑aligned responses, superior mathematical reasoning, fine‑grained image understanding, and efficient deployment features that make it a compelling tool for developers and AI researchers alike.

AI researchLarge Language ModelQwen2.5-VL-32B

0 likes · 9 min read

Why Qwen2.5‑VL‑32B Is the New AI Breakthrough for Vision and Math

21CTO

Mar 25, 2025 · Artificial Intelligence

Which LLM Is Best for Coding? Speed, Hallucination, and Context Compared

This article breaks down major large language models, defining key comparison metrics such as speed, hallucination rate, and context window, then evaluates each model with benchmarks like HumanEval+, ChatBot Arena, and Aider to help you choose the most suitable LLM for your coding tasks.

AIBenchmarkLLM

0 likes · 10 min read

Which LLM Is Best for Coding? Speed, Hallucination, and Context Compared

Cognitive Technology Team

Mar 22, 2025 · Artificial Intelligence

Three Stages of Developing Large Language Models and Practical Guidance

The article outlines the three development phases of large language models—building, pre‑training, and fine‑tuning—describes usage options, highlights key factors such as data scale, architecture, training processes, and evaluation, and offers practical advice for cost‑effective development.

LLMLarge Language ModelModel Development

0 likes · 3 min read

Three Stages of Developing Large Language Models and Practical Guidance

Architect's Alchemy Furnace

Mar 19, 2025 · Artificial Intelligence

Choosing the Right Deployment Strategy for Large Language Models: QwQ‑32B vs DeepSeek‑R1

This article compares QwQ‑32B and DeepSeek‑R1 large language models across performance, technical breakthroughs, deployment costs, and open‑source ecosystems, then evaluates pure‑local, hybrid, and pure‑cloud deployment options, and finally provides practical guidelines for preparing knowledge‑base documents and indexing methods.

AIDeploymentKnowledge Base

0 likes · 10 min read

Choosing the Right Deployment Strategy for Large Language Models: QwQ‑32B vs DeepSeek‑R1

JD Tech

Mar 19, 2025 · Artificial Intelligence

JD Retail's End‑to‑End AI Engine Compatible with GPU and Domestic NPU: Architecture, Optimization, and Real‑World Applications

This article details JD Retail's AI engine that seamlessly supports both GPU and domestic NPU hardware, describing its heterogeneous cluster architecture, unified training and inference APIs, performance optimizations, extensive model coverage, and multiple production use cases across e‑commerce, logistics, and intelligent assistance.

AI EngineGPUJD Retail

0 likes · 20 min read

JD Retail's End‑to‑End AI Engine Compatible with GPU and Domestic NPU: Architecture, Optimization, and Real‑World Applications

Baidu Geek Talk

Mar 19, 2025 · Artificial Intelligence

Inside Baidu’s New Wenxin 4.5 & X1: Multimodal Breakthroughs and Tool‑Enabled AI

Baidu officially launched the Wenxin 4.5 and X1 large language models, showcasing native multimodal foundations, advanced attention masks, heterogeneous expert extensions, and tool‑calling capabilities, while offering low‑cost API access on the Qianfan platform and outlining the underlying technical innovations that drive their performance gains.

AI PlatformBaiduLarge Language Model

0 likes · 8 min read

Inside Baidu’s New Wenxin 4.5 & X1: Multimodal Breakthroughs and Tool‑Enabled AI

Java Architecture Diary

Mar 19, 2025 · Artificial Intelligence

Unlocking Google’s Gemma 3: Multimodal Power, 128k Context & Local Deployment Guide

This article introduces Google’s open‑source Gemma 3 model, highlighting its multimodal capabilities, massive 128k token context window, multilingual support, and provides step‑by‑step instructions for installing Ollama, pulling the model, and running local tests with code examples.

AI modelGemma 3Large Language Model

0 likes · 7 min read

Unlocking Google’s Gemma 3: Multimodal Power, 128k Context & Local Deployment Guide