Tag

Large Models

1 views collected around this technical thread.

DataFunSummit
DataFunSummit
Jun 12, 2025 · Artificial Intelligence

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

This article details Alibaba Cloud AI Search’s development journey, covering its dual product lines, the evolution of Agentic RAG technology, multi‑agent architectures, vector retrieval breakthroughs, GPU‑accelerated indexing, NL2SQL capabilities, deployment models, and future directions for AI‑driven search solutions.

AI SearchGPU AccelerationLarge Models
0 likes · 33 min read
How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations
DataFunSummit
DataFunSummit
Jun 2, 2025 · Artificial Intelligence

Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs

This article explains how the rapid development of large language models and knowledge graph technologies creates new opportunities for enterprise knowledge management, outlines the challenges of massive unstructured data, describes the architecture and core data flow of a corporate knowledge brain, and showcases key technologies and real‑world applications.

AI architectureLarge Modelsdata integration
0 likes · 13 min read
Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs
JD Retail Technology
JD Retail Technology
May 7, 2025 · Artificial Intelligence

Solving Technical Challenges with Large AI Models at JD Retail: Reward Modeling, Query Expansion, and Model Pruning

JD Retail’s engineering team tackles hard AI problems by replacing a monolithic reward model with specialized small models for ad‑image generation, deploying an LLM‑driven query‑expansion pipeline that lifts conversion rates, and pruning text‑to‑image transformers using FFT and RDP to boost throughput 40% without loss, while building comprehensive evaluation tools and a semantic smart‑assistant.

AILarge ModelsReward Modeling
0 likes · 14 min read
Solving Technical Challenges with Large AI Models at JD Retail: Reward Modeling, Query Expansion, and Model Pruning
DevOps
DevOps
Apr 27, 2025 · Artificial Intelligence

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

This article examines how Retrieval‑Augmented Generation (RAG), AI agents, and multimodal large‑model techniques are reshaping AI‑industry integration, discusses their technical challenges and practical implementations, and outlines future development directions across algorithms, products, and domain‑specific applications.

AI agentsArtificial IntelligenceLarge Models
0 likes · 14 min read
Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends
JD Retail Technology
JD Retail Technology
Apr 22, 2025 · Artificial Intelligence

Generative Large‑Model Architecture for JD Advertising: Practices, Challenges, and Optimization

JD’s advertising platform replaces rule‑based recall with a generative large‑model pipeline that unifies e‑commerce knowledge, multimodal user intent, and semantic IDs across recall, coarse‑ranking, fine‑ranking and creative optimization, while meeting sub‑100 ms latency and sub‑¥1‑per‑million‑token cost through quantization, parallelism, caching, and joint generative‑discriminative inference, delivering double‑digit performance gains and paving the way for domain‑specific foundation models.

Large Modelsadvertisingdistributed systems
0 likes · 20 min read
Generative Large‑Model Architecture for JD Advertising: Practices, Challenges, and Optimization
DataFunSummit
DataFunSummit
Apr 19, 2025 · Artificial Intelligence

Enterprise Knowledge Management and Knowledge Platform Development in the Age of Large AI Models

This article summarizes a recent sharing session led by Wang Chaolun of the China Academy of Information and Communications Technology, covering the department overview, enterprise knowledge management challenges, knowledge platform trends, standardization efforts, and the impact of large AI models on knowledge services.

AILarge ModelsStandards
0 likes · 15 min read
Enterprise Knowledge Management and Knowledge Platform Development in the Age of Large AI Models
DataFunSummit
DataFunSummit
Apr 16, 2025 · Artificial Intelligence

ChatBI: NetEase’s AI‑Powered Business Intelligence Platform – Architecture, Capabilities, and Real‑World Applications

This article introduces ChatBI, NetEase’s AI‑driven BI solution, detailing its product architecture, the opportunities and challenges AI brings to data analysis, the underlying NL2SQL model, performance‑optimizing techniques such as materialized views, open integration capabilities, and several enterprise deployment cases.

AIBusiness IntelligenceChatbot
0 likes · 21 min read
ChatBI: NetEase’s AI‑Powered Business Intelligence Platform – Architecture, Capabilities, and Real‑World Applications
Beijing SF i-TECH City Technology Team
Beijing SF i-TECH City Technology Team
Apr 8, 2025 · Artificial Intelligence

Automatic Algorithm Design for Operations Optimization Using Large Language Models and Evolutionary Techniques

This document outlines how large language models can be combined with evolutionary algorithms such as genetic algorithms to automatically generate, evaluate, and iteratively improve operations‑optimization code for logistics, resource allocation, and staffing scenarios, reducing development cycles, enhancing adaptability, and achieving higher solution quality.

AI optimizationLarge Modelsautomated-code-generation
0 likes · 21 min read
Automatic Algorithm Design for Operations Optimization Using Large Language Models and Evolutionary Techniques
Baidu Tech Salon
Baidu Tech Salon
Apr 2, 2025 · Artificial Intelligence

PaddlePaddle Framework 3.0 Released: Five Core Innovations for Large Models and Scientific Computing

PaddlePaddle 3.0, launched on April 1 2025, introduces five core innovations—including dynamic‑static unified automatic parallelism, a training‑inference integrated PIR, high‑order automatic differentiation for scientific computing, a one‑stage CINN compiler, and heterogeneous multi‑chip adaptation—that dramatically reduce distributed‑training code, boost performance up to four‑fold, and extend the framework to aerospace, automotive, meteorology and life‑science applications while remaining fully compatible with the 2.0 API.

Large ModelsNeural Network CompilerPaddlePaddle
0 likes · 21 min read
PaddlePaddle Framework 3.0 Released: Five Core Innovations for Large Models and Scientific Computing
DataFunTalk
DataFunTalk
Apr 2, 2025 · Artificial Intelligence

Trends, Applications, and Future Directions of Large Models and Inference Acceleration

This article examines the current state and future prospects of large AI models and inference acceleration, covering technology trends, diverse application scenarios from research to industry, and the challenges and opportunities that lie ahead for intelligent data governance, multimodal agents, and AGI.

AGIAIInference Acceleration
0 likes · 11 min read
Trends, Applications, and Future Directions of Large Models and Inference Acceleration
AntTech
AntTech
Apr 1, 2025 · Artificial Intelligence

AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance

The Ant Research Institute and Tsinghua University's Wu Yi team released AReaL‑boba 0.2, an open‑source reinforcement‑learning training framework that dramatically speeds up large‑scale model training, achieves state‑of‑the‑art mathematical reasoning results, and provides all code, data, and scripts for reproducible research.

AILarge Modelsperformance
0 likes · 5 min read
AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance
Nightwalker Tech
Nightwalker Tech
Mar 15, 2025 · Artificial Intelligence

Guide to Accessing International AI Large Models via Aggregation Tools, APIs, and Python Code

This article introduces major international and domestic AI large models, recommends desktop aggregation tools and APIs such as POE, Monica, and OpenRouter, and provides complete Python code examples for synchronous and streaming text and multimodal conversations, along with additional API and compute‑rental resources.

AIAPILarge Models
0 likes · 11 min read
Guide to Accessing International AI Large Models via Aggregation Tools, APIs, and Python Code
DevOps
DevOps
Mar 13, 2025 · Artificial Intelligence

Large Model Commercialization Reshapes Cloud AI Competition: Capital Spending, Strategic Paths, and Multi‑Model Ecosystems

The article analyzes how the commercialization of large AI models is redefining cloud providers' competitive dynamics, highlighting Amazon Bedrock's DeepSeek‑R1 launch, IDC forecasts on model usage, major vendors' capital expenditures, and the shift toward flexible, cost‑effective multi‑model ecosystems for enterprise AI.

AICapital ExpenditureCloud Computing
0 likes · 14 min read
Large Model Commercialization Reshapes Cloud AI Competition: Capital Spending, Strategic Paths, and Multi‑Model Ecosystems
JD Tech
JD Tech
Mar 12, 2025 · Artificial Intelligence

From Low‑Resource Large Model Training to Dynamic Margin Selection: A JD Engineer’s Journey

The article recounts a JD retail engineer’s rapid growth through tackling low‑resource large‑model training, developing a margin‑based dynamic data selection method (DynaMS) that earned an ICLR paper, and sharing practical insights on aligning business needs with cutting‑edge AI research.

AI researchICLRLarge Models
0 likes · 11 min read
From Low‑Resource Large Model Training to Dynamic Margin Selection: A JD Engineer’s Journey
Cognitive Technology Team
Cognitive Technology Team
Mar 9, 2025 · Artificial Intelligence

AGI Learning Framework and Practical AI Application Guide

This article outlines a systematic AGI learning framework across five capability levels, recommends key papers and books, and provides practical steps for engineers to combine study with hands‑on large‑model projects, identify suitable use‑cases, and stay competitive in the evolving AI landscape.

AGIAI applicationsLarge Models
0 likes · 7 min read
AGI Learning Framework and Practical AI Application Guide
DataFunTalk
DataFunTalk
Mar 8, 2025 · Artificial Intelligence

DeepSeek Reflection Wave and the Shifting Landscape of AGI Development in China

The article analyzes how DeepSeek's rapid rise has triggered a strategic rethink across Chinese AI startups and tech giants, prompting a shift from product‑centric growth to deep‑model research, while examining the real barriers to AGI and the importance of time‑advantage in the large‑model race.

AGIAIChinese tech
0 likes · 12 min read
DeepSeek Reflection Wave and the Shifting Landscape of AGI Development in China
AntData
AntData
Mar 7, 2025 · Artificial Intelligence

Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training

This article examines the challenges of AI storage for massive models, describes Ant Group's multi‑cloud, high‑availability PCache architecture, and details its GPU‑mixed deployment, metadata services, data‑link optimizations, and performance results that enable petabyte‑scale training with low cost and high stability.

AI StorageLarge ModelsMulti-Cloud
0 likes · 19 min read
Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training
DataFunTalk
DataFunTalk
Mar 6, 2025 · Artificial Intelligence

AI Large Model Applications in Chinese Regional Banks: Cases, Challenges, and Strategies

Chinese regional banks are leveraging AI large models across fourteen use cases—from intelligent customer service and risk control to credit approval and regulatory compliance—highlighting operational efficiencies, data-driven credit assessments, and challenges such as compute costs, data sovereignty, and talent gaps, while proposing solutions like elastic compute pools and privacy-preserving federated learning.

AIFinTechLarge Models
0 likes · 12 min read
AI Large Model Applications in Chinese Regional Banks: Cases, Challenges, and Strategies
DaTaobao Tech
DaTaobao Tech
Mar 5, 2025 · Artificial Intelligence

Multimodal Large‑Model Cover Generation AI Agent for Taobao Video and Live Streams

Taobao’s new multimodal AI Agent automatically creates high‑quality static and dynamic video covers by planning tasks, consulting a memory of quality criteria, executing frame selection with ReKV streaming and dual‑stage evaluation, generating marketing copy via fine‑tuned Qwen2.5‑7B, and refining layout, resulting in significantly higher click‑through rates, lower latency, and reduced manual effort.

AIContent AILarge Models
0 likes · 17 min read
Multimodal Large‑Model Cover Generation AI Agent for Taobao Video and Live Streams
DataFunSummit
DataFunSummit
Mar 3, 2025 · Artificial Intelligence

DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training

The DeepSeek open‑source week introduced seven breakthrough technologies—FlashMLA, DeepGEMM, DeepEP, DualPipe, EPLB, 3FS, and Smallpond—that together overhaul data flow, algorithmic complexity, hardware utilization, MoE communication, and resource balancing, dramatically improving large‑model training efficiency and lowering entry barriers for the AI industry.

AI hardwareDeepSeekLarge Models
0 likes · 17 min read
DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training