Tag

large model

1 views collected around this technical thread.

DeWu Technology
DeWu Technology
May 9, 2025 · Artificial Intelligence

Growth Story of a Technical Lead: Building a One‑Stop Large‑Model Training and Inference Platform at Dewu

Meng, a former Tencent and Alibaba engineer, led Dewu’s one‑stop large‑model training and inference platform, cutting integration costs, creating a shared GPU pool and CI/CD pipeline, building a Milvus vector‑database, and driving self‑directed learning that boosted business value, user experience, and set a roadmap for future RAG and cloud‑native optimizations.

AI PlatformCloud NativePerformance Optimization
0 likes · 18 min read
Growth Story of a Technical Lead: Building a One‑Stop Large‑Model Training and Inference Platform at Dewu
Baidu Geek Talk
Baidu Geek Talk
Apr 9, 2025 · Artificial Intelligence

Baidu's Wenxin X1 Large Model Officially Launches on Qianfan Platform

On April 2, Baidu released its Wenxin X1 large model on the Qianfan platform, offering enterprise users and developers a multimodal, deep‑thinking AI with superior math, coding, and reasoning scores, low token‑price API access, batch inference, one‑click distillation, and rapid RAG/Agent application building.

AIAPI ServiceBaidu
0 likes · 4 min read
Baidu's Wenxin X1 Large Model Officially Launches on Qianfan Platform
DataFunSummit
DataFunSummit
Dec 24, 2024 · Artificial Intelligence

Considerations and Practices for Domesticating Large‑Model Inference Engines

This article examines the importance of domestic large‑model inference engines, compares Chinese and international chips, evaluates four architectural approaches, discusses practical challenges such as performance loss and model support, and outlines future expectations for high‑performance, heterogeneous‑chip inference solutions.

AI infrastructureDomestic ChipInference Engine
0 likes · 9 min read
Considerations and Practices for Domesticating Large‑Model Inference Engines
DataFunSummit
DataFunSummit
Dec 23, 2024 · Artificial Intelligence

Huolala's Large Model Evaluation Framework (LaLaEval) and Application Practices

This article presents Huolala's comprehensive LaLaEval framework for evaluating large language models, detailing the challenges of model deployment, the five‑step assessment process, two real‑world case studies in freight and driver invitation, and future directions toward more automated, product‑driven evaluation.

AILogisticsevaluation
0 likes · 24 min read
Huolala's Large Model Evaluation Framework (LaLaEval) and Application Practices
AntTech
AntTech
Nov 13, 2024 · Information Security

Ant Group’s Large‑Model‑Based Security Parallel Plane and Intelligent Threat Detection System

The article details Ant Group’s AI‑driven security parallel plane and intelligent threat detection system, its DKCF‑based architecture, key modules for data correlation, unknown threat discovery, alarm reduction, and knowledge‑graph integration, and its recognition in the 2024 AI Pioneer Case Collection.

AI securityAnt GroupDKCF
0 likes · 5 min read
Ant Group’s Large‑Model‑Based Security Parallel Plane and Intelligent Threat Detection System
DataFunSummit
DataFunSummit
Nov 4, 2024 · Artificial Intelligence

Performance Optimization Techniques for Large Model Inference Frameworks

This article outlines four key optimization areas for large model inference frameworks—quantization, speculative sampling, TTFT/TPOT improvements, and communication optimization—detailing specific techniques, experimental results, and practical benefits such as reduced memory usage, lower latency, and higher throughput.

AIPerformanceinference optimization
0 likes · 12 min read
Performance Optimization Techniques for Large Model Inference Frameworks
AntTech
AntTech
Jul 9, 2024 · Artificial Intelligence

2024 Large Model Security Practice Whitepaper Unveiled at the World AI Conference

The jointly authored 2024 Large Model Security Practice whitepaper, released at the World AI Conference, outlines a comprehensive safety framework covering security, reliability, and controllability, presents industry case studies, and proposes a five‑dimensional governance model to guide high‑quality development of large AI models.

AI safetyWhitepaperindustry practice
0 likes · 7 min read
2024 Large Model Security Practice Whitepaper Unveiled at the World AI Conference
DataFunTalk
DataFunTalk
Jul 7, 2024 · Artificial Intelligence

Large Model Application Development: Architecture, Lifecycle, and Prompt Engineering

This article presents a comprehensive knowledge map for developing large‑model applications, covering a four‑layer technical architecture, the full development lifecycle, core elements such as prompt engineering and model fine‑tuning, evaluation methods, and practical case studies, offering guidance for both enterprises and startups.

AI Application DevelopmentPrompt Engineeringevaluation
0 likes · 15 min read
Large Model Application Development: Architecture, Lifecycle, and Prompt Engineering
DataFunTalk
DataFunTalk
Jun 30, 2024 · Artificial Intelligence

Application and Exploration of Large Audio Representation Models for Cold-Start Songs in QQ Music

This article presents a technical overview of how large‑scale audio representation models are fine‑tuned with I2I co‑occurrence and U2I interaction data to improve cold‑start song recommendation on QQ Music, describing the challenges, methodology, deployment scenarios, and experimental results.

I2I fine-tuningU2I fine-tuningaudio representation
0 likes · 17 min read
Application and Exploration of Large Audio Representation Models for Cold-Start Songs in QQ Music
AntTech
AntTech
Jun 21, 2024 · Artificial Intelligence

Ant Group’s Security Parallel Aspect Fusion AI: A Case Study Selected at the 2024 World Intelligent Industry Expo

The article details Ant Group’s security parallel aspect fusion AI solution, selected as an exemplary case at the 2024 World Intelligent Industry Expo, explaining its multi‑dimensional data collection, large‑model integration, baseline construction, knowledge‑graph generation, and superior threat‑detection performance.

AIAnt GroupCase Study
0 likes · 4 min read
Ant Group’s Security Parallel Aspect Fusion AI: A Case Study Selected at the 2024 World Intelligent Industry Expo
Baidu Tech Salon
Baidu Tech Salon
May 21, 2024 · Artificial Intelligence

Baidu Comate Code Knowledge Enhancement 2.0: First Domestic Intelligent Code Assistant with Real-time Retrieval

At the iTechClub summit, Baidu unveiled Comate Code Knowledge Enhancement 2.0—the nation’s first AI‑driven code assistant with real‑time web retrieval, over 10,000 built‑in GitHub repositories, API‑call generation, test case creation, and deep private‑doc understanding, now producing 27% of Baidu’s daily new code and topping VSCode and JetBrains plugin ratings.

AI code assistantBaiduComate
0 likes · 5 min read
Baidu Comate Code Knowledge Enhancement 2.0: First Domestic Intelligent Code Assistant with Real-time Retrieval
Efficient Ops
Efficient Ops
May 14, 2024 · Artificial Intelligence

How Large‑Model Agents Are Revolutionizing AIOps and Modern Operations

This article explores why large‑model Agent technology is essential for AIOps, explains single‑ and multi‑Agent architectures, memory and tool integration, and demonstrates practical applications such as anomaly detection, fault diagnosis, automated remediation, ChatOps, and future directions for intelligent, autonomous operations.

AI AgentsAIOpsLLM
0 likes · 14 min read
How Large‑Model Agents Are Revolutionizing AIOps and Modern Operations
DataFunTalk
DataFunTalk
Mar 11, 2024 · Artificial Intelligence

Challenges and Future Directions for Knowledge Graph Construction in the Era of Large Models

The article examines the high construction cost and lack of unified standards in knowledge graphs, explains why large language models cannot fully solve core issues such as hallucination and multi‑hop reasoning, and argues that a new, unified semantic framework integrating large models is essential for future progress.

AIdata integrationgraph database
0 likes · 5 min read
Challenges and Future Directions for Knowledge Graph Construction in the Era of Large Models
Tencent Tech
Tencent Tech
Oct 26, 2023 · Artificial Intelligence

Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips

This guide introduces Tencent Hunyuan's upgraded text‑to‑image model, explains its technical innovations, provides detailed prompt engineering advice, showcases example prompts and generated images across various styles, and highlights real‑world applications and performance metrics for developers and creators.

AI generationPrompt EngineeringTencent Hunyuan
0 likes · 12 min read
Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips
ByteDance Cloud Native
ByteDance Cloud Native
Jun 13, 2023 · Artificial Intelligence

How Ray and Cloud‑Native Tech Supercharge Large‑Model Offline Inference

This article explains the challenges of large‑model offline (batch) inference, such as GPU memory limits and distributed scheduling, and shows how Ray’s cloud‑native architecture, model partitioning, and Ray Datasets can be used to build efficient, elastic inference frameworks deployed with KubeRay.

Cloud NativeGPU memoryRay
0 likes · 18 min read
How Ray and Cloud‑Native Tech Supercharge Large‑Model Offline Inference
DataFunSummit
DataFunSummit
Dec 19, 2022 · Artificial Intelligence

Multimodal Large‑Model Driven Virtual Digital Humans: Background, Methods, and Applications

This article introduces the rapid development of multimodal digital humans powered by large AI models, covering their background, current challenges, NeRF‑GAN based modeling methods, multimodal dialogue capabilities, and real‑world application cases such as virtual assistants, tourism guides, and sign‑language avatars.

AIGCDigital HumanHuman-Computer Interaction
0 likes · 14 min read
Multimodal Large‑Model Driven Virtual Digital Humans: Background, Methods, and Applications