Tag

AI optimization

1 views collected around this technical thread.

Architect's Guide
Architect's Guide
May 13, 2025 · Artificial Intelligence

DeepSeek Model Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

This article provides a comprehensive overview of DeepSeek's model distillation technology, detailing its definition, key innovations, architecture, training methods, performance gains, and the remaining challenges such as the implicit performance ceiling and multimodal data distillation.

AI optimizationDeepSeekModel Distillation
0 likes · 14 min read
DeepSeek Model Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges
JD Tech
JD Tech
May 6, 2025 · Artificial Intelligence

One4All Generative Recommendation Framework for CPS Advertising

This article reviews recent advances in applying large language models to CPS advertising recommendation, outlines business requirements and core technical challenges, proposes an extensible multi‑task generative framework with explicit intent perception and multi‑objective optimization, and presents offline and online performance gains along with future research directions.

AI optimizationCPS AdvertisingLLM
0 likes · 13 min read
One4All Generative Recommendation Framework for CPS Advertising
Beijing SF i-TECH City Technology Team
Beijing SF i-TECH City Technology Team
Apr 8, 2025 · Artificial Intelligence

Automatic Algorithm Design for Operations Optimization Using Large Language Models and Evolutionary Techniques

This document outlines how large language models can be combined with evolutionary algorithms such as genetic algorithms to automatically generate, evaluate, and iteratively improve operations‑optimization code for logistics, resource allocation, and staffing scenarios, reducing development cycles, enhancing adaptability, and achieving higher solution quality.

AI optimizationLarge Modelsautomated-code-generation
0 likes · 21 min read
Automatic Algorithm Design for Operations Optimization Using Large Language Models and Evolutionary Techniques
JD Tech Talk
JD Tech Talk
Mar 24, 2025 · Artificial Intelligence

MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Ad Serving

This article presents MaRCA, a multi‑agent reinforcement learning framework that allocates computation resources across the full ad‑serving chain by modeling user value, compute consumption, and action rewards, enabling fine‑grained power‑tilting toward high‑quality traffic and achieving significant business gains under strict latency constraints.

AI optimizationLoad Balancingad serving
0 likes · 16 min read
MaRCA: Multi‑Agent Reinforcement Learning Computation Allocation for Full‑Chain Ad Serving
58 Tech
58 Tech
Mar 11, 2025 · Artificial Intelligence

Applying Large Language Models to Real Estate Recommendation: Case Studies and Optimization Techniques

This article presents a comprehensive case study on how large language models are integrated into 58.com’s real‑estate recommendation platform, detailing challenges, data adaptation, prompt and parameter optimizations, embedding generation, conversational recommendation, and future directions for multimodal and generative recommendation systems.

AI optimizationReal EstateRecommendation systems
0 likes · 14 min read
Applying Large Language Models to Real Estate Recommendation: Case Studies and Optimization Techniques
Bilibili Tech
Bilibili Tech
Feb 25, 2025 · Artificial Intelligence

Design and Implementation of a Live Streaming Highlight System with AI Optimization

The paper details a live‑streaming highlight system that integrates heterogeneous data sources, uses a three‑stage pipeline with MySQL/Redis storage, applies sliding‑window interval optimization and AI‑driven title generation, scoring, and segment selection, managed by a shared state‑machine, and outlines future stability and observability improvements.

AI optimizationData ProcessingHighlight System
0 likes · 22 min read
Design and Implementation of a Live Streaming Highlight System with AI Optimization
Top Architect
Top Architect
Feb 14, 2025 · Artificial Intelligence

DeepSeek Model Distillation: Principles, Innovations, Architecture, and Performance

This article provides an in‑depth overview of DeepSeek’s model distillation technology, covering its definition, core principles, innovative data‑model distillation integration, architecture design, training strategies, performance gains, and the challenges of scaling to multimodal data.

AI optimizationDeepSeekModel Distillation
0 likes · 16 min read
DeepSeek Model Distillation: Principles, Innovations, Architecture, and Performance
DevOps
DevOps
Feb 12, 2025 · Artificial Intelligence

A Comprehensive Guide to Prompt Engineering, RAG, and Optimization Techniques for Large Language Models

This article presents a systematic framework for crafting effective prompts, detailing the universal prompt template, role definition, task decomposition, RAG integration, few‑shot examples, memory handling, and parameter tuning to enhance large language model performance across diverse applications.

AI optimizationPrompt TemplatesRAG
0 likes · 24 min read
A Comprehensive Guide to Prompt Engineering, RAG, and Optimization Techniques for Large Language Models
IT Architects Alliance
IT Architects Alliance
Feb 10, 2025 · Artificial Intelligence

DeepSeek Distillation Technology: Principles, Innovations, Performance, and Future Outlook

The article explains DeepSeek's model distillation technique, covering its fundamental knowledge‑transfer principles, unique innovations such as data‑model fusion and task‑specific strategies, impressive benchmark results, practical applications in edge and online inference, existing challenges, and future research directions.

AI optimizationEdge ComputingModel Distillation
0 likes · 15 min read
DeepSeek Distillation Technology: Principles, Innovations, Performance, and Future Outlook
DataFunSummit
DataFunSummit
Feb 4, 2025 · Artificial Intelligence

Training Optimization for Large-Scale Multimodal Models in Content Safety

This article examines the challenges of content safety, outlines the limitations of current task‑specific multimodal models, and proposes large‑model‑inspired training optimizations—including diversified data construction, automated annotation, parameter fine‑tuning, and multi‑task evaluation—to improve efficiency, accuracy, and scalability of multimodal AI systems.

AI optimizationLarge Model Trainingcontent safety
0 likes · 26 min read
Training Optimization for Large-Scale Multimodal Models in Content Safety
DataFunTalk
DataFunTalk
Jan 4, 2024 · Artificial Intelligence

Using OpenLLM to Quickly Build and Deploy Large Language Model Applications

This presentation explains how OpenLLM, an open‑source LLM framework, together with BentoML, addresses the challenges of deploying large language models by offering model switching, memory optimizations, multi‑GPU support, observability, and easy containerized deployment for production AI applications.

AI optimizationBentoMLLLM deployment
0 likes · 18 min read
Using OpenLLM to Quickly Build and Deploy Large Language Model Applications
AntTech
AntTech
Jan 3, 2024 · Cloud Native

Ant Group’s Green Computing Technology Recognized for Energy Efficiency and Carbon Reduction

The National Energy Center’s recent green computing evaluation highlighted Ant Group’s innovative cloud‑native scheduling, AI‑driven prediction, and offline mixed‑branch technologies that significantly improve data‑center energy efficiency, reduce carbon emissions, and demonstrate strong promotion value across major enterprises.

AI optimizationAnt GroupGreen computing
0 likes · 4 min read
Ant Group’s Green Computing Technology Recognized for Energy Efficiency and Carbon Reduction
High Availability Architecture
High Availability Architecture
Jun 15, 2023 · Artificial Intelligence

InferX Inference Framework: Challenges, Architecture, Optimizations, and Triton Integration

The article presents the background, challenges, and objectives of Bilibili's AI services, introduces the self‑developed InferX inference framework with its quantization and sparsity optimizations, details OCR‑specific enhancements, and describes how integrating InferX with Nvidia Triton dramatically improves throughput, latency, and GPU utilization.

AI optimizationCUDAInference
0 likes · 10 min read
InferX Inference Framework: Challenges, Architecture, Optimizations, and Triton Integration
Kuaishou Large Model
Kuaishou Large Model
Mar 31, 2023 · Artificial Intelligence

How Kuaishou Elevates Video Quality and AI Performance at NVIDIA GTC 2023

At NVIDIA GTC 2023, Kuaishou engineers unveiled cutting‑edge solutions ranging from video quality assessment and enhancement, 3D digital‑human live streaming, a custom TensorRT‑based performance framework, large‑scale recommendation model acceleration, to multimodal massive‑model deployment for short‑video scenarios.

AI optimizationDigital HumanMultimodal Models
0 likes · 9 min read
How Kuaishou Elevates Video Quality and AI Performance at NVIDIA GTC 2023
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Mar 30, 2023 · Artificial Intelligence

How Kuaishou Elevates Short‑Video Quality and AI Performance at NVIDIA GTC 2023

At NVIDIA GTC 2023, Kuaishou engineers presented cutting‑edge solutions ranging from video quality assessment and enhancement to digital‑human live streaming, custom performance‑optimization frameworks, large‑scale recommendation model acceleration, and multimodal massive‑model deployment for short‑video applications.

AI optimizationDigital Humanlarge recommendation models
0 likes · 9 min read
How Kuaishou Elevates Short‑Video Quality and AI Performance at NVIDIA GTC 2023
DataFunTalk
DataFunTalk
Feb 20, 2023 · Artificial Intelligence

Low‑Cost Open‑Source Replication of ChatGPT Using Colossal‑AI

This article explains how researchers reproduced the full ChatGPT training pipeline—including supervised fine‑tuning, reward‑model training, and RLHF—using the open‑source Colossal‑AI system, dramatically reducing GPU memory and hardware requirements while providing ready‑to‑run code and performance benchmarks.

AI optimizationChatGPTColossal-AI
0 likes · 10 min read
Low‑Cost Open‑Source Replication of ChatGPT Using Colossal‑AI
Architects' Tech Alliance
Architects' Tech Alliance
Sep 23, 2022 · Databases

Analysis of Chinese Database Product Strategies and Emerging Trends

This article summarizes recent Chinese database product strategy reports, outlining database definitions, management systems, design processes, product classifications, architectural layers, HTAP technology, compression methods, storage index structures, intelligent autonomous optimization, and deployment models, highlighting trends and future directions in the database industry.

AI optimizationDatabase ArchitectureHTAP
0 likes · 8 min read
Analysis of Chinese Database Product Strategies and Emerging Trends
Tencent Cloud Developer
Tencent Cloud Developer
Apr 8, 2022 · Databases

Tencent Cloud Native Database AI Autonomy: SIGMOD Research and Intelligent Tuning System

Tencent Cloud’s native database team achieved a SIGMOD breakthrough by embedding AI into MySQL, creating an autonomous “database brain” that uses deep‑reinforcement learning, genetic pre‑heating and a closed‑loop learner/actor architecture to automatically observe, analyze, and tune diverse workloads, delivering rapid performance gains, anomaly detection, and self‑optimizing features while addressing adaptability, stability, and interpretability challenges.

AI optimizationCDBTuneDatabase Autonomy
0 likes · 8 min read
Tencent Cloud Native Database AI Autonomy: SIGMOD Research and Intelligent Tuning System
DataFunTalk
DataFunTalk
Apr 1, 2022 · Operations

Integrated Digital Supply Chain: JD Logistics' Intelligent Planning, Algorithm Platform, and Digital Twin Practices

This article explores JD Logistics' integrated digital supply chain, detailing its evolution, the construction of an algorithm middle‑platform, engineering platforms, digital twin system, real‑world case studies, and future talent and ecosystem directions, illustrating how AI and big‑data technologies drive end‑to‑end logistics optimization.

AI optimizationAlgorithm PlatformBig Data
0 likes · 16 min read
Integrated Digital Supply Chain: JD Logistics' Intelligent Planning, Algorithm Platform, and Digital Twin Practices
Ctrip Technology
Ctrip Technology
Sep 16, 2021 · Artificial Intelligence

Automated AI Model Optimization Platform for Travel Services

This article describes the design, automated workflow, functional modules, and performance results of a comprehensive AI model optimization platform built for Ctrip's travel business, covering operator libraries, graph optimization, model compression techniques such as distillation, quantization, pruning, and deployment integration.

AI optimizationAutoMLInference Acceleration
0 likes · 16 min read
Automated AI Model Optimization Platform for Travel Services