Tag

AI Engineering

1 views collected around this technical thread.

Java Architecture Diary
Java Architecture Diary
May 19, 2025 · Artificial Intelligence

How Ollama 0.7 Unlocks Local Multimodal AI with One Command

Ollama 0.7 introduces a fully re‑engineered core that brings seamless multimodal model support, lists top visual models, showcases OCR and image analysis capabilities, explains technical breakthroughs, and provides a quick three‑step guide to deploy powerful local AI vision.

AI EngineeringAI modelsImage Recognition
0 likes · 7 min read
How Ollama 0.7 Unlocks Local Multimodal AI with One Command
DevOps
DevOps
Apr 22, 2025 · Artificial Intelligence

How to Think About Agent Frameworks: A Critical Review of Design Patterns, Challenges, and LangGraph

This article critically examines popular agent frameworks, compares OpenAI and Anthropic definitions, highlights the core difficulty of maintaining proper context for reliable agents, and presents LangGraph’s declarative and imperative features along with practical guidance for building production‑grade agent systems.

AI EngineeringLangGraphagent frameworks
0 likes · 24 min read
How to Think About Agent Frameworks: A Critical Review of Design Patterns, Challenges, and LangGraph
DaTaobao Tech
DaTaobao Tech
Feb 21, 2025 · Artificial Intelligence

AI-Powered Face Swapping for the Spring Festival Gala: System Design and Deployment

The paper details the design and deployment of an AI‑driven face‑swap platform for the 2025 CCTV Spring Festival Gala, featuring a dual‑model SDXL pipeline with ControlNet and LoRA fine‑tuning, optimized preprocessing and GPU‑specific acceleration to achieve sub‑3‑second latency at over 10 k QPS, supporting scaling, throttling, and multi‑region load balancing, and ultimately serving ten million users and generating hundreds of millions of personalized gala images.

AI EngineeringAIGCSpring Festival Gala
0 likes · 28 min read
AI-Powered Face Swapping for the Spring Festival Gala: System Design and Deployment
Zhihu Tech Column
Zhihu Tech Column
Dec 9, 2024 · Artificial Intelligence

Large Model Application Engineering: ZhiLight Inference Framework and Zhihu Direct Answer System

The article details Zhihu's technical salon on large‑model engineering, covering the RAG‑based Zhihu Direct Answer system, the open‑source ZhiLight inference framework, prompt engineering, agent research, and future plans for integrating AI into product and community workflows.

AI EngineeringInference FrameworkPrompt Engineering
0 likes · 8 min read
Large Model Application Engineering: ZhiLight Inference Framework and Zhihu Direct Answer System
JD Tech
JD Tech
May 31, 2024 · Artificial Intelligence

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

This article explains the fundamentals and engineering practices of large language models (LLM), retrieval‑augmented generation (RAG) and AI agents, compares small and large embedding models, provides Python code for vector‑database RAG with Chroma, and discusses integration, use cases, and future challenges in AI development.

AI AgentsAI EngineeringLLM
0 likes · 41 min read
Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications
DevOps
DevOps
Apr 17, 2024 · Artificial Intelligence

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

The article explores how enterprises can build and improve large‑model applications by combining prompt engineering, retrieval‑augmented generation (RAG), and fine‑tuning, discusses their relationships, optimization dimensions, testing challenges, and provides practical guidance for SE4AI implementation.

AI EngineeringEnterprise AIFine-tuning
0 likes · 20 min read
Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning
HelloTech
HelloTech
Apr 10, 2024 · Artificial Intelligence

An Overview of LangChain: Architecture, Core Components, and Code Examples

LangChain is an open‑source framework that provides Python and JavaScript SDKs, templates, and services such as LangServe and LangSmith to compose models, embeddings, prompts, indexes, memory, chains, and agents via a concise expression language, enabling rapid prototyping, debugging, and deployment of LLM‑driven applications.

AI EngineeringAgentsJavaScript
0 likes · 19 min read
An Overview of LangChain: Architecture, Core Components, and Code Examples
DataFunTalk
DataFunTalk
Mar 20, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Large Language Model Training

The article outlines the resource and efficiency challenges of scaling large language models, explains data and model parallelism strategies, and details practical I/O, communication, and stability optimizations—including high‑availability storage, RDMA networking, and fault‑tolerance measures—to improve training throughput and reliability.

AI EngineeringI/O optimizationcommunication optimization
0 likes · 13 min read
Challenges and Optimization Techniques for Large Language Model Training
Architecture & Thinking
Architecture & Thinking
Jan 14, 2024 · Artificial Intelligence

How Baidu Scales Content Understanding to Trillions of Pages with AI Engineering

This article explains how Baidu processes internet‑scale content by applying deep AI‑driven understanding, detailing cost‑optimization, efficiency improvements, model‑service frameworks, resource‑scheduling systems, and batch‑compute platforms that together enable trillion‑level indexing and feature extraction.

AI EngineeringHTAP storageResource Scheduling
0 likes · 16 min read
How Baidu Scales Content Understanding to Trillions of Pages with AI Engineering
DataFunSummit
DataFunSummit
Oct 7, 2023 · Artificial Intelligence

MLOps Implementation in Network Intelligence: Jiutian Platform Overview

This article presents the Jiutian Network Intelligence platform’s MLOps implementation at China Mobile, detailing its AI engineering workflow, platform functional and technical architecture, technology selections, model deployment, monitoring, and operational challenges, and shares insights on scaling AI services across 31 provinces.

AI EngineeringNetwork Intelligencecloud-native
0 likes · 20 min read
MLOps Implementation in Network Intelligence: Jiutian Platform Overview
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 25, 2023 · Artificial Intelligence

DataFunSummit 2023: Recommendation Systems Online Summit

The DataFunSummit 2023 online summit (August 26‑27) will explore eight recommendation‑system topics—including core and engineering architecture, model training/inference, large models, graphs, cold start, and multi‑task scenarios—featuring Xiaohongshu leaders who will present on graph‑based business architecture, integrated training‑inference pipelines, and user/content cold‑start strategies.

AI EngineeringCold StartRecommendation systems
0 likes · 6 min read
DataFunSummit 2023: Recommendation Systems Online Summit
DataFunSummit
DataFunSummit
May 5, 2023 · Artificial Intelligence

Advances in Virtual Humans, Multimodal Technology, and General AI – Insights from OPPO

The article presents OPPO's latest research on virtual human audio‑lip and RGB driving, multimodal learning breakthroughs such as CETNETs and cross‑modal matching, and a reflective discussion on the challenges and future directions of general artificial intelligence, highlighting the interconnections among these three domains.

AI Engineeringaudio2lipgeneral AI
0 likes · 9 min read
Advances in Virtual Humans, Multimodal Technology, and General AI – Insights from OPPO
Efficient Ops
Efficient Ops
Jan 16, 2023 · Artificial Intelligence

How MLOps Is Transforming AI Production in China: Trends, Tools, and Standards

This report examines how MLOps is accelerating AI production in China, highlighting industry adoption across sectors, the booming tool ecosystem, the rise of feature platforms, enhanced observability, performance needs for large models, AI asset management, and the emerging national standards and evaluation results.

AI EngineeringAI standardsFeatureOps
0 likes · 8 min read
How MLOps Is Transforming AI Production in China: Trends, Tools, and Standards
GuanYuan Data Tech Team
GuanYuan Data Tech Team
Dec 1, 2022 · Artificial Intelligence

Why MLOps Is the Key to Scalable AI Projects

This article explains the concept, significance, and practical case studies of MLOps—showing how integrating DevOps principles with data and machine learning creates reliable, automated pipelines for data quality, model monitoring, error analysis, and continuous integration, ultimately accelerating AI delivery.

AI EngineeringContinuous IntegrationMachine Learning Operations
0 likes · 15 min read
Why MLOps Is the Key to Scalable AI Projects
Efficient Ops
Efficient Ops
Nov 29, 2022 · Artificial Intelligence

How MLOps is Revolutionizing AI Development: Baidu’s Flagship Platform Insights

This article examines how China’s AI strategy and newly released MLOps standards are driving AI engineering, featuring Baidu Cloud’s flagship-level platform, its evaluation results, practical benefits, challenges, and future directions for MLOps in enterprise AI development.

AI EngineeringAI standardsBaidu
0 likes · 10 min read
How MLOps is Revolutionizing AI Development: Baidu’s Flagship Platform Insights
Efficient Ops
Efficient Ops
Nov 7, 2022 · Artificial Intelligence

Unlocking AI Project Success with the New MLOps Maturity Assessment

This article outlines the background, standards, evaluation items, process, and registration details of a newly launched MLOps development management maturity assessment designed to accelerate AI model delivery and improve operational efficiency across teams.

AI EngineeringAI operationsMaturity Assessment
0 likes · 6 min read
Unlocking AI Project Success with the New MLOps Maturity Assessment
Efficient Ops
Efficient Ops
Oct 26, 2022 · Artificial Intelligence

Unveiling China’s AI Model Delivery Standard: Boosting MLOps and AI Engineering

China’s 14th Five-Year Plan and 2035 Vision prioritize AI, prompting a shift from proof‑of‑concept to product deployment; the newly released Model Delivery standard, part of the Model/MLOps maturity model, defines five maturity levels and a reusable pipeline to boost AI engineering across industries.

AIAI EngineeringChina
0 likes · 5 min read
Unveiling China’s AI Model Delivery Standard: Boosting MLOps and AI Engineering
Efficient Ops
Efficient Ops
Aug 2, 2022 · Artificial Intelligence

How MLOps Boosted AI Service Delivery at China Agricultural Bank

In a detailed interview, the Agricultural Bank of China's R&D center explains how its AI service platform achieved a Level‑3 leading rating in the national MLOps maturity assessment, and how MLOps practices have accelerated model development, improved quality, reduced risk, and driven scalable AI adoption across financial services.

AI EngineeringBanking TechnologyFinancial AI
0 likes · 10 min read
How MLOps Boosted AI Service Delivery at China Agricultural Bank
Efficient Ops
Efficient Ops
Apr 24, 2022 · Artificial Intelligence

How ModelOps and MLOps Accelerate AI Project Development

ModelOps and MLOps are transforming AI engineering by introducing continuous training, integration, and deployment, which streamline development cycles, standardize model management, and enable ongoing monitoring to enhance inference accuracy and maximize the business value generated by AI models.

AI EngineeringContinuous DeploymentModel Management
0 likes · 1 min read
How ModelOps and MLOps Accelerate AI Project Development