Tagged articles

118 articles

Page 2 of 2

Dec 12, 2024 · Artificial Intelligence

Airbnb Automation Platform v2: Enabling LLM‑Driven Conversational AI

Airbnb’s Automation Platform v2 replaces the rigid, workflow‑driven architecture of v1 with an LLM‑centric design that orchestrates context gathering, chain‑of‑thought reasoning, tool execution, and guardrails, enabling more natural, scalable, and safe conversational AI while preserving the reliability of traditional workflows.

AI ArchitectureAirbnbConversational AI

0 likes · 11 min read

Airbnb Automation Platform v2: Enabling LLM‑Driven Conversational AI

DataFunSummit

Nov 27, 2024 · Artificial Intelligence

Applying Large Language Models in Data Management and Risk Control at Ping An One Wallet

This presentation details how Ping An One Wallet leverages large language models across five key areas—current application status, data management, risk control, technical architecture, and a Q&A session—highlighting strategies such as vectorized rule storage, prompt engineering, RAG enhancements, and workflow agents to improve efficiency and accuracy in data governance and fraud detection.

AI ArchitectureData GovernanceVector Database

0 likes · 16 min read

Applying Large Language Models in Data Management and Risk Control at Ping An One Wallet

Tencent Docs Tech Team

Nov 13, 2024 · Artificial Intelligence

Technical Architecture and Practices of the AI Document Assistant

This article explores the challenges large language models bring to efficiency tools, outlines the AI document assistant's technical thinking and architecture, and details both application‑side and model‑side practices such as retrieval‑augmented generation, intent recognition, and code‑driven table handling, concluding with key lessons.

AIAI ArchitectureDocument Automation

0 likes · 16 min read

Technical Architecture and Practices of the AI Document Assistant

phodal

Oct 20, 2024 · Backend Development

Why Streaming BFF Is the Missing Glue for AI‑Native Apps

The article proposes a Streaming Backend‑for‑Frontend (BFF) layer to unify heterogeneous AI agents, handle Server‑Sent Events streams, and resolve interface inconsistencies, offering a practical architecture for generative‑AI‑native systems across IDEs, DevOps, and team‑AI scenarios.

AI ArchitectureBackend DevelopmentServer-Sent Events

0 likes · 13 min read

Why Streaming BFF Is the Missing Glue for AI‑Native Apps

JavaEdge

Jun 23, 2024 · Artificial Intelligence

Mapping the Generative AI Landscape: From Infrastructure to Applications

This article provides a comprehensive overview of the generative AI industry, detailing its upstream foundation layer, midstream large‑model and tool layers, downstream application scenarios, and an extensive glossary of models, techniques, platforms, and concepts.

AI ArchitectureIndustry OverviewMachine Learning

0 likes · 12 min read

Mapping the Generative AI Landscape: From Infrastructure to Applications

Baobao Algorithm Notes

May 9, 2024 · Artificial Intelligence

Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance

This article provides an in‑depth technical analysis of Deepseek‑V2, covering its 236B parameter size, Multi‑Head Latent Attention optimization that reduces KV‑cache memory, architectural details, training pipelines, infrastructure choices, and performance results on benchmarks such as MMLU and instruction following.

AI ArchitectureDeepSeekLarge Language Model

0 likes · 17 min read

Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance

Architect

Apr 16, 2024 · Artificial Intelligence

Unraveling Sora: How OpenAI Might Build a 60‑Second Video Generator

This article dissects the possible architecture of OpenAI's Sora video model, tracing its visual encoder‑decoder, Spacetime Latent Patch, transformer‑based diffusion backbone, long‑time consistency strategies, and training pipeline, while comparing alternatives such as MAGVIT‑v2, TECO, NaViT, and FDM to reveal why each design choice may have been made.

AI ArchitectureLatent DiffusionSora

0 likes · 51 min read

Unraveling Sora: How OpenAI Might Build a 60‑Second Video Generator

DataFunTalk

Mar 21, 2024 · Artificial Intelligence

A Detailed Technical Analysis of Sora: Architecture, Key Components, and Potential Implementation

This article provides a comprehensive, easy‑to‑understand breakdown of Sora’s possible architecture—including its visual encoder‑decoder, Spacetime Latent Patch, transformer‑based diffusion model, long‑time consistency strategies, training techniques, and how it supports variable resolution and duration video generation.

AI ArchitectureSoraSpacetime Patch

0 likes · 49 min read

A Detailed Technical Analysis of Sora: Architecture, Key Components, and Potential Implementation

Ops Development & AI Practice

Mar 17, 2024 · Artificial Intelligence

Why the Transformer Model Revolutionized AI and How It Works

This article explains the Transformer architecture, its self‑attention mechanism, encoder‑decoder design, and the profound impact it has had on natural language processing, computer vision, and large‑scale language models like GPT.

AI ArchitectureNLPSelf-Attention

0 likes · 6 min read

Why the Transformer Model Revolutionized AI and How It Works

JD Retail Technology

Feb 28, 2024 · Artificial Intelligence

Edge AI at JD Retail: Architecture, Challenges, and Business Practices

This article details JD Retail's edge AI (on‑device intelligence) platform, covering its definition, performance and security challenges, three‑layer cloud‑edge‑device architecture, key components such as high‑performance inference engine, data pipeline, Python VM container, and real‑world applications in traffic distribution and image recognition.

AI ArchitectureEdge AIJD Retail

0 likes · 15 min read

Edge AI at JD Retail: Architecture, Challenges, and Business Practices

AI Large Model Application Practice

Dec 28, 2023 · Artificial Intelligence

How AI Agents Can Transform Enterprise Operations and Architecture

The article examines the rise of AI Agents as a bridge to AGI, analyzes their value, application domains, and user groups in enterprises, and proposes a layered architecture with model, data, ops, and agent components to guide practical implementation and integration.

AI AgentAI ArchitectureAutomation

0 likes · 14 min read

How AI Agents Can Transform Enterprise Operations and Architecture

phodal

Sep 3, 2023 · Artificial Intelligence

Engineering LLM Applications: Architecture, Prompt Modeling, and Multi‑Language Strategies

This article shares practical insights from months of building LLM proof‑of‑concepts, covering language‑agnostic architectures, FFI integration, prompt engineering, RAG patterns, DSL design, and four core architectural principles for scalable AI applications.

AI ArchitectureDSLFFI

0 likes · 13 min read

Engineering LLM Applications: Architecture, Prompt Modeling, and Multi‑Language Strategies

HomeTech

Jul 26, 2023 · Artificial Intelligence

Practical Implementation of ChatGPT Technology Products: Architecture, Prompt Engineering, and Future Challenges

This article explores the practical deployment of ChatGPT‑based products, detailing the model fundamentals, technical architecture, engineering‑focused prompt design, real‑world application scenarios, and the challenges of model generalization, resource consumption, data privacy, interpretability, and ethical considerations.

AI ArchitectureChatGPTData Analysis

0 likes · 15 min read

Practical Implementation of ChatGPT Technology Products: Architecture, Prompt Engineering, and Future Challenges

phodal

Jun 2, 2023 · Artificial Intelligence

Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI

This comprehensive guide walks programmers through the fundamentals of large language model capabilities, prompt writing and management, new interaction and workflow designs, advanced scenario‑specific applications, and context engineering, offering practical strategies and architectural insights for AI‑native development.

AI ArchitectureContext EngineeringLLM

0 likes · 14 min read

Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI

Architects' Tech Alliance

May 15, 2023 · Artificial Intelligence

How Transformer Powers ChatGPT: A Deep Dive into Attention and Architecture

This article provides a comprehensive analysis of the Transformer model behind ChatGPT, covering its origin, core mechanisms such as embedding, positional encoding, self‑attention, multi‑head attention, a step‑by‑step translation example, and the broader implications for AI research and industry.

AI ArchitectureChatGPTTransformer

0 likes · 19 min read

How Transformer Powers ChatGPT: A Deep Dive into Attention and Architecture

DataFunSummit

Mar 30, 2023 · Artificial Intelligence

An Overview of ChatGPT’s Software Architecture and Technology Stack

The article examines ChatGPT’s underlying software architecture, detailing its cloud deployment on AWS and Azure, database choices like PostgreSQL and Redis, front‑end technologies such as TypeScript and React, core AI frameworks including PyTorch and Triton, as well as its container orchestration, monitoring, and programming language ecosystem.

AI ArchitectureChatGPTFrontend

0 likes · 6 min read

An Overview of ChatGPT’s Software Architecture and Technology Stack

21CTO

Feb 8, 2023 · Artificial Intelligence

Understanding ChatGPT: Architecture, Training, Limitations, and Future Directions

This article provides a comprehensive overview of ChatGPT, covering its origin, core GPT‑3.5 architecture, RLHF training pipeline, distinctive features, current limitations, and emerging research directions such as model compression and integration with symbolic engines.

AI ArchitectureArtificial IntelligenceChatGPT

0 likes · 18 min read

Understanding ChatGPT: Architecture, Training, Limitations, and Future Directions

Alibaba Cloud Developer

Sep 3, 2019 · Artificial Intelligence

Unlocking Scalable Private‑Domain Recommendations with a “4+N” Architecture

This article describes a systematic, standardized, and automated “4+N” recommendation framework that unifies features, samples, models, and pipelines to accelerate private‑domain marketing recommendations across multiple scenarios while improving accuracy, efficiency, and business impact.

AI ArchitectureModel Deploymentdeep learning

0 likes · 12 min read

Unlocking Scalable Private‑Domain Recommendations with a “4+N” Architecture