Tagged articles

2071 articles

Page 21 of 21

Aug 10, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM Applications in Python

This tutorial introduces LangChain, an open‑source Python framework that provides unified model access, prompt management, memory, retrieval, and tool integration, enabling developers to quickly prototype AI‑driven applications using large language models and various external data sources.

LLMLangChainPython

0 likes · 13 min read

Getting Started with LangChain: Building LLM Applications in Python

phodal

Aug 6, 2023 · Industry Insights

Can AI Really Supercharge Your IDE? Inside the AutoDev Open‑Source Copilot

This article examines the open‑source AutoDev project, outlining its assumptions about private LLMs, the three assistance modes it provides for IDEs, technical implementation details, integration challenges, and broader thoughts on LLMs as developer copilots.

AIIDELLM

0 likes · 8 min read

Can AI Really Supercharge Your IDE? Inside the AutoDev Open‑Source Copilot

JD Tech

Aug 4, 2023 · Artificial Intelligence

Deploying and Evaluating the Vicuna Open‑Source Large Language Model on a Single Machine

This article details a step‑by‑step guide to deploying the Vicuna open‑source LLM on a single server, covering model preparation, environment setup, dependency installation, GPU and CUDA configuration, inference commands, performance evaluation, and attempted fine‑tuning, while sharing practical observations and results.

Fine‑tuningGPULLM

0 likes · 16 min read

Deploying and Evaluating the Vicuna Open‑Source Large Language Model on a Single Machine

Architect

Jul 31, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM‑Powered Applications

This article introduces LangChain, explains why it’s useful for building applications with large language models, walks through installation, API‑key setup, model and embedding selection, prompt engineering, chaining, memory, agents, and vector‑store indexing, and provides runnable Python code examples throughout.

LLMLangChainPromptEngineering

0 likes · 16 min read

Getting Started with LangChain: Building LLM‑Powered Applications

Rare Earth Juejin Tech Community

Jul 30, 2023 · Artificial Intelligence

ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)

This article reviews the evolution of the GPT family from GPT‑1 to GPT‑3, comparing encoder‑decoder architectures, explaining the shift from supervised fine‑tuning to zero‑shot and few‑shot learning, and highlighting the architectural and training innovations that enabled large‑scale language models.

GPTLLMTransformer

0 likes · 13 min read

ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)

AI Large Model Application Practice

Jul 29, 2023 · Artificial Intelligence

How TypeChat Turns Natural Language into Structured JSON with LLMs

TypeChat, an open‑source project from Microsoft’s TypeScript creator, demonstrates how large language models can translate free‑form user requests into strongly‑typed JSON structures, enabling natural‑language interfaces for applications such as coffee ordering while reducing the complexity of traditional intent‑recognition pipelines.

AILLMTypeChat

0 likes · 7 min read

How TypeChat Turns Natural Language into Structured JSON with LLMs

MoonWebTeam

Jul 28, 2023 · Artificial Intelligence

Unlocking LangChain: A Complete Guide to Building LLM Applications

This article introduces LangChain, explains its architecture and core components, and provides step‑by‑step Python examples for chat models, embeddings, prompts, indexes, chains, memory, agents, and practical use‑cases such as QA bots, web search, summarization, and persistent vector stores.

LLMLangChainPython

0 likes · 34 min read

Unlocking LangChain: A Complete Guide to Building LLM Applications

Alibaba Terminal Technology

Jul 27, 2023 · Artificial Intelligence

How LLMs Transform Industrial Configuration Software: Architecture & Use Cases

This article explains how integrating large language models into industrial configuration tools creates AI‑driven features such as knowledge Q&A, automatic application generation, smart drawing, script and material generation, and outlines the three‑layer architecture, prompt engineering, and implementation lessons for developers.

AI integrationLLMLow‑code

0 likes · 30 min read

How LLMs Transform Industrial Configuration Software: Architecture & Use Cases

Alibaba Cloud Big Data AI Platform

Jul 26, 2023 · Artificial Intelligence

Build a Custom LLM Chatbot with Hologres Vector Store and LangChain

This guide shows how to create a domain‑specific chatbot by combining large language models, prompt‑tuning, and Alibaba Cloud's Hologres real‑time vector store, covering prerequisites, environment setup, code configuration, vector data preparation, verification, and optional DingTalk integration.

ChatbotHologresLLM

0 likes · 12 min read

Build a Custom LLM Chatbot with Hologres Vector Store and LangChain

Tencent Cloud Developer

Jul 24, 2023 · Artificial Intelligence

Building an Internal Code Knowledge Base with Embedding and AST Interpreter

The author builds an internal code knowledge base for the TDesign Vue‑Next library by scraping documentation, chunking and embedding texts with OpenAI’s ada model into a vector store, then retrieving relevant chunks for LLM answers, and enhances context continuity using a JavaScript AST interpreter, achieving up to 90 % query accuracy and a 20 % productivity boost.

ASTEmbeddingKnowledge Base

0 likes · 11 min read

Building an Internal Code Knowledge Base with Embedding and AST Interpreter

Baobao Algorithm Notes

Jul 23, 2023 · Artificial Intelligence

Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training

The article analyzes key challenges in large‑language‑model pipelines—including the necessity of cold‑start pretraining, the pitfalls of reward‑model hacking, efficiency‑effectiveness trade‑offs, evaluation difficulties, and downstream fine‑tuning limits—offering practical insights for more reliable LLM development.

EfficiencyLLMRLHF

0 likes · 9 min read

Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training

Volcano Engine Developer Services

Jul 21, 2023 · Artificial Intelligence

Build a Custom AI Q&A System Using Volcano Engine Cloud Search & LangChain

This guide explains how to create a domain‑specific intelligent Q&A system by leveraging prompt‑tuning, Volcano Engine Cloud Search vector store, and LangChain, covering model selection, data embedding, vector indexing, retrieval, and LLM integration with full code examples.

AI Q&AEmbeddingLLM

0 likes · 8 min read

Build a Custom AI Q&A System Using Volcano Engine Cloud Search & LangChain

Tencent Cloud Developer

Jul 19, 2023 · Artificial Intelligence

Build a Full‑Scale LLM from Scratch in 61 Lines of Python

This step‑by‑step tutorial shows how to set up a GPU environment, prepare custom text data, train a tokenizer, configure and train a GPT‑2‑based large language model, test its generation, and run the entire pipeline using only 61 lines of Python code.

AIDockerGPT-2

0 likes · 10 min read

Build a Full‑Scale LLM from Scratch in 61 Lines of Python

Java Backend Technology

Jul 19, 2023 · Artificial Intelligence

Unlocking LangChain: Build Powerful LLM Applications Step‑by‑Step

This article introduces LangChain, explains its modular architecture—including models, prompts, memory, chains, and agents—shows how to handle tokens and model selection, and provides complete Python code examples for building a conversational retrieval chatbot with OpenAI models.

EmbeddingLLMLangChain

0 likes · 20 min read

Unlocking LangChain: Build Powerful LLM Applications Step‑by‑Step

Alibaba Cloud Developer

Jul 19, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

This comprehensive guide explores prompt engineering for large language models, covering its background, fundamental concepts, prompt formats, construction principles, advanced techniques like few‑shot, zero‑shot, and chain‑of‑thought prompting, as well as practical examples, evaluation metrics, and future directions.

Artificial IntelligenceFew-shotLLM

0 likes · 33 min read

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

NetEase Yanxuan Technology Product Team

Jul 17, 2023 · Artificial Intelligence

Beyond Prompts: Designing Robust LLM Applications and the Rise of AI Engineers

This article analyzes the evolving landscape of large‑model applications, detailing prompt engineering, engineering challenges, AI‑engineer roles, domain‑driven design, and agent frameworks, while offering practical guidance and references for building production‑grade LLM‑driven systems.

AI EngineerAgent FrameworkDomain‑Driven Design

0 likes · 14 min read

Beyond Prompts: Designing Robust LLM Applications and the Rise of AI Engineers

DaTaobao Tech

Jul 12, 2023 · Artificial Intelligence

Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference

The article details a workflow that converts the PyTorch ChatGLM‑6B model to MNN, splits and compresses embeddings, applies int4/int8 quantization, supports dynamic shapes, and uses hybrid GPU/CPU or CPU‑only loading to enable low‑memory edge inference on PCs and mobile devices with competitive token‑per‑second performance.

ChatGLMLLMMNN

0 likes · 16 min read

Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference

Alibaba Cloud Developer

Jul 12, 2023 · Artificial Intelligence

Can Large Language Models Transform Recommendation Systems?

This article reviews how recent large language models (LLMs) are reshaping recommendation systems, covering their emergence, in‑context learning, prompt‑based strategies, three main LLM‑driven recommendation paradigms, key research papers, experimental results, and future research directions.

In-Context LearningLLMcold start

0 likes · 20 min read

Can Large Language Models Transform Recommendation Systems?

Rare Earth Juejin Tech Community

Jul 10, 2023 · Artificial Intelligence

Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation

This article explains the limitations of large language models, introduces prompt engineering as a remedy, and provides a comprehensive guide to using the LangChain framework—including models, prompts, chains, agents, vector search, and practical Node.js code examples—to enable LLMs to interact with external tools and data sources.

AI developmentLLMLangChain

0 likes · 35 min read

Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation

phodal

Jul 9, 2023 · Artificial Intelligence

How LLMs Can Transform Software Architecture Governance and Code Generation

This article explores how large language models can be integrated into software architecture governance, turning architectural rules into code, enhancing design, development, and runtime phases, and improving code generation quality through explicit knowledge, DSLs, and full‑process AI assistance.

AI-assisted developmentDSLLLM

0 likes · 14 min read

Architect

Jul 7, 2023 · Artificial Intelligence

Introduction to LangChain: Building LLM Applications with Chains, Agents, and Memory

LangChain is an innovative framework that simplifies building language‑model‑driven applications by providing modular components such as models, prompts, memory, chains, and agents, with examples of asynchronous LLM calls, custom prompt templates, vector stores, token handling, and a simple Gradio chatbot implementation.

AIChatbotLLM

0 likes · 21 min read

Introduction to LangChain: Building LLM Applications with Chains, Agents, and Memory

DaTaobao Tech

Jul 7, 2023 · Artificial Intelligence

Introduction to LangChain: Concepts, Tools, and Applications

The article introduces LangChain, a framework that unifies language models, prompts, memory, retrieval, and tool‑driven agents into composable chains, illustrating its core components, code examples, end‑to‑end applications such as retrieval‑augmented QA and image generation, and outlining future uses in customer service, recommendation, and automated code review.

AILLMLangChain

0 likes · 25 min read

Introduction to LangChain: Concepts, Tools, and Applications

php Courses

Jul 6, 2023 · Artificial Intelligence

Microsoft Introduces Guidance: A Domain-Specific Language for Managing Large Language Models

Microsoft's newly released Guidance language unifies generation, prompting, and logical control for large language models, enabling seamless integration with providers like Hugging Face, enhancing prompt reliability, and simplifying complex LLM workflows for developers.

AILLMLangChain

0 likes · 3 min read

Microsoft Introduces Guidance: A Domain-Specific Language for Managing Large Language Models

ITPUB

Jul 5, 2023 · Databases

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

Vector databases enable efficient similarity search and storage of high‑dimensional embeddings, allowing enterprises to combine large language models with proprietary knowledge assets to create domain‑specific, accurate, and up‑to‑date AI services, as illustrated with open‑source solutions Chroma and Milvus.

AIChromaEmbeddings

0 likes · 11 min read

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

Tencent Tech

Jul 4, 2023 · Databases

What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?

This article explains what vectors and vector databases are, how they differ from traditional databases, their core technologies, their relationship with large language models, market trends, and details of Tencent Cloud VectorDB’s capabilities, architecture, real‑world applications, and future competitive challenges.

AIEmbeddingLLM

0 likes · 10 min read

What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?

php Courses

Jul 3, 2023 · Databases

Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)

Vector databases store and search large-scale vector data, and in 2023 the five leading solutions—Chroma, Pinecone, Weaviate, Milvus, and Faiss—offer scalable, high-performance options for applications such as LLM-driven services, audio search, recommendation systems, image/video analysis, and semantic retrieval across various industries.

AILLMMachine Learning

0 likes · 4 min read

Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)

phodal

Jul 2, 2023 · Industry Insights

Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development

This article explores how large language models can standardize software engineering practices by converting requirements and designs into DSL formats, enabling more automated and efficient code generation, while discussing the challenges of dynamic context building, DSL specification, and the evolving role of LLMs in development pipelines.

AI automationDSLLLM

0 likes · 14 min read

Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development

ByteFE

Jun 28, 2023 · Frontend Development

How GPT is Transforming Frontend Development and UI Interaction

The article examines the rapid rise of GPT models, their technical capabilities and limitations, and how their integration is reshaping software interaction from command‑line to GUI‑plus‑Language UI, offering frontend engineers new opportunities, practical examples, and guidance on leveraging large‑model AI in product design.

AI integrationGPTLLM

0 likes · 21 min read

How GPT is Transforming Frontend Development and UI Interaction

phodal

Jun 27, 2023 · Artificial Intelligence

Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model

This article presents a detailed reference architecture for building LLM‑driven applications, using the ArchGuard Co‑mate project to illustrate layered design, local model integration, DSL‑based orchestration, and streaming LLM interfaces, complete with code examples and practical implementation notes.

AI OpsKotlinLLM

0 likes · 10 min read

Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model

ITPUB

Jun 27, 2023 · Artificial Intelligence

Why Databricks’ $1.3B MosaicML Deal Marks a Bold Bet on Generative AI

Databricks' $1.3 billion acquisition of MosaicML brings the startup's open‑source MPT models and high‑efficiency training stack into the Lakehouse platform, reflecting a strategic push to embed generative AI across enterprises while emphasizing data control, cost reduction, and open‑source policies.

AIGCDatabricksLLM

0 likes · 8 min read

Why Databricks’ $1.3B MosaicML Deal Marks a Bold Bet on Generative AI

Kuaishou E-commerce Frontend Team

Jun 27, 2023 · Artificial Intelligence

Exploring AIGC in Enterprise: From OT Algorithms to 3D AI Innovations

The A2M Internet Architecture and AI Summit in Shanghai showcased 24 topics and 84 talks, highlighting AIGC applications across enterprise collaboration, large‑scale model integration, 3D visual content, music entertainment, and the broader impact of LLMs on software development and digital transformation.

3D AIAIGCCollaboration

0 likes · 10 min read

Exploring AIGC in Enterprise: From OT Algorithms to 3D AI Innovations

Alibaba Cloud Native

Jun 24, 2023 · Cloud Native

How to Deploy Distributed LLM Inference with DeepSpeed on Alibaba Cloud ACK

This guide walks through deploying a Bloom 7B1 large language model for distributed inference on Alibaba Cloud Container Service (ACK) using DeepSpeed, Arena, and Kubernetes, covering environment setup, model configuration, service launch, verification, and Ingress exposure.

ACKArenaCloud Native

0 likes · 14 min read

How to Deploy Distributed LLM Inference with DeepSpeed on Alibaba Cloud ACK

DataFunTalk

Jun 23, 2023 · Artificial Intelligence

DeepKE-LLM: An Open‑Source Large Language Model Toolkit for Knowledge Extraction

DeepKE-LLM is an open‑source, extensible knowledge‑graph extraction framework that leverages large language models for entity, relation, and attribute extraction, supports multiple LLMs, provides installation scripts, various usage modes, fine‑tuning pipelines, and integrates with the KnowLM project for advanced instruction‑following capabilities.

DeepKELLMPython

0 likes · 8 min read

DeepKE-LLM: An Open‑Source Large Language Model Toolkit for Knowledge Extraction

Alibaba Cloud Native

Jun 23, 2023 · Cloud Native

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

This guide explains how to deploy large language models on Alibaba Cloud's ACK using KServe for serverless inference, integrates Fluid for distributed data caching to cut cold‑start latency, provides step‑by‑step commands, performance benchmarks, and practical tips for production‑grade AI model serving.

Cloud NativeFluidKServe

0 likes · 22 min read

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

Architects' Tech Alliance

Jun 23, 2023 · Artificial Intelligence

Can GPT‑4 Really Design a Chip? Inside NYU’s AI‑Generated Processor Breakthrough

A NYU Tandon study shows that GPT‑4, using plain‑language dialogue, can generate Verilog code for a novel 8‑bit accumulator microprocessor, which was tape‑out on Skywater 130 nm, demonstrating AI’s emerging ability to perform hardware design tasks at near‑human level.

AGIAI chip designGPT-4

0 likes · 5 min read

Can GPT‑4 Really Design a Chip? Inside NYU’s AI‑Generated Processor Breakthrough

DataFunTalk

Jun 19, 2023 · Artificial Intelligence

Rensselaer Polytechnic Institute (RPI) Computer Science Faculty, Resources, and PhD/Intern Recruitment Overview

The announcement introduces RPI's prestigious computer science department, its extensive GPU resources, collaborations with IBM Research, and detailed profiles of three incoming faculty members—highlighting their research areas in graph neural networks, trustworthy AI, data‑centric AI, drug‑design generative models, and neural‑symbolic reasoning—while inviting PhD and intern applicants to apply with full scholarships and funding support.

Artificial IntelligenceData‑Centric AILLM

0 likes · 8 min read

Rensselaer Polytechnic Institute (RPI) Computer Science Faculty, Resources, and PhD/Intern Recruitment Overview

phodal

Jun 18, 2023 · Artificial Intelligence

How to Build Language‑First APIs: 5 LLM‑Powered Architectural Patterns

The article outlines five practical patterns—natural‑language DSL, streaming DSL, DSL‑guided generation, explicit retry, and dynamic proxying—that enable developers to treat large‑language‑model interactions as first‑class APIs, improving efficiency, accuracy, and user experience across diverse scenarios.

DSLDynamic ProxyLLM

0 likes · 10 min read

How to Build Language‑First APIs: 5 LLM‑Powered Architectural Patterns

21CTO

Jun 16, 2023 · Artificial Intelligence

Why Are LLM Stacks Becoming Essential for Modern Companies?

A comprehensive look at how companies are rapidly adopting large language model APIs, retrieval techniques, and custom model strategies, revealing key statistics, emerging toolchains, and the shifting balance between closed‑source LLM services and open‑source custom stacks.

AI adoptionCustom ModelsLLM

0 likes · 8 min read

Why Are LLM Stacks Becoming Essential for Modern Companies?

ByteFE

Jun 15, 2023 · Artificial Intelligence

Effective Prompt Engineering: Techniques, Prompt Injection Prevention, Hallucination Mitigation, and Advanced Prompting Strategies

This article explains how to craft efficient prompts by combining clear instructions and questions, discusses prompt injection risks and mitigation with delimiters, addresses hallucinations, and introduces zero‑shot, few‑shot, and chain‑of‑thought prompting techniques for large language models.

Few-shotLLMchain-of-thought

0 likes · 16 min read

Effective Prompt Engineering: Techniques, Prompt Injection Prevention, Hallucination Mitigation, and Advanced Prompting Strategies

phodal

Jun 14, 2023 · Industry Insights

What Are the Four Core Principles for LLM‑Powered Software Architecture?

This article outlines four foundational design principles—user‑intent‑driven design, context awareness, atomic capability mapping, and language‑API integration—for building LLM‑centric software architectures, illustrating each with DSL examples, Kotlin implementations, and practical insights on prompt engineering, dynamic context layering, and API evolution.

DSLDesign PrinciplesLLM

0 likes · 10 min read

What Are the Four Core Principles for LLM‑Powered Software Architecture?

phodal

Jun 11, 2023 · Artificial Intelligence

How ArchGuard 2.0 Uses LLMs to Dynamically Generate Architecture Governance Functions

This article explains how ArchGuard 2.0 leverages large language models to transform tacit architectural knowledge into dynamic, DSL‑based governance functions, detailing the challenges of traditional tools, the design of the Co‑mate system, and the practical implementation using Kotlin REPL.

APIDSLGovernance

0 likes · 8 min read

How ArchGuard 2.0 Uses LLMs to Dynamically Generate Architecture Governance Functions

dbaplus Community

Jun 10, 2023 · Artificial Intelligence

How Embeddings and Vector Databases Empower LLMs for Semantic Search

This article explains what embeddings and vector databases are, why they matter for large language models, and how they can be used to overcome token limits by storing and retrieving relevant text chunks for accurate, context‑aware responses.

EmbeddingsLLMVector Database

0 likes · 6 min read

How Embeddings and Vector Databases Empower LLMs for Semantic Search

Tencent Cloud Developer

Jun 7, 2023 · Artificial Intelligence

Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain

The article explains prompt‑engineering fundamentals—definitions, instruction, context, and output formatting—and showcases tricks like few‑shot, chain‑of‑thought, and ReAct, then demonstrates testing with OpenAI APIs, token management, LangChain integration, and low‑code applications such as AI‑generated SQL, API gateways, DSL‑driven UI, chart creation, and vector‑based semantic search.

AIGPTKnowledge Generation

0 likes · 30 min read

Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain

phodal

Jun 2, 2023 · Artificial Intelligence

Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI

This comprehensive guide walks programmers through the fundamentals of large language model capabilities, prompt writing and management, new interaction and workflow designs, advanced scenario‑specific applications, and context engineering, offering practical strategies and architectural insights for AI‑native development.

AI ArchitectureContext EngineeringLLM

0 likes · 14 min read

Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI

21CTO

Jun 2, 2023 · Artificial Intelligence

Why LangChain Is the Fast‑Growing Framework for LLM‑Powered Apps

LangChain, launched in 2022, quickly evolved from a Python library to a multi‑environment framework that helps developers build chat‑based applications, agents, and memory‑aware LLM solutions, while integrating with major cloud and AI tooling ecosystems.

AI agentsAuto-GPTChat applications

0 likes · 8 min read

Why LangChain Is the Fast‑Growing Framework for LLM‑Powered Apps

Architect

May 29, 2023 · Artificial Intelligence

Understanding Embeddings and Vector Databases for LLM Applications

This article explains what embeddings and vector databases are, how they are generated with models like OpenAI's Ada, why they enable semantic search and help overcome large language model token limits, and demonstrates a practical workflow for retrieving relevant document chunks using cosine similarity.

EmbeddingsLLMinformation retrieval

0 likes · 7 min read

Understanding Embeddings and Vector Databases for LLM Applications

DataFunSummit

May 24, 2023 · Artificial Intelligence

Digital Humans and XR Interaction Forum at DataFunCon2023

The DataFunCon2023 Digital Humans and XR Interaction Forum showcases the latest advances in digital human technology, multimodal interaction, emotion computing, and LLM‑enhanced AI beings through expert talks, detailed outlines, and audience benefits, highlighting both research progress and practical applications.

AILLMXR

0 likes · 8 min read

Digital Humans and XR Interaction Forum at DataFunCon2023

Network Intelligence Research Center (NIRC)

May 22, 2023 · Artificial Intelligence

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

Microsoft researchers fine‑tuned GPT‑3.x models with LoRA on over 40,000 cloud incident records, evaluated them with six NLP metrics and human interviews, and found that LLMs can generate root‑cause analyses and mitigation steps comparable to BERT models, especially for machine‑detected failures.

AI for operationsGPT-3LLM

0 likes · 8 min read

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

JD Retail Technology

May 16, 2023 · Artificial Intelligence

Deploying and Fine‑Tuning the Alpaca‑LoRA Large Language Model on a Multi‑GPU Server

This guide details the end‑to‑end process of installing GPU drivers, setting up a Python environment, deploying the open‑source Alpaca‑LoRA model, fine‑tuning it with Chinese data on a multi‑GPU server, and performing inference, while highlighting practical challenges and performance observations.

Alpaca-LoRAGPULLM

0 likes · 11 min read

Deploying and Fine‑Tuning the Alpaca‑LoRA Large Language Model on a Multi‑GPU Server

Alibaba Cloud Developer

May 16, 2023 · Artificial Intelligence

How to Build a Company‑Specific Chatbot with LLMs and Vector Databases

This article explains why combining large language models with vector databases enables enterprises to create specialized, up‑to‑date chatbots, outlines the underlying principles, describes the ADB‑PG vector‑search capabilities, and provides step‑by‑step implementation details including data processing, indexing, and query examples.

AnalyticDBChatbotLLM

0 likes · 17 min read

How to Build a Company‑Specific Chatbot with LLMs and Vector Databases

phodal

May 15, 2023 · Artificial Intelligence

How GitHub Copilot Builds Context: Inside Its Tree‑Sitter Architecture and Token Management

This article reverse‑engineers GitHub Copilot's context construction, detailing visible and invisible processes, prompt composition, token allocation strategies, fast response mechanisms, and future directions for LLM context engineering.

AI code assistantGitHub CopilotLLM

0 likes · 11 min read

How GitHub Copilot Builds Context: Inside Its Tree‑Sitter Architecture and Token Management

AsiaInfo Technology: New Tech Exploration

May 15, 2023 · Artificial Intelligence

How LangChain, Auto‑GPT, and HuggingGPT Shape LLM‑Powered Application Development

This article analyzes the current landscape of large‑language‑model application frameworks—LangChain, Auto‑GPT, and HuggingGPT—detailing their core capabilities, workflow patterns, limitations, and how they enable developers to integrate LLMs into real‑world products while outlining future platform requirements.

AI frameworksApplication DevelopmentAuto-GPT

0 likes · 17 min read

How LangChain, Auto‑GPT, and HuggingGPT Shape LLM‑Powered Application Development

DataFunSummit

May 4, 2023 · Artificial Intelligence

LLM Ranking Arena: Elo‑Based Competitive Evaluation of Open‑Source Chatbots

A recent study by the LMSYS organization introduces an Elo‑rated, 1v1 battle arena for large language models, ranking open‑source chatbots like Vicuna, Koala, and ChatGLM, while discussing the limitations of traditional benchmarks and the advantages of crowd‑sourced, scalable evaluation.

AI benchmarkingChatbot ArenaElo Rating

0 likes · 7 min read

LLM Ranking Arena: Elo‑Based Competitive Evaluation of Open‑Source Chatbots

Top Architect

Apr 21, 2023 · Artificial Intelligence

Fine‑Tuning LLaMA‑7B with Alpaca‑LoRA to Build a Chinese ChatGPT

This article explains why and how to fine‑tune the LLaMA‑7B model using the cheap Alpaca‑LoRA approach, covering hardware requirements, dataset preparation, LoRA training, optional model merging and quantization, and provides ready‑to‑run code snippets for single‑ and multi‑GPU setups.

Alpaca-LoRAGPULLM

0 likes · 10 min read

Fine‑Tuning LLaMA‑7B with Alpaca‑LoRA to Build a Chinese ChatGPT

Continuous Delivery 2.0

Apr 20, 2023 · Artificial Intelligence

AutoGPT: Autonomous AI Tool Overview, Demonstrations, and Setup Guide

AutoGPT, the latest autonomous AI system, can independently browse the web, use third‑party tools, and execute tasks without human intervention, exemplified by building a React website in minutes, and this article explains its principles, showcases demos, and provides step‑by‑step installation instructions.

AI toolsAgentGPTAutoGPT

0 likes · 10 min read

AutoGPT: Autonomous AI Tool Overview, Demonstrations, and Setup Guide

Programmer DD

Apr 19, 2023 · Artificial Intelligence

Is Auto-GPT a Breakthrough or Overhyped? Uncovering Its Real Limitations

This article examines Auto‑GPT’s hype versus reality, detailing its architecture, cost per task, looping pitfalls, limited memory management, and why its current capabilities fall short of practical production use.

AI agentsAuto-GPTGPT-4

0 likes · 17 min read

Is Auto-GPT a Breakthrough or Overhyped? Uncovering Its Real Limitations

Architect

Apr 14, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

The article surveys major large language models—including GPT‑3, T5, LaMDA, Jurassic‑1, MT‑NLG, Gopher, Chinchilla, PaLM, U‑PaLM, OPT, LLaMA, BLOOM, GLM‑130B, and ERNIE 3.0 Titan—explains their architectures, scaling trade‑offs, and then details instruction‑fine‑tuned variants such as T0, FLAN, GPT‑3.5, ChatGPT, GPT‑4, Alpaca and ChatGLM, providing references for further study.

AIChatGPTGPT-3

0 likes · 27 min read

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

Programmer DD

Apr 7, 2023 · Artificial Intelligence

How Vicuna-13B Achieves ChatGPT‑Level Performance with Low‑Cost Open‑Source Training

The Vicuna-13B open‑source chatbot, fine‑tuned from LLaMA on 70k ShareGPT conversations, matches over 90% of ChatGPT and Google Bard quality while costing only about $300 to train, thanks to memory optimizations, multi‑turn dialogue handling, and cheap spot‑instance training.

AIChatbotLLM

0 likes · 8 min read

How Vicuna-13B Achieves ChatGPT‑Level Performance with Low‑Cost Open‑Source Training

DataFunTalk

Apr 6, 2023 · Artificial Intelligence

A Comprehensive Survey of Large Language Models: Background, Capabilities, Key Technologies, and Future Directions

This article reviews the rapid progress of large language models (LLMs), covering their historical development, scaling laws, emergent abilities, core technologies such as training and alignment, resource ecosystems, evaluation methods, safety concerns, and prospective research challenges.

AI researchLLMScaling

0 likes · 21 min read

A Comprehensive Survey of Large Language Models: Background, Capabilities, Key Technologies, and Future Directions

DataFunSummit

Mar 25, 2023 · Artificial Intelligence

How GPT‑4 Has Changed NLP Research: Community Perspectives

A collection of Zhihu answers reflects on how the release of GPT‑4 has reshaped NLP research, dividing the community into LLM‑enthusiasts and skeptics, discussing the relevance of parsing, resource‑driven research directions, and the existential challenges faced by researchers.

AIAcademic CommunityGPT-4

0 likes · 10 min read

How GPT‑4 Has Changed NLP Research: Community Perspectives

360 Tech Engineering

Mar 23, 2023 · Artificial Intelligence

Integrating Knowledge Graphs with ChatGPT: Opportunities, Challenges, and Applications

The article examines ChatGPT's rapid adoption, the rise of generative AI, and how integrating external knowledge sources such as knowledge graphs can improve factual accuracy, enhance downstream tasks, and enable a variety of ChatGPT+ applications ranging from intelligent search to office automation tools.

AI applicationsChatGPTLLM

0 likes · 9 min read

Integrating Knowledge Graphs with ChatGPT: Opportunities, Challenges, and Applications

Programmer DD

Mar 19, 2023 · Artificial Intelligence

How Visual ChatGPT Adds Image Interaction to ChatGPT – A Deep Dive

Microsoft's open‑source Visual ChatGPT extends ChatGPT with image send/receive capabilities, explains its multimodal architecture, demo scenarios, used visual models, and points to the arXiv paper, highlighting its rapid popularity growth on GitHub.

LLMMicrosoftVisual ChatGPT

0 likes · 4 min read

How Visual ChatGPT Adds Image Interaction to ChatGPT – A Deep Dive

DeWu Technology

Mar 17, 2023 · Artificial Intelligence

Prompt‑Ops and LangChain: Engineering LLM Applications

Prompt‑Ops frameworks like LangChain let developers turn pre‑trained LLMs into versatile applications by abstracting model calls, chaining prompts, integrating tools, managing memory, and handling private data, while addressing challenges such as nondeterminism, version control, and prompt injection in production environments.

AI applicationsAgentLLM

0 likes · 15 min read

Prompt‑Ops and LangChain: Engineering LLM Applications

Nightwalker Tech

Mar 9, 2023 · Artificial Intelligence

AI-Assisted Programming Tools: Research Findings, Comparisons, and Recommendations

This report investigates the impact of large language model‑based AI coding assistants such as Copilot, Tabnine, CodeGeeX, and ChatGPT on frontend and backend development, evaluates their efficiency gains, compares features, pricing, and IDE integration, and provides practical adoption recommendations for development teams.

AILLMcode completion

0 likes · 13 min read

AI-Assisted Programming Tools: Research Findings, Comparisons, and Recommendations

Top Architect

Feb 28, 2023 · Artificial Intelligence

The Economics of Large Language Models and Their Impact on Search

This article analyses the economic feasibility of integrating large language models (LLMs) into search, estimating inference and training costs, exploring hardware efficiency, scaling laws, and future trends, and concludes that while technically viable, the added expense may challenge profitability for major search providers.

AI economicsCloud ComputingLLM

0 likes · 25 min read

The Economics of Large Language Models and Their Impact on Search

Programmer DD

Feb 15, 2023 · Artificial Intelligence

Can LLM-Powered Search Rival Google? Uncovering the Economics of Large Language Models

This article examines whether large language model‑driven search can replace traditional engines like Google, analyzing the economic feasibility, training and inference costs, hardware considerations, and future scaling trends, while providing a detailed cost framework and insights into the profitability of LLM‑based services.

AI cost analysisCloud ComputingLLM

0 likes · 27 min read

Can LLM-Powered Search Rival Google? Uncovering the Economics of Large Language Models

ITPUB

Feb 6, 2023 · Artificial Intelligence

Can ChatGPT Replace a DBA? Real‑World Tests Reveal Its Strengths and Flaws

The article recounts a hands‑on exploration of ChatGPT’s abilities for database administration, highlighting impressive language understanding, notable mistakes such as confusing pg_resetwal with pg_resetxlog, and the broader implications of large language models for AI‑driven search and workflow automation.

AIChatGPTDatabase

0 likes · 8 min read

Can ChatGPT Replace a DBA? Real‑World Tests Reveal Its Strengths and Flaws

21CTO

Jan 26, 2023 · Artificial Intelligence

Open-Source ChatGPT Training: LAION, CarperAI, and Phil Wang’s RLHF Implementations

This article surveys recent open‑source projects—including LAION’s OpenAssistant, CarperAI’s trlX, and Phil Wang’s ChatGPT implementation—that provide RLHF‑based training pipelines for large language models, while highlighting community expectations, resource challenges, and future accessibility goals.

Artificial IntelligenceChatGPTLAION

0 likes · 7 min read

Open-Source ChatGPT Training: LAION, CarperAI, and Phil Wang’s RLHF Implementations

DataFunTalk

Jan 10, 2023 · Artificial Intelligence

Paradigm Shifts in Large Language Model Research and Future Directions

The article reviews the evolution of large language models from the pre‑GPT‑3 era to the present, analyzes the conceptual and technical gaps between Chinese and global research, and outlines key future research directions such as scaling laws, prompting techniques, multimodal training, and efficient model architectures.

AI researchChatGPTIn-Context Learning

0 likes · 73 min read

Paradigm Shifts in Large Language Model Research and Future Directions

IT Architects Alliance

Dec 13, 2022 · Artificial Intelligence

Technical Principles and Training Process of ChatGPT

The article explains ChatGPT’s underlying technology, detailing its three-stage training pipeline—supervised fine‑tuning, reward‑model learning, and reinforcement learning with PPO—while discussing its strengths, limitations, and potential integration with traditional search engines.

AIChatGPTLLM

0 likes · 14 min read

Technical Principles and Training Process of ChatGPT

Xiaohongshu Tech REDtech

Nov 11, 2022 · Artificial Intelligence

Language Model as a Service and Black‑Box Optimization: Insights from Prof. Qiu Xipeng’s Talk

Prof. Qiu Xipeng’s talk highlighted how large language models can be offered as a service and efficiently adapted via in‑context learning, lightweight label‑tuning, and gradient‑free black‑box optimization, showcasing a unified asymmetric Transformer (CPT) that handles understanding, generation, ABSA and NER tasks while reducing resource demands.

Black-Box OptimizationLLMNLP

0 likes · 15 min read

Language Model as a Service and Black‑Box Optimization: Insights from Prof. Qiu Xipeng’s Talk