Tagged articles
2071 articles
Page 21 of 21
Architect's Guide
Architect's Guide
Aug 10, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM Applications in Python

This tutorial introduces LangChain, an open‑source Python framework that provides unified model access, prompt management, memory, retrieval, and tool integration, enabling developers to quickly prototype AI‑driven applications using large language models and various external data sources.

LLMLangChainPython
0 likes · 13 min read
Getting Started with LangChain: Building LLM Applications in Python
JD Tech
JD Tech
Aug 4, 2023 · Artificial Intelligence

Deploying and Evaluating the Vicuna Open‑Source Large Language Model on a Single Machine

This article details a step‑by‑step guide to deploying the Vicuna open‑source LLM on a single server, covering model preparation, environment setup, dependency installation, GPU and CUDA configuration, inference commands, performance evaluation, and attempted fine‑tuning, while sharing practical observations and results.

Fine‑tuningGPULLM
0 likes · 16 min read
Deploying and Evaluating the Vicuna Open‑Source Large Language Model on a Single Machine
Architect
Architect
Jul 31, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM‑Powered Applications

This article introduces LangChain, explains why it’s useful for building applications with large language models, walks through installation, API‑key setup, model and embedding selection, prompt engineering, chaining, memory, agents, and vector‑store indexing, and provides runnable Python code examples throughout.

LLMLangChainPromptEngineering
0 likes · 16 min read
Getting Started with LangChain: Building LLM‑Powered Applications
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 30, 2023 · Artificial Intelligence

ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)

This article reviews the evolution of the GPT family from GPT‑1 to GPT‑3, comparing encoder‑decoder architectures, explaining the shift from supervised fine‑tuning to zero‑shot and few‑shot learning, and highlighting the architectural and training innovations that enabled large‑scale language models.

GPTLLMTransformer
0 likes · 13 min read
ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)
AI Large Model Application Practice
AI Large Model Application Practice
Jul 29, 2023 · Artificial Intelligence

How TypeChat Turns Natural Language into Structured JSON with LLMs

TypeChat, an open‑source project from Microsoft’s TypeScript creator, demonstrates how large language models can translate free‑form user requests into strongly‑typed JSON structures, enabling natural‑language interfaces for applications such as coffee ordering while reducing the complexity of traditional intent‑recognition pipelines.

AILLMTypeChat
0 likes · 7 min read
How TypeChat Turns Natural Language into Structured JSON with LLMs
MoonWebTeam
MoonWebTeam
Jul 28, 2023 · Artificial Intelligence

Unlocking LangChain: A Complete Guide to Building LLM Applications

This article introduces LangChain, explains its architecture and core components, and provides step‑by‑step Python examples for chat models, embeddings, prompts, indexes, chains, memory, agents, and practical use‑cases such as QA bots, web search, summarization, and persistent vector stores.

LLMLangChainPython
0 likes · 34 min read
Unlocking LangChain: A Complete Guide to Building LLM Applications
Alibaba Terminal Technology
Alibaba Terminal Technology
Jul 27, 2023 · Artificial Intelligence

How LLMs Transform Industrial Configuration Software: Architecture & Use Cases

This article explains how integrating large language models into industrial configuration tools creates AI‑driven features such as knowledge Q&A, automatic application generation, smart drawing, script and material generation, and outlines the three‑layer architecture, prompt engineering, and implementation lessons for developers.

AI integrationLLMLow‑code
0 likes · 30 min read
How LLMs Transform Industrial Configuration Software: Architecture & Use Cases
Tencent Cloud Developer
Tencent Cloud Developer
Jul 24, 2023 · Artificial Intelligence

Building an Internal Code Knowledge Base with Embedding and AST Interpreter

The author builds an internal code knowledge base for the TDesign Vue‑Next library by scraping documentation, chunking and embedding texts with OpenAI’s ada model into a vector store, then retrieving relevant chunks for LLM answers, and enhances context continuity using a JavaScript AST interpreter, achieving up to 90 % query accuracy and a 20 % productivity boost.

ASTEmbeddingKnowledge Base
0 likes · 11 min read
Building an Internal Code Knowledge Base with Embedding and AST Interpreter
Baobao Algorithm Notes
Baobao Algorithm Notes
Jul 23, 2023 · Artificial Intelligence

Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training

The article analyzes key challenges in large‑language‑model pipelines—including the necessity of cold‑start pretraining, the pitfalls of reward‑model hacking, efficiency‑effectiveness trade‑offs, evaluation difficulties, and downstream fine‑tuning limits—offering practical insights for more reliable LLM development.

EfficiencyLLMRLHF
0 likes · 9 min read
Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training
Tencent Cloud Developer
Tencent Cloud Developer
Jul 19, 2023 · Artificial Intelligence

Build a Full‑Scale LLM from Scratch in 61 Lines of Python

This step‑by‑step tutorial shows how to set up a GPU environment, prepare custom text data, train a tokenizer, configure and train a GPT‑2‑based large language model, test its generation, and run the entire pipeline using only 61 lines of Python code.

AIDockerGPT-2
0 likes · 10 min read
Build a Full‑Scale LLM from Scratch in 61 Lines of Python
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 19, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

This comprehensive guide explores prompt engineering for large language models, covering its background, fundamental concepts, prompt formats, construction principles, advanced techniques like few‑shot, zero‑shot, and chain‑of‑thought prompting, as well as practical examples, evaluation metrics, and future directions.

Artificial IntelligenceFew-shotLLM
0 likes · 33 min read
Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples
NetEase Yanxuan Technology Product Team
NetEase Yanxuan Technology Product Team
Jul 17, 2023 · Artificial Intelligence

Beyond Prompts: Designing Robust LLM Applications and the Rise of AI Engineers

This article analyzes the evolving landscape of large‑model applications, detailing prompt engineering, engineering challenges, AI‑engineer roles, domain‑driven design, and agent frameworks, while offering practical guidance and references for building production‑grade LLM‑driven systems.

AI EngineerAgent FrameworkDomain‑Driven Design
0 likes · 14 min read
Beyond Prompts: Designing Robust LLM Applications and the Rise of AI Engineers
DaTaobao Tech
DaTaobao Tech
Jul 12, 2023 · Artificial Intelligence

Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference

The article details a workflow that converts the PyTorch ChatGLM‑6B model to MNN, splits and compresses embeddings, applies int4/int8 quantization, supports dynamic shapes, and uses hybrid GPU/CPU or CPU‑only loading to enable low‑memory edge inference on PCs and mobile devices with competitive token‑per‑second performance.

ChatGLMLLMMNN
0 likes · 16 min read
Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 12, 2023 · Artificial Intelligence

Can Large Language Models Transform Recommendation Systems?

This article reviews how recent large language models (LLMs) are reshaping recommendation systems, covering their emergence, in‑context learning, prompt‑based strategies, three main LLM‑driven recommendation paradigms, key research papers, experimental results, and future research directions.

In-Context LearningLLMcold start
0 likes · 20 min read
Can Large Language Models Transform Recommendation Systems?
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 10, 2023 · Artificial Intelligence

Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation

This article explains the limitations of large language models, introduces prompt engineering as a remedy, and provides a comprehensive guide to using the LangChain framework—including models, prompts, chains, agents, vector search, and practical Node.js code examples—to enable LLMs to interact with external tools and data sources.

AI developmentLLMLangChain
0 likes · 35 min read
Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation
phodal
phodal
Jul 9, 2023 · Artificial Intelligence

How LLMs Can Transform Software Architecture Governance and Code Generation

This article explores how large language models can be integrated into software architecture governance, turning architectural rules into code, enhancing design, development, and runtime phases, and improving code generation quality through explicit knowledge, DSLs, and full‑process AI assistance.

AI-assisted developmentDSLLLM
0 likes · 14 min read
How LLMs Can Transform Software Architecture Governance and Code Generation
Architect
Architect
Jul 7, 2023 · Artificial Intelligence

Introduction to LangChain: Building LLM Applications with Chains, Agents, and Memory

LangChain is an innovative framework that simplifies building language‑model‑driven applications by providing modular components such as models, prompts, memory, chains, and agents, with examples of asynchronous LLM calls, custom prompt templates, vector stores, token handling, and a simple Gradio chatbot implementation.

AIChatbotLLM
0 likes · 21 min read
Introduction to LangChain: Building LLM Applications with Chains, Agents, and Memory
DaTaobao Tech
DaTaobao Tech
Jul 7, 2023 · Artificial Intelligence

Introduction to LangChain: Concepts, Tools, and Applications

The article introduces LangChain, a framework that unifies language models, prompts, memory, retrieval, and tool‑driven agents into composable chains, illustrating its core components, code examples, end‑to‑end applications such as retrieval‑augmented QA and image generation, and outlining future uses in customer service, recommendation, and automated code review.

AILLMLangChain
0 likes · 25 min read
Introduction to LangChain: Concepts, Tools, and Applications
ITPUB
ITPUB
Jul 5, 2023 · Databases

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

Vector databases enable efficient similarity search and storage of high‑dimensional embeddings, allowing enterprises to combine large language models with proprietary knowledge assets to create domain‑specific, accurate, and up‑to‑date AI services, as illustrated with open‑source solutions Chroma and Milvus.

AIChromaEmbeddings
0 likes · 11 min read
Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications
Tencent Tech
Tencent Tech
Jul 4, 2023 · Databases

What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?

This article explains what vectors and vector databases are, how they differ from traditional databases, their core technologies, their relationship with large language models, market trends, and details of Tencent Cloud VectorDB’s capabilities, architecture, real‑world applications, and future competitive challenges.

AIEmbeddingLLM
0 likes · 10 min read
What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?
php Courses
php Courses
Jul 3, 2023 · Databases

Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)

Vector databases store and search large-scale vector data, and in 2023 the five leading solutions—Chroma, Pinecone, Weaviate, Milvus, and Faiss—offer scalable, high-performance options for applications such as LLM-driven services, audio search, recommendation systems, image/video analysis, and semantic retrieval across various industries.

AILLMMachine Learning
0 likes · 4 min read
Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)
phodal
phodal
Jul 2, 2023 · Industry Insights

Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development

This article explores how large language models can standardize software engineering practices by converting requirements and designs into DSL formats, enabling more automated and efficient code generation, while discussing the challenges of dynamic context building, DSL specification, and the evolving role of LLMs in development pipelines.

AI automationDSLLLM
0 likes · 14 min read
Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development
ByteFE
ByteFE
Jun 28, 2023 · Frontend Development

How GPT is Transforming Frontend Development and UI Interaction

The article examines the rapid rise of GPT models, their technical capabilities and limitations, and how their integration is reshaping software interaction from command‑line to GUI‑plus‑Language UI, offering frontend engineers new opportunities, practical examples, and guidance on leveraging large‑model AI in product design.

AI integrationGPTLLM
0 likes · 21 min read
How GPT is Transforming Frontend Development and UI Interaction
phodal
phodal
Jun 27, 2023 · Artificial Intelligence

Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model

This article presents a detailed reference architecture for building LLM‑driven applications, using the ArchGuard Co‑mate project to illustrate layered design, local model integration, DSL‑based orchestration, and streaming LLM interfaces, complete with code examples and practical implementation notes.

AI OpsKotlinLLM
0 likes · 10 min read
Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model
ITPUB
ITPUB
Jun 27, 2023 · Artificial Intelligence

Why Databricks’ $1.3B MosaicML Deal Marks a Bold Bet on Generative AI

Databricks' $1.3 billion acquisition of MosaicML brings the startup's open‑source MPT models and high‑efficiency training stack into the Lakehouse platform, reflecting a strategic push to embed generative AI across enterprises while emphasizing data control, cost reduction, and open‑source policies.

AIGCDatabricksLLM
0 likes · 8 min read
Why Databricks’ $1.3B MosaicML Deal Marks a Bold Bet on Generative AI
DataFunTalk
DataFunTalk
Jun 23, 2023 · Artificial Intelligence

DeepKE-LLM: An Open‑Source Large Language Model Toolkit for Knowledge Extraction

DeepKE-LLM is an open‑source, extensible knowledge‑graph extraction framework that leverages large language models for entity, relation, and attribute extraction, supports multiple LLMs, provides installation scripts, various usage modes, fine‑tuning pipelines, and integrates with the KnowLM project for advanced instruction‑following capabilities.

DeepKELLMPython
0 likes · 8 min read
DeepKE-LLM: An Open‑Source Large Language Model Toolkit for Knowledge Extraction
Alibaba Cloud Native
Alibaba Cloud Native
Jun 23, 2023 · Cloud Native

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

This guide explains how to deploy large language models on Alibaba Cloud's ACK using KServe for serverless inference, integrates Fluid for distributed data caching to cut cold‑start latency, provides step‑by‑step commands, performance benchmarks, and practical tips for production‑grade AI model serving.

Cloud NativeFluidKServe
0 likes · 22 min read
Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid
DataFunTalk
DataFunTalk
Jun 19, 2023 · Artificial Intelligence

Rensselaer Polytechnic Institute (RPI) Computer Science Faculty, Resources, and PhD/Intern Recruitment Overview

The announcement introduces RPI's prestigious computer science department, its extensive GPU resources, collaborations with IBM Research, and detailed profiles of three incoming faculty members—highlighting their research areas in graph neural networks, trustworthy AI, data‑centric AI, drug‑design generative models, and neural‑symbolic reasoning—while inviting PhD and intern applicants to apply with full scholarships and funding support.

Artificial IntelligenceData‑Centric AILLM
0 likes · 8 min read
Rensselaer Polytechnic Institute (RPI) Computer Science Faculty, Resources, and PhD/Intern Recruitment Overview
phodal
phodal
Jun 18, 2023 · Artificial Intelligence

How to Build Language‑First APIs: 5 LLM‑Powered Architectural Patterns

The article outlines five practical patterns—natural‑language DSL, streaming DSL, DSL‑guided generation, explicit retry, and dynamic proxying—that enable developers to treat large‑language‑model interactions as first‑class APIs, improving efficiency, accuracy, and user experience across diverse scenarios.

DSLDynamic ProxyLLM
0 likes · 10 min read
How to Build Language‑First APIs: 5 LLM‑Powered Architectural Patterns
21CTO
21CTO
Jun 16, 2023 · Artificial Intelligence

Why Are LLM Stacks Becoming Essential for Modern Companies?

A comprehensive look at how companies are rapidly adopting large language model APIs, retrieval techniques, and custom model strategies, revealing key statistics, emerging toolchains, and the shifting balance between closed‑source LLM services and open‑source custom stacks.

AI adoptionCustom ModelsLLM
0 likes · 8 min read
Why Are LLM Stacks Becoming Essential for Modern Companies?
ByteFE
ByteFE
Jun 15, 2023 · Artificial Intelligence

Effective Prompt Engineering: Techniques, Prompt Injection Prevention, Hallucination Mitigation, and Advanced Prompting Strategies

This article explains how to craft efficient prompts by combining clear instructions and questions, discusses prompt injection risks and mitigation with delimiters, addresses hallucinations, and introduces zero‑shot, few‑shot, and chain‑of‑thought prompting techniques for large language models.

Few-shotLLMchain-of-thought
0 likes · 16 min read
Effective Prompt Engineering: Techniques, Prompt Injection Prevention, Hallucination Mitigation, and Advanced Prompting Strategies
phodal
phodal
Jun 14, 2023 · Industry Insights

What Are the Four Core Principles for LLM‑Powered Software Architecture?

This article outlines four foundational design principles—user‑intent‑driven design, context awareness, atomic capability mapping, and language‑API integration—for building LLM‑centric software architectures, illustrating each with DSL examples, Kotlin implementations, and practical insights on prompt engineering, dynamic context layering, and API evolution.

DSLDesign PrinciplesLLM
0 likes · 10 min read
What Are the Four Core Principles for LLM‑Powered Software Architecture?
phodal
phodal
Jun 11, 2023 · Artificial Intelligence

How ArchGuard 2.0 Uses LLMs to Dynamically Generate Architecture Governance Functions

This article explains how ArchGuard 2.0 leverages large language models to transform tacit architectural knowledge into dynamic, DSL‑based governance functions, detailing the challenges of traditional tools, the design of the Co‑mate system, and the practical implementation using Kotlin REPL.

APIDSLGovernance
0 likes · 8 min read
How ArchGuard 2.0 Uses LLMs to Dynamically Generate Architecture Governance Functions
Tencent Cloud Developer
Tencent Cloud Developer
Jun 7, 2023 · Artificial Intelligence

Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain

The article explains prompt‑engineering fundamentals—definitions, instruction, context, and output formatting—and showcases tricks like few‑shot, chain‑of‑thought, and ReAct, then demonstrates testing with OpenAI APIs, token management, LangChain integration, and low‑code applications such as AI‑generated SQL, API gateways, DSL‑driven UI, chart creation, and vector‑based semantic search.

AIGPTKnowledge Generation
0 likes · 30 min read
Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain
phodal
phodal
Jun 2, 2023 · Artificial Intelligence

Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI

This comprehensive guide walks programmers through the fundamentals of large language model capabilities, prompt writing and management, new interaction and workflow designs, advanced scenario‑specific applications, and context engineering, offering practical strategies and architectural insights for AI‑native development.

AI ArchitectureContext EngineeringLLM
0 likes · 14 min read
Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI
21CTO
21CTO
Jun 2, 2023 · Artificial Intelligence

Why LangChain Is the Fast‑Growing Framework for LLM‑Powered Apps

LangChain, launched in 2022, quickly evolved from a Python library to a multi‑environment framework that helps developers build chat‑based applications, agents, and memory‑aware LLM solutions, while integrating with major cloud and AI tooling ecosystems.

AI agentsAuto-GPTChat applications
0 likes · 8 min read
Why LangChain Is the Fast‑Growing Framework for LLM‑Powered Apps
Architect
Architect
May 29, 2023 · Artificial Intelligence

Understanding Embeddings and Vector Databases for LLM Applications

This article explains what embeddings and vector databases are, how they are generated with models like OpenAI's Ada, why they enable semantic search and help overcome large language model token limits, and demonstrates a practical workflow for retrieving relevant document chunks using cosine similarity.

EmbeddingsLLMinformation retrieval
0 likes · 7 min read
Understanding Embeddings and Vector Databases for LLM Applications
DataFunSummit
DataFunSummit
May 24, 2023 · Artificial Intelligence

Digital Humans and XR Interaction Forum at DataFunCon2023

The DataFunCon2023 Digital Humans and XR Interaction Forum showcases the latest advances in digital human technology, multimodal interaction, emotion computing, and LLM‑enhanced AI beings through expert talks, detailed outlines, and audience benefits, highlighting both research progress and practical applications.

AILLMXR
0 likes · 8 min read
Digital Humans and XR Interaction Forum at DataFunCon2023
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
May 22, 2023 · Artificial Intelligence

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

Microsoft researchers fine‑tuned GPT‑3.x models with LoRA on over 40,000 cloud incident records, evaluated them with six NLP metrics and human interviews, and found that LLMs can generate root‑cause analyses and mitigation steps comparable to BERT models, especially for machine‑detected failures.

AI for operationsGPT-3LLM
0 likes · 8 min read
How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes
Alibaba Cloud Developer
Alibaba Cloud Developer
May 16, 2023 · Artificial Intelligence

How to Build a Company‑Specific Chatbot with LLMs and Vector Databases

This article explains why combining large language models with vector databases enables enterprises to create specialized, up‑to‑date chatbots, outlines the underlying principles, describes the ADB‑PG vector‑search capabilities, and provides step‑by‑step implementation details including data processing, indexing, and query examples.

AnalyticDBChatbotLLM
0 likes · 17 min read
How to Build a Company‑Specific Chatbot with LLMs and Vector Databases
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
May 15, 2023 · Artificial Intelligence

How LangChain, Auto‑GPT, and HuggingGPT Shape LLM‑Powered Application Development

This article analyzes the current landscape of large‑language‑model application frameworks—LangChain, Auto‑GPT, and HuggingGPT—detailing their core capabilities, workflow patterns, limitations, and how they enable developers to integrate LLMs into real‑world products while outlining future platform requirements.

AI frameworksApplication DevelopmentAuto-GPT
0 likes · 17 min read
How LangChain, Auto‑GPT, and HuggingGPT Shape LLM‑Powered Application Development
DataFunSummit
DataFunSummit
May 4, 2023 · Artificial Intelligence

LLM Ranking Arena: Elo‑Based Competitive Evaluation of Open‑Source Chatbots

A recent study by the LMSYS organization introduces an Elo‑rated, 1v1 battle arena for large language models, ranking open‑source chatbots like Vicuna, Koala, and ChatGLM, while discussing the limitations of traditional benchmarks and the advantages of crowd‑sourced, scalable evaluation.

AI benchmarkingChatbot ArenaElo Rating
0 likes · 7 min read
LLM Ranking Arena: Elo‑Based Competitive Evaluation of Open‑Source Chatbots
Top Architect
Top Architect
Apr 21, 2023 · Artificial Intelligence

Fine‑Tuning LLaMA‑7B with Alpaca‑LoRA to Build a Chinese ChatGPT

This article explains why and how to fine‑tune the LLaMA‑7B model using the cheap Alpaca‑LoRA approach, covering hardware requirements, dataset preparation, LoRA training, optional model merging and quantization, and provides ready‑to‑run code snippets for single‑ and multi‑GPU setups.

Alpaca-LoRAGPULLM
0 likes · 10 min read
Fine‑Tuning LLaMA‑7B with Alpaca‑LoRA to Build a Chinese ChatGPT
Continuous Delivery 2.0
Continuous Delivery 2.0
Apr 20, 2023 · Artificial Intelligence

AutoGPT: Autonomous AI Tool Overview, Demonstrations, and Setup Guide

AutoGPT, the latest autonomous AI system, can independently browse the web, use third‑party tools, and execute tasks without human intervention, exemplified by building a React website in minutes, and this article explains its principles, showcases demos, and provides step‑by‑step installation instructions.

AI toolsAgentGPTAutoGPT
0 likes · 10 min read
AutoGPT: Autonomous AI Tool Overview, Demonstrations, and Setup Guide
Architect
Architect
Apr 14, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

The article surveys major large language models—including GPT‑3, T5, LaMDA, Jurassic‑1, MT‑NLG, Gopher, Chinchilla, PaLM, U‑PaLM, OPT, LLaMA, BLOOM, GLM‑130B, and ERNIE 3.0 Titan—explains their architectures, scaling trade‑offs, and then details instruction‑fine‑tuned variants such as T0, FLAN, GPT‑3.5, ChatGPT, GPT‑4, Alpaca and ChatGLM, providing references for further study.

AIChatGPTGPT-3
0 likes · 27 min read
Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques
DataFunSummit
DataFunSummit
Mar 25, 2023 · Artificial Intelligence

How GPT‑4 Has Changed NLP Research: Community Perspectives

A collection of Zhihu answers reflects on how the release of GPT‑4 has reshaped NLP research, dividing the community into LLM‑enthusiasts and skeptics, discussing the relevance of parsing, resource‑driven research directions, and the existential challenges faced by researchers.

AIAcademic CommunityGPT-4
0 likes · 10 min read
How GPT‑4 Has Changed NLP Research: Community Perspectives
Programmer DD
Programmer DD
Mar 19, 2023 · Artificial Intelligence

How Visual ChatGPT Adds Image Interaction to ChatGPT – A Deep Dive

Microsoft's open‑source Visual ChatGPT extends ChatGPT with image send/receive capabilities, explains its multimodal architecture, demo scenarios, used visual models, and points to the arXiv paper, highlighting its rapid popularity growth on GitHub.

LLMMicrosoftVisual ChatGPT
0 likes · 4 min read
How Visual ChatGPT Adds Image Interaction to ChatGPT – A Deep Dive
DeWu Technology
DeWu Technology
Mar 17, 2023 · Artificial Intelligence

Prompt‑Ops and LangChain: Engineering LLM Applications

Prompt‑Ops frameworks like LangChain let developers turn pre‑trained LLMs into versatile applications by abstracting model calls, chaining prompts, integrating tools, managing memory, and handling private data, while addressing challenges such as nondeterminism, version control, and prompt injection in production environments.

AI applicationsAgentLLM
0 likes · 15 min read
Prompt‑Ops and LangChain: Engineering LLM Applications
Nightwalker Tech
Nightwalker Tech
Mar 9, 2023 · Artificial Intelligence

AI-Assisted Programming Tools: Research Findings, Comparisons, and Recommendations

This report investigates the impact of large language model‑based AI coding assistants such as Copilot, Tabnine, CodeGeeX, and ChatGPT on frontend and backend development, evaluates their efficiency gains, compares features, pricing, and IDE integration, and provides practical adoption recommendations for development teams.

AILLMcode completion
0 likes · 13 min read
AI-Assisted Programming Tools: Research Findings, Comparisons, and Recommendations
Top Architect
Top Architect
Feb 28, 2023 · Artificial Intelligence

The Economics of Large Language Models and Their Impact on Search

This article analyses the economic feasibility of integrating large language models (LLMs) into search, estimating inference and training costs, exploring hardware efficiency, scaling laws, and future trends, and concludes that while technically viable, the added expense may challenge profitability for major search providers.

AI economicsCloud ComputingLLM
0 likes · 25 min read
The Economics of Large Language Models and Their Impact on Search
Programmer DD
Programmer DD
Feb 15, 2023 · Artificial Intelligence

Can LLM-Powered Search Rival Google? Uncovering the Economics of Large Language Models

This article examines whether large language model‑driven search can replace traditional engines like Google, analyzing the economic feasibility, training and inference costs, hardware considerations, and future scaling trends, while providing a detailed cost framework and insights into the profitability of LLM‑based services.

AI cost analysisCloud ComputingLLM
0 likes · 27 min read
Can LLM-Powered Search Rival Google? Uncovering the Economics of Large Language Models
ITPUB
ITPUB
Feb 6, 2023 · Artificial Intelligence

Can ChatGPT Replace a DBA? Real‑World Tests Reveal Its Strengths and Flaws

The article recounts a hands‑on exploration of ChatGPT’s abilities for database administration, highlighting impressive language understanding, notable mistakes such as confusing pg_resetwal with pg_resetxlog, and the broader implications of large language models for AI‑driven search and workflow automation.

AIChatGPTDatabase
0 likes · 8 min read
Can ChatGPT Replace a DBA? Real‑World Tests Reveal Its Strengths and Flaws
21CTO
21CTO
Jan 26, 2023 · Artificial Intelligence

Open-Source ChatGPT Training: LAION, CarperAI, and Phil Wang’s RLHF Implementations

This article surveys recent open‑source projects—including LAION’s OpenAssistant, CarperAI’s trlX, and Phil Wang’s ChatGPT implementation—that provide RLHF‑based training pipelines for large language models, while highlighting community expectations, resource challenges, and future accessibility goals.

Artificial IntelligenceChatGPTLAION
0 likes · 7 min read
Open-Source ChatGPT Training: LAION, CarperAI, and Phil Wang’s RLHF Implementations
DataFunTalk
DataFunTalk
Jan 10, 2023 · Artificial Intelligence

Paradigm Shifts in Large Language Model Research and Future Directions

The article reviews the evolution of large language models from the pre‑GPT‑3 era to the present, analyzes the conceptual and technical gaps between Chinese and global research, and outlines key future research directions such as scaling laws, prompting techniques, multimodal training, and efficient model architectures.

AI researchChatGPTIn-Context Learning
0 likes · 73 min read
Paradigm Shifts in Large Language Model Research and Future Directions
IT Architects Alliance
IT Architects Alliance
Dec 13, 2022 · Artificial Intelligence

Technical Principles and Training Process of ChatGPT

The article explains ChatGPT’s underlying technology, detailing its three-stage training pipeline—supervised fine‑tuning, reward‑model learning, and reinforcement learning with PPO—while discussing its strengths, limitations, and potential integration with traditional search engines.

AIChatGPTLLM
0 likes · 14 min read
Technical Principles and Training Process of ChatGPT
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 11, 2022 · Artificial Intelligence

Language Model as a Service and Black‑Box Optimization: Insights from Prof. Qiu Xipeng’s Talk

Prof. Qiu Xipeng’s talk highlighted how large language models can be offered as a service and efficiently adapted via in‑context learning, lightweight label‑tuning, and gradient‑free black‑box optimization, showcasing a unified asymmetric Transformer (CPT) that handles understanding, generation, ABSA and NER tasks while reducing resource demands.

Black-Box OptimizationLLMNLP
0 likes · 15 min read
Language Model as a Service and Black‑Box Optimization: Insights from Prof. Qiu Xipeng’s Talk