Tagged articles

2074 articles

Page 20 of 21

Feb 19, 2024 · Artificial Intelligence

AI/ML Technology Articles Collection

This collection compiles technical articles that explore diverse AI/ML applications, from deploying large language models on MacBooks and building e‑commerce recommendation engines, to leveraging the LangChain framework, creating AIGC‑driven fashion solutions, and implementing Stable Diffusion for image generation.

AIAIGCDeployment

0 likes · 1 min read

DataFunTalk

Feb 19, 2024 · Artificial Intelligence

Large Language Model Inference Overview and Performance Optimizations

This article presents a comprehensive overview of large language model inference, detailing the prefill and decoding stages, key performance metrics such as throughput, latency and QPS, and a series of system-level optimizations—including pipeline parallelism, dynamic batching, specialized attention kernels, virtual memory allocation, KV‑cache quantization, and mixed‑precision strategies—to improve GPU utilization and overall inference efficiency.

GPULLMLatency

0 likes · 24 min read

Large Language Model Inference Overview and Performance Optimizations

Java Tech Enthusiast

Feb 16, 2024 · Artificial Intelligence

Google's Gemini 1.5: Breakthrough in Long-Context Understanding and Multimodal Capabilities

Google’s Gemini 1.5, a new multimodal Mixture‑of‑Experts model, supports up to a million‑token context (10 million internally), can understand text, video, audio and code, learns a new language from a single prompt, and is already being used by Samsung, Jasper and Quora, positioning it as a direct challenger to OpenAI’s flagship models.

Gemini 1.5Google AILLM

0 likes · 7 min read

Google's Gemini 1.5: Breakthrough in Long-Context Understanding and Multimodal Capabilities

AI Large Model Application Practice

Feb 15, 2024 · Artificial Intelligence

How Generative AI is Transforming RPA: Three Powerful Integration Scenarios

This article explores three key ways large language models and multimodal generative AI can enhance robotic process automation, from cognition‑boosted RPA and AI‑Agent collaboration to visual‑intelligent navigation, illustrating practical examples and future prospects for smarter digital workers.

AI AgentAutomationLLM

0 likes · 12 min read

How Generative AI is Transforming RPA: Three Powerful Integration Scenarios

NewBeeNLP

Feb 11, 2024 · Industry Insights

What 2023 Taught Us About LLMs and AI‑Guided Optimization

The author reviews a year of rapid progress in large language models, highlighting breakthrough papers such as Positional Interpolation, StreamingLLM, Deja Vu, and RLCD, and discusses how AI‑guided optimization techniques like SurCo, LANCER, and GenCo are reshaping research and industry applications.

LLMLarge Language ModelsTransformers

0 likes · 13 min read

What 2023 Taught Us About LLMs and AI‑Guided Optimization

Rare Earth Juejin Tech Community

Feb 7, 2024 · Artificial Intelligence

Step-by-Step Guide to Building Multi‑Agent Applications with LangChain LangGraph in Google Colab

This tutorial walks through installing LangChain, LangGraph and related packages in Google Colab, configuring environment variables, defining search and Twitter‑writer tools, constructing a StateGraph workflow with supervisor logic, and executing a multi‑agent LLM pipeline using LangChain’s new multi‑agent capabilities.

AIGoogle ColabLLM

0 likes · 11 min read

Step-by-Step Guide to Building Multi‑Agent Applications with LangChain LangGraph in Google Colab

21CTO

Feb 4, 2024 · Artificial Intelligence

Running Large Language Models on Raspberry Pi with Ollama: A Step‑by‑Step Guide

This tutorial walks you through installing Ollama on a Raspberry Pi, exploring TinyLlama, Phi, and LLaVA models, and demonstrates how to run and interact with these LLMs locally, including hardware requirements and practical command examples.

AIEdge AILLM

0 likes · 5 min read

Running Large Language Models on Raspberry Pi with Ollama: A Step‑by‑Step Guide

DataFunSummit

Feb 3, 2024 · Artificial Intelligence

Practical Application of Large Language Models in MaShang Consumer Finance: From Model Building to Deployment

This article details how MaShang Consumer Finance leverages large language models for sales, collection, and customer service, covering company background, AI research achievements, model training infrastructure, data‑quality and compliance challenges, prompt engineering, inference acceleration, evaluation methods, and lessons learned from real‑world deployment.

ComplianceFinanceLLM

0 likes · 21 min read

Practical Application of Large Language Models in MaShang Consumer Finance: From Model Building to Deployment

NewBeeNLP

Feb 2, 2024 · Artificial Intelligence

ControlRec: Aligning LLMs with IDs to Boost Personalized Recommendations

ControlRec introduces heterogeneous feature matching and instruction contrastive learning to bridge the semantic gap between language models and discrete user/item IDs, enabling more effective personalized recommendation across multiple tasks such as rating prediction, sequential recommendation, and explanation generation.

ControlRecHeterogeneous Feature MatchingInstruction Contrast Learning

0 likes · 10 min read

ControlRec: Aligning LLMs with IDs to Boost Personalized Recommendations

Rare Earth Juejin Tech Community

Jan 28, 2024 · Artificial Intelligence

Building a Weibo Influencer Finder with LangChain and LLM

This article demonstrates how to use LangChain, LLMs, and SerpAPI to create a Weibo influencer‑search tool that extracts UID numbers, scrapes profile data, filters Chinese content, and prepares the information for automated marketing outreach.

AgentLLMLangChain

0 likes · 9 min read

Building a Weibo Influencer Finder with LangChain and LLM

Ctrip Technology

Jan 26, 2024 · Artificial Intelligence

Implementing Plugin Functionality for a Large Language Model Chatbot Using Function Calling and Asynchronous Execution

This article explains how Ctrip's security R&D team built a web‑based LLM chatbot with version‑2.0 features such as plugin support, function calling, synchronous and asynchronous execution, WebSocket/Socket.IO communication, and provides full Python code examples for defining and invoking plugins.

AIFunction CallingLLM

0 likes · 15 min read

Implementing Plugin Functionality for a Large Language Model Chatbot Using Function Calling and Asynchronous Execution

Rare Earth Juejin Tech Community

Jan 22, 2024 · Artificial Intelligence

Prompt Engineering and CAMEL: Role‑Playing AI Agents for Automated Prompt Generation

This article explains how Prompt Engineering combined with the CAMEL framework enables role‑playing AI agents to automatically generate and manage prompts, illustrates the concept with a stock‑trading example, and provides Python code using LangChain to build a marketing‑automation agent for a small business.

AI agentsCAMELInception Prompting

0 likes · 11 min read

Prompt Engineering and CAMEL: Role‑Playing AI Agents for Automated Prompt Generation

Rare Earth Juejin Tech Community

Jan 21, 2024 · Artificial Intelligence

Understanding Pretraining and Fine‑Tuning of Large Language Models: Methods, Resources, and Practical Applications

This article explains the concepts of pretraining and fine‑tuning for large language models, compares full‑parameter, LoRA and QLoRA approaches, discusses resource consumption, introduces the ModelScope SWIFT framework with code examples, and shows how fine‑tuning can improve data‑visualisation tasks while reducing token usage.

Data visualizationLLMLoRA

0 likes · 24 min read

Understanding Pretraining and Fine‑Tuning of Large Language Models: Methods, Resources, and Practical Applications

Rare Earth Juejin Tech Community

Jan 20, 2024 · Artificial Intelligence

Understanding LangChain Callback Mechanism, Custom Async Handlers, and Token Cost Management in Python

This article introduces LangChain's callback mechanism, demonstrates how to implement custom synchronous and asynchronous callbacks in Python, compares them with JavaScript async patterns, and shows how to monitor token usage and control costs using OpenAI callbacks.

LLMLangChainPython

0 likes · 10 min read

Understanding LangChain Callback Mechanism, Custom Async Handlers, and Token Cost Management in Python

AI Large Model Application Practice

Jan 18, 2024 · Operations

How to Build an RPA Bot with Robot Framework and Compare It to AI Agents

This article explains the fundamentals of Robotic Process Automation (RPA), compares RPA with BPA and AI Agents, and provides a step‑by‑step tutorial for building and running an RPA robot using Robot Framework and the open‑source RPA Framework, including full code examples.

AI AgentAutomationLLM

0 likes · 14 min read

How to Build an RPA Bot with Robot Framework and Compare It to AI Agents

Bitu Technology

Jan 17, 2024 · Artificial Intelligence

Rosetta Stone: Scalable ID Mapping System for Tubi's Content Library Using LLMs and Embeddings

This article describes how Tubi built the Rosetta Stone system—a flexible ID mapping workflow that leverages large language models, embedding similarity ranking, and K‑nearest‑neighbors to unify and enrich metadata across a 200,000‑title library, improve content recommendation, and streamline operations.

Big DataEmbeddingsLLM

0 likes · 10 min read

Rosetta Stone: Scalable ID Mapping System for Tubi's Content Library Using LLMs and Embeddings

Tencent Cloud Developer

Jan 16, 2024 · Frontend Development

Frontend Technology Review 2023 and Outlook 2024

The 2023 frontend review highlights TypeScript’s size and speed gains, ECMAScript 2023 features, evolving frameworks like React, Vue, Svelte, Angular and emerging Qwik, while Rust tooling, Bun, browser changes, AI‑driven low‑code, and WASM progress set the stage for 2024’s LLM‑powered, Rust‑centric, cross‑platform development.

BunD2CHarmonyOS

0 likes · 49 min read

Frontend Technology Review 2023 and Outlook 2024

Xiaohongshu Tech REDtech

Jan 12, 2024 · Artificial Intelligence

Negative Sample Assisted Distillation for Large Language Models

The AAAI‑2024 paper introduces a Negative Sample Assisted Distillation framework—comprising Negative Assistance Training, Negative Calibration Enhancement, and Adaptive Self‑Consistency—that leverages both correct and incorrect reasoning examples to train a compact LLaMA‑7B student, achieving up to 75.75 % accuracy gains over fine‑tuning on MATH and improving out‑of‑domain benchmarks.

Knowledge DistillationLLMchain-of-thought

0 likes · 13 min read

Negative Sample Assisted Distillation for Large Language Models

Data Thinking Notes

Jan 7, 2024 · Artificial Intelligence

Boost Text2SQL Accuracy with Retrieval‑Augmented Generation and LangChain

This article explains how Retrieval‑Augmented Generation (RAG) can improve LLM‑based Text2SQL conversion, covering RAG fundamentals, LangChain implementation steps, practical enhancements for SQL agents, and future directions for integrating domain knowledge.

AI agentsLLMLangChain

0 likes · 16 min read

Boost Text2SQL Accuracy with Retrieval‑Augmented Generation and LangChain

Rare Earth Juejin Tech Community

Jan 7, 2024 · Artificial Intelligence

A Comprehensive Guide to Generative AI Tools, Prompts, and Learning Resources

This article provides an extensive overview of generative AI concepts such as AIGC and AGI, evaluates various coding assistants and chat models, offers prompt engineering tips, and lists numerous free and paid AI tools and learning resources for developers and everyday users.

AIAIGCLLM

0 likes · 15 min read

A Comprehensive Guide to Generative AI Tools, Prompts, and Learning Resources

Baobao Algorithm Notes

Jan 6, 2024 · Artificial Intelligence

How to Pick the Best Fine‑Tuning Data for LLMs with the Nuggets Method

This article explains the Nuggets approach for selecting a high‑quality subset of annotated instructions to fine‑tune large language models, describing its three inputs, the gold‑score computation based on perplexity improvement, empirical results on Alpaca, and practical considerations such as task‑set design.

LLMNuggetsdata selection

0 likes · 7 min read

How to Pick the Best Fine‑Tuning Data for LLMs with the Nuggets Method

DaTaobao Tech

Jan 5, 2024 · Mobile Development

Edge Deployment and Performance Optimization of Large Language Models with MNN

The upgraded mnn‑llm framework adds a unified llm‑export pipeline, cross‑platform inference with tokenizers and disk‑embedding, and ARM‑focused linear‑layer optimizations—including SIMD, hand‑written assembly and 4‑bit quantization—that dramatically speed up prefilling and achieve real‑time LLM conversation on mobile devices within a 2 GB memory budget, outperforming llama.cpp, fastllm and mlc‑llm.

ARM CPULLMMNN

0 likes · 17 min read

Edge Deployment and Performance Optimization of Large Language Models with MNN

DataFunSummit

Jan 4, 2024 · Big Data

YY Live Business Metric Governance Practice

This presentation details YY Live’s data product team’s end‑to‑end business metric governance practice, covering problem background, analysis, governance objectives, multi‑team collaboration, implementation steps, achieved efficiencies, and future directions leveraging large language models.

Big DataData PlatformLLM

0 likes · 16 min read

YY Live Business Metric Governance Practice

Baobao Algorithm Notes

Jan 2, 2024 · Artificial Intelligence

Uncovering Mixtral‑8x7B: How MoE Experts Shape Performance and Training

This article analyses the Mixtral‑8x7B Mixture‑of‑Experts LLM, explains its gate‑driven 8‑expert architecture, presents a simplified PyTorch implementation, and reports a series of experiments that probe top‑2 gating during training, individual expert contributions, task‑specific pre‑training, the impact of expert count, and similarity with Mistral‑7B, ultimately offering hypotheses about its training pipeline.

LLMMixtralMixture of Experts

0 likes · 14 min read

Uncovering Mixtral‑8x7B: How MoE Experts Shape Performance and Training

Rare Earth Juejin Tech Community

Dec 29, 2023 · Artificial Intelligence

Overview of Major Benchmark Datasets for Evaluating Large Language Models

This article provides a comprehensive overview of major benchmark datasets—including CMMLU, MMLU, C‑Eval, GSM8K, Gaokao‑Bench, AGIEval, MATH, BBH, HumanEval, and MBPP—used to evaluate large language models' knowledge, reasoning, and coding abilities, and summarizes related leaderboards and evaluation tools.

Artificial IntelligenceLLMdataset

0 likes · 14 min read

Overview of Major Benchmark Datasets for Evaluating Large Language Models

Huolala Tech

Dec 28, 2023 · Artificial Intelligence

How Huolala Built a Low‑Code LLM Platform to Accelerate AI Agent Deployment

Huolala created a visual, drag‑and‑drop LLM application platform that streamlines AI integration, reduces development costs, and enables rapid deployment of agents across marketing, invitation, advertising, and modeling scenarios, boosting efficiency by over 98% while cutting integration time from hours to minutes.

AIAgentLLM

0 likes · 13 min read

How Huolala Built a Low‑Code LLM Platform to Accelerate AI Agent Deployment

Alibaba Cloud Big Data AI Platform

Dec 28, 2023 · Big Data

How LLMs Can Revolutionize Data Warehouse ETL: From Push‑Pull to Stable Queries

This article explores the challenges of traditional data‑warehouse ETL, compares push and pull models, and presents an LLM‑driven architecture that generates both on‑demand SQL queries and streaming ETL code with automatic error‑feedback loops, dramatically improving cost, accuracy, and maintainability.

Big DataData WarehouseETL

0 likes · 16 min read

How LLMs Can Revolutionize Data Warehouse ETL: From Push‑Pull to Stable Queries

Alibaba Cloud Native

Dec 27, 2023 · Cloud Computing

One‑Click Deployment of LLMs to Alibaba Cloud Function Compute with SwingDeploy

This guide explains how to quickly select a ModelScope open‑source LLM, deploy it to Alibaba Cloud Function Compute using the SwingDeploy one‑click feature, enable reserved idle billing, and evaluate the cost savings compared with traditional GPU provisioning.

Function ComputeGPULLM

0 likes · 11 min read

One‑Click Deployment of LLMs to Alibaba Cloud Function Compute with SwingDeploy

DaTaobao Tech

Dec 27, 2023 · Artificial Intelligence

Deploying a Private LLM Knowledge Base on a MacBook

The guide walks through installing and quantizing the open‑source ChatGLM3‑6B model and the m3e‑base embedder on a MacBook, wrapping them with a FastAPI OpenAI‑compatible service, routing requests through a One‑API gateway, storing metadata in MongoDB and vectors in PostgreSQL pgvector, deploying FastGPT for RAG, ingesting data, and demonstrating 5‑7 second response times, while outlining future improvements.

ChatGLM3DeploymentFastAPI

0 likes · 23 min read

Deploying a Private LLM Knowledge Base on a MacBook

Rare Earth Juejin Tech Community

Dec 27, 2023 · Artificial Intelligence

Comprehensive Overview of Large Language Models: Capabilities, Limitations, Deployment, and Future Trends

This article provides a detailed examination of large language models, covering their underlying technologies, capabilities and constraints, model families, training processes, cloud and edge deployment challenges, agent architectures, and emerging trends, offering practical insights for developers, product managers, and researchers.

Artificial IntelligenceLLMModel Deployment

0 likes · 43 min read

Comprehensive Overview of Large Language Models: Capabilities, Limitations, Deployment, and Future Trends

21CTO

Dec 15, 2023 · Artificial Intelligence

Why 2024 Will Be the Year of AI Engineers and LLM‑Driven Apps

The article outlines five major AI engineering trends for 2024—including the rise of AI engineers, evolving LLM tech stacks, open‑source large models, vector databases, and AI agents—highlighting how these shifts will reshape application development and industry competition.

2024 trendsAI EngineeringAI agents

0 likes · 9 min read

Why 2024 Will Be the Year of AI Engineers and LLM‑Driven Apps

DataFunSummit

Dec 15, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article explores how large language models can be incorporated into recommender systems, discussing background challenges, specific integration points across the recommendation pipeline, practical implementation methods, experimental results, and future research directions, while highlighting industrial considerations and potential improvements.

Industrial ApplicationsLLMModel Fusion

0 likes · 20 min read

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

Data Thinking Notes

Dec 12, 2023 · Artificial Intelligence

Boosting Text‑to‑SQL Accuracy with Prompt Engineering and LLMs

This article examines the challenges of LLM‑based Text‑to‑SQL such as hallucinations, data‑security risks, and user input errors, and presents prompt‑engineering strategies, fine‑tuning comparisons, prompt types, code examples, and experimental results to improve reliability and cost‑effectiveness.

Artificial IntelligenceLLMLangChain

0 likes · 15 min read

Boosting Text‑to‑SQL Accuracy with Prompt Engineering and LLMs

NetEase Cloud Music Tech Team

Dec 12, 2023 · Artificial Intelligence

How LangChain Powers AI Agents: Principles, Debugging, and Real‑World Optimizations

This article explains the concept of AI Agents in the large‑language‑model era, details LangChain's implementation mechanics, shares practical challenges and optimizations encountered by NetEase Cloud Music, and provides step‑by‑step code examples and performance insights for building robust AI Agents.

AI AgentDebuggingLLM

0 likes · 20 min read

How LangChain Powers AI Agents: Principles, Debugging, and Real‑World Optimizations

AI Large Model Application Practice

Dec 12, 2023 · Artificial Intelligence

Boost Enterprise LLM Performance: Solving Common RAG Challenges

This article explains Retrieval‑Augmented Generation for enterprise LLMs, outlines four production‑grade problems, and presents practical solutions such as parent‑child chunking, multi‑vector and multi‑query retrieval, and context‑aware question refinement with concrete prompts and workflow diagrams.

LLMRAG

0 likes · 13 min read

Boost Enterprise LLM Performance: Solving Common RAG Challenges

Rare Earth Juejin Tech Community

Dec 8, 2023 · Artificial Intelligence

Simplifying Transformer Blocks: Removing Residual Connections, LayerNorm, and Other Components without Losing Performance

A recent ETH Zurich paper shows that standard Transformer blocks can be drastically simplified by removing residual connections, LayerNorm, projection and value parameters, and even MLP sub‑block components, achieving up to 16% fewer parameters and comparable training speed and downstream performance on both GPT‑style decoders and BERT models.

AILLMSignal Propagation

0 likes · 11 min read

Simplifying Transformer Blocks: Removing Residual Connections, LayerNorm, and Other Components without Losing Performance

Sohu Tech Products

Dec 6, 2023 · Databases

GPTuner: LLM-Driven PostgreSQL Knob Tuning

GPTuner, an LLM‑driven system for PostgreSQL knob tuning developed by researchers at Sichuan University, demonstrates that knowledge processing, parameter selection, search‑range optimization, and a two‑stage Bayesian framework each significantly improve performance, while costing roughly 880 000 GPT‑4 tokens (≈ $30) with reusable knowledge.

Ablation StudyDatabase TuningGPTuner

0 likes · 9 min read

GPTuner: LLM-Driven PostgreSQL Knob Tuning

DataFunTalk

Dec 6, 2023 · Artificial Intelligence

Distributed Training Techniques and Quantitative Analysis for Large Language Models (GPT‑175B)

This article presents a comprehensive overview of state‑of‑the‑art distributed training methods for large language models, using GPT‑175B as a case study to analyze memory, communication, and compute overheads, and to recommend practical optimization strategies such as tensor, pipeline, and sequence parallelism, ZeRO‑1 optimizer, and selective activation checkpointing.

GPU memory optimizationLLMMegatron

0 likes · 22 min read

Distributed Training Techniques and Quantitative Analysis for Large Language Models (GPT‑175B)

Rare Earth Juejin Tech Community

Dec 6, 2023 · Artificial Intelligence

Multi-Agent Research Overview, Open-Source Implementations, and Design Considerations

This article reviews the background of multi‑agent systems, compares major open‑source frameworks such as AutoGen, MetaGPT, AgentVerse, and XAgent, discusses design principles, collaboration strategies, and offers conclusions on LLM‑driven versus SOP‑driven approaches for building multi‑agent applications.

AIAgent FrameworkAutoGen

0 likes · 15 min read

Multi-Agent Research Overview, Open-Source Implementations, and Design Considerations

Alibaba Cloud Big Data AI Platform

Dec 5, 2023 · Artificial Intelligence

How to Efficiently Fine‑Tune Qwen LLMs on Alibaba Cloud PAI Lingjun

This guide walks you through setting up Alibaba Cloud PAI Lingjun resources, preparing Qwen‑7B/14B/72B models, preprocessing large‑scale WuDao data, configuring distributed training with Megatron‑LM, performing continued pre‑training and supervised fine‑tuning, and finally deploying the model as an online service via PAI‑EAS.

Alibaba CloudLLMMegatron-LM

0 likes · 27 min read

How to Efficiently Fine‑Tune Qwen LLMs on Alibaba Cloud PAI Lingjun

Huawei Cloud Developer Alliance

Nov 30, 2023 · Artificial Intelligence

Mastering LLM Text Generation: Decoding Methods Explained

This review of the recent MindSpore NLP public class walks through the fundamentals of large language model text generation, detailing deterministic decoding such as greedy and beam search, stochastic sampling techniques like temperature, top‑k and top‑p, and advanced methods including constrained beam, contrastive, and assisted search, with illustrative examples.

Beam SearchGreedy SearchLLM

0 likes · 5 min read

Mastering LLM Text Generation: Decoding Methods Explained

Rare Earth Juejin Tech Community

Nov 29, 2023 · Artificial Intelligence

Building a Private LLM‑Powered Knowledge Base with LangChain and ChatGLM3

This article explains how to migrate personal notes into a private knowledge base by combining a large language model with an external vector store, detailing the concepts of tokenization, embedding, vector databases, and step‑by‑step deployment using LangChain‑Chatchat and the open‑source ChatGLM3 model.

ChatGLM3EmbeddingKnowledge Base

0 likes · 10 min read

Building a Private LLM‑Powered Knowledge Base with LangChain and ChatGLM3

Data Thinking Notes

Nov 28, 2023 · Artificial Intelligence

Build a Text‑to‑SQL App with LangChain and OpenAI: Step‑by‑Step Guide

This article explains how to build a Text‑to‑SQL application using LangChain, OpenAI LLMs, and SQLDatabaseChain, covering the fundamentals of Text2SQL, LangChain components, code examples, and a practical SQLite case that transforms natural‑language questions into executable SQL queries.

LLMLangChainPython

0 likes · 12 min read

Build a Text‑to‑SQL App with LangChain and OpenAI: Step‑by‑Step Guide

Open Source Tech Hub

Nov 25, 2023 · Artificial Intelligence

How to Deploy FastGPT Locally with Docker Compose: A Step‑by‑Step Guide

This guide walks you through installing Docker, configuring Docker‑Compose, setting up FastGPT’s config files, launching the containers, and creating a private knowledge base to enable AI‑driven question answering on your own server.

AI deploymentDocker-ComposeFastGPT

0 likes · 10 min read

How to Deploy FastGPT Locally with Docker Compose: A Step‑by‑Step Guide

DataFunSummit

Nov 20, 2023 · Artificial Intelligence

ModelScope Agents: Open‑Source LLM Agent Framework and Practical Guide

This article introduces ModelScope Agents, an open‑source LLM‑based agent framework that addresses limitations of GPT Store, outlines its features, provides installation and usage instructions, showcases a RPG game example, and invites the community to contribute to its roadmap.

AIAgent FrameworkLLM

0 likes · 7 min read

ModelScope Agents: Open‑Source LLM Agent Framework and Practical Guide

Alibaba Cloud Big Data AI Platform

Nov 16, 2023 · Artificial Intelligence

How Alibaba Cloud’s AI-Powered OpenSearch Boosts Search Accuracy and Cuts Costs

Alibaba Cloud unveiled AI-driven upgrades to its OpenSearch and Elasticsearch services, highlighting LLM‑based conversational search, three‑fold vector retrieval speed gains, and up to 70% cost reductions through serverless architectures and extensive performance optimizations.

ElasticsearchLLMOpenSearch

0 likes · 6 min read

How Alibaba Cloud’s AI-Powered OpenSearch Boosts Search Accuracy and Cuts Costs

JD Retail Technology

Nov 14, 2023 · Artificial Intelligence

An Overview of LangChain: Core Concepts, Components, and Practical Applications

This article introduces LangChain—a Python framework for building LLM‑driven applications—explains its core components such as models, indexes, chains, memory, and agents, and provides practical code examples for document summarization, retrieval‑augmented QA, and future development directions.

LLMLangChainPromptTemplate

0 likes · 19 min read

An Overview of LangChain: Core Concepts, Components, and Practical Applications

Baobao Algorithm Notes

Nov 13, 2023 · Artificial Intelligence

Mastering LLM Fundamentals: Tokenizers, Layer Norm, and PEFT Explained

This article provides a comprehensive technical guide on large language model fundamentals, covering tokenizer construction methods such as BPE, WordPiece, and SentencePiece, detailed explanations of Layer Normalization variants, Deep Norm concepts with code, and an overview of parameter‑efficient fine‑tuning techniques like LoRA and PEFT.

Artificial IntelligenceLLMLayer Normalization

0 likes · 36 min read

Mastering LLM Fundamentals: Tokenizers, Layer Norm, and PEFT Explained

Data Thinking Notes

Nov 12, 2023 · Artificial Intelligence

Unlocking LLM Power: Semantic Search, Private Knowledge Bases, and Text‑to‑SQL for Data Teams

This article explores how large language models can boost data workflows by using embeddings for semantic retrieval, building domain‑specific knowledge bases for private Q&A, generating SQL code from natural language, and automating exploratory data analysis, offering practical steps and visual examples.

EmbeddingKnowledge BaseLLM

0 likes · 7 min read

Unlocking LLM Power: Semantic Search, Private Knowledge Bases, and Text‑to‑SQL for Data Teams

Baobao Algorithm Notes

Nov 7, 2023 · Artificial Intelligence

A Complete Technical Guide to LLM Foundations, Advanced Topics, Fine‑Tuning, and LangChain Applications

This article provides an in‑depth technical overview of large language models (LLMs), covering core model families, architectural differences, emergent abilities, common challenges such as repetition and token limits, detailed fine‑tuning strategies including PEFT, practical guidance for training custom models, and a thorough introduction to the LangChain framework with code examples, core concepts, and troubleshooting tips for building LLM‑powered applications.

LLMLangChainVector Store

0 likes · 97 min read

A Complete Technical Guide to LLM Foundations, Advanced Topics, Fine‑Tuning, and LangChain Applications

Huawei Cloud Developer Alliance

Nov 3, 2023 · Artificial Intelligence

Can LLMs Master Lifelong Learning? Exploring MoE and Continuous Adaptation

This article explains how large language models can achieve continual lifelong learning, outlines the key properties required, reviews mixture‑of‑experts (MoE) techniques—including sparse MoE, GShard, Switch Transformer, GLaM and PanGu‑Sigma—and discusses the remaining challenges such as model complexity, expert balancing and distributed communication overhead.

Artificial IntelligenceLLMLifelong Learning

0 likes · 9 min read

Can LLMs Master Lifelong Learning? Exploring MoE and Continuous Adaptation

JD Tech

Nov 2, 2023 · Artificial Intelligence

An Introduction to LangChain: Core Components, Usage Patterns, and Practical Code Examples

This article explains what LangChain is, outlines its core components such as Models, Indexes, Chains, Memory and Agents, and demonstrates how to build LLM‑driven applications with detailed Python code snippets, visual diagrams, and future development suggestions.

AI FrameworkLLMLangChain

0 likes · 20 min read

An Introduction to LangChain: Core Components, Usage Patterns, and Practical Code Examples

DataFunSummit

Nov 1, 2023 · Artificial Intelligence

Exploring Large Language Models for Recommendation Systems: Experiments and Insights

This article investigates how large language models can be applied to recommendation tasks, presenting two usage strategies, experimental evaluations on multiple datasets, comparisons with traditional baselines, and analyses of prompting methods, cost, and cold‑start performance.

Artificial IntelligenceLLMPrompt Engineering

0 likes · 13 min read

Exploring Large Language Models for Recommendation Systems: Experiments and Insights

DataFunSummit

Nov 1, 2023 · Artificial Intelligence

DataFunCon2023 Shenzhen: Program Overview and Session Highlights

DataFunCon2023 Shenzhen showcases a comprehensive program featuring expert talks on building Data+LLM applications, large-scale storage, cloud‑native architectures, metric systems, data governance, AB testing, and industry‑specific large language model use cases across finance, gaming, advertising, and more, providing valuable insights for practitioners and researchers alike.

@DataAIGCArtificial Intelligence

0 likes · 50 min read

DataFunCon2023 Shenzhen: Program Overview and Session Highlights

Software Development Quality

Oct 27, 2023 · Artificial Intelligence

TestAgent: Open-Source 7B LLM for Multi-Language Test Generation

TestAgent introduces an open-source 7B large language model tailored for software testing, offering multi‑language test case generation, automatic assert completion, and a lightweight engineering framework with quick‑start scripts, performance benchmarks, and deployment options for various hardware accelerators.

AI modelLLMMulti-language Generation

0 likes · 10 min read

TestAgent: Open-Source 7B LLM for Multi-Language Test Generation

JD Retail Technology

Oct 26, 2023 · Artificial Intelligence

Leveraging Large Language Models for Text-to-SQL: Prompt Design and End-to-End Pipeline

This article explains how large language models can be used to convert natural language queries into SQL statements, describes two main approaches—direct generation and fine‑tuned open‑source models—details prompt engineering techniques, and outlines an end‑to‑end pipeline that executes the generated SQL and summarizes results.

ChatGLMData PrivacyLLM

0 likes · 7 min read

Leveraging Large Language Models for Text-to-SQL: Prompt Design and End-to-End Pipeline

Alibaba Cloud Native

Oct 24, 2023 · Cloud Native

Deploy a Qwen‑Powered AI Assistant on Alibaba Cloud Function Compute in 5 Minutes

This tutorial walks you through quickly setting up a Qwen‑based AI assistant on Alibaba Cloud Function Compute, covering prerequisite API‑key acquisition, deployment steps, password protection, and how to access the running service.

AICloud NativeFunction Compute

0 likes · 4 min read

Deploy a Qwen‑Powered AI Assistant on Alibaba Cloud Function Compute in 5 Minutes

Open Source Tech Hub

Oct 22, 2023 · Artificial Intelligence

How to Integrate Xunfei Starfire Cognitive Model into PHP Projects – Step-by-Step Guide

This guide walks you through the background of Xunfei's Starfire large language model, its 2.0 features, account setup, obtaining API credentials, cloning the example repository, installing dependencies, configuring keys, and troubleshooting common errors for PHP integration.

AIAPILLM

0 likes · 7 min read

How to Integrate Xunfei Starfire Cognitive Model into PHP Projects – Step-by-Step Guide

phodal

Oct 19, 2023 · Operations

Can LLMs Revolutionize Code Review? Inside AutoDev’s AI‑Powered Approach

The article examines how rising code volume and AI‑generated snippets challenge traditional code review, proposes an LLM‑assisted workflow using AutoDev and DevOpsGenius, details prompt design, commit filtering, and implementation steps, and discusses the benefits and limitations for different team roles.

AI automationCode ReviewDevOps

0 likes · 9 min read

Can LLMs Revolutionize Code Review? Inside AutoDev’s AI‑Powered Approach

Alibaba Cloud Big Data AI Platform

Oct 19, 2023 · Artificial Intelligence

How to Build a Retrieval‑Augmented LLM Knowledge Base on Alibaba Cloud

This guide details a complete end‑to‑end solution for constructing a large‑language‑model knowledge‑base chatbot on Alibaba Cloud, covering background, modular architecture, vector database selection, text preprocessing, embedding models, LLM fine‑tuning, prompt engineering, deployment with PAI‑EAS and BladeLLM, and real‑world results.

AILLMLangChain

0 likes · 37 min read

How to Build a Retrieval‑Augmented LLM Knowledge Base on Alibaba Cloud

Baobao Algorithm Notes

Oct 19, 2023 · Artificial Intelligence

Efficient LLM Deployment: Low‑Precision, Flash Attention, and Architecture Tricks

This article reviews the main memory and compute challenges of deploying large language models and presents practical solutions—including low‑precision arithmetic, flash attention, advanced positional embeddings, key‑value caching, and quantization techniques—backed by code examples and performance measurements on models such as OctoCoder.

Flash AttentionLLMTransformers

0 likes · 35 min read

Architect

Oct 18, 2023 · Artificial Intelligence

Code Understanding: Techniques, Applications, and AI‑Driven Solutions

This article explores the fundamentals of code understanding, including static, dynamic, and non‑code analysis, presents a three‑layer architecture for scalable code comprehension, and demonstrates practical AI‑enhanced applications such as intelligent unit testing, dead‑code detection, and AI‑based static analysis within CI/CD pipelines.

AILLMci/cd

0 likes · 16 min read

Code Understanding: Techniques, Applications, and AI‑Driven Solutions

AI Large Model Application Practice

Oct 18, 2023 · Artificial Intelligence

How to Extract and Embed Tables and Images from PDFs for Multimodal RAG

This article explains a practical approach to parsing PDFs containing text, tables, and images, using the open‑source Unstructured library and LlaVA model, then embedding each modality into a vector store with multi‑vector retrieval to enable accurate semantic search in private‑knowledge RAG pipelines, with optional LangChain integration.

EmbeddingsLLMLangChain

0 likes · 12 min read

How to Extract and Embed Tables and Images from PDFs for Multimodal RAG

Ximalaya Technology Team

Oct 18, 2023 · Artificial Intelligence

The Evolution of AI Agents: From Philosophy to Modern Implementations

Tracing AI agents from Aristotle’s and Zhuangzi’s philosophical notions through the coining of “agent” in computer science to today’s learning‑based systems powered by large language models, the article outlines key milestones, core components—LLM brain, memory, planning, tool use—and showcases applications such as AlphaGo, Siri, and autonomous platforms, while forecasting their expanding, industry‑wide ubiquity.

AI agentsArtificial IntelligenceAutonomous Systems

0 likes · 21 min read

The Evolution of AI Agents: From Philosophy to Modern Implementations

Model Perspective

Oct 15, 2023 · Artificial Intelligence

How to Use Large Language Models Ethically in Math Modeling Contests

COMAP’s new policy outlines why and how teams in mathematical modeling competitions should responsibly employ large language models and generative AI, detailing guiding principles, risks, citation requirements, and ethical considerations to ensure fairness, transparency, and academic integrity.

AI PolicyLLMacademic integrity

0 likes · 9 min read

How to Use Large Language Models Ethically in Math Modeling Contests

phodal

Oct 15, 2023 · Artificial Intelligence

How Prompt Design Shapes AIGC Tool Architecture: Lessons from Copilot, Bloop, and JetBrains AI

This article analyzes how carefully crafted prompts influence the architecture of complex AIGC applications, comparing tools like GitHub Copilot, JetBrains AI Assistant, and Bloop, and presents practical strategies and design patterns for building robust, context‑aware AI‑driven development environments.

AIGCIDE pluginsLLM

0 likes · 13 min read

How Prompt Design Shapes AIGC Tool Architecture: Lessons from Copilot, Bloop, and JetBrains AI

dbaplus Community

Oct 14, 2023 · Artificial Intelligence

Demystifying Retrieval‑Augmented Generation: From Theory to Working Chatbot

This guide explains the Retrieval‑Augmented Generation (RAG) technique, detailing how user queries are matched to private knowledge bases, how relevant passages are retrieved, and how large language models use those passages to generate context‑aware answers, complete with code examples and practical tips.

ChatbotEmbeddingLLM

0 likes · 19 min read

Demystifying Retrieval‑Augmented Generation: From Theory to Working Chatbot

21CTO

Oct 12, 2023 · Frontend Development

How Vercel’s AI‑Powered v0 Tool Is Transforming Frontend Development

Vercel has launched v0, an AI‑driven tool that lets developers describe desired UI components in plain text and receive generated frontend code, streamlining creation, offering multiple design options, and shifting developer focus toward creativity and design.

AILLMVercel

0 likes · 4 min read

How Vercel’s AI‑Powered v0 Tool Is Transforming Frontend Development

Zhuanzhuan Tech

Oct 11, 2023 · Artificial Intelligence

Building a ChatGPT‑Based Intelligent Customer Service System with BERT Classification and Knowledge Filtering

This article describes how to construct an intelligent customer‑service assistant using ChatGPT for natural‑language understanding, BERT for user‑question classification, and Sentence‑BERT for knowledge‑selection, detailing system architecture, prompt design, model training, performance results, and practical cost reductions.

BERTChatGPTIntelligent Customer Service

0 likes · 16 min read

Building a ChatGPT‑Based Intelligent Customer Service System with BERT Classification and Knowledge Filtering

ByteFE

Oct 11, 2023 · Artificial Intelligence

CR Copilot: An Open‑Source LLM‑Based Code Review Assistant with Private Knowledge Base

This article describes the design and implementation of a code‑review assistant powered by open‑source large language models and a privately hosted knowledge base, covering background, pain points, system architecture, model selection, vector‑store integration, prompt engineering, diff parsing, and practical reflections.

AICode ReviewKnowledge Base

0 likes · 24 min read

CR Copilot: An Open‑Source LLM‑Based Code Review Assistant with Private Knowledge Base

DataFunTalk

Oct 10, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article surveys how large language models can be incorporated into recommender systems, discussing their strengths and limitations, outlining where and how they can be applied across the recommendation pipeline, presenting recent research examples, and highlighting challenges and future directions for industrial deployment.

LLMfeature engineeringrecommender systems

0 likes · 20 min read

Alibaba Cloud Big Data AI Platform

Oct 10, 2023 · Artificial Intelligence

Create a Custom Enterprise Conversational Search with Alibaba Cloud OpenSearch Vector & LLM

This guide walks you through setting up Alibaba Cloud OpenSearch Vector Search and LLM Intelligent Q&A editions, covering environment preparation, instance creation, data source configuration, field and index setup, document ingestion, query processing, and a complete Java SDK demo for building a flexible enterprise conversational search system.

Alibaba CloudConversational AIJava SDK

0 likes · 20 min read

Create a Custom Enterprise Conversational Search with Alibaba Cloud OpenSearch Vector & LLM

Baidu Geek Talk

Oct 9, 2023 · Artificial Intelligence

Code Understanding Technology: Building White-Box Software Knowledge Graph at Baidu

Baidu’s white‑box code understanding platform combines static, dynamic, non‑code and LLM‑based analyses in a three‑layer architecture that accelerates C/C++ processing ninefold, supports multiple languages, and powers applications such as intelligent unit testing, orphan‑function cleanup and AI‑driven risk detection, while future integration with models like GPT‑4 aims to enable multi‑turn code Q&A, automated refactoring and predictive testing.

ASTBaiduCode Understanding

0 likes · 15 min read

Code Understanding Technology: Building White-Box Software Knowledge Graph at Baidu

Baobao Algorithm Notes

Oct 8, 2023 · Interview Experience

Must‑Know Large‑Model Interview Questions for RLHF Candidates

The article shares a practitioner’s transition story from reinforcement‑learning‑focused game AI to large‑model work, outlines the challenges faced during job hunting at major Chinese tech firms, and provides a curated list of 23 technical interview questions covering PPO, RLHF, dataset evaluation, model fine‑tuning, and broader LLM concepts.

AI researchInterview preparationLLM

0 likes · 10 min read

Must‑Know Large‑Model Interview Questions for RLHF Candidates

21CTO

Oct 4, 2023 · Artificial Intelligence

How LangStream Merges Data Streams with Generative AI for Real‑Time LLM Apps

LangStream, the new open‑source framework from DataStax, combines event‑driven data streaming with generative AI, offering seamless integration with vector databases like Astra DB, Milvus, and Pinecone, and providing a Kubernetes‑based runtime that enables real‑time LLM applications without extensive coding.

Data StreamingKubernetesLLM

0 likes · 7 min read

How LangStream Merges Data Streams with Generative AI for Real‑Time LLM Apps

AI Large Model Application Practice

Oct 4, 2023 · Artificial Intelligence

Controlling LLM‑Based AI Agents with the Open‑Source ‘Agents’ Framework

This article introduces the experimental open‑source project ‘Agents’, explains common challenges of LLM‑based AI agents, compares it with tools like AutoGPT, LangChain and MetaGPT, and demonstrates how its configuration‑driven SOP approach enables more controllable, multi‑agent interactions and easier deployment.

LLMOpen SourceSOP

0 likes · 14 min read

Controlling LLM‑Based AI Agents with the Open‑Source ‘Agents’ Framework

HomeTech

Sep 26, 2023 · Artificial Intelligence

Integrating Large Language Models with Search for Automotive Knowledge Retrieval

This article explores how combining traditional keyword search with large language models (LLMs) enhances understanding of user intent, builds a robust automotive knowledge base, and delivers more accurate, context‑aware answers through a multi‑stage retrieval and generation pipeline.

AIAutomotiveKnowledge Base

0 likes · 17 min read

Integrating Large Language Models with Search for Automotive Knowledge Retrieval

AI Large Model Application Practice

Sep 25, 2023 · Artificial Intelligence

How LangSmith Turns LLM Debugging, Testing, and Production Monitoring into a Seamless Workflow

This article explores LangSmith, the experimental platform from the creators of LangChain, detailing how it tracks complex LLM reasoning, supports batch testing and evaluation of AI applications, and offers a community Hub for sharing prompts and chains, ultimately helping move LLM projects from prototype to production.

AI testingDebuggingLLM

0 likes · 10 min read

How LangSmith Turns LLM Debugging, Testing, and Production Monitoring into a Seamless Workflow

phodal

Sep 24, 2023 · Artificial Intelligence

Designing a JVM‑Based LLM Framework: Insights from Chocolate Factory

This article explores the design principles, architectural decisions, and practical code examples behind the Chocolate Factory framework, a JVM‑centric LLM development platform inspired by LangChain, LlamaIndex, Spring AI, and PromptFlow, highlighting SDK construction, RAG workflows, and prompt engineering challenges.

AI developmentFrameworkJVM

0 likes · 11 min read

Designing a JVM‑Based LLM Framework: Insights from Chocolate Factory

NetEase LeiHuo Testing Center

Sep 22, 2023 · Artificial Intelligence

Understanding Large Language Models and Prompt Engineering: A Practical Guide

This article provides an introductory overview of large language models (LLMs), compares popular models, explains their underlying principles, and offers practical guidance on prompt engineering, model evaluation, usage tips, and safety considerations, helping readers effectively select and apply LLMs in various scenarios.

AILLMLarge Language Models

0 likes · 44 min read

Understanding Large Language Models and Prompt Engineering: A Practical Guide

DataFunSummit

Sep 22, 2023 · Artificial Intelligence

Exploring Game AI Agents: Review, LLM‑Driven Exploration, and Future Directions

This article reviews the evolution of game AI agents, examines how large language models (LLMs) can drive new AI behaviors in games, and discusses practical case studies across genres such as Werewolf‑style, war‑SLG, and MOBA games, concluding with challenges and future research directions.

AI agentsGame DevelopmentLLM

0 likes · 31 min read

Exploring Game AI Agents: Review, LLM‑Driven Exploration, and Future Directions

21CTO

Sep 21, 2023 · Artificial Intelligence

Falcon 180B vs Llama 2: Which Open‑Source LLM Leads the AI Race?

This article compares the open‑source large language models Falcon 180B and Meta’s Llama 2, detailing their parameter sizes, training data, licensing, variants, infrastructure, language support, and safety policies, while providing links to official resources and a side‑by‑side feature table.

AI comparisonFalcon 180BLLM

0 likes · 8 min read

Falcon 180B vs Llama 2: Which Open‑Source LLM Leads the AI Race?

Ximalaya Technology Team

Sep 18, 2023 · Artificial Intelligence

Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts

The article surveys the rise of LLM‑powered AI agents, defining them as LLM + memory + planning + tool use, contrasting fully autonomous agents with human‑guided autopilot/copilot variants, outlining their benefits, risks such as hallucinations and unsafe actions, and urging modular frameworks and oversight for reliable enterprise deployment.

AI agentsAgent FrameworkArtificial Intelligence

0 likes · 27 min read

Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts

Yunxuetang Frontend Team

Sep 15, 2023 · Frontend Development

Front-End Insights: Architecture, Code Review, Storage, and New Tools

This article explores front‑end architecture decisions between Vue and React, shares a comprehensive code‑review methodology from 13 years at Tencent, evaluates the most robust local storage solutions for large offline data, details two screen‑adaptation approaches for Vue 3/Vite, and introduces emerging technologies such as LangChain’s RAG and Agents and the high‑performance Bun 1.0 JavaScript runtime.

BunCode ReviewFrontend

0 likes · 4 min read

Front-End Insights: Architecture, Code Review, Storage, and New Tools

DaTaobao Tech

Sep 13, 2023 · Artificial Intelligence

Integrating Large Language Models with Recommendation Systems: Paradigms, Methods, and Experiments

The article surveys how large language models can be integrated into recommendation systems, either as feature extractors or as end‑to‑end recommenders, showing that LLM‑derived semantics improve recall, ranking, diversity, and user experience, and outlining future multimodal, efficiency, and re‑ranking directions.

EmbeddingLLMMultimodal

0 likes · 19 min read

Integrating Large Language Models with Recommendation Systems: Paradigms, Methods, and Experiments

Alibaba Cloud Infrastructure

Sep 13, 2023 · Artificial Intelligence

Pai‑Megatron‑Patch: Design Principles, Key Features, and End‑to‑End Usage for Large Language Model Training

This article introduces the open‑source Pai‑Megatron‑Patch tool from Alibaba Cloud, explains its non‑intrusive patch architecture, enumerates supported models and features such as weight conversion, Flash‑Attention 2.0, FP8 training with Transformer Engine, and provides detailed command‑line examples for model conversion, pre‑training, supervised fine‑tuning, inference, and RLHF reinforcement learning pipelines.

FP8LLMMegatron

0 likes · 19 min read

Pai‑Megatron‑Patch: Design Principles, Key Features, and End‑to‑End Usage for Large Language Model Training

Baobao Algorithm Notes

Sep 12, 2023 · Artificial Intelligence

Why RTX 4090 Beats H100 for LLM Inference but Fails at Training

The article analyses the performance, memory, bandwidth and cost of NVIDIA H100, A100 and RTX 4090 GPUs, explains why the 4090 cannot handle large‑model training due to communication and memory limits, and shows how its high compute‑to‑price ratio makes it attractive for inference, backed by detailed parallelism calculations and cost‑per‑token estimates.

CostGPULLM

0 likes · 46 min read

Why RTX 4090 Beats H100 for LLM Inference but Fails at Training

Continuous Delivery 2.0

Sep 12, 2023 · Artificial Intelligence

Compression as a Measure of Intelligence in Large Language Models

The article argues that a large language model's ability to compress data through next‑token prediction reflects its intelligence, reviews theoretical and empirical evidence linking compression efficiency to model scale, and proposes a circuit‑competition framework to explain emergent capabilities, in‑context learning, and fine‑tuning effects.

GPT-4IntelligenceLLM

0 likes · 58 min read

Compression as a Measure of Intelligence in Large Language Models

AI Large Model Application Practice

Sep 6, 2023 · Artificial Intelligence

Prompt Engineering vs Fine‑Tuning: How to Choose the Best Strategy for Reliable LLM Outputs

This article compares Prompt Engineering and Supervised Fine‑Tuning for large language models, explains their principles, showcases common prompt patterns such as Chain‑of‑Thought, ReAct and Self‑Ask, outlines fine‑tuning stages and trade‑offs, and provides practical guidance on selecting the most suitable approach for specific enterprise AI Agent scenarios.

AI AgentLLMLarge Language Model

0 likes · 17 min read

Prompt Engineering vs Fine‑Tuning: How to Choose the Best Strategy for Reliable LLM Outputs

phodal

Sep 3, 2023 · Artificial Intelligence

Engineering LLM Applications: Architecture, Prompt Modeling, and Multi‑Language Strategies

This article shares practical insights from months of building LLM proof‑of‑concepts, covering language‑agnostic architectures, FFI integration, prompt engineering, RAG patterns, DSL design, and four core architectural principles for scalable AI applications.

AI ArchitectureDSLFFI

0 likes · 13 min read

Engineering LLM Applications: Architecture, Prompt Modeling, and Multi‑Language Strategies

Rare Earth Juejin Tech Community

Sep 1, 2023 · Artificial Intelligence

Understanding Function Calling and ReAct for LLM Agents with LangChain

This article explains how large language models can act as agents by using OpenAI's Function Calling and the ReAct prompting paradigm, compares their trade‑offs, and demonstrates practical implementations with LangChain, including code examples for defining tools, invoking functions, and orchestrating multi‑step reasoning.

AIAgentFunction Calling

0 likes · 21 min read

Understanding Function Calling and ReAct for LLM Agents with LangChain

Architect

Aug 31, 2023 · Artificial Intelligence

Building a Custom LLM Chatbot with LangChain, ChromaDB, and LLaMA‑2

This tutorial explains how to leverage generative AI tools—including LLMs, embedding models, vector databases, and the LangChain framework—to create a custom chatbot that answers user queries using a knowledge base, with step‑by‑step code examples for Google Colab.

ChatbotEmbeddingLLM

0 likes · 15 min read

Building a Custom LLM Chatbot with LangChain, ChromaDB, and LLaMA‑2

21CTO

Aug 31, 2023 · Artificial Intelligence

Google Duet AI, IBM Mainframe AI, and NCSC LLM Warning: What’s Next for Enterprise Tech

The article reports on Google’s launch of Duet AI in Workspace, a Chinese firm’s apology over a VS Code‑based IDE, NCSC’s caution about LLMs, IBM’s generative‑AI tool for mainframe modernization, and OpenSSF’s new open‑source security manifesto.

AIEnterprise CollaborationLLM

0 likes · 9 min read

Google Duet AI, IBM Mainframe AI, and NCSC LLM Warning: What’s Next for Enterprise Tech

DataFunSummit

Aug 30, 2023 · Databases

Milvus: An AI‑Native Vector Database for Large Language Model Applications

This article introduces Milvus, an open‑source, cloud‑native vector database designed for AI workloads, explains how it helps mitigate large‑model hallucinations, outlines its CVP architecture, showcases performance benchmarks, and explores diverse application scenarios and future directions for LLM‑vector database integration.

AILLMMilvus

0 likes · 13 min read

Milvus: An AI‑Native Vector Database for Large Language Model Applications

21CTO

Aug 28, 2023 · Artificial Intelligence

What Is Code Llama? Meta’s Open-Source LLM for Generating Code

Code Llama, Meta’s specialized extension of Llama 2, is a large language model fine‑tuned on code data that can generate, complete, and debug software across multiple languages, supports up to 100 k tokens, and is freely available for research and commercial use.

Code LlamaLLMMeta AI

0 likes · 5 min read

What Is Code Llama? Meta’s Open-Source LLM for Generating Code

21CTO

Aug 26, 2023 · Artificial Intelligence

How MetaGPT Leverages SOP to Boost Multi‑Agent LLM Collaboration

MetaGPT is a meta‑programming framework that encodes standard operating procedures as prompts, enabling LLM‑driven multi‑agent systems to automatically generate software artifacts, coordinate roles, and build complex applications like a Blackjack CLI with near‑perfect task completion.

AI CollaborationLLMMetaGPT

0 likes · 4 min read

How MetaGPT Leverages SOP to Boost Multi‑Agent LLM Collaboration

phodal

Aug 26, 2023 · Artificial Intelligence

How CoUnit Turns LLMs Into a Smart Team API for Faster Collaboration

CoUnit is an open‑source Rust‑based tool that uses local semantic search and LLMs to create a virtual team interface, enabling low‑cost, offline knowledge retrieval, API discovery, and cross‑team assistance for software development teams.

AI integrationLLMRust

0 likes · 7 min read

How CoUnit Turns LLMs Into a Smart Team API for Faster Collaboration

DataFunTalk

Aug 23, 2023 · Artificial Intelligence

Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems

This article investigates whether replacing traditional ID-based item encoders with massive LLMs such as GPT‑3 improves recommendation performance, by conducting extensive experiments on three real‑world datasets, analyzing performance limits, generality of item representations, and comparing against ID‑based and prompt‑based methods.

AIGPT-3LLM

0 likes · 15 min read

Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems

Ximalaya Technology Team

Aug 22, 2023 · Artificial Intelligence

Guidelines and Best Practices for Prompt Engineering with Large Language Models

The guide outlines prompt‑engineering best practices for large language models, distinguishing base and instruction‑tuned LLMs, emphasizing clear, structured, step‑by‑step prompts, handling hallucinations, iterating through idea‑code‑data cycles, applying techniques to summarization, reasoning and expansion, managing token costs, and providing concrete OpenAI API examples.

AIAPI UsageLLM

0 likes · 14 min read

Guidelines and Best Practices for Prompt Engineering with Large Language Models

AI Large Model Application Practice

Aug 21, 2023 · Artificial Intelligence

Building an AI Sales Assistant: Enterprise LLM Architecture and Agent Workflow

This article outlines a practical enterprise architecture for integrating large language models into a sales assistant, detailing knowledge ingestion, vector embedding, task planning, tool usage, and iterative dialogue, while introducing AI Agent concepts and open‑source frameworks such as LangChain.

AI AgentLLMLangChain

0 likes · 10 min read

Building an AI Sales Assistant: Enterprise LLM Architecture and Agent Workflow