Tagged articles

2074 articles

Page 19 of 21

May 31, 2024 · Artificial Intelligence

Generating Custom QA Datasets with Large Language Models and Fine‑Tuning via LoRA

This article explains how to use a large language model to automatically convert long‑form texts into Alpaca‑style question‑answer pairs, build a LangChain processing chain, and then fine‑tune a model such as Phi‑3‑mini‑4k‑instruct with LoRA, providing full Python code examples.

LLMLangChainLoRA

0 likes · 11 min read

Generating Custom QA Datasets with Large Language Models and Fine‑Tuning via LoRA

21CTO

May 30, 2024 · Artificial Intelligence

Why AI Leaders Urge Students to Move Beyond Large Language Models

At VivaTech, Meta AI chief Yann LeCun warned students that building next‑generation AI systems means steering clear of large language model research, while other experts highlight emerging architectures and multimodal models like GPT‑4o as the future of artificial intelligence.

AIGPT-4oLLM

0 likes · 3 min read

Why AI Leaders Urge Students to Move Beyond Large Language Models

Practical DevOps Architecture

May 30, 2024 · Artificial Intelligence

Eight‑Week LLM and Large Model Training Course Outline

This article outlines an eight‑week curriculum covering LLM evolution, PyTorch fundamentals, CUDA training, large‑model fine‑tuning, LangChain application development, cloud‑based quantization, industry case studies, and a recruitment session, providing video resources for each topic.

AILLMLangChain

0 likes · 5 min read

Eight‑Week LLM and Large Model Training Course Outline

Continuous Delivery 2.0

May 30, 2024 · Artificial Intelligence

Meta’s TestGen‑LLM: AI‑Driven Automatic Unit Test Generation for Kotlin Code

In 2024 Meta introduced TestGen‑LLM, an AI‑powered tool that automatically generates Kotlin unit tests using large language models, improving test coverage through a multi‑stage pipeline of candidate generation, compilation filtering, execution filtering, coverage validation, refactoring, and engineer review, with reported coverage gains across Facebook and Instagram codebases.

AIKotlinLLM

0 likes · 6 min read

Meta’s TestGen‑LLM: AI‑Driven Automatic Unit Test Generation for Kotlin Code

Baobao Algorithm Notes

May 30, 2024 · Artificial Intelligence

What’s the Latest RLHF Landscape? From PPO to ORPO Explained

This article surveys the current RLHF ecosystem, comparing on‑policy methods like PPO with off‑policy approaches such as DPO, and examines recent variants—including ReMax, GRPO, DPOP, TDPO, and ORPO—highlighting their algorithmic differences, resource trade‑offs, and practical performance insights.

DPOLLMPPO

0 likes · 23 min read

What’s the Latest RLHF Landscape? From PPO to ORPO Explained

21CTO

May 29, 2024 · Artificial Intelligence

How AI PCs Are Redefining the Desktop: Inside Microsoft’s Copilot+ Vision

Microsoft’s vision of AI PCs, highlighted by the Copilot+ concept, details how integrated NPU hardware, local large‑language models, and the Windows Copilot Runtime enable on‑device AI inference, reducing data‑center load and offering developers a unified platform for building next‑generation AI applications.

AI PCCopilot+LLM

0 likes · 11 min read

How AI PCs Are Redefining the Desktop: Inside Microsoft’s Copilot+ Vision

JD Cloud Developers

May 29, 2024 · Artificial Intelligence

How Multi‑Agent AI Is Revolutionizing E‑Commerce Decision Making

This article explores JD Retail's AI‑driven multi‑agent system that mimics real‑world merchant decision processes, detailing the ReAct paradigm, agent roles, workflow, training methods, monitoring, and future directions for building intelligent e‑commerce assistants.

AIAgent ArchitectureLLM

0 likes · 21 min read

How Multi‑Agent AI Is Revolutionizing E‑Commerce Decision Making

21CTO

May 28, 2024 · Artificial Intelligence

13 Open‑Source AI Projects That Made the 2024 GitHub Accelerator – A Deep Dive

This article showcases the 13 award‑winning open‑source AI projects featured in the 2024 GitHub Accelerator, highlighting each project's purpose, founders, key technologies, and how they advance machine‑learning, model training, deployment, and innovative AI applications across various domains.

AI toolsGitHub AcceleratorLLM

0 likes · 9 min read

13 Open‑Source AI Projects That Made the 2024 GitHub Accelerator – A Deep Dive

NewBeeNLP

May 28, 2024 · Artificial Intelligence

How Generative Models Are Redefining Recommendation Systems

This article reviews recent advances in generative recommendation, highlighting challenges such as item representation and multimodal fusion, and summarizing four key research papers that propose novel tokenization, collaborative integration, and transformer-based multimodal approaches to improve recommendation performance.

AI researchGenerative RecommendationLLM

0 likes · 8 min read

How Generative Models Are Redefining Recommendation Systems

37 Interactive Technology Team

May 27, 2024 · Artificial Intelligence

Enhancing AI Code Review Quality with Contextual Embedding and Function Calling

The article explains how AI code reviews suffer from missing context, and improves them by embedding the codebase, using Retrieval‑Augmented Generation to fetch relevant snippets, and adding a function‑calling tool that lets the model autonomously request additional code, resulting in precise, bug‑detecting feedback.

AI code reviewEmbeddingFunction Calling

0 likes · 8 min read

Enhancing AI Code Review Quality with Contextual Embedding and Function Calling

DaTaobao Tech

May 27, 2024 · Artificial Intelligence

Sampling Strategies for Large Language Models: Greedy, Beam, Top‑K, Top‑p, and Temperature

The article explains how greedy search, beam search, Top‑K, Top‑p (nucleus) sampling, and temperature each shape large language model generation, comparing their effects on repetition, diversity, and creativity, and provides concise TensorFlow‑based code examples illustrating these inference‑time strategies.

AILLMPython

0 likes · 15 min read

Sampling Strategies for Large Language Models: Greedy, Beam, Top‑K, Top‑p, and Temperature

NewBeeNLP

May 24, 2024 · Artificial Intelligence

How NoteLLM Boosts Cold‑Start Recommendation with Generative Contrastive Learning

This article reviews the NoteLLM paper, which leverages Llama 2 to create richer text embeddings and automatically generate tags and categories for note recommendation, addressing cold‑start issues through a multitask prompt design, generative‑contrastive learning, and collaborative supervised fine‑tuning, and demonstrates strong offline and online gains.

EmbeddingGenerative Contrastive LearningLLM

0 likes · 14 min read

How NoteLLM Boosts Cold‑Start Recommendation with Generative Contrastive Learning

DevOps

May 23, 2024 · Information Security

Guidelines for Evaluating Large Language Models in Cybersecurity Tasks

The article examines the opportunities and risks of applying large language models (LLMs) to cybersecurity, outlines fourteen practical recommendations for assessing their real‑world capabilities, and concludes with an invitation to the upcoming R&D Efficiency Conference covering AI, product management, and related topics.

AI safetyInformation SecurityLLM

0 likes · 11 min read

Guidelines for Evaluating Large Language Models in Cybersecurity Tasks

Cognitive Technology Team

May 23, 2024 · Operations

eBPF + LLM: Building the Infrastructure for Observability Agents

The article explains how zero‑intrusion eBPF provides full‑stack, high‑quality observability data that, when combined with large language models, enables AI‑driven agents to automate ticket handling, change impact analysis, and vulnerability triage, dramatically improving operational efficiency.

AI AgentLLMObservability

0 likes · 17 min read

eBPF + LLM: Building the Infrastructure for Observability Agents

Rare Earth Juejin Tech Community

May 23, 2024 · Artificial Intelligence

Building a Multi‑Agent Bookmark Assistant Bot with Coze: From File Upload to AI‑Powered Search

This tutorial walks through creating a Coze bot that uses multi‑agent orchestration, memory variables, triggers, and large‑language‑model integration to upload bookmark files, extract and clean data, classify sites, generate importable HTML bookmarks, and provide AI‑driven search functionality, complete with Python code examples and deployment tips.

AI agentsBookmark ManagementBot Development

0 likes · 24 min read

Building a Multi‑Agent Bookmark Assistant Bot with Coze: From File Upload to AI‑Powered Search

Eric Tech Circle

May 22, 2024 · Artificial Intelligence

Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide

This article introduces Dify, an open‑source LLM application platform, outlines its core features such as workflows, model support, RAG pipelines, agents, and observability, compares it with alternatives, and provides step‑by‑step deployment instructions using Docker Compose and Helm for local and Kubernetes environments.

AI PlatformKubernetesLLM

0 likes · 7 min read

Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide

JD Tech

May 22, 2024 · Artificial Intelligence

AI Multi‑Agent System for E‑commerce Merchant Assistance: Design, ReAct Architecture, and Implementation

The article describes JD Retail's AI‑driven multi‑agent platform that models real‑world merchant decision‑making with ReAct‑based LLM agents, detailing the system architecture, agent roles, reasoning loops, workflow examples, training pipelines, monitoring, and future directions for e‑commerce support.

AILLMdecision support

0 likes · 21 min read

AI Multi‑Agent System for E‑commerce Merchant Assistance: Design, ReAct Architecture, and Implementation

Baidu Geek Talk

May 22, 2024 · Artificial Intelligence

How AI Can Auto‑Generate Perfect Git Commit Messages

This article explains how a large‑language‑model‑driven tool can automatically create standardized Git commit messages by extracting change summaries, applying customizable plugins, measuring performance with MSE and adoption rate, and optimizing prompts, data pipelines, and fine‑tuning strategies.

AICommitMessageDataProcessing

0 likes · 17 min read

How AI Can Auto‑Generate Perfect Git Commit Messages

JD Retail Technology

May 22, 2024 · Artificial Intelligence

AI Multi‑Agent System for E‑commerce: Design, Implementation, and Operational Insights

This article presents a comprehensive overview of JD Retail's AI‑driven multi‑agent architecture for e‑commerce assistance, detailing how real‑world merchant decision processes are modeled with ReAct‑based LLM agents, the hierarchical workflow, training pipelines, monitoring mechanisms, and future directions for scalable intelligent commerce support.

AIAgent ArchitectureKnowledge retrieval

0 likes · 20 min read

AI Multi‑Agent System for E‑commerce: Design, Implementation, and Operational Insights

NewBeeNLP

May 18, 2024 · Artificial Intelligence

How to Detect Test Set Contamination in Black‑Box Language Models

Researchers propose a black‑box method to expose test‑set leakage in large language models by comparing log‑probability shifts when test items are shuffled, using Monte‑Carlo estimation and a sharded likelihood test, and demonstrate its effectiveness on several models including Mistral‑7B.

LLMblack-box detectionevaluation

0 likes · 8 min read

How to Detect Test Set Contamination in Black‑Box Language Models

Open Source Tech Hub

May 16, 2024 · Artificial Intelligence

Deploy and Run Llama 3 Locally with Ollama in Minutes

This guide explains how to download a GGUF‑format Llama 3 model, create a Modelfile, use Ollama commands to build and run the model locally, test it, and interact via the built‑in REST API, including useful Docker and model‑management tips.

GGUFLLMLlama3

0 likes · 7 min read

Deploy and Run Llama 3 Locally with Ollama in Minutes

Alibaba Cloud Native

May 15, 2024 · Cloud Native

Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress

This article walks through setting up a cloud‑native test environment using the open‑source NextChat UI and Higress API gateway to let Qwen‑2.5 masquerade as GPT‑4o, enabling a side‑by‑side comparison of their responses while showcasing Higress’s streaming, hot‑update, and security features for AI workloads.

AI gatewayGPT-4oHigress

0 likes · 8 min read

Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress

Efficient Ops

May 14, 2024 · Artificial Intelligence

How Large‑Model Agents Are Revolutionizing AIOps and Modern Operations

This article explores why large‑model Agent technology is essential for AIOps, explains single‑ and multi‑Agent architectures, memory and tool integration, and demonstrates practical applications such as anomaly detection, fault diagnosis, automated remediation, ChatOps, and future directions for intelligent, autonomous operations.

AI agentsLLMOperations Automation

0 likes · 14 min read

How Large‑Model Agents Are Revolutionizing AIOps and Modern Operations

StarRocks

May 14, 2024 · Artificial Intelligence

How Tencent Games Boosted AI‑Generated SQL Accuracy to 89% with a Lakehouse Architecture

Tencent Games tackled the low accuracy of AI‑generated SQL in production by combining large language models with a StarRocks lake‑warehouse, introducing a semantic layer, async materialized views, and an agent‑based multi‑intelligence framework, ultimately raising one‑shot SQL correctness to 89% and cutting delivery time from 2 hours to 0.33 hours.

AIData EngineeringLLM

0 likes · 13 min read

How Tencent Games Boosted AI‑Generated SQL Accuracy to 89% with a Lakehouse Architecture

NewBeeNLP

May 13, 2024 · Artificial Intelligence

Why DPO Treats LLMs as Q‑Functions: A Deep Theoretical Dive

This article offers a detailed theoretical interpretation of the DPO algorithm, showing how large language models can be viewed as Q‑functions, unifying sequence‑wise and step‑wise decision perspectives, and discussing the resulting implications for reinforcement‑learning‑based alignment research.

DPOLLMPreference Optimization

0 likes · 14 min read

Why DPO Treats LLMs as Q‑Functions: A Deep Theoretical Dive

DataFunTalk

May 9, 2024 · Databases

ByteHouse Vector Search Technical Guide: Architecture, Design, and Performance Optimizations

This guide explains ByteHouse’s high‑performance vector search capabilities, covering the background of vector retrieval for LLMs, the limitations of its existing skip‑index architecture, the new vector‑index design with HNSW and IVF, query‑time optimizations, performance benchmarks against Milvus, and future development plans.

ByteHouseLLMVector Database

0 likes · 8 min read

ByteHouse Vector Search Technical Guide: Architecture, Design, and Performance Optimizations

Baidu App Technology

May 8, 2024 · Artificial Intelligence

How AI Can Auto‑Generate Standardized Git Commit Messages

This article details the design, implementation, and evaluation of an AI‑powered tool that automatically creates compliant Git commit messages by leveraging large language models, custom plugins, and performance‑focused optimizations to improve developer productivity and commit quality.

AILLMPerformance Optimization

0 likes · 16 min read

How AI Can Auto‑Generate Standardized Git Commit Messages

DataFunTalk

May 8, 2024 · Artificial Intelligence

Intelligent NPCs: Infusing Soul into Game Characters with AI and the Art and Science of Deep Model Inference Acceleration

This talk explores how large‑model AI can give game NPCs personality, outlines the opportunities and challenges of intelligent NPCs, presents a case study of the "Jue Zhi An Nuan" NPC, and discusses future directions, safety compliance, and real‑time multimodal interaction solutions.

AIGame DevelopmentGame NPC

0 likes · 3 min read

Intelligent NPCs: Infusing Soul into Game Characters with AI and the Art and Science of Deep Model Inference Acceleration

Java Backend Technology

May 8, 2024 · Artificial Intelligence

Explore the Latest Open‑Source AI Projects: Llama 3, MaxKB, Phidata & RAGFlow

This article highlights four cutting‑edge open‑source AI initiatives—Meta’s Llama 3 large language model, the MaxKB knowledge‑base Q&A system, the Phidata framework for building AI assistants, and the RAGFlow retrieval‑augmented generation engine—detailing their capabilities, licensing, and where to access the code.

AIKnowledge BaseLLM

0 likes · 7 min read

Explore the Latest Open‑Source AI Projects: Llama 3, MaxKB, Phidata & RAGFlow

Architect

May 5, 2024 · Artificial Intelligence

The Rise of Small Language Models (SLM) and Their Impact on AI Development

Amidst a growing trend that narrows performance gaps between large and small language models, researchers highlight the efficiency, adaptability, and specialized advantages of small language models (SLM), while also discussing the high costs, hallucinations, and security concerns that still challenge large‑scale LLMs.

AI efficiencyLLMModel Scaling

0 likes · 9 min read

The Rise of Small Language Models (SLM) and Their Impact on AI Development

Xiaohe Frontend Team

May 5, 2024 · Artificial Intelligence

SenseNova 5.0 Takes on GPT‑4 Turbo and Other AI Breakthroughs This Week

This roundup covers SenseTime's new SenseNova 5.0 model rivaling GPT‑4 Turbo, Apple's ReALM model that outperforms GPT‑4, the free‑to‑try Meshy 3 3D generator, Lamini's $25 M funding for enterprise generative AI, and OpenAI's upcoming ChatGPT‑powered search engine challenging Google.

AIFundingGenerativeAI

0 likes · 13 min read

SenseNova 5.0 Takes on GPT‑4 Turbo and Other AI Breakthroughs This Week

Baobao Algorithm Notes

May 5, 2024 · Artificial Intelligence

Deep Dive into Transformer Mechanics: Scaling, Q/K Projections, FFNs, and More

This article provides concise technical explanations for 25 common questions about Transformer models, covering scaled dot‑product attention scaling, separate Q/K projections, feed‑forward network design, attention variants, normalization, LoRA versus full‑parameter training, KV‑cache, pre‑ and post‑norm, computational cost analysis, and advanced position‑encoding techniques.

LLMLoRATransformer

0 likes · 25 min read

Deep Dive into Transformer Mechanics: Scaling, Q/K Projections, FFNs, and More

DataFunSummit

May 4, 2024 · Artificial Intelligence

Applications of Large Language Models in Recommendation Systems: Overview and Future Directions

This article provides a comprehensive overview of how large language models (LLMs) are integrated into recommendation systems, detailing two main paradigms—LLM as a component and LLM as a standalone system—while discussing their impact on retrieval, ranking, prompting, and outlining future research challenges such as multimodal recommendation, hallucination mitigation, bias reduction, and agent‑based approaches.

AIFuture DirectionsLLM

0 likes · 6 min read

Applications of Large Language Models in Recommendation Systems: Overview and Future Directions

AI Large Model Application Practice

May 3, 2024 · Artificial Intelligence

Can Giant Context LLMs Replace RAG? Exploring the Limits of Long‑Context Retrieval

This article examines whether the rapid growth of large‑language‑model context windows can eliminate the need for retrieval‑augmented generation, presenting experimental needle‑in‑a‑haystack tests, analysis of model performance across token lengths and needle positions, and practical guidance using an open‑source evaluation tool.

AILLMNeedle-in-a-Haystack

0 likes · 13 min read

Can Giant Context LLMs Replace RAG? Exploring the Limits of Long‑Context Retrieval

Rare Earth Juejin Tech Community

May 2, 2024 · Artificial Intelligence

Understanding Large Language Models: Principles, Training, Risks, and Application Security

This article provides a comprehensive overview of large language models (LLMs), explaining their core concepts, transformer architecture, training stages, known shortcomings such as hallucination and reversal curse, and highlights emerging security threats like prompt injection and jailbreaking, offering guidance for safe deployment.

AI safetyLLMLarge Language Models

0 likes · 21 min read

Understanding Large Language Models: Principles, Training, Risks, and Application Security

21CTO

Apr 29, 2024 · Artificial Intelligence

Fine‑Tuning vs. Context Learning: Building Apps with the Emerging LLM Tech Stack

This article explores how developers can integrate large language models into applications by comparing fine‑tuning and context learning, detailing each method’s advantages and drawbacks, and presenting a four‑layer LLM tech stack—data, model, orchestration, and operations—with practical tooling examples.

AI StackLLMLLMOps

0 likes · 16 min read

Fine‑Tuning vs. Context Learning: Building Apps with the Emerging LLM Tech Stack

Rare Earth Juejin Tech Community

Apr 29, 2024 · Artificial Intelligence

Building Enterprise‑Grade Retrieval‑Augmented Generation (RAG) Systems: Challenges, Fault Points, and Best Practices

This comprehensive guide explores the complexities of building enterprise‑level Retrieval‑Augmented Generation (RAG) systems, detailing common failure points, architectural components such as authentication, input guards, query rewriting, document ingestion, indexing, storage, retrieval, generation, observability, caching, and multi‑tenant considerations, and provides actionable best‑practice recommendations for developers and technical leaders.

CachingEnterprise AILLM

0 likes · 32 min read

Building Enterprise‑Grade Retrieval‑Augmented Generation (RAG) Systems: Challenges, Fault Points, and Best Practices

Baobao Algorithm Notes

Apr 27, 2024 · Artificial Intelligence

Qwen1.5-110B vs Llama‑3‑70B: Performance Insights of Alibaba’s 110B Model

Alibaba unveiled the 110‑billion‑parameter Qwen1.5‑110B model, featuring GQA, 32k context and multilingual support, and benchmark results show it matches or surpasses Llama‑3‑70B and Mixtral‑8x22B on a range of tasks, with notable gains in chat evaluations.

AILLMModel Scaling

0 likes · 7 min read

Qwen1.5-110B vs Llama‑3‑70B: Performance Insights of Alibaba’s 110B Model

NewBeeNLP

Apr 25, 2024 · Artificial Intelligence

How Apple’s OpenELM Redefines Efficient LLM Scaling with Layer‑Wise Design

Apple’s OpenELM introduces a layer‑wise scaling Transformer family ranging from 270 M to 3 B parameters, provides a full open‑source training framework, and demonstrates superior zero‑shot and few‑shot performance over existing open LLMs despite using less public data, while also analyzing inference bottlenecks and PEFT results.

LLMOpen-sourceOpenELM

0 likes · 8 min read

How Apple’s OpenELM Redefines Efficient LLM Scaling with Layer‑Wise Design

Huolala Tech

Apr 25, 2024 · Artificial Intelligence

How LLM‑Powered Multi‑Agent AI Boosts Vehicle Selection in HuoLala’s Customer Service

This article details the design and implementation of an LLM‑driven multi‑agent AI customer‑service assistant for vehicle selection at HuoLala, covering system architecture, algorithmic solutions, retrieval‑augmented generation, NLU/NLG agents, performance improvements, and future outlooks.

AI Customer ServiceLLMRAG

0 likes · 12 min read

How LLM‑Powered Multi‑Agent AI Boosts Vehicle Selection in HuoLala’s Customer Service

21CTO

Apr 24, 2024 · Artificial Intelligence

Microsoft’s Phi‑3 Mini: The Smallest LLM That Beats GPT‑3.5 on iPhone

Microsoft unveiled the open‑source Phi‑3 series, a lightweight family of large language models that outperform larger rivals, run offline on smartphones, and cost a fraction of comparable AI models, opening new possibilities for edge and mobile AI applications.

LLMPhi-3offline-inference

0 likes · 8 min read

Microsoft’s Phi‑3 Mini: The Smallest LLM That Beats GPT‑3.5 on iPhone

JavaEdge

Apr 22, 2024 · Artificial Intelligence

Why Large Language Models Still Struggle and How to Fix Them

Large language models still suffer from limited memory, constrained context windows, outdated knowledge, inability to control external systems, and poor domain expertise, but the article outlines two main remedies—fine‑tuning (Model‑as‑a‑Service) and prompt‑engineering—detailing their mechanisms, suitable scenarios, and trade‑offs.

Artificial IntelligenceLLMModel as a Service

0 likes · 9 min read

Why Large Language Models Still Struggle and How to Fix Them

21CTO

Apr 20, 2024 · Artificial Intelligence

How OpenAI Revitalized Microsoft: The AI Strategy Behind a Tech Giant’s Comeback

The article chronicles Microsoft’s transformation from a stagnant software behemoth to the world’s most valuable company by embracing artificial intelligence, detailing Sophia Velastegui’s pivotal role, the strategic partnership with OpenAI, Azure OpenAI services, product integrations, associated risks, and future outlook.

LLMOpenAI

0 likes · 9 min read

How OpenAI Revitalized Microsoft: The AI Strategy Behind a Tech Giant’s Comeback

dbaplus Community

Apr 19, 2024 · Backend Development

How Justine Tunney Built a Six‑OS C Web Server and Other Groundbreaking Projects

The article showcases Justine Tunney’s remarkable engineering feats—from the RedBean web server that runs the same binary on six operating systems, to the cosmopolitan libc, a 512‑byte sectorLisp, the Blinkenlights visual debugger, the RoseHub security effort, and the llamafile tool that packages large language models into a single portable executable.

CLLMLisp

0 likes · 9 min read

How Justine Tunney Built a Six‑OS C Web Server and Other Groundbreaking Projects

JD Tech

Apr 18, 2024 · Artificial Intelligence

Getting Started with LangChain: Overview, Core Components, and Python Code Samples

This article introduces the LangChain framework for large language model integration, explains its key components and advantages, and provides step‑by‑step Python examples for setting up environment variables, creating prompts, chaining models, and using embeddings, completions, and chat models.

ChatModelEmbeddingLLM

0 likes · 7 min read

Getting Started with LangChain: Overview, Core Components, and Python Code Samples

Alimama Tech

Apr 17, 2024 · Artificial Intelligence

Applying Large Language Models to Advertising Copy Generation

The article examines how large language models can streamline advertising copy creation by addressing format diversity, creativity, and new media demands, detailing model evaluation, fine‑tuning of Chinese‑adapted LLMs—ultimately selecting QWen 1.5‑7B—and showing that deployment boosts copy quality, click‑through and conversion rates while outlining future personalization and data‑efficient scaling.

AICopy GenerationLLM

0 likes · 18 min read

Applying Large Language Models to Advertising Copy Generation

DaTaobao Tech

Apr 17, 2024 · Artificial Intelligence

Challenges and Practices of LLM‑Based Knowledge Bases and Personal Assistants

The article examines how LLM‑driven knowledge‑base QA and personal‑assistant agents struggle with context management, token limits, multimodal data, and tool‑parameter parsing, reviews open‑source frameworks such as LangChain, AutoGen and MetaGPT, and argues that fine‑tuning (e.g., LoRA) is essential for domain‑specific, scalable solutions.

AgentKnowledge BaseLLM

0 likes · 11 min read

Challenges and Practices of LLM‑Based Knowledge Bases and Personal Assistants

dbaplus Community

Apr 17, 2024 · Operations

Boost IT Operations with Offline LLMs: A Step‑by‑Step RAG Guide Using LangChain

This article explains how to build an offline knowledge base for IT operations by combining large language models with Retrieval‑Augmented Generation (RAG) using LangChain, covering document loading, chunking, embedding, vector storage, and query‑time retrieval with concrete code examples.

EmbeddingLLMLangChain

0 likes · 6 min read

Boost IT Operations with Offline LLMs: A Step‑by‑Step RAG Guide Using LangChain

DevOps

Apr 14, 2024 · Artificial Intelligence

Exploring the Application of Large Language Models in DevOps: Practices, Principles, and Future Prospects

This article examines how large language models (LLMs) are being integrated into DevOps workflows, detailing practical implementations, organizational adoption, efficiency‑boosting techniques, underlying principles, limitations, and future directions for software engineers seeking to leverage AI as a reliable development partner.

Artificial IntelligenceAutomationLLM

0 likes · 22 min read

Exploring the Application of Large Language Models in DevOps: Practices, Principles, and Future Prospects

21CTO

Apr 12, 2024 · Artificial Intelligence

How I Built an AI‑Powered Resume Chatbot with LLMs and RAG

Senior developer Jon Olson shares how he created an AI resume assistant using GPT‑4/3.5, LangChain, LlamaIndex, and retrieval‑augmented generation, detailing prompt engineering, backend integration, and future routing features to help job seekers showcase their skills.

AI chatbotLLMLangChain

0 likes · 8 min read

How I Built an AI‑Powered Resume Chatbot with LLMs and RAG

Rare Earth Juejin Tech Community

Apr 12, 2024 · Artificial Intelligence

Typical Business and Technical Architectures for Large Language Model Applications

This article reviews the common business and technical architectures used in large language model (LLM) applications, explains AI Embedded, AI Copilot, and AI Agent modes—including single‑ and multi‑agent systems—and offers guidance on selecting appropriate technology stacks such as prompt‑only, function‑calling agents, RAG, and fine‑tuning.

AI AgentLLMRAG

0 likes · 9 min read

Typical Business and Technical Architectures for Large Language Model Applications

Rare Earth Juejin Tech Community

Apr 11, 2024 · Artificial Intelligence

AI Hackathon Journey: Building the "Novel Jump" Bot on Coze Platform

This article recounts the author's participation in a Shenzhen AI Hackathon, detailing the development of an interactive novel‑character chatbot using the Coze platform, describing the workflow design, technical challenges, model choices, knowledge‑base construction, and the final demo and award outcomes.

AIChatbotCoze

0 likes · 12 min read

AI Hackathon Journey: Building the "Novel Jump" Bot on Coze Platform

AI Large Model Application Practice

Apr 10, 2024 · Artificial Intelligence

What Is Self‑RAG? A Simple Guide to Self‑Reflective Retrieval‑Augmented Generation

This article explains the motivation behind Self‑RAG, describes its core workflow—including conditional retrieval, enhanced generation, and self‑evaluation tokens—details the four evaluation metrics (Retrieve, IsRel, IsSup, IsUse), and provides a Python scoring example using log‑probabilities.

LLMLogprobsPython

0 likes · 13 min read

What Is Self‑RAG? A Simple Guide to Self‑Reflective Retrieval‑Augmented Generation

Xiaohongshu Tech REDtech

Apr 10, 2024 · Artificial Intelligence

Early‑Stopping Self‑Consistency (ESC): Reducing Sampling Cost for Large Language Model Reasoning

Early‑Stopping Self‑Consistency (ESC) dynamically halts sampling once a sliding‑window answer distribution reaches zero entropy, cutting the number of required LLM reasoning samples by 34‑84 % across arithmetic, commonsense, and symbolic benchmarks while preserving accuracy and offering a theoretically‑bounded, robust, budget‑adaptive alternative to traditional Self‑Consistency.

AIEarly StoppingLLM

0 likes · 14 min read

Early‑Stopping Self‑Consistency (ESC): Reducing Sampling Cost for Large Language Model Reasoning

HelloTech

Apr 10, 2024 · Artificial Intelligence

An Overview of LangChain: Architecture, Core Components, and Code Examples

LangChain is an open‑source framework that provides Python and JavaScript SDKs, templates, and services such as LangServe and LangSmith to compose models, embeddings, prompts, indexes, memory, chains, and agents via a concise expression language, enabling rapid prototyping, debugging, and deployment of LLM‑driven applications.

AI EngineeringJavaScriptLLM

0 likes · 19 min read

An Overview of LangChain: Architecture, Core Components, and Code Examples

DaTaobao Tech

Apr 10, 2024 · Artificial Intelligence

Survey of Popular AI Agent Frameworks and Their Architectures

The article surveys modern open‑source AI agent frameworks, defining agents as autonomous perception‑planning‑action systems, outlining core modules (inference, memory, tools, action), comparing single‑agent designs like BabyAGI and AutoGPT with multi‑agent platforms such as MetaGPT and AutoGen, and discussing their benefits, trade‑offs, and future research directions.

AI agentsAgent FrameworksLLM

0 likes · 28 min read

Survey of Popular AI Agent Frameworks and Their Architectures

NewBeeNLP

Apr 10, 2024 · Artificial Intelligence

What Scaling Laws Reveal About LLM Fine‑Tuning and RLHF Performance

This article reviews recent scaling‑law research on large‑language‑model fine‑tuning and RLHF, explaining how data quantity, model size, PET parameters, reward‑model size and KL‑penalty affect downstream performance and offering practical insights for efficient training.

Artificial IntelligenceLLMRLHF

0 likes · 11 min read

What Scaling Laws Reveal About LLM Fine‑Tuning and RLHF Performance

Alibaba Cloud Developer

Apr 10, 2024 · Artificial Intelligence

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

This guide walks AI engineers through a rapid 10‑minute boot‑strap of LangChain, explaining its purpose, core concepts, design questions, environment setup, and step‑by‑step code examples that cover APIs, chains, memory, retrieval‑augmented generation, tools, agents, and the overall architecture.

AI EngineeringLLMLangChain

0 likes · 28 min read

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

DataFunTalk

Apr 8, 2024 · Artificial Intelligence

LLM‑Based Agents: Architecture, Key Challenges, and Future Directions

This article surveys the emerging field of large‑language‑model (LLM) based agents, detailing their modular architecture—including profiling, memory, planning, and action components—while discussing critical challenges such as role‑playing, memory design, reasoning, multi‑agent collaboration, and outlining promising research directions and practical case studies.

AI AgentAgent ArchitectureLLM

0 likes · 11 min read

LLM‑Based Agents: Architecture, Key Challenges, and Future Directions

DataFunTalk

Apr 6, 2024 · Artificial Intelligence

Exploring Large Language Models for Recommendation Systems: Experiments and Insights

This article investigates how large language models can be applied to recommendation tasks, describing two usage strategies, various ranking approaches, experimental evaluations on multiple datasets, comparisons with traditional models, and analyses of prompt design, cost, and cold‑start capabilities.

LLMPrompt Engineeringranking

0 likes · 13 min read

Exploring Large Language Models for Recommendation Systems: Experiments and Insights

AI Large Model Application Practice

Apr 5, 2024 · Artificial Intelligence

Hands‑On Comparison of Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze LLM Platforms

This article provides a practical, side‑by‑side review of three major large‑model application development platforms—Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze—detailing their creation workflows, configuration options, SDK capabilities, plugin ecosystems, workflow orchestration, and overall strengths and limitations for building AI agents.

AI PlatformAppBuilderCoze

0 likes · 18 min read

Hands‑On Comparison of Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze LLM Platforms

dbaplus Community

Apr 4, 2024 · Artificial Intelligence

10 Guiding Principles for Building LLM‑Powered Software Applications

This article outlines ten practical principles for designing applications with large language models, emphasizing a model‑first mindset, precision through interactive disambiguation, clear division of code and model responsibilities, data quality, handling uncertainty, and recognizing the limits of LLMs to build robust, maintainable software.

AI designLLMPrompt Engineering

0 likes · 13 min read

10 Guiding Principles for Building LLM‑Powered Software Applications

NewBeeNLP

Apr 2, 2024 · Artificial Intelligence

Jamba: How AI21 Labs Merged Mamba and Transformer for 3× Faster 128k Contexts

Jamba, a hybrid Mamba‑Transformer model from AI21 Labs, combines state‑space and attention layers with Mixture‑of‑Experts to deliver up to three times the throughput of comparable 52‑billion‑parameter LLMs on 128k context windows while maintaining high output quality and low memory usage.

JambaLLMMamba

0 likes · 6 min read

Jamba: How AI21 Labs Merged Mamba and Transformer for 3× Faster 128k Contexts

Rare Earth Juejin Tech Community

Mar 30, 2024 · Artificial Intelligence

Comprehensive Guide to Coze: AI Bot Development, Prompt Engineering, and Workflow Design

This article provides an in‑depth overview of the Coze low‑code AI bot platform, covering its core features, product comparisons, step‑by‑step bot creation, RAG implementation, plugin usage, memory mechanisms, cron jobs, agent design, advanced workflow techniques, quality management, and future prospects.

AI botCozeLLM

0 likes · 25 min read

Comprehensive Guide to Coze: AI Bot Development, Prompt Engineering, and Workflow Design

Baobao Algorithm Notes

Mar 29, 2024 · Artificial Intelligence

Can Data Mixing Laws Predict LLM Performance? A Deep Dive into Scaling Laws

This article reviews the paper “Data Mixing Laws: Optimizing Data Mixture by Predicting Language Modeling Performance”, explaining how the authors quantify the impact of data mixture ratios on LLM loss, propose a simple predictive model, validate it on RedPajama and multi‑domain mixes, and outline a scaling‑law procedure for continual pre‑training.

Data MixingData SchedulingLLM

0 likes · 9 min read

Can Data Mixing Laws Predict LLM Performance? A Deep Dive into Scaling Laws

AI Large Model Application Practice

Mar 29, 2024 · Artificial Intelligence

How RAG Architecture Evolves: From Simple Chains to Flexible RAG Flows

This article examines the evolution of Retrieval‑Augmented Generation (RAG) architectures for large language models, outlines the challenges they face, introduces the modular RAG Flow concept with four workflow paradigms, and provides a step‑by‑step implementation using LangChain and LlamaIndex with code examples.

LLMLangChainRAG

0 likes · 15 min read

How RAG Architecture Evolves: From Simple Chains to Flexible RAG Flows

DataFunSummit

Mar 29, 2024 · Artificial Intelligence

Large Language Model (LLM) Revolution in Recommendation Systems: Overview, Techniques, and Future Directions

This article reviews how the rapid rise of large language models, exemplified by ChatGPT, is transforming recommendation systems by addressing traditional ID‑centric limitations, introducing prompt‑based and ID‑free representations, discussing recent research advances, practical challenges, and future research directions.

AILLMLarge Models

0 likes · 18 min read

Large Language Model (LLM) Revolution in Recommendation Systems: Overview, Techniques, and Future Directions

Bilibili Tech

Mar 26, 2024 · Frontend Development

Design and Implementation of the AutoMotion UI Automation Testing Platform

The AutoMotion platform streamlines UI automation by recording user actions through a Chrome extension, converting them into Cypress scripts, isolating test data in a sandbox, and employing LLM‑driven self‑healing selectors, while offering open‑API integration and scalable containerized execution for reliable, low‑maintenance testing.

CypressData SandboxLLM

0 likes · 27 min read

Design and Implementation of the AutoMotion UI Automation Testing Platform

Eric Tech Circle

Mar 24, 2024 · Artificial Intelligence

Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison

This guide compares Ollama and Hugging Face for running large language models locally, detailing API and local execution methods, installation steps, model selection, resource requirements, integration with AnythingLLM, container deployment, embedding and vector store setup, and practical observations on performance and limitations.

AnythingLLMEmbeddingHugging Face

0 likes · 15 min read

Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison

AI Large Model Application Practice

Mar 22, 2024 · Artificial Intelligence

How to Build a Real‑Time AI‑Powered 3D Digital Human with Unreal Engine

This guide explains the architecture of an interactive digital‑human system, walks through 3D avatar creation with Unreal Engine, details the AI controller that combines ASR, LLM and TTS, and provides step‑by‑step instructions for deploying the open‑source Fay project.

AI AvatarASRFay

0 likes · 14 min read

How to Build a Real‑Time AI‑Powered 3D Digital Human with Unreal Engine

Sohu Tech Products

Mar 20, 2024 · Artificial Intelligence

Comparison of Base LLM and Instruction Tuned LLM

The diagram contrasts a Base LLM, which merely predicts the next word from training data and can continue stories or answer simple facts but may generate unsafe text, with an Instruction‑Tuned LLM that is fine‑tuned via RLHF to understand and follow commands, delivering more accurate, useful, and safe responses.

AIAI applicationsBASE model

0 likes · 7 min read

Comparison of Base LLM and Instruction Tuned LLM

DeWu Technology

Mar 18, 2024 · Frontend Development

QCon Shanghai 2023: LLM-Powered Frontend Debugging, WebNN, AI-Native Development, and HarmonyOS Insights

QCon Shanghai 2023 highlighted LLM‑driven frontend debugging, the emerging WebNN API for accelerated browser inference, AI‑native UI patterns with evaluation‑driven development, LLM‑enhanced developer bots using RAG and fine‑tuning, and a HarmonyOS round‑table exploring ArkUI’s declarative framework and opportunities for frontend engineers.

AIFrontendHarmonyOS

0 likes · 18 min read

QCon Shanghai 2023: LLM-Powered Frontend Debugging, WebNN, AI-Native Development, and HarmonyOS Insights

NewBeeNLP

Mar 18, 2024 · Artificial Intelligence

Mastering RAG and LLM Techniques: From Retrieval to Fine‑Tuning

This article provides a comprehensive technical guide on Retrieval‑Augmented Generation (RAG), open‑source large language models such as LLaMA, fine‑tuning methods, evaluation metrics, memory‑optimization tricks, and attention‑related optimizations for modern AI systems.

LLMLangChainMemory Optimization

0 likes · 19 min read

Mastering RAG and LLM Techniques: From Retrieval to Fine‑Tuning

Baobao Algorithm Notes

Mar 18, 2024 · Industry Insights

Inside the 2024 KDD Cup ShopBench Challenge: Tasks, Data, and Evaluation Metrics

The 2024 KDD Cup introduces the ShopBench benchmark, a large‑scale LLM competition that simulates real‑world online shopping with 57 tasks, over 20,000 questions, and multiple tracks covering concept understanding, knowledge reasoning, user‑behavior alignment, multilingual ability, and an all‑round track, all evaluated with task‑specific metrics and a hidden test set.

KDD CupLLMShopBench

0 likes · 11 min read

Inside the 2024 KDD Cup ShopBench Challenge: Tasks, Data, and Evaluation Metrics

Baobao Algorithm Notes

Mar 17, 2024 · Artificial Intelligence

Why Role‑Playing LLMs Need More Than Assistant Fine‑Tuning

The article explains that current large language models lack true self‑awareness and act as assistants, so achieving convincing role‑playing behavior requires dedicated system prompts, specialized data, careful balance of continue pre‑training and general SFT, and evaluation methods to detect dissonance and preserve base capabilities.

AILLMPrompt Engineering

0 likes · 19 min read

Why Role‑Playing LLMs Need More Than Assistant Fine‑Tuning

Bilibili Tech

Mar 15, 2024 · Artificial Intelligence

Hardware Resource Estimation and Bottleneck Analysis for Large Language Models (LLMs)

The article analyzes the compute, memory, and communication resources required to train and run large language models, quantifies bottlenecks such as the massive FLOP demand, terabyte‑scale GPU memory, and high‑bandwidth interconnect needs, and evaluates parallelism strategies and bandwidth estimates to guide hardware and software design for scaling LLMs.

AI infrastructureHardwareLLM

0 likes · 53 min read

Hardware Resource Estimation and Bottleneck Analysis for Large Language Models (LLMs)

Sohu Tech Products

Mar 13, 2024 · Artificial Intelligence

Build a Minimal Retrieval‑Augmented Generation (Tiny‑RAG) from Scratch

This step‑by‑step guide explains how to implement a lightweight Retrieval‑Augmented Generation system—Tiny‑RAG—by creating embedding classes, loading and chunking documents, building a simple vector store, performing similarity search, and integrating a large language model for answer generation, complete with runnable Python code.

EmbeddingLLMPython

0 likes · 14 min read

Build a Minimal Retrieval‑Augmented Generation (Tiny‑RAG) from Scratch

Alipay Experience Technology

Mar 13, 2024 · Artificial Intelligence

Unlock LangChain: Build Powerful LLM Apps Like LEGO with Agents and Tools

This article explains how LangChain turns large language models into modular LEGO‑like building blocks, covering its core concepts, practical travel‑assistant and face‑recognition troubleshooting examples, and explores the rapid evolution of AI agents such as Gorilla, ToolLLaMa, MetaGPT and ChatDev.

LLMLangChainTool Integration

0 likes · 43 min read

Unlock LangChain: Build Powerful LLM Apps Like LEGO with Agents and Tools

Efficient Ops

Mar 13, 2024 · Operations

Why Traditional Ops Stalls and How AI‑Driven Solutions Can Revitalize It

The article examines common operational pain points such as cumbersome release processes, lack of standardization, and weak security controls, then explores how AI‑powered SRE tools and automation can address these challenges and guide teams toward more efficient, standardized, and resilient operations.

AILLMSRE

0 likes · 9 min read

Why Traditional Ops Stalls and How AI‑Driven Solutions Can Revitalize It

AI Large Model Application Practice

Mar 12, 2024 · Artificial Intelligence

How to Build a Corrective RAG Agent with LangGraph: A Step‑by‑Step Guide

This article explains how to use LangGraph—a graph‑based extension of LangChain—to implement a corrective RAG (C‑RAG) pipeline that evaluates retrieved documents, rewrites queries when needed, performs web search, and generates accurate answers, complete with code snippets and a runnable example.

Corrective RAGLLMLangChain

0 likes · 14 min read

How to Build a Corrective RAG Agent with LangGraph: A Step‑by‑Step Guide

AntTech

Mar 11, 2024 · Artificial Intelligence

Can Small Language Models be Good Reasoners in Recommender Systems?

This article presents SLIM, a knowledge‑distillation framework that transfers the reasoning abilities of large language models to compact models for sequential recommendation, enhancing item representation, user profiling, and bias mitigation while achieving comparable performance with far lower computational resources.

AIEfficiencyKnowledge Distillation

0 likes · 12 min read

Can Small Language Models be Good Reasoners in Recommender Systems?

Aikesheng Open Source Community

Mar 11, 2024 · Databases

Google Adds Vector Search to MySQL Service, Advancing LLM Capabilities

Google has introduced preview vector search to its Cloud SQL for MySQL service, positioning it ahead of Oracle, while industry analysts note the growing importance of vector capabilities for generative AI applications across major database platforms.

Artificial IntelligenceCloud SQLDatabase Trends

0 likes · 7 min read

Google Adds Vector Search to MySQL Service, Advancing LLM Capabilities

NewBeeNLP

Mar 10, 2024 · Industry Insights

What WWW'24 Papers Reveal About LLMs in Search & Recommendation

This overview summarizes six WWW 2024 industry papers that apply large language models to e‑commerce search, personalized query suggestion, article recommendation, collaborative filtering, and lifelong sequential behavior understanding, highlighting their methods, experimental results, deployment status, and emerging trends in LLM‑driven search and recommendation.

LLMLarge Language ModelsSearch

0 likes · 16 min read

What WWW'24 Papers Reveal About LLMs in Search & Recommendation

Alibaba Cloud Native

Mar 9, 2024 · Cloud Computing

Deploy Google Gemma LLM on Alibaba Cloud Function Compute GPU with Low‑Cost Idle Mode

This guide shows how to quickly and cheaply deploy the open‑source Google Gemma large language model on Alibaba Cloud Function Compute GPU using the new idle‑billing mode, covering prerequisites, Docker image creation, function setup, idle reservation, testing, monitoring, and cost estimation.

Cloud ComputingFunction ComputeGPU

0 likes · 10 min read

Deploy Google Gemma LLM on Alibaba Cloud Function Compute GPU with Low‑Cost Idle Mode

NewBeeNLP

Mar 8, 2024 · Industry Insights

Why Building LLMs Is Like Buying a Hardware Lottery – Lessons from a Startup

The article recounts Yi Tay’s experience founding Reka and building large language models from scratch, highlighting the unpredictable quality of GPU clusters, the challenges of multi‑cluster orchestration, code‑base choices, and how startups must rely on fast, intuition‑driven experimentation to succeed.

Cluster ManagementGPUHardware

0 likes · 12 min read

Why Building LLMs Is Like Buying a Hardware Lottery – Lessons from a Startup

Sohu Tech Products

Mar 6, 2024 · Mobile Development

On‑Device Deployment of Large Language Models Using Sohu’s Hybrid AI Engine and GPT‑2

The article outlines how Sohu’s Hybrid AI Engine enables on‑device deployment of a distilled GPT‑2 model by converting it to TensorFlow Lite, detailing the setup, customization with Keras, inference workflow, and core SDK calls, and argues that this approach offers fast, private, and cost‑effective AI for mobile devices despite typical LLM constraints.

GPT-2Hybrid AIKeras

0 likes · 9 min read

On‑Device Deployment of Large Language Models Using Sohu’s Hybrid AI Engine and GPT‑2

Alibaba Cloud Developer

Mar 6, 2024 · Artificial Intelligence

Unlocking LangChain: Build Powerful LLM Apps Like LEGO with Real-World Examples

This article explains how LangChain simplifies building and integrating large language model applications by providing modular components such as models, prompts, indexes, tools, memory, chains, and agents, illustrated with practical use cases like travel assistants, face‑recognition troubleshooting, and multi‑agent workflows.

AI agentsLLMLangChain

0 likes · 44 min read

Unlocking LangChain: Build Powerful LLM Apps Like LEGO with Real-World Examples

21CTO

Mar 5, 2024 · Artificial Intelligence

Can Generative AI Replace Human Programmers? LLM Insights & Future of Coding

The article examines why large language models (LLMs) cannot fully replace human programmers, compares major models like Gemma, Code Llama, GPT‑4 and Claude, discusses trust and copyright concerns, and explores how smaller, specialized LLMs may shape the future of software development.

AI ethicsLLMcode generation

0 likes · 7 min read

Can Generative AI Replace Human Programmers? LLM Insights & Future of Coding

JD Retail Technology

Mar 4, 2024 · Artificial Intelligence

How JD Retail Integrates LLMs with SFT, RAG, and AI Agents for Real-World Impact

This article examines JD Retail's end‑to‑end large language model framework that combines supervised fine‑tuning, retrieval‑augmented generation, and ReAct‑based AI agents to overcome retail‑specific challenges, improve model accuracy, reduce hallucinations, and enable autonomous multi‑step business workflows.

AI AgentArtificial IntelligenceIndustry Insights

0 likes · 20 min read

How JD Retail Integrates LLMs with SFT, RAG, and AI Agents for Real-World Impact

21CTO

Feb 29, 2024 · Artificial Intelligence

StarCoder2 Unveiled: Open-Source LLM That Outperforms Its Predecessor with Fewer Parameters

StarCoder2, the latest open-source large language model from ServiceNow, Hugging Face, and NVIDIA, offers three sizes—30B, 70B, and 150B parameters—delivering performance comparable to the original 150B StarCoder while being more efficient and freely accessible under the BigCode Open RAIL‑M license.

Artificial IntelligenceLLMStarCoder2

0 likes · 4 min read

StarCoder2 Unveiled: Open-Source LLM That Outperforms Its Predecessor with Fewer Parameters

Baobao Algorithm Notes

Feb 28, 2024 · Artificial Intelligence

Building a LLM‑Powered Theme‑Park Queue Planner with Baidu Qianfan AppBuilder

This article walks through using large language model agents to create a theme‑park queue planning assistant for Baidu's Qianfan competition, covering the problem definition, dynamic‑programming‑style solution, prompt engineering, Python code generation, and step‑by‑step deployment in AppBuilder.

AI AgentAppBuilderLLM

0 likes · 17 min read

Building a LLM‑Powered Theme‑Park Queue Planner with Baidu Qianfan AppBuilder

NewBeeNLP

Feb 27, 2024 · Artificial Intelligence

Boosting E‑Commerce AIGC with Knowledge Graphs: From Multimodal Inputs to Controlled LLMs

The article details how JD.com leverages domain‑specific and generic knowledge graphs to enhance multimodal product information, improve controlled text generation, and boost LLM performance for e‑commerce copywriting, covering model architecture, copy‑only mechanisms, token‑type encoding, experimental results, and practical deployment scenarios.

AIGCKnowledge GraphLLM

0 likes · 23 min read

Boosting E‑Commerce AIGC with Knowledge Graphs: From Multimodal Inputs to Controlled LLMs

Alibaba Cloud Big Data AI Platform

Feb 27, 2024 · Artificial Intelligence

Build a Knowledge‑Enhanced LLM Chatbot with Alibaba Cloud PAI: A Step‑by‑Step RAG Guide

This comprehensive guide walks AI developers through building a Retrieval‑Augmented Generation (RAG) chatbot on Alibaba Cloud PAI, covering architecture, vector store setup, model deployment, knowledge ingestion, multi‑modal retrieval, fusion, re‑ranking, prompt design, and end‑to‑end configuration with code examples.

Alibaba CloudChatbotLLM

0 likes · 26 min read

Build a Knowledge‑Enhanced LLM Chatbot with Alibaba Cloud PAI: A Step‑by‑Step RAG Guide

DataFunTalk

Feb 26, 2024 · Artificial Intelligence

Large Language Model Empowered Recommendation Systems: Overview, Techniques, and Future Directions

With the rapid rise of ChatGPT and large language models, recommendation systems are undergoing a transformative shift, moving beyond traditional behavior‑based methods to leverage LLMs for improved generalization, representation, and prompt‑based learning, while addressing challenges such as scalability, interpretability, bias, and deployment costs.

AIGeneralizationLLM

0 likes · 19 min read

Large Language Model Empowered Recommendation Systems: Overview, Techniques, and Future Directions

AI Large Model Application Practice

Feb 23, 2024 · Artificial Intelligence

How to Build a Text‑to‑SQL Chatbot with Vanna’s Open‑Source RAG Framework

This guide explains Vanna, an open‑source Python RAG framework for Text2SQL, covering its core concepts, RAG‑based architecture, step‑by‑step model training, code examples for customization, and how to deploy a conversational database chatbot with a Flask web UI.

ChatbotDatabaseLLM

0 likes · 11 min read

How to Build a Text‑to‑SQL Chatbot with Vanna’s Open‑Source RAG Framework

Ops Development & AI Practice

Feb 22, 2024 · Artificial Intelligence

Exploring GPT4All: Open-Source LLMs You Can Run Locally on Any Device

GPT4All, an open‑source LLM ecosystem from Nomic AI, lets users run and customize large language models locally on CPUs or GPUs, offering features like GGUF support, multi‑platform installers, API access, and community contribution guidelines, making it a versatile tool for AI enthusiasts and developers.

AIGPT4AllLLM

0 likes · 4 min read

Exploring GPT4All: Open-Source LLMs You Can Run Locally on Any Device

DaTaobao Tech

Feb 21, 2024 · Artificial Intelligence

An Overview of LangChain: Core Concepts and Practical Implementations

The article introduces LangChain as a framework that unifies LLM providers through model I/O, connects external data via retrievers, composes workflows with chains, maintains context with memory, and enables tool use through agents, and demonstrates Java examples for TongYi embeddings, a ChatGLM‑6B RetrievalQA chain, and discusses agent registration and micro‑service‑based agent factories.

EmbeddingJavaLLM

0 likes · 9 min read

An Overview of LangChain: Core Concepts and Practical Implementations

Rare Earth Juejin Tech Community

Feb 21, 2024 · Artificial Intelligence

Building a Personal Blog Knowledge Base with Coze AI Bot Platform

This guide explains how to use the Coze no‑code AI Bot platform to create a personal blog knowledge base by configuring plugins, data sources, persistence, workflow nodes, prompt engineering, and publishing the bot for seamless AI‑assisted content retrieval.

AI botCozeKnowledge Base

0 likes · 12 min read

Building a Personal Blog Knowledge Base with Coze AI Bot Platform

21CTO

Feb 20, 2024 · Artificial Intelligence

Which LLM Dominates Coding? GPT‑4 vs CodeLlama vs Mixtral vs Gemini

This article presents a head‑to‑head evaluation of four leading large language models—GPT‑4, CodeLlama 70B, CodeLlama 7B, and Mixtral 8x7B—across eight coding‑related tasks, revealing GPT‑4 as the overall winner while highlighting the trade‑offs of smaller models and emerging competitors like Google Gemini.

AI evaluationCodeLlamaGPT-4

0 likes · 9 min read

Which LLM Dominates Coding? GPT‑4 vs CodeLlama vs Mixtral vs Gemini

Alibaba Cloud Developer

Feb 20, 2024 · Artificial Intelligence

Boost LLM Inference Speed with KV‑Cache Reuse and Speculative Sampling

This article explains two production‑grade optimization techniques for large language model inference—KV‑cache reuse across multi‑turn dialogues and speculative sampling with a small draft model—detailing their design, implementation, and performance impact.

AIInference OptimizationKV Cache

0 likes · 14 min read

Boost LLM Inference Speed with KV‑Cache Reuse and Speculative Sampling