Tag

LLM

2 views collected around this technical thread.

Architecture & Thinking
Architecture & Thinking
Jun 11, 2025 · Artificial Intelligence

Accelerate LLM App Development with Eino: A Go Framework Walkthrough

Eino is an open‑source Golang framework for building large‑model applications, offering reusable components, robust orchestration, clean APIs, best‑practice templates, and full‑cycle DevOps tools, with code examples for both Ollama and OpenAI modes, plus streaming and normal output options.

AI DevelopmentGoLLM
0 likes · 10 min read
Accelerate LLM App Development with Eino: A Go Framework Walkthrough
DataFunSummit
DataFunSummit
Jun 8, 2025 · Artificial Intelligence

Mastering LLM Applications: Practical Agent Design and Implementation Strategies

This comprehensive guide explores the core implementation paths for large language model (LLM) applications, focusing on agent design, workflow orchestration, tool integration, memory management, multi‑agent architectures, and future trends, providing actionable methodologies and real‑world examples for practitioners.

AI AgentAgent DesignLLM
0 likes · 25 min read
Mastering LLM Applications: Practical Agent Design and Implementation Strategies
Youzan Coder
Youzan Coder
Jun 6, 2025 · Artificial Intelligence

How AI Agents Turn Manual Data Retrieval into Fully Automated Insights

This article examines the challenges of manual data extraction in data‑driven enterprises, explains why large language models alone fall short, and details how the Cursor‑Agent framework automates end‑to‑end querying, knowledge‑base integration, and result validation to become a self‑sufficient "data master" for both technical and non‑technical users.

AI AgentCursor-AgentData Automation
0 likes · 26 min read
How AI Agents Turn Manual Data Retrieval into Fully Automated Insights
DaTaobao Tech
DaTaobao Tech
Jun 6, 2025 · Artificial Intelligence

Redefining Business Core Assets in the LLM Era: Agent Evolution & Collaboration

This article examines how the rise of large language models reshapes core business assets, defines agents and tools, explores multi‑agent collaboration patterns, task allocation and conflict resolution mechanisms, and evaluates the MCP protocol and engineering requirements for building scalable, flexible agent platforms.

LLMMCP protocolMulti-Agent Collaboration
0 likes · 9 min read
Redefining Business Core Assets in the LLM Era: Agent Evolution & Collaboration
Didi Tech
Didi Tech
Jun 5, 2025 · Artificial Intelligence

Unlocking Modern AI Application Architecture: From RAG to Agents and MCP

This article surveys the evolution of AI applications, explains large language model fundamentals, outlines architectural challenges, and introduces three core patterns—Retrieval‑Augmented Generation (RAG), autonomous Agents, and Model Context Protocol (MCP)—while providing practical LangChain code snippets and integration guidance.

AILLMLangChain
0 likes · 28 min read
Unlocking Modern AI Application Architecture: From RAG to Agents and MCP
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 4, 2025 · Artificial Intelligence

From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN

MarkerGen introduces a novel, plug‑and‑play framework that decomposes length‑controllable text generation into four sub‑abilities—identifying, counting, planning, and aligning—integrates external tokenizers and dynamic markers, and achieves significantly lower length errors and higher quality across diverse models, tasks, and languages.

LLMLength-Controlled GenerationMarkerGen
0 likes · 14 min read
From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN
DaTaobao Tech
DaTaobao Tech
Jun 4, 2025 · Artificial Intelligence

Understanding Large Language Model Architecture, Parameters, Memory, Storage, and Fine‑Tuning Techniques

This article provides a comprehensive overview of large language models (LLMs), covering their transformer architecture, parameter counts, GPU memory and storage requirements, and detailed fine‑tuning methods such as prompt engineering, data construction, LoRA, PEFT, RLHF, and DPO, along with practical deployment and inference acceleration strategies.

DPOFine-tuningLLM
0 likes · 17 min read
Understanding Large Language Model Architecture, Parameters, Memory, Storage, and Fine‑Tuning Techniques
Tencent Technical Engineering
Tencent Technical Engineering
May 28, 2025 · Artificial Intelligence

A Beginner-friendly Overview of LLMs, Transformers, Prompts, Function Calling, MCP and Agents

This article provides a concise, easy-to-understand introduction to large language models, the transformer architecture, prompt engineering, temperature settings, function calling, the Model Context Protocol (MCP), agent communication (A2A), and future AI programming trends, using simple analogies and illustrative examples.

AIFunction CallingLLM
0 likes · 11 min read
A Beginner-friendly Overview of LLMs, Transformers, Prompts, Function Calling, MCP and Agents
Architecture & Thinking
Architecture & Thinking
May 25, 2025 · Artificial Intelligence

Which AI Workflow Platform Wins? A Deep Dive into n8n, Dify, and Coze

This article compares three leading AI workflow tools—n8n, Dify, and Coze—by examining their origins, technical architectures, core advantages, typical use cases, real‑world case studies, and future deployment trends, helping developers and businesses choose the right "intelligent assistant" for their needs.

AIComparisonLLM
0 likes · 11 min read
Which AI Workflow Platform Wins? A Deep Dive into n8n, Dify, and Coze
JD Tech Talk
JD Tech Talk
May 22, 2025 · Artificial Intelligence

From Academic Research to Industrial Anti‑Fraud: Leveraging LLMs, Reinforcement Learning, and Model Distillation for Advertising Risk Detection

The article recounts Xiaoting’s journey from a PhD research background to leading JD.com’s ad‑fraud detection, detailing how large language models, reinforcement learning, and model distillation were applied to identify hidden address codes, reduce false‑positive rates to 0.3%, and balance accuracy with real‑time performance in a high‑traffic e‑commerce environment.

AILLMModel Distillation
0 likes · 11 min read
From Academic Research to Industrial Anti‑Fraud: Leveraging LLMs, Reinforcement Learning, and Model Distillation for Advertising Risk Detection
DeWu Technology
DeWu Technology
May 19, 2025 · Artificial Intelligence

AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans

This article presents a comprehensive AI-driven solution for automatically generating functional test cases, detailing the AI background, design scheme, core components such as PRD parsing, test‑point generation, test‑case creation, knowledge‑base construction, implementation results, and future development directions.

AILLMRAG
0 likes · 7 min read
AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans
Youzan Coder
Youzan Coder
May 16, 2025 · Artificial Intelligence

Intelligent Address Recognition: AI‑Assisted Hybrid Solution and Prompt Engineering

This article describes how a hybrid architecture that combines third‑party address‑recognition APIs with large‑language‑model (LLM) processing, along with carefully engineered prompts and a TSV output format, dramatically improves address parsing accuracy and latency in a retail checkout scenario.

AIHybrid ArchitectureLLM
0 likes · 12 min read
Intelligent Address Recognition: AI‑Assisted Hybrid Solution and Prompt Engineering
Java Architecture Diary
Java Architecture Diary
May 15, 2025 · Artificial Intelligence

What’s New in LangChain4j 1.0.0? A Deep Dive into Java AI SDK Features

LangChain4j 1.0.0 brings official OpenAI SDK support, GitHub Models integration, expanded database and vector store compatibility, customizable HTTP clients, and clear migration steps for renamed interfaces and streaming methods, marking a major milestone for Java AI development.

AI SDKJavaLLM
0 likes · 7 min read
What’s New in LangChain4j 1.0.0? A Deep Dive into Java AI SDK Features
Tencent Cloud Developer
Tencent Cloud Developer
May 13, 2025 · Artificial Intelligence

Function Calling and Model Context Protocol (MCP): Bridging Large Language Models with Real‑World Systems

The article reviews the shortcomings of traditional large language models, explains how function calling extends LLMs beyond pure text, introduces the Model Context Protocol (MCP) as a standardized USB‑C‑like interface for AI tools, and demonstrates a Python MCP example that integrates LLMs with Tencent Advertising APIs.

AI integrationAPIFunction Calling
0 likes · 16 min read
Function Calling and Model Context Protocol (MCP): Bridging Large Language Models with Real‑World Systems
Tencent Technical Engineering
Tencent Technical Engineering
May 12, 2025 · Artificial Intelligence

Comprehensive Summary and Expansion of Andrej Karpathy’s 7‑Hour LLM Lecture

This article provides a detailed Chinese‑to‑English summary of Andrej Karpathy’s 7‑hour LLM tutorial, covering chat process analysis, tokenization, pre‑training data pipelines, model architecture, training strategies, post‑training fine‑tuning, reinforcement learning, chain‑of‑thought reasoning, and current industry applications.

AILLMModel Architecture
0 likes · 25 min read
Comprehensive Summary and Expansion of Andrej Karpathy’s 7‑Hour LLM Lecture
Java Tech Enthusiast
Java Tech Enthusiast
May 12, 2025 · Artificial Intelligence

Chain‑of‑Recursive‑Thoughts (CoRT): Boosting LLM Reasoning with Recursive Self‑Critique

The article introduces Chain‑of‑Recursive‑Thoughts (CoRT), explains how recursive self‑evaluation enhances large language model reasoning, outlines its workflow, shares GitHub resources, compares it with existing CoT methods, and reports experimental results using Mistral 3.1 24B.

AIChain-of-Recursive-ThoughtsCoRT
0 likes · 6 min read
Chain‑of‑Recursive‑Thoughts (CoRT): Boosting LLM Reasoning with Recursive Self‑Critique
Youzan Coder
Youzan Coder
May 8, 2025 · Artificial Intelligence

Building and Optimizing a Store Smart Assistant with Aily: Architecture, Workflow, and Practical Lessons

The article details how Youzan’s Store Smart Assistant was built on the Feishu Aily platform, describing why Aily was chosen, the three‑stage development process, deep system integration, practical tips for knowledge‑base management and model stability, and the resulting efficiency gains such as handling 80% of routine queries.

AI AssistantAily platformLLM
0 likes · 24 min read
Building and Optimizing a Store Smart Assistant with Aily: Architecture, Workflow, and Practical Lessons
Architect
Architect
May 7, 2025 · Artificial Intelligence

RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval‑Augmented Generation

The article reviews the RAG-MCP framework, which combines Retrieval‑Augmented Generation with Model Context Protocol to reduce prompt bloat and improve tool‑selection accuracy for large language models by first retrieving the most relevant tools before feeding them to the LLM.

Artificial IntelligenceLLMPrompt Bloat
0 likes · 11 min read
RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval‑Augmented Generation
JD Tech
JD Tech
May 6, 2025 · Artificial Intelligence

One4All Generative Recommendation Framework for CPS Advertising

This article reviews recent advances in applying large language models to CPS advertising recommendation, outlines business requirements and core technical challenges, proposes an extensible multi‑task generative framework with explicit intent perception and multi‑objective optimization, and presents offline and online performance gains along with future research directions.

AI optimizationCPS AdvertisingLLM
0 likes · 13 min read
One4All Generative Recommendation Framework for CPS Advertising