AI Engineer Programming
Author

AI Engineer Programming

In the AI era, defining problems is often more important than solving them; here we explore AI's contradictions, boundaries, and possibilities.

74
Articles
0
Likes
44
Views
0
Comments
Recent Articles

Latest from AI Engineer Programming

74 recent articles
AI Engineer Programming
AI Engineer Programming
May 30, 2026 · Artificial Intelligence

Should You Pre‑filter or Post‑filter in RAG Vector Search?

The article examines RAG vector retrieval filtering strategies, comparing pre‑filtering (filter before vector search) and post‑filtering (filter after ANN search), and introduces single‑stage filtering, discussing their principles, trade‑offs, suitable scenarios, and architectural implications for accuracy and performance.

ANNRAGmetadata filtering
0 likes · 15 min read
Should You Pre‑filter or Post‑filter in RAG Vector Search?
AI Engineer Programming
AI Engineer Programming
May 29, 2026 · Artificial Intelligence

How to Build a Reliable RAG Test Dataset

The article explains why a structured test set is essential for Retrieval‑Augmented Generation systems, outlines failure modes, describes layered evaluation of retrieval and generation, details infrastructure like chunk IDs and manifests, and provides a complete annotation pipeline with cold‑start and adversarial strategies.

LLMRAGadversarial
0 likes · 24 min read
How to Build a Reliable RAG Test Dataset
AI Engineer Programming
AI Engineer Programming
May 28, 2026 · Artificial Intelligence

Claude Code Best Practices and Getting Started Guide for Large Codebases

This guide explains how Claude Code can be deployed in massive monorepos, legacy systems, and distributed repositories, detailing navigation methods, the limits of RAG, the benefits of agentic search, and a five‑layer support system—including CLAUDE.md, hooks, skills, plugins, and MCP servers—to help teams of thousands achieve reliable AI‑assisted coding.

AI codingCLAUDE.mdClaude Code
0 likes · 18 min read
Claude Code Best Practices and Getting Started Guide for Large Codebases
AI Engineer Programming
AI Engineer Programming
May 27, 2026 · Artificial Intelligence

MMR for RAG: Low-Cost Chunk Limits Balance Relevance and Diversity

When a long document is split into many highly similar chunks, vector‑based top‑k retrieval tends to return multiple pieces from the same source, causing document dominance; applying a per‑document chunk limit together with Maximal Marginal Relevance (MMR) re‑ranking introduces diversity while preserving relevance, offering a low‑cost way to improve RAG answer quality.

ChunkingDPPDiversity
0 likes · 17 min read
MMR for RAG: Low-Cost Chunk Limits Balance Relevance and Diversity
AI Engineer Programming
AI Engineer Programming
May 26, 2026 · Artificial Intelligence

What Exactly Makes a System AI‑Native?

The article defines AI‑native as a system whose existence depends on AI at every layer, contrasts it with AI‑enabled and AI‑first, explains the structural layers, role shifts, bottlenecks, and maturity stages, and offers concrete guidelines for building truly AI‑native engineering practices.

AI-nativeDevOpsSoftware Architecture
0 likes · 10 min read
What Exactly Makes a System AI‑Native?
AI Engineer Programming
AI Engineer Programming
May 25, 2026 · Artificial Intelligence

From Demo to Production: Building a Reliable Agent Development Lifecycle

The article outlines a four‑stage agent development lifecycle—Build, Test, Deploy, Monitor—explaining how early, iterative delivery, systematic testing, controlled deployment, and continuous monitoring transform experimental agents into reliable production systems while addressing governance, cost, and scalability challenges.

AgentDeploymentGovernance
0 likes · 16 min read
From Demo to Production: Building a Reliable Agent Development Lifecycle
AI Engineer Programming
AI Engineer Programming
May 24, 2026 · Artificial Intelligence

Why AI Agents Fail Beyond Hallucinations

The article catalogs dozens of AI agent failure modes—from one‑shot attempts and cold‑start amnesia to hidden harness control—and explains why these issues quickly overwhelm developers, then outlines concrete mitigation strategies and their trade‑offs.

AI agentsContext Managementagentic engineering
0 likes · 11 min read
Why AI Agents Fail Beyond Hallucinations
AI Engineer Programming
AI Engineer Programming
May 23, 2026 · Artificial Intelligence

Is the A2A Protocol Worth Using? An In‑Depth Technical Review

The article examines the emerging A2A (Agent‑to‑Agent) protocol, tracing its evolution from function calling to MCP and finally A2A, and evaluates its core concepts, security model, task lifecycle, transport options, design guidelines, and operational best practices for building interoperable AI agent systems.

A2AAgent CardJSON-RPC
0 likes · 15 min read
Is the A2A Protocol Worth Using? An In‑Depth Technical Review
AI Engineer Programming
AI Engineer Programming
May 22, 2026 · Artificial Intelligence

Is MCP Dead? From Protocol Design to Production

The article examines Model Context Protocol (MCP), introduced by Anthropic in November 2024, tracing its rapid adoption, architectural design—including Host/Client/Server roles, transport layers, security and observability practices—and outlines production guidelines, future roadmap, and current limitations.

AI integrationJSON-RPCMCP
0 likes · 19 min read
Is MCP Dead? From Protocol Design to Production
AI Engineer Programming
AI Engineer Programming
May 21, 2026 · Artificial Intelligence

RAG with Multimodal Inputs vs LLM + Toolchains: Handling Non‑Text Data

The article analyzes how large language models process only tokenized text, compares the traditional LLM‑plus‑toolchain pipeline with emerging multimodal models, evaluates their cost, speed, controllability, and hallucination risks, and proposes a hybrid architecture that matches each approach to specific document scenarios.

LLMMultimodalRAG
0 likes · 16 min read
RAG with Multimodal Inputs vs LLM + Toolchains: Handling Non‑Text Data