Author

AI Engineer Programming

In the AI era, defining problems is often more important than solving them; here we explore AI's contradictions, boundaries, and possibilities.

Articles

Likes

Views

Comments

Latest from AI Engineer Programming

74 recent articles

AI Engineer Programming

May 30, 2026 · Artificial Intelligence

Should You Pre‑filter or Post‑filter in RAG Vector Search?

The article examines RAG vector retrieval filtering strategies, comparing pre‑filtering (filter before vector search) and post‑filtering (filter after ANN search), and introduces single‑stage filtering, discussing their principles, trade‑offs, suitable scenarios, and architectural implications for accuracy and performance.

ANNRAGmetadata filtering

0 likes · 15 min read

Should You Pre‑filter or Post‑filter in RAG Vector Search?

AI Engineer Programming

May 29, 2026 · Artificial Intelligence

How to Build a Reliable RAG Test Dataset

The article explains why a structured test set is essential for Retrieval‑Augmented Generation systems, outlines failure modes, describes layered evaluation of retrieval and generation, details infrastructure like chunk IDs and manifests, and provides a complete annotation pipeline with cold‑start and adversarial strategies.

LLMRAGadversarial

0 likes · 24 min read

How to Build a Reliable RAG Test Dataset

AI Engineer Programming

May 28, 2026 · Artificial Intelligence

Claude Code Best Practices and Getting Started Guide for Large Codebases

This guide explains how Claude Code can be deployed in massive monorepos, legacy systems, and distributed repositories, detailing navigation methods, the limits of RAG, the benefits of agentic search, and a five‑layer support system—including CLAUDE.md, hooks, skills, plugins, and MCP servers—to help teams of thousands achieve reliable AI‑assisted coding.

AI codingCLAUDE.mdClaude Code

0 likes · 18 min read

Claude Code Best Practices and Getting Started Guide for Large Codebases

AI Engineer Programming

May 27, 2026 · Artificial Intelligence

MMR for RAG: Low-Cost Chunk Limits Balance Relevance and Diversity

When a long document is split into many highly similar chunks, vector‑based top‑k retrieval tends to return multiple pieces from the same source, causing document dominance; applying a per‑document chunk limit together with Maximal Marginal Relevance (MMR) re‑ranking introduces diversity while preserving relevance, offering a low‑cost way to improve RAG answer quality.

ChunkingDPPDiversity

0 likes · 17 min read

MMR for RAG: Low-Cost Chunk Limits Balance Relevance and Diversity

AI Engineer Programming

May 26, 2026 · Artificial Intelligence

What Exactly Makes a System AI‑Native?

The article defines AI‑native as a system whose existence depends on AI at every layer, contrasts it with AI‑enabled and AI‑first, explains the structural layers, role shifts, bottlenecks, and maturity stages, and offers concrete guidelines for building truly AI‑native engineering practices.

AI-nativeDevOpsSoftware Architecture

0 likes · 10 min read

AI Engineer Programming

May 25, 2026 · Artificial Intelligence

From Demo to Production: Building a Reliable Agent Development Lifecycle

The article outlines a four‑stage agent development lifecycle—Build, Test, Deploy, Monitor—explaining how early, iterative delivery, systematic testing, controlled deployment, and continuous monitoring transform experimental agents into reliable production systems while addressing governance, cost, and scalability challenges.

AgentDeploymentGovernance

0 likes · 16 min read

From Demo to Production: Building a Reliable Agent Development Lifecycle

AI Engineer Programming

May 24, 2026 · Artificial Intelligence

Why AI Agents Fail Beyond Hallucinations

The article catalogs dozens of AI agent failure modes—from one‑shot attempts and cold‑start amnesia to hidden harness control—and explains why these issues quickly overwhelm developers, then outlines concrete mitigation strategies and their trade‑offs.

AI agentsContext Managementagentic engineering

0 likes · 11 min read

Why AI Agents Fail Beyond Hallucinations

AI Engineer Programming

May 23, 2026 · Artificial Intelligence

Is the A2A Protocol Worth Using? An In‑Depth Technical Review

The article examines the emerging A2A (Agent‑to‑Agent) protocol, tracing its evolution from function calling to MCP and finally A2A, and evaluates its core concepts, security model, task lifecycle, transport options, design guidelines, and operational best practices for building interoperable AI agent systems.

A2AAgent CardJSON-RPC

0 likes · 15 min read

Is the A2A Protocol Worth Using? An In‑Depth Technical Review

AI Engineer Programming

May 22, 2026 · Artificial Intelligence

Is MCP Dead? From Protocol Design to Production

The article examines Model Context Protocol (MCP), introduced by Anthropic in November 2024, tracing its rapid adoption, architectural design—including Host/Client/Server roles, transport layers, security and observability practices—and outlines production guidelines, future roadmap, and current limitations.

AI integrationJSON-RPCMCP

0 likes · 19 min read

Is MCP Dead? From Protocol Design to Production

AI Engineer Programming

May 21, 2026 · Artificial Intelligence

RAG with Multimodal Inputs vs LLM + Toolchains: Handling Non‑Text Data

The article analyzes how large language models process only tokenized text, compares the traditional LLM‑plus‑toolchain pipeline with emerging multimodal models, evaluates their cost, speed, controllability, and hallucination risks, and proposes a hybrid architecture that matches each approach to specific document scenarios.

LLMMultimodalRAG

0 likes · 16 min read

RAG with Multimodal Inputs vs LLM + Toolchains: Handling Non‑Text Data