AI Engineer Programming
Author

AI Engineer Programming

In the AI era, defining problems is often more important than solving them; here we explore AI's contradictions, boundaries, and possibilities.

74
Articles
0
Likes
45
Views
0
Comments
Recent Articles

Latest from AI Engineer Programming

74 recent articles
AI Engineer Programming
AI Engineer Programming
May 20, 2026 · Artificial Intelligence

Why Chunk‑Based RAG Fails and How IdeaBlocks Improve Retrieval

The article argues that the common assumption that text chunks are the proper knowledge unit in RAG pipelines is flawed, leading to versioning, metadata, and redundancy problems, and demonstrates that replacing chunks with structured IdeaBlocks dramatically reduces corpus size, token usage, and improves vector relevance.

IdeaBlockLLMRAG
0 likes · 10 min read
Why Chunk‑Based RAG Fails and How IdeaBlocks Improve Retrieval
AI Engineer Programming
AI Engineer Programming
May 18, 2026 · Artificial Intelligence

Designing an Agent Gateway: Bridging Business Logic and Protocol Infrastructure

The article analyzes why traditional API gateways cannot meet the needs of stateful Agentic workflows and proposes a dedicated Agent gateway that handles access control, cross‑service execution tracing, and pre‑LLM security enforcement while addressing connection overhead, session fan‑out, and observability challenges.

A2AAI securityAgent Gateway
0 likes · 14 min read
Designing an Agent Gateway: Bridging Business Logic and Protocol Infrastructure
AI Engineer Programming
AI Engineer Programming
May 17, 2026 · Fundamentals

Why Are We Still Using Markdown?

The article analyses Markdown's minimalist design, its ambiguous syntax, security flaws such as ReDoS and XSS vulnerabilities, and the growing gap between its original simple transliteration goal and the complex compiler‑like features developers now demand.

CommonMarkMarkdownReDoS
0 likes · 14 min read
Why Are We Still Using Markdown?
AI Engineer Programming
AI Engineer Programming
May 17, 2026 · Artificial Intelligence

ReAct, Plan‑Execute, and Reflection: How Continuous Loops Make Agent Architecture Crucial

While a single LLM call is a stateless function, real‑world tasks require dynamic information gathering, hypothesis testing, and iterative refinement, so agents must operate in a continuous loop; the article analyzes core patterns such as ReAct, Plan‑Execute, Reflection, Multi‑Agent and HITL, highlighting state management, cost, debugging, and observability challenges.

Agent ArchitectureLLMObservability
0 likes · 21 min read
ReAct, Plan‑Execute, and Reflection: How Continuous Loops Make Agent Architecture Crucial
AI Engineer Programming
AI Engineer Programming
May 16, 2026 · Artificial Intelligence

How to Boost RAG Retrieval Quality: Real‑World Cost‑Benefit Analysis

This article examines practical ways to improve Retrieval‑Augmented Generation (RAG) retrieval quality—covering vector database choices, data chunking, embedding models, query expansion, and re‑ranking—while weighing performance gains against operational costs through multiple real‑world case studies.

LLMRAGRe‑ranking
0 likes · 16 min read
How to Boost RAG Retrieval Quality: Real‑World Cost‑Benefit Analysis
AI Engineer Programming
AI Engineer Programming
May 14, 2026 · Artificial Intelligence

RAG Retrieval: Comparing Bi-encoder and Cross-encoder Architectures

The article reviews the three‑step RAG pipeline, explains why retrieval quality hinges on fast, accurate semantic matching, contrasts Bi-encoder’s offline vector indexing and speed with Cross-encoder’s token‑level interaction and higher precision, and discusses hybrid solutions such as ColBERT and LLM rerankers with practical engineering guidelines.

Bi-EncoderColBERTCross-Encoder
0 likes · 10 min read
RAG Retrieval: Comparing Bi-encoder and Cross-encoder Architectures
AI Engineer Programming
AI Engineer Programming
May 13, 2026 · Artificial Intelligence

AI Agent Architecture Patterns: How to Choose the Right Solution for Your Workload

The article analyzes how AI agent architecture choices—single‑agent versus multi‑agent, ReAct, plan‑and‑execute, orchestrator‑worker, hierarchical teams, reflection, and HITL—affect cost, reliability, and scalability, providing quantitative trade‑offs and industry examples to guide workload‑specific selection.

AI AgentsArchitecture PatternsHuman-in-the-Loop
0 likes · 16 min read
AI Agent Architecture Patterns: How to Choose the Right Solution for Your Workload