Tagged articles

14 articles

Page 1 of 1

May 21, 2026 · Artificial Intelligence

Building Smarter, More Reliable, and Faster Claude Code Workflows with 19 OMC Agents

This article dissects the oh‑my‑claudecode (OMC) system, explaining Claude Code's four‑layer agentic workflow, OMC's Hooks‑Skills‑Agents‑State architecture, model‑routing strategies, installation steps, and three real‑world scenarios that demonstrate how 19 specialized agents can automate refactoring, bulk fixes, and multi‑module bug resolution.

AI agentsClaude CodeModel routing

0 likes · 24 min read

Building Smarter, More Reliable, and Faster Claude Code Workflows with 19 OMC Agents

Ops Development & AI Practice

May 19, 2026 · Artificial Intelligence

Does Locking Gemini CLI to Pro Really Drain Your Quota? A Deep Dive into Model Routing

The article explains how Gemini CLI’s Auto mode intelligently switches between the Pro and Flash models, why manually locking the Pro model does not cause extra quota consumption, presents benchmark comparisons, clarifies the meaning of the quota indicator, and offers practical model‑selection guidance.

Auto modeGemini CLIModel routing

0 likes · 6 min read

Does Locking Gemini CLI to Pro Really Drain Your Quota? A Deep Dive into Model Routing

21CTO

May 9, 2026 · Artificial Intelligence

Why Most AI Coding Feels Like Driving a Ferrari to Buy Milk

In an interview, Neel Sundaresan, the founding engineer behind GitHub Copilot and now lead of IBM Bob, explains how his API‑recommendation system evolved into an enterprise‑focused AI coding assistant, discusses the hidden costs of large models, and shares his view on the future of AI agents.

AI agentsAI codingIBM Bob

0 likes · 10 min read

Why Most AI Coding Feels Like Driving a Ferrari to Buy Milk

AI Engineering

May 4, 2026 · Artificial Intelligence

Why the Big‑Model Race Is Over: Where Real Value Lies in AI Infrastructure

The article argues that the competition over which large language model will dominate is outdated, explaining that true value now comes from building multi‑model routing, context engineering, standardized tool protocols, intelligent orchestration, and robust evaluation layers that turn models into reliable AI infrastructure.

AI infrastructureMCPModel routing

0 likes · 6 min read

Why the Big‑Model Race Is Over: Where Real Value Lies in AI Infrastructure

AI Architecture Path

May 4, 2026 · Artificial Intelligence

Free Open-Source Proxy Unlocks Unlimited Claude Code Usage

Claude Code offers powerful AI‑assisted coding but its official API is costly and hard to access; the open‑source free‑claude‑code proxy intercepts Claude’s requests, forwards them to free or low‑cost LLM providers, provides multi‑model routing, zero‑cost usage, detailed setup steps, advanced bot and voice features, and troubleshooting guidance.

AI programmingClaude CodeFastAPI

0 likes · 17 min read

Free Open-Source Proxy Unlocks Unlimited Claude Code Usage

AI Programming Lab

Apr 8, 2026 · Artificial Intelligence

Hermes Agent: An Open‑Source AI Agent Framework That Learns and Grows

Hermes Agent is a self‑evolving open‑source AI agent framework that supports over 200 models and 40 tools, offering persistent memory, skill auto‑generation, cross‑session retrieval, multi‑backend execution, and robust security, while providing a quick one‑line deployment and a direct comparison with OpenClaw.

AI agentsDeploymentHermes Agent

0 likes · 7 min read

Hermes Agent: An Open‑Source AI Agent Framework That Learns and Grows

James' Growth Diary

Apr 6, 2026 · Artificial Intelligence

10 Practical LangChain Performance Hacks to Speed Up and Cut Costs

This article presents ten concrete techniques—including in‑memory and Redis caching, semantic caching, parallel execution, batch processing, prompt compression, model routing, streaming output, and connection‑pool reuse—to dramatically reduce latency and token costs in production LangChain applications.

CachingLangChainModel routing

0 likes · 14 min read

10 Practical LangChain Performance Hacks to Speed Up and Cut Costs

AI Step-by-Step

Apr 1, 2026 · Artificial Intelligence

When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth

The article explains why routing every request to the most powerful LLM hurts cost, speed, and throughput, and presents a three‑layer task decomposition that assigns execution‑level tasks to cheap small models, intermediate tasks to mid‑size models, and high‑risk judgment tasks to large models, with concrete examples and a minimal routing strategy.

Agent DesignLLMModel routing

0 likes · 8 min read

When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth

DataFunSummit

Mar 20, 2026 · Artificial Intelligence

Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents

OpenClaw v2026.3.7 brings webhook compatibility fixes, private‑message typing feedback, a 33% token‑saving prompt‑cache, smarter model routing, seamless integration of domestic LLMs such as DeepSeek, Doubao and Qwen, and persistent bindings for Docker deployments, dramatically improving stability, cost efficiency and scalability for enterprise AI agents.

FeishuModel routingOpenClaw

0 likes · 10 min read

Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents

DataFunSummit

Mar 16, 2026 · Artificial Intelligence

Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents

OpenClaw v2026.3.7 introduces Feishu webhook compatibility fixes, prompt‑cache cost reductions, smarter model routing, domestic model connectors, and persistent binding for container deployments, turning the tool from a geek toy into a reliable enterprise‑grade AI‑agent platform.

AI agentsModel routingOpenClaw

0 likes · 10 min read

DataFunTalk

Mar 15, 2026 · Artificial Intelligence

How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Efficiency and Cuts Costs

The OpenClaw v2026.3.7 upgrade introduces webhook compatibility fixes, typing‑feedback support, a 33% prompt‑caching cost reduction, smarter model routing with domestic model integration, and persistent bindings for container deployments, making the platform far more suitable for enterprise AI agent scenarios.

AI agentsContainer DeploymentModel routing

0 likes · 10 min read

How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Efficiency and Cuts Costs

DataFunTalk

Mar 9, 2026 · Artificial Intelligence

How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Performance and Cuts Costs by 33%

The OpenClaw v2026.3.7 release introduces Feishu webhook compatibility, typing‑feedback, prompt‑caching that reduces token usage by a third, smarter model routing, domestic model integrations and persistent bindings for Docker, offering enterprises a more stable, cost‑effective AI‑agent platform.

AI agentsContainer DeploymentModel routing

0 likes · 10 min read

How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Performance and Cuts Costs by 33%

Alibaba Cloud Native

Oct 28, 2025 · Artificial Intelligence

How SOFA AI Gateway Transforms Cloud‑Native AI Service Management

The article explains how the SOFA AI Gateway, built on the open‑source Higress kernel, evolves traditional API gateways into specialized AI gateways by adding intelligent routing, model proxy, agent proxy, and MCP market features to meet the unique latency, resource, and security demands of AI workloads.

AI gatewayCloud NativeHigress

0 likes · 12 min read

How SOFA AI Gateway Transforms Cloud‑Native AI Service Management

DataFunTalk

Sep 14, 2025 · Artificial Intelligence

Why Modern LLMs Skip Thinking: Token Routing and Zero‑Compute Experts Explained

The article examines how large language models now use routing mechanisms and token‑level expert selection to reduce computation and cost, illustrating the trade‑offs with real‑world examples from OpenAI, LongCat, and DeepSeek while highlighting both the benefits and the pitfalls of this approach.

AIModel routingdeep learning

0 likes · 8 min read

Why Modern LLMs Skip Thinking: Token Routing and Zero‑Compute Experts Explained