Tagged articles
14 articles
Page 1 of 1
Shuge Unlimited
Shuge Unlimited
May 21, 2026 · Artificial Intelligence

Building Smarter, More Reliable, and Faster Claude Code Workflows with 19 OMC Agents

This article dissects the oh‑my‑claudecode (OMC) system, explaining Claude Code's four‑layer agentic workflow, OMC's Hooks‑Skills‑Agents‑State architecture, model‑routing strategies, installation steps, and three real‑world scenarios that demonstrate how 19 specialized agents can automate refactoring, bulk fixes, and multi‑module bug resolution.

AI agentsClaude CodeModel routing
0 likes · 24 min read
Building Smarter, More Reliable, and Faster Claude Code Workflows with 19 OMC Agents
Ops Development & AI Practice
Ops Development & AI Practice
May 19, 2026 · Artificial Intelligence

Does Locking Gemini CLI to Pro Really Drain Your Quota? A Deep Dive into Model Routing

The article explains how Gemini CLI’s Auto mode intelligently switches between the Pro and Flash models, why manually locking the Pro model does not cause extra quota consumption, presents benchmark comparisons, clarifies the meaning of the quota indicator, and offers practical model‑selection guidance.

Auto modeGemini CLIModel routing
0 likes · 6 min read
Does Locking Gemini CLI to Pro Really Drain Your Quota? A Deep Dive into Model Routing
21CTO
21CTO
May 9, 2026 · Artificial Intelligence

Why Most AI Coding Feels Like Driving a Ferrari to Buy Milk

In an interview, Neel Sundaresan, the founding engineer behind GitHub Copilot and now lead of IBM Bob, explains how his API‑recommendation system evolved into an enterprise‑focused AI coding assistant, discusses the hidden costs of large models, and shares his view on the future of AI agents.

AI agentsAI codingIBM Bob
0 likes · 10 min read
Why Most AI Coding Feels Like Driving a Ferrari to Buy Milk
AI Engineering
AI Engineering
May 4, 2026 · Artificial Intelligence

Why the Big‑Model Race Is Over: Where Real Value Lies in AI Infrastructure

The article argues that the competition over which large language model will dominate is outdated, explaining that true value now comes from building multi‑model routing, context engineering, standardized tool protocols, intelligent orchestration, and robust evaluation layers that turn models into reliable AI infrastructure.

AI infrastructureMCPModel routing
0 likes · 6 min read
Why the Big‑Model Race Is Over: Where Real Value Lies in AI Infrastructure
AI Architecture Path
AI Architecture Path
May 4, 2026 · Artificial Intelligence

Free Open-Source Proxy Unlocks Unlimited Claude Code Usage

Claude Code offers powerful AI‑assisted coding but its official API is costly and hard to access; the open‑source free‑claude‑code proxy intercepts Claude’s requests, forwards them to free or low‑cost LLM providers, provides multi‑model routing, zero‑cost usage, detailed setup steps, advanced bot and voice features, and troubleshooting guidance.

AI programmingClaude CodeFastAPI
0 likes · 17 min read
Free Open-Source Proxy Unlocks Unlimited Claude Code Usage
AI Programming Lab
AI Programming Lab
Apr 8, 2026 · Artificial Intelligence

Hermes Agent: An Open‑Source AI Agent Framework That Learns and Grows

Hermes Agent is a self‑evolving open‑source AI agent framework that supports over 200 models and 40 tools, offering persistent memory, skill auto‑generation, cross‑session retrieval, multi‑backend execution, and robust security, while providing a quick one‑line deployment and a direct comparison with OpenClaw.

AI agentsDeploymentHermes Agent
0 likes · 7 min read
Hermes Agent: An Open‑Source AI Agent Framework That Learns and Grows
James' Growth Diary
James' Growth Diary
Apr 6, 2026 · Artificial Intelligence

10 Practical LangChain Performance Hacks to Speed Up and Cut Costs

This article presents ten concrete techniques—including in‑memory and Redis caching, semantic caching, parallel execution, batch processing, prompt compression, model routing, streaming output, and connection‑pool reuse—to dramatically reduce latency and token costs in production LangChain applications.

CachingLangChainModel routing
0 likes · 14 min read
10 Practical LangChain Performance Hacks to Speed Up and Cut Costs
AI Step-by-Step
AI Step-by-Step
Apr 1, 2026 · Artificial Intelligence

When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth

The article explains why routing every request to the most powerful LLM hurts cost, speed, and throughput, and presents a three‑layer task decomposition that assigns execution‑level tasks to cheap small models, intermediate tasks to mid‑size models, and high‑risk judgment tasks to large models, with concrete examples and a minimal routing strategy.

Agent DesignLLMModel routing
0 likes · 8 min read
When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth
DataFunSummit
DataFunSummit
Mar 20, 2026 · Artificial Intelligence

Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents

OpenClaw v2026.3.7 brings webhook compatibility fixes, private‑message typing feedback, a 33% token‑saving prompt‑cache, smarter model routing, seamless integration of domestic LLMs such as DeepSeek, Doubao and Qwen, and persistent bindings for Docker deployments, dramatically improving stability, cost efficiency and scalability for enterprise AI agents.

FeishuModel routingOpenClaw
0 likes · 10 min read
Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents
DataFunSummit
DataFunSummit
Mar 16, 2026 · Artificial Intelligence

Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents

OpenClaw v2026.3.7 introduces Feishu webhook compatibility fixes, prompt‑cache cost reductions, smarter model routing, domestic model connectors, and persistent binding for container deployments, turning the tool from a geek toy into a reliable enterprise‑grade AI‑agent platform.

AI agentsModel routingOpenClaw
0 likes · 10 min read
Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents
DataFunTalk
DataFunTalk
Mar 15, 2026 · Artificial Intelligence

How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Efficiency and Cuts Costs

The OpenClaw v2026.3.7 upgrade introduces webhook compatibility fixes, typing‑feedback support, a 33% prompt‑caching cost reduction, smarter model routing with domestic model integration, and persistent bindings for container deployments, making the platform far more suitable for enterprise AI agent scenarios.

AI agentsContainer DeploymentModel routing
0 likes · 10 min read
How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Efficiency and Cuts Costs
DataFunTalk
DataFunTalk
Mar 9, 2026 · Artificial Intelligence

How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Performance and Cuts Costs by 33%

The OpenClaw v2026.3.7 release introduces Feishu webhook compatibility, typing‑feedback, prompt‑caching that reduces token usage by a third, smarter model routing, domestic model integrations and persistent bindings for Docker, offering enterprises a more stable, cost‑effective AI‑agent platform.

AI agentsContainer DeploymentModel routing
0 likes · 10 min read
How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Performance and Cuts Costs by 33%
Alibaba Cloud Native
Alibaba Cloud Native
Oct 28, 2025 · Artificial Intelligence

How SOFA AI Gateway Transforms Cloud‑Native AI Service Management

The article explains how the SOFA AI Gateway, built on the open‑source Higress kernel, evolves traditional API gateways into specialized AI gateways by adding intelligent routing, model proxy, agent proxy, and MCP market features to meet the unique latency, resource, and security demands of AI workloads.

AI gatewayCloud NativeHigress
0 likes · 12 min read
How SOFA AI Gateway Transforms Cloud‑Native AI Service Management
DataFunTalk
DataFunTalk
Sep 14, 2025 · Artificial Intelligence

Why Modern LLMs Skip Thinking: Token Routing and Zero‑Compute Experts Explained

The article examines how large language models now use routing mechanisms and token‑level expert selection to reduce computation and cost, illustrating the trade‑offs with real‑world examples from OpenAI, LongCat, and DeepSeek while highlighting both the benefits and the pitfalls of this approach.

AIModel routingdeep learning
0 likes · 8 min read
Why Modern LLMs Skip Thinking: Token Routing and Zero‑Compute Experts Explained