Tagged articles
6 articles
Page 1 of 1
Su San Talks Tech
Su San Talks Tech
May 31, 2026 · Artificial Intelligence

How Claude Code, Codex, and OpenCode Can Cut Token Usage by Up to 80%

The article breaks down why input tokens dominate 70‑90% of LLM costs and provides concrete, platform‑specific techniques—file filtering, context compression, documentation drives, memory caching, plan mode, output trimming, and model switching—that together can reduce token consumption by 20‑90% across Claude Code, Codex, and OpenCode.

AI coding assistantsClaude CodeCodex
0 likes · 10 min read
How Claude Code, Codex, and OpenCode Can Cut Token Usage by Up to 80%
Eric Tech Circle
Eric Tech Circle
Nov 21, 2025 · Artificial Intelligence

Unlock Gemini CLI: Free AI Power, Installation Guide, and Pro Tips

After three months of using the free Gemini CLI, this guide explains why the tool is valuable, shows how to install and enable the Gemini‑3‑Pro model, and provides practical tips such as resetting daily quotas, switching models, managing rules, adding context, and configuring MCP tools.

AI toolCommand LineContext Management
0 likes · 6 min read
Unlock Gemini CLI: Free AI Power, Installation Guide, and Pro Tips
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 10, 2025 · Artificial Intelligence

Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway

Learn how to deploy the new QwQ‑32B inference model alongside DeepSeek‑R1 using the Higress AI gateway, covering environment setup, model configuration, routing, token‑level rate limiting, content safety, semantic caching, and advanced features like automatic fallback and internet‑search integration.

DeepSeekHigressLLM integration
0 likes · 16 min read
Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway