Tagged articles
12 articles
Page 1 of 1
SuanNi
SuanNi
May 22, 2026 · Artificial Intelligence

How GLM‑5.1‑highspeed Achieves 7× Faster Inference to Become the World’s Fastest Flagship Model

On May 22, Zhipu launched the GLM‑5.1‑highspeed API, delivering 400 tokens per second—about 7× faster than the original model and twice as fast as Gemini 3.5 Flash—through a three‑layer optimization that rewrites the MoE inference path, introduces dynamic scheduling, and leverages TileRT’s AOT engine to cut latency while preserving full flagship capabilities.

GLM-5.1Inference OptimizationLarge Language Model
0 likes · 10 min read
How GLM‑5.1‑highspeed Achieves 7× Faster Inference to Become the World’s Fastest Flagship Model
JavaGuide
JavaGuide
May 12, 2026 · Artificial Intelligence

Unlock Claude Code Tokens and Access GLM‑5.1 with iFlytek’s Ultra‑Low‑Cost Coding Plan

The article reviews iFlytek’s Coding Plan, highlighting its three pricing tiers—including a 3.9 CNY entry tier that unlocks GLM‑5.1 for Claude Code—explains how request‑based billing reduces token anxiety, and provides step‑by‑step integration guides for Claude Code, OpenClaw and Cursor along with real‑world skill examples.

AI codingClaude CodeCursor
0 likes · 13 min read
Unlock Claude Code Tokens and Access GLM‑5.1 with iFlytek’s Ultra‑Low‑Cost Coding Plan
AI Engineer Programming
AI Engineer Programming
Apr 26, 2026 · Artificial Intelligence

2026 AI Model API Prices – DeepSeek V4 Flash Costs Only 1% of GPT‑5.5

The article provides a detailed April 2026 comparison of API pricing for six major AI model families—including DeepSeek, GLM‑5.1, Kimi, Claude, GPT‑5.5, and Gemini—covering official and proxy channels, context limits, discount periods, peak‑time surcharges, and practical selection recommendations for developers.

AI Model PricingClaudeDeepSeek
0 likes · 11 min read
2026 AI Model API Prices – DeepSeek V4 Flash Costs Only 1% of GPT‑5.5
Old Meng AI Explorer
Old Meng AI Explorer
Apr 23, 2026 · Artificial Intelligence

GLM-5.1 vs Qwen3.6 Plus vs MiniMax M2.7: In‑Depth 2026 Review of China’s Top AI Models

This article provides a detailed, data‑driven comparison of three 2026 Chinese flagship large language models—GLM-5.1, Qwen3.6 Plus, and MiniMax M2.7—covering knowledge, math, code, long‑task, multimodal performance, pricing, open‑source status, ecosystem support, and scenario‑based recommendations.

GLM-5.1Large Language ModelMiniMax M2.7
0 likes · 12 min read
GLM-5.1 vs Qwen3.6 Plus vs MiniMax M2.7: In‑Depth 2026 Review of China’s Top AI Models
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 12, 2026 · Industry Insights

How 1/10 Pricing Drives Chinese LLMs to 10× Market Share

The article analyzes how Chinese large language models like GLM‑5.1, Qianwen 3.6‑Plus and Gemma 4 achieve roughly one‑tenth the cost of GPT‑5.4, leading to dramatically higher profit margins, silent migration in Silicon Valley, and a rapid rise in market share backed by a maturing ecosystem.

AI model ecosystemChinese LLMGLM-5.1
0 likes · 10 min read
How 1/10 Pricing Drives Chinese LLMs to 10× Market Share
Coder Circle
Coder Circle
Apr 8, 2026 · Industry Insights

GLM‑5.1 Enables 8‑Hour Continuous Operation and Leads SWE‑bench; Tencent Unveils First Open‑Config AI Browser

The AI daily briefing highlights GLM‑5.1’s breakthrough 8‑hour continuous reasoning, its top performance on SWE‑bench and a 10% price hike, while Tencent’s QBotClaw introduces the first domestically free‑configurable large‑model API browser, signaling a shift toward open AI ecosystems in China.

AI ecosystemAI pricingGLM-5.1
0 likes · 6 min read
GLM‑5.1 Enables 8‑Hour Continuous Operation and Leads SWE‑bench; Tencent Unveils First Open‑Config AI Browser
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Apr 8, 2026 · Artificial Intelligence

Unlocking 8‑Hour Autonomous Coding: GLM‑5.1’s Leap with Kunlun XPU

The open‑source GLM‑5.1 model, adapted to Baidu Baige's Kunlun XPU via the vLLM‑Kunlun Plugin, delivers record‑breaking SWE‑bench scores, eight‑hour autonomous coding, long‑context handling up to 64K tokens, and scalable deployment across tens of thousands of chips, showcasing end‑to‑end AI acceleration.

GLM-5.1Kunlun XPUModel Deployment
0 likes · 8 min read
Unlocking 8‑Hour Autonomous Coding: GLM‑5.1’s Leap with Kunlun XPU
PaperAgent
PaperAgent
Apr 2, 2026 · Artificial Intelligence

Can an LLM Build a Full‑Stack Knowledge Graph System in Under 3 Hours?

Using the GLM‑5.1 large language model, the author automated the end‑to‑end development of an ontology‑based knowledge‑graph extraction and visualization platform—covering backend, frontend, and graph database—in just 2 hours 47 minutes, consuming 747 k tokens and self‑correcting multiple issues.

AI EngineeringFull-Stack DevelopmentGLM-5.1
0 likes · 12 min read
Can an LLM Build a Full‑Stack Knowledge Graph System in Under 3 Hours?
Su San Talks Tech
Su San Talks Tech
Apr 2, 2026 · Artificial Intelligence

How GLM-5.1 Beats Its Predecessor: A Hands‑On Test and Deep Dive

The article presents a detailed, hands‑on evaluation of the newly released GLM‑5.1 model, describing the rollout strategy, step‑by‑step testing on complex coding tasks, configuration details, observed performance improvements over previous versions, and practical guidance for developers seeking to leverage the model for real‑world projects.

AI coding assistantGLM-5.1Large Language Model
0 likes · 9 min read
How GLM-5.1 Beats Its Predecessor: A Hands‑On Test and Deep Dive
ShiZhen AI
ShiZhen AI
Mar 28, 2026 · Artificial Intelligence

GLM-5.1 Now Open to All: Performance vs Claude Opus, Pricing & Setup Guide

GLM-5.1 is now available to all Coding Plan subscribers, including the $10/month Lite tier, scoring 45.3 on SWE‑bench—just 5.4% below Claude Opus 4.6’s 47.9—while offering 20+ tool integrations and a manual switch from the default GLM‑4.7 model.

AI coding modelClaude OpusGLM-5.1
0 likes · 7 min read
GLM-5.1 Now Open to All: Performance vs Claude Opus, Pricing & Setup Guide