Tagged articles

12 articles

Page 1 of 1

May 22, 2026 · Artificial Intelligence

How GLM‑5.1‑highspeed Achieves 7× Faster Inference to Become the World’s Fastest Flagship Model

On May 22, Zhipu launched the GLM‑5.1‑highspeed API, delivering 400 tokens per second—about 7× faster than the original model and twice as fast as Gemini 3.5 Flash—through a three‑layer optimization that rewrites the MoE inference path, introduces dynamic scheduling, and leverages TileRT’s AOT engine to cut latency while preserving full flagship capabilities.

GLM-5.1Inference OptimizationLarge Language Model

0 likes · 10 min read

How GLM‑5.1‑highspeed Achieves 7× Faster Inference to Become the World’s Fastest Flagship Model

JavaGuide

May 12, 2026 · Artificial Intelligence

Unlock Claude Code Tokens and Access GLM‑5.1 with iFlytek’s Ultra‑Low‑Cost Coding Plan

The article reviews iFlytek’s Coding Plan, highlighting its three pricing tiers—including a 3.9 CNY entry tier that unlocks GLM‑5.1 for Claude Code—explains how request‑based billing reduces token anxiety, and provides step‑by‑step integration guides for Claude Code, OpenClaw and Cursor along with real‑world skill examples.

AI codingClaude CodeCursor

0 likes · 13 min read

Unlock Claude Code Tokens and Access GLM‑5.1 with iFlytek’s Ultra‑Low‑Cost Coding Plan

AI Engineer Programming

Apr 26, 2026 · Artificial Intelligence

2026 AI Model API Prices – DeepSeek V4 Flash Costs Only 1% of GPT‑5.5

The article provides a detailed April 2026 comparison of API pricing for six major AI model families—including DeepSeek, GLM‑5.1, Kimi, Claude, GPT‑5.5, and Gemini—covering official and proxy channels, context limits, discount periods, peak‑time surcharges, and practical selection recommendations for developers.

AI Model PricingClaudeDeepSeek

0 likes · 11 min read

2026 AI Model API Prices – DeepSeek V4 Flash Costs Only 1% of GPT‑5.5

Old Meng AI Explorer

Apr 23, 2026 · Artificial Intelligence

GLM-5.1 vs Qwen3.6 Plus vs MiniMax M2.7: In‑Depth 2026 Review of China’s Top AI Models

This article provides a detailed, data‑driven comparison of three 2026 Chinese flagship large language models—GLM-5.1, Qwen3.6 Plus, and MiniMax M2.7—covering knowledge, math, code, long‑task, multimodal performance, pricing, open‑source status, ecosystem support, and scenario‑based recommendations.

GLM-5.1Large Language ModelMiniMax M2.7

0 likes · 12 min read

GLM-5.1 vs Qwen3.6 Plus vs MiniMax M2.7: In‑Depth 2026 Review of China’s Top AI Models

Lao Guo's Learning Space

Apr 12, 2026 · Industry Insights

How 1/10 Pricing Drives Chinese LLMs to 10× Market Share

The article analyzes how Chinese large language models like GLM‑5.1, Qianwen 3.6‑Plus and Gemma 4 achieve roughly one‑tenth the cost of GPT‑5.4, leading to dramatically higher profit margins, silent migration in Silicon Valley, and a rapid rise in market share backed by a maturing ecosystem.

AI model ecosystemChinese LLMGLM-5.1

0 likes · 10 min read

How 1/10 Pricing Drives Chinese LLMs to 10× Market Share

Coder Circle

Apr 8, 2026 · Industry Insights

GLM‑5.1 Enables 8‑Hour Continuous Operation and Leads SWE‑bench; Tencent Unveils First Open‑Config AI Browser

The AI daily briefing highlights GLM‑5.1’s breakthrough 8‑hour continuous reasoning, its top performance on SWE‑bench and a 10% price hike, while Tencent’s QBotClaw introduces the first domestically free‑configurable large‑model API browser, signaling a shift toward open AI ecosystems in China.

AI ecosystemAI pricingGLM-5.1

0 likes · 6 min read

GLM‑5.1 Enables 8‑Hour Continuous Operation and Leads SWE‑bench; Tencent Unveils First Open‑Config AI Browser

Baidu Intelligent Cloud Tech Hub

Apr 8, 2026 · Artificial Intelligence

Unlocking 8‑Hour Autonomous Coding: GLM‑5.1’s Leap with Kunlun XPU

The open‑source GLM‑5.1 model, adapted to Baidu Baige's Kunlun XPU via the vLLM‑Kunlun Plugin, delivers record‑breaking SWE‑bench scores, eight‑hour autonomous coding, long‑context handling up to 64K tokens, and scalable deployment across tens of thousands of chips, showcasing end‑to‑end AI acceleration.

GLM-5.1Kunlun XPUModel Deployment

0 likes · 8 min read

Unlocking 8‑Hour Autonomous Coding: GLM‑5.1’s Leap with Kunlun XPU

AI Engineering

Apr 8, 2026 · Artificial Intelligence

How GLM-5.1 Tops Open‑Source Benchmarks and Generates Articles and Short Videos with a Single Prompt

GLM-5.1, the newly open‑sourced large language model, leads global code‑generation benchmarks, excels at eight‑hour continuous long‑term tasks, can build a complete Linux desktop in eight hours, and even creates a short video from an article with just one prompt.

Claude Sonnet alternativeGLM-5.1benchmark

0 likes · 7 min read

How GLM-5.1 Tops Open‑Source Benchmarks and Generates Articles and Short Videos with a Single Prompt

Old Zhang's AI Learning

Apr 8, 2026 · Artificial Intelligence

GLM‑5.1 Outperforms Claude Opus in Benchmarks – The Open‑Source LLM’s Edge

GLM‑5.1, the new 744 B‑parameter open‑source LLM from Zhipu, tops SWE‑Bench Pro with a score of 58.4, outpacing Claude Opus, GPT‑5.4 and Gemini, excels at long‑duration autonomous tasks, yet shows gaps in single‑turn generation and pure mathematical reasoning.

Agent ProgrammingGLM-5.1Model Deployment

0 likes · 22 min read

GLM‑5.1 Outperforms Claude Opus in Benchmarks – The Open‑Source LLM’s Edge

PaperAgent

Apr 2, 2026 · Artificial Intelligence

Can an LLM Build a Full‑Stack Knowledge Graph System in Under 3 Hours?

Using the GLM‑5.1 large language model, the author automated the end‑to‑end development of an ontology‑based knowledge‑graph extraction and visualization platform—covering backend, frontend, and graph database—in just 2 hours 47 minutes, consuming 747 k tokens and self‑correcting multiple issues.

AI EngineeringFull-Stack DevelopmentGLM-5.1

0 likes · 12 min read

Can an LLM Build a Full‑Stack Knowledge Graph System in Under 3 Hours?

Su San Talks Tech

Apr 2, 2026 · Artificial Intelligence

How GLM-5.1 Beats Its Predecessor: A Hands‑On Test and Deep Dive

The article presents a detailed, hands‑on evaluation of the newly released GLM‑5.1 model, describing the rollout strategy, step‑by‑step testing on complex coding tasks, configuration details, observed performance improvements over previous versions, and practical guidance for developers seeking to leverage the model for real‑world projects.

AI coding assistantGLM-5.1Large Language Model

0 likes · 9 min read

How GLM-5.1 Beats Its Predecessor: A Hands‑On Test and Deep Dive

ShiZhen AI

Mar 28, 2026 · Artificial Intelligence

GLM-5.1 Now Open to All: Performance vs Claude Opus, Pricing & Setup Guide

GLM-5.1 is now available to all Coding Plan subscribers, including the $10/month Lite tier, scoring 45.3 on SWE‑bench—just 5.4% below Claude Opus 4.6’s 47.9—while offering 20+ tool integrations and a manual switch from the default GLM‑4.7 model.

AI coding modelClaude OpusGLM-5.1

0 likes · 7 min read

GLM-5.1 Now Open to All: Performance vs Claude Opus, Pricing & Setup Guide