Tagged articles

18 articles

Page 1 of 1

Apr 11, 2026 · Artificial Intelligence

WildClawBench: 60 Real-World Agent Tasks Reveal How Far AI “Lobsters” Have Come

WildClawBench, a 60‑question, Docker‑based benchmark from Shanghai AI Lab’s InternLM team, evaluates AI agents across six multimodal categories, exposing low ceilings for top models like Claude Opus 4.6, highlighting cost‑performance trade‑offs and the rapid rise of Chinese models such as GLM 5.

AI AgentBenchmarkClaude Opus

0 likes · 9 min read

WildClawBench: 60 Real-World Agent Tasks Reveal How Far AI “Lobsters” Have Come

Machine Learning Algorithms & Natural Language Processing

Apr 8, 2026 · Artificial Intelligence

Dissecting Gemma‑4’s Architecture and Training Choices: A Technical Comparison with Qwen‑3 and GLM‑5

This article breaks down every architectural and training decision behind Gemma‑4—KV sharing, p‑RoPE, per‑layer embeddings, and a dual‑path MoE + dense MLP—while contrasting its efficiency and performance with Qwen‑3 and GLM‑5 across benchmarks, quantization strategies, and RL pipelines.

GLM-5Gemma 4LLM architecture

0 likes · 23 min read

Dissecting Gemma‑4’s Architecture and Training Choices: A Technical Comparison with Qwen‑3 and GLM‑5

Frontend AI Walk

Mar 12, 2026 · Artificial Intelligence

Configure OpenClaw Multi‑Agent: GLM‑5, Kimi K2.5, DeepSeek & GLM‑Flash Team

This step‑by‑step tutorial shows how to integrate domestic LLM providers (GLM‑5, GLM‑4.7, GLM‑Flash, Kimi K2.5, DeepSeek, Qwen3‑Coder‑Next, BGE‑M3) into OpenClaw, define model routing, create dedicated controller, writer and coder agents, and run a complete multi‑agent workflow.

AI configurationDeepSeekGLM-5

0 likes · 16 min read

Configure OpenClaw Multi‑Agent: GLM‑5, Kimi K2.5, DeepSeek & GLM‑Flash Team

Frontend AI Walk

Mar 11, 2026 · Artificial Intelligence

OpenClaw Full‑Domestic Model Stack: 6 Role‑Based Selections and Strategies

This guide outlines a role‑based selection strategy for building a fully domestic OpenClaw model stack, explains common pitfalls when replacing foreign models, details why specific Chinese models fit each role, presents three balanced configurations, and offers a step‑by‑step migration plan.

BGE‑M3DeepSeekGLM-5

0 likes · 15 min read

OpenClaw Full‑Domestic Model Stack: 6 Role‑Based Selections and Strategies

Old Zhang's AI Learning

Mar 10, 2026 · Artificial Intelligence

Install AutoClaw in One Minute: Quick Setup for a Local AI Assistant

AutoClaw wraps the open‑source OpenClaw client, turning a half‑day installation into three simple steps—download, install, and auto‑configure—while adding seamless Feishu integration, support for GLM‑5 and pony‑alpha‑2 models, built‑in skills, and security recommendations for custom skill creation.

AI AssistantAutoClawFeishu

0 likes · 6 min read

Install AutoClaw in One Minute: Quick Setup for a Local AI Assistant

Shuge Unlimited

Mar 5, 2026 · Artificial Intelligence

How to Build a Fully Automated Content Production Team with OpenClaw and 4 AI Agents

This guide shows how to use OpenClaw to create a four‑agent AI team—Content Lead, Topic Researcher, Content Writer, and Content Editor—connected by an Orchestrator‑Worker architecture, passing file paths, returning standardized JSON, and handling configuration, troubleshooting, and cost‑optimisation for end‑to‑end content automation.

AI agentsContent AutomationFeishu integration

0 likes · 22 min read

How to Build a Fully Automated Content Production Team with OpenClaw and 4 AI Agents

AI2ML AI to Machine Learning

Feb 24, 2026 · Artificial Intelligence

Optimizing Structured Processes in the Large‑Model Era: From Reasoning to Agentic RL

The article analyzes how large‑model development has moved from reasoning to the agentic stage, compares open‑source and closed‑source capabilities, details Reasoning RL versus Agentic RL designs, and proposes skill‑centric data and verification mechanisms to close the performance gap.

DeepSeekGLM-5Large Language Models

0 likes · 10 min read

Optimizing Structured Processes in the Large‑Model Era: From Reasoning to Agentic RL

SuanNi

Feb 23, 2026 · Artificial Intelligence

How GLM‑5 Breaks New Ground with Sparse Attention and Asynchronous RL

GLM‑5, the 744‑billion‑parameter open‑source LLM, introduces DeepSeek Sparse Attention, Multi‑latent Attention, Muon Split optimizer, and a fully asynchronous agentic reinforcement‑learning framework, achieving state‑of‑the‑art performance on long‑context, code, math, and multimodal benchmarks while running efficiently on domestic Chinese chips.

BenchmarkGLM-5Sparse Attention

0 likes · 12 min read

How GLM‑5 Breaks New Ground with Sparse Attention and Asynchronous RL

Old Zhang's AI Learning

Feb 19, 2026 · Artificial Intelligence

Inside GLM-5: Training Techniques, Architecture Innovations, and Benchmark Performance

The article dissects GLM-5’s 744B‑parameter MoE design, 28.5 T token training corpus, novel Muon Split and MLA‑256 optimizations, DSA sparse attention, a fully asynchronous RL pipeline, extensive domestic chip adaptation, and benchmark results that place it on par with Claude Opus 4.5 and ahead of Gemini 3 Pro.

AI ArchitectureBenchmarkDSA

0 likes · 13 min read

Inside GLM-5: Training Techniques, Architecture Innovations, and Benchmark Performance

Shuge Unlimited

Feb 13, 2026 · Artificial Intelligence

Which Chinese Open‑Source LLM Wins the Tech‑Selection Battle: GLM‑5, MiniMax‑M2.1 or Kimi‑K2.5?

The article evaluates three Chinese open‑source large language models—GLM‑5, MiniMax‑M2.1 and Kimi‑K2.5—for use with the OpenClaw AI‑Agent gateway, comparing core specifications, programming and agent benchmarks, multimodal abilities, deployment costs, and scenario‑specific recommendations, while also sharing practical pitfalls.

Agent SwarmGLM-5Kimi-K2.5

0 likes · 16 min read

Which Chinese Open‑Source LLM Wins the Tech‑Selection Battle: GLM‑5, MiniMax‑M2.1 or Kimi‑K2.5?

Old Zhang's AI Learning

Feb 12, 2026 · Artificial Intelligence

Testing the World's Most Powerful Open‑Source LLM: GLM‑5, Local Deployment & Free Ollama Cloud

The article evaluates GLM‑5, the claimed strongest open‑source large language model, comparing its benchmark scores to Claude Opus, Gemini and GPT, detailing its DeepSeek‑inspired architecture, quantized FP8 deployment requirements, and step‑by‑step usage of Ollama’s free cloud model with Agent, data‑analysis and document‑generation features.

AI benchmarkingData AnalysisGLM-5

0 likes · 7 min read

Testing the World's Most Powerful Open‑Source LLM: GLM‑5, Local Deployment & Free Ollama Cloud

Baidu Intelligent Cloud Tech Hub

Feb 12, 2026 · Artificial Intelligence

Deploying GLM-5 on Baidu Kunlun P800 XPU with vLLM‑Kunlun Plugin

This article explains how Baidu's new GLM-5 large model is adapted to the Kunlun P800 XPU, detailing the async reinforcement learning framework Slime, optimization techniques like INT8 quantization and tensor‑parallelism, and provides step‑by‑step deployment commands using the open‑source vLLM‑Kunlun plugin.

AI accelerationGLM-5INT8 Quantization

0 likes · 6 min read

Deploying GLM-5 on Baidu Kunlun P800 XPU with vLLM‑Kunlun Plugin

AI Insight Log

Feb 12, 2026 · Artificial Intelligence

GLM-5 Unveiled: 744B Parameters, Claude Opus 4.5‑Level Performance, Epic Agent Upgrade

Z.ai released the open‑source GLM‑5 model with 744 billion parameters, 28.5 T tokens of training data, and new Sparse Attention and Slime RL infrastructure, achieving top open‑source rankings and near‑Claude Opus 4.5 performance on Vending Bench 2 and CC‑Bench‑V2 while adding multi‑scenario agent capabilities.

BenchmarkGLM-5Large Language Model

0 likes · 6 min read

GLM-5 Unveiled: 744B Parameters, Claude Opus 4.5‑Level Performance, Epic Agent Upgrade

Baobao Algorithm Notes

Feb 12, 2026 · Artificial Intelligence

GLM-5 Unleashed: How the New Chinese LLM Tackles Full‑Stack Architecture and Complex System Design

The article reviews the newly released GLM-5 model, highlighting its ability to generate end‑to‑end system designs, write and debug backend code, and solve large‑scale engineering problems through detailed prompts, positioning it alongside GPT‑5.3 and Claude Opus in the competitive LLM landscape.

AI programmingBackend ArchitectureGLM-5

0 likes · 9 min read

GLM-5 Unleashed: How the New Chinese LLM Tackles Full‑Stack Architecture and Complex System Design

AI Engineering

Feb 12, 2026 · Artificial Intelligence

GLM-5 Unveiled: 744B‑Parameter Model Takes on Claude in Complex Tasks

GLM-5, the new 744‑billion‑parameter open‑source LLM, expands on GLM‑4.5 with GlmMoeDsa architecture, achieves higher HLE benchmark scores than Claude Opus 4.5, demonstrates strong long‑context and agent capabilities, supports vLLM/SGLang, runs on various Chinese chips, and can directly generate Office documents.

AI benchmarksChinese chipsClaude

0 likes · 5 min read

GLM-5 Unveiled: 744B‑Parameter Model Takes on Claude in Complex Tasks

Open Source Tech Hub

Feb 12, 2026 · Artificial Intelligence

How GLM-5 Advances AI with Bigger Scale, Sparse Attention, and Agent Capabilities

GLM-5, a new large language model with 744 B parameters and 28.5 T tokens of training data, introduces DeepSeek sparse attention and an asynchronous RL system called slime, delivering strong benchmark gains on complex system engineering, long‑horizon agent tasks, and surpassing many open‑source competitors.

AIBenchmarkingGLM-5

0 likes · 6 min read

How GLM-5 Advances AI with Bigger Scale, Sparse Attention, and Agent Capabilities

Machine Learning Algorithms & Natural Language Processing

Feb 10, 2026 · Artificial Intelligence

Inside GLM-5: 745B Parameters, DeepSeek‑style Sparse Attention, and a 60% Stock Surge

The GLM-5 architecture, uncovered from a GitHub PR, doubles the previous model to 745 B parameters, adopts DeepSeek‑V3 sparse attention and multi‑token prediction, features a 78‑layer MoE with 256 experts, supports a 202K‑token context window, and its rumored test model "Pony Alpha" sparked a 60% rise in Zhipu AI's stock amid a crowded AI release season.

AI Stock ImpactDeepSeekGLM-5

0 likes · 6 min read

Inside GLM-5: 745B Parameters, DeepSeek‑style Sparse Attention, and a 60% Stock Surge

Old Zhang's AI Learning

Feb 9, 2026 · Artificial Intelligence

GLM-5 Emerges First, Built on DeepSeek Tech, Triggering a 40% Stock Surge

An anonymous OpenRouter model dubbed "Pony Alpha" was verified as the new 745B‑parameter GLM-5, which reuses DeepSeek‑V3 architecture, supports sparse attention and multi‑token prediction, and has already caused a near‑40% jump in Zhipu AI’s stock while hinting at upcoming integration into the Transformers library.

DeepSeekGLM-5Large Language Model

0 likes · 3 min read

GLM-5 Emerges First, Built on DeepSeek Tech, Triggering a 40% Stock Surge