Tagged articles

674 articles

Page 2 of 7

Apr 19, 2026 · Artificial Intelligence

Understanding the Key Differences Between Large Model Pretraining and Fine‑Tuning

The article explains how pretraining on massive generic data creates a reusable base model, while fine‑tuning uses smaller, high‑quality task‑specific data to adapt the model, covering objectives, data scale, cost, methods, and why most projects prefer fine‑tuning.

Large Language ModelLoRAPEFT

0 likes · 6 min read

Understanding the Key Differences Between Large Model Pretraining and Fine‑Tuning

SuanNi

Apr 18, 2026 · Artificial Intelligence

How GPT‑Rosalind Is Accelerating Drug Discovery with AI

OpenAI's GPT‑Rosalind model, designed for chemistry and genomics, demonstrates superior performance on scientific benchmarks, outperforms human experts, offers a rich plugin ecosystem, and implements strict access controls to help accelerate early-stage drug research while ensuring responsible AI use in life sciences.

AI governanceArtificial IntelligenceLarge Language Model

0 likes · 10 min read

How GPT‑Rosalind Is Accelerating Drug Discovery with AI

Old Zhang's AI Learning

Apr 18, 2026 · Artificial Intelligence

NVIDIA Nemotron 3 Super: 7× Faster Than Qwen3.5 – Inside Hybrid Mamba‑Attention, LatentMoE, and MTP

NVIDIA’s Nemotron 3 Super, a 120.6 B‑parameter flagship model supporting 1 M‑token context, combines Hybrid Mamba‑Attention, LatentMoE, and Multi‑Token Prediction to achieve up to 7.5× higher inference throughput than Qwen3.5 while matching or surpassing its accuracy across a range of benchmarks.

Hybrid Mamba-AttentionLarge Language ModelLatentMoE

0 likes · 11 min read

NVIDIA Nemotron 3 Super: 7× Faster Than Qwen3.5 – Inside Hybrid Mamba‑Attention, LatentMoE, and MTP

AI Large-Model Wave and Transformation Guide

Apr 18, 2026 · Artificial Intelligence

Does Qwen3.6‑35B‑A3B Really Outclass All AI Coding Models? Inside the Benchmark Breakdown

Qwen3.6‑35B‑A3B, a mixture‑of‑experts model that activates only 3 B parameters, outperforms leading AI systems across SWE‑bench, Terminal‑Bench, NL2Repo and several agentic coding benchmarks, while also achieving top scores in GPQA, HMMT and RealWorldQA, prompting a reassessment of domestic LLM capabilities.

AI codingAgentic CodingChinese AI

0 likes · 7 min read

Does Qwen3.6‑35B‑A3B Really Outclass All AI Coding Models? Inside the Benchmark Breakdown

Wuming AI

Apr 16, 2026 · Artificial Intelligence

Why Claude Opus 4.7 Is Shifting From Smart Answers to Real Work Execution

Anthropic’s Claude Opus 4.7 moves the competition from raw cleverness to reliable task completion, boosting complex coding, long‑running agents, high‑resolution visual understanding, stricter instruction following, and safety guardrails, while urging developers to retest prompts, budgets, and real‑world workflows.

AIAgentLarge Language Model

0 likes · 11 min read

Why Claude Opus 4.7 Is Shifting From Smart Answers to Real Work Execution

SuanNi

Apr 16, 2026 · Artificial Intelligence

Claude Opus 4.7 Unleashed: How Anthropic’s New Model Automates Complex Tasks

Anthropic’s latest Claude Opus 4.7 model introduces autonomous task execution via Routines, enhanced code review with /ultrareview, higher-resolution visual input, and significant performance gains across knowledge work, vision, and long‑context reasoning, while adding safety guardrails, a new xhigh compute tier, and unchanged pricing.

AI automationAnthropicClaude Opus

0 likes · 6 min read

Claude Opus 4.7 Unleashed: How Anthropic’s New Model Automates Complex Tasks

AI Explorer

Apr 16, 2026 · Artificial Intelligence

Claude Opus 4.7: How Anthropic’s New Model Makes AI Programming Autonomous

Anthropic’s Claude Opus 4.7, released on April 16, 2026, boosts visual resolution threefold, adds self‑verifying programming ability, delivers strong benchmark gains across code review, data analysis, legal and financial tasks, and introduces new inference tiers and security controls, reshaping AI‑assisted software development.

AI programmingAnthropicClaude Opus 4.7

0 likes · 11 min read

Claude Opus 4.7: How Anthropic’s New Model Makes AI Programming Autonomous

AI Code to Success

Apr 16, 2026 · Artificial Intelligence

Master Claude Code’s 1M‑Token Context: Proven Strategies to Manage, Compact, and Rewind

Claude Code now supports a 1 million‑token context window, but effective use hinges on disciplined context management—choosing when to continue, rewind, clear, compact, or delegate to sub‑agents, and applying three core concepts of context windows, compaction, and context rot to avoid performance pitfalls.

AI workflowClaudeContext Management

0 likes · 10 min read

Master Claude Code’s 1M‑Token Context: Proven Strategies to Manage, Compact, and Rewind

Lao Guo's Learning Space

Apr 16, 2026 · Artificial Intelligence

Why Alibaba Unveiled Three New LLMs in One Week—and What It Means for China’s AI Landscape

In the first week of April 2026, Alibaba’s Tongyi Lab launched three purpose‑built large language models—Qwen3.6-Plus for programming, Qwen3.5-Omni for multimodal tasks, and Qwen3 Coder Next for repository‑level coding—illustrating a strategic shift from pure benchmark races to targeted, cost‑effective deployment across distinct AI battlefields.

AlibabaLarge Language ModelQwen3-Coder-Next

0 likes · 15 min read

Why Alibaba Unveiled Three New LLMs in One Week—and What It Means for China’s AI Landscape

AI Large-Model Wave and Transformation Guide

Apr 16, 2026 · Artificial Intelligence

How MiniMax M2.7 Is Pioneering Self‑Evolving AI Models

MiniMax’s open‑source M2.7 model, released in April 2026, demonstrates the first self‑evolving AI agent that autonomously updates its memory, learns new skills, and optimizes its own training loop, achieving up to 30% performance gains and leading benchmark scores across programming, ML automation, and productivity tasks.

Large Language ModelOpen Sourceagentic AI

0 likes · 9 min read

How MiniMax M2.7 Is Pioneering Self‑Evolving AI Models

Machine Learning Algorithms & Natural Language Processing

Apr 15, 2026 · Artificial Intelligence

Industrial Code LLM Learns to Think Before Writing – InCoder-32B Thinking Tackles Verilog and CUDA Pitfalls

The article analyzes InCoder-32B Thinking, an industrial‑code large language model that incorporates error‑driven chain‑of‑thought and an Industrial Code World Model to predict execution outcomes, adapt reasoning depth, and achieve high accuracy across diverse hardware‑centric benchmarks.

CUDALarge Language ModelVerilog

0 likes · 7 min read

Industrial Code LLM Learns to Think Before Writing – InCoder-32B Thinking Tackles Verilog and CUDA Pitfalls

AI Explorer

Apr 14, 2026 · Artificial Intelligence

Anthropic’s Mythos Model Stuns in 100 Prototype Tests, Surpassing Expectations

Anthropic’s newly unveiled Mythos model surprised its creators by outperforming expectations across more than 100 diverse product‑prototype tests, highlighting emergent capabilities, a strategic shift toward real‑world applicability, and potential implications for AI safety, competition, and industry adoption.

AI competitionAI emergenceAnthropic

0 likes · 6 min read

Anthropic’s Mythos Model Stuns in 100 Prototype Tests, Surpassing Expectations

Geek Labs

Apr 12, 2026 · Artificial Intelligence

How Open-Source Persona Distillation Skills Enable AI to Mimic Human Thought

The article introduces the open‑source "awesome‑persona‑distill‑skills" library, explains the concept of persona distillation, details its Agent Skills‑based architecture, showcases concrete Jobs and Zhang Xuefeng skill outputs, and outlines five skill categories and usage instructions.

AIAgent SkillsLarge Language Model

0 likes · 8 min read

How Open-Source Persona Distillation Skills Enable AI to Mimic Human Thought

AI Explorer

Apr 11, 2026 · Artificial Intelligence

How Kronos Redefines Quantitative Analysis with a Financial‑Market Language Model

Kronos, an open‑source large model trained on OHLCV data from over 45 exchanges, treats financial time‑series as a specialized language, using a custom tokenizer and a two‑stage Transformer to enable price prediction, market state detection, signal generation, and risk simulation, with easy Hugging Face integration and a live demo for BTC/USDT.

KronosLarge Language ModelOpen Source

0 likes · 6 min read

How Kronos Redefines Quantitative Analysis with a Financial‑Market Language Model

AI Architect Hub

Apr 10, 2026 · Artificial Intelligence

How to Build an AI‑Powered WeChat Article Automation Workflow with Prompt Engineering

This guide walks through creating a fully automated WeChat public‑account article publishing pipeline using large‑model prompt engineering, covering token retrieval, title generation, subtitle creation, hand‑drawn comic generation, content formatting, image handling, and final draft publishing with detailed code snippets.

AIJavaScriptLarge Language Model

0 likes · 11 min read

How to Build an AI‑Powered WeChat Article Automation Workflow with Prompt Engineering

Old Meng AI Explorer

Apr 9, 2026 · Artificial Intelligence

Why Anthropic’s Claude Mythos Is So Powerful It Won’t Be Publicly Released

Anthropic’s Claude Mythos preview, a model that outperforms its predecessor across multiple benchmarks, is being kept under wraps due to its dual‑use capabilities that combine unprecedented AI performance with dangerous autonomous vulnerability‑exploitation potential, prompting a safety‑first rollout and industry‑wide security concerns.

AI benchmarkingAI safetyAnthropic

0 likes · 8 min read

Why Anthropic’s Claude Mythos Is So Powerful It Won’t Be Publicly Released

AI Software Product Manager

Apr 8, 2026 · Artificial Intelligence

Unlocking ByteDance’s Agent Platform: How LLMs, Coze Plugins, and Trae Accelerate AI Development

This article outlines ByteDance’s Agent concept, explains the role of large language models such as Doubao‑Seed‑1.6, describes how the Coze plugin marketplace and the Trae development environment simplify building intelligent agents, and presents the talent capability model required for successful Agent engineering.

AI developmentAgentCoze

0 likes · 11 min read

Unlocking ByteDance’s Agent Platform: How LLMs, Coze Plugins, and Trae Accelerate AI Development

HyperAI Super Neural

Apr 8, 2026 · Artificial Intelligence

One‑Click Deploy Gemma‑4‑31B with 256K Context, Matching Qwen 3.5 397B Performance

HyperAI’s tutorial lets developers instantly launch the open‑source Gemma‑4‑31B model—supporting multimodal input, up to 256 K token context and over 140 languages—through a one‑click deployment on RTX 6000 or RTX 5090 GPUs, with detailed step‑by‑step instructions and optional compute credits.

256K contextGemma-4-31BHyperAI

0 likes · 5 min read

One‑Click Deploy Gemma‑4‑31B with 256K Context, Matching Qwen 3.5 397B Performance

Design Hub

Apr 8, 2026 · Artificial Intelligence

Why Anthropic’s Most Powerful Model Mythos Is Locked Away from the Public

Anthropic’s Mythos Preview, touted as its strongest frontier model with dramatic gains in vulnerability discovery and complex system analysis, is being released only to a handful of security partners, sparking debate over high‑risk capabilities, “ability‑sequestered” deployment, and the future of AI model governance.

AI safetyAnthropicLarge Language Model

0 likes · 13 min read

Why Anthropic’s Most Powerful Model Mythos Is Locked Away from the Public

ShiZhen AI

Apr 8, 2026 · Artificial Intelligence

Why Anthropic’s Claude Mythos Preview Is Too Powerful to Sell

Anthropic’s Claude Mythos Preview uncovered thousands of zero‑day bugs across major operating systems and browsers, outperformed all benchmark suites, and is being kept out of the public market in favor of a exclusive Project Glasswing partnership with twelve tech giants.

AI securityAnthropicBenchmark performance

0 likes · 11 min read

Why Anthropic’s Claude Mythos Preview Is Too Powerful to Sell

Lao Guo's Learning Space

Apr 8, 2026 · Artificial Intelligence

2026 Qwen Model Comparison: Choose the Right Qwen for Your Mac Studio

An in‑depth 2026 comparative review of Alibaba’s Qwen series (Qwen2.5, Qwen3, Qwen3.5) evaluates architecture, performance, speed and VRAM usage on Mac Studio, ranks each variant, and provides concrete model‑selection guidance for different memory configurations, highlighting the MoE‑based Qwen3.5 as the optimal choice.

AI PerformanceLarge Language ModelMac Studio

0 likes · 9 min read

2026 Qwen Model Comparison: Choose the Right Qwen for Your Mac Studio

AI Insight Log

Apr 7, 2026 · Artificial Intelligence

Anthropic Unveils ‘Too Powerful to Release’ Mythos Model; Apple, Microsoft, Google Join Security Alliance

Anthropic released the Claude Mythos Preview, a model that outperforms Claude Opus 4.6 on multiple software‑engineering benchmarks and uncovers thousands of high‑severity vulnerabilities, while forming the Project Glasswing alliance with twelve tech giants to safeguard critical software infrastructure, yet keeping the model closed to the public.

AI securityAnthropicLarge Language Model

0 likes · 8 min read

Anthropic Unveils ‘Too Powerful to Release’ Mythos Model; Apple, Microsoft, Google Join Security Alliance

AI Programming Lab

Apr 5, 2026 · Artificial Intelligence

Do You Really Understand Tokens? A Deep Dive Starting from a Claude Code Session

The article explains what tokens are, how different models tokenize text, the role of token embeddings, positional encoding, self‑attention, KV cache, and why output tokens cost far more than input tokens, while also covering pricing differences and prompt‑caching savings across major LLM providers.

KV CacheLLM pricingLarge Language Model

0 likes · 13 min read

Do You Really Understand Tokens? A Deep Dive Starting from a Claude Code Session

Machine Heart

Apr 3, 2026 · Artificial Intelligence

Kimi’s ‘Option Time Machine’: Interns Gain Equity While Building Cutting‑Edge AI

Kimi, a three‑year‑old AI‑native unicorn valued over $120 billion, launches a “Time‑Machine” option program that grants interns equity while showcasing its rapid valuation growth, record‑breaking context lengths, novel Kimi Linear architecture, token‑efficiency gains, and open‑source models that rival leading LLMs.

AI Talent ProgramAgent SwarmsAttention Residuals

0 likes · 10 min read

Kimi’s ‘Option Time Machine’: Interns Gain Equity While Building Cutting‑Edge AI

AI Engineering

Apr 3, 2026 · Artificial Intelligence

Gemma 4: Native Multimodal Model That Packs Large‑Model Performance into a Small Footprint

Google DeepMind's Gemma 4 family introduces four open‑source models—including a 31B dense and a 26B MoE variant with 256K context—that deliver multimodal capabilities, tool‑use functions, and benchmark results rivaling much larger models while running on a single H100 GPU.

256K contextApache-2.0Gemma 4

0 likes · 5 min read

Gemma 4: Native Multimodal Model That Packs Large‑Model Performance into a Small Footprint

SuanNi

Apr 2, 2026 · Artificial Intelligence

How Alibaba’s New Qwen3.5‑Omni, Wan2.7‑Image, and Qwen3.6‑Plus Redefine Multimodal AI

Alibaba unveiled three cutting‑edge models—Qwen3.5‑Omni with native multimodal interaction, Wan2.7‑Image for high‑precision image generation and editing, and Qwen3.6‑Plus boosting coding agent performance—each achieving dozens of SOTA benchmarks, massive context windows, and novel capabilities such as Audio‑Visual Vibe Coding and transparent layer separation.

AICoding AgentLarge Language Model

0 likes · 7 min read

How Alibaba’s New Qwen3.5‑Omni, Wan2.7‑Image, and Qwen3.6‑Plus Redefine Multimodal AI

Su San Talks Tech

Apr 2, 2026 · Artificial Intelligence

How GLM-5.1 Beats Its Predecessor: A Hands‑On Test and Deep Dive

The article presents a detailed, hands‑on evaluation of the newly released GLM‑5.1 model, describing the rollout strategy, step‑by‑step testing on complex coding tasks, configuration details, observed performance improvements over previous versions, and practical guidance for developers seeking to leverage the model for real‑world projects.

AI coding assistantGLM-5.1Large Language Model

0 likes · 9 min read

How GLM-5.1 Beats Its Predecessor: A Hands‑On Test and Deep Dive

Machine Heart

Mar 31, 2026 · Artificial Intelligence

What Does DeepResearch Bench Measure? Toward Human‑Level AI Agent Evaluation

The DeepResearch Bench and Bench II, open‑source benchmarks from the USTC team, evaluate deep‑research AI agents on report quality, citation reliability, and information recall using the RACE and FACT frameworks, aiming to align automated scores with human expert judgments.

AI Agent EvaluationDeepResearch BenchFACT

0 likes · 12 min read

What Does DeepResearch Bench Measure? Toward Human‑Level AI Agent Evaluation

Old Zhang's AI Learning

Mar 31, 2026 · Artificial Intelligence

Turning a Bluetooth Speaker into a Smart Assistant with Qwen 3.5‑Omni

The author demonstrates a proof‑of‑concept that combines Qwen 3.5‑Omni's real‑time internet search and audio output with a locally hosted voice‑wake‑up model to transform a Bluetooth speaker into an always‑on smart assistant, while noting latency challenges and the potential of a sub‑10B open‑source alternative.

AI integrationBluetoothLarge Language Model

0 likes · 2 min read

Turning a Bluetooth Speaker into a Smart Assistant with Qwen 3.5‑Omni

AI Engineering

Mar 31, 2026 · Artificial Intelligence

Qwen3.5-Omni Introduces Audio‑Visual Vibe Coding: Code by Speaking and Gesturing

Alibaba's newly released Qwen3.5-Omni multimodal model adds an Audio‑Visual Vibe Coding feature that lets users describe a website or game with speech and gestures to generate code, while offering advanced audio comprehension, long‑duration media support, multilingual capabilities, fine‑grained voice control, and voice cloning, though its weights remain closed‑source.

AIAlibabaAudio-Visual Vibe Coding

0 likes · 3 min read

Qwen3.5-Omni Introduces Audio‑Visual Vibe Coding: Code by Speaking and Gesturing

Machine Heart

Mar 30, 2026 · Artificial Intelligence

Echo: A Small Step for Predictive AI, a Giant Leap Toward General Intelligence

The Echo system from UniPat AI introduces a fully integrated predictive‑intelligence infrastructure—including a dynamic evaluation engine, a Train‑on‑Future training paradigm, and the EchoZ‑1.0 model—that outperforms leading LLMs and human traders on a comprehensive AI Prediction Leaderboard, while offering transparent, reproducible benchmarks.

Dynamic EvaluationElo rankingLarge Language Model

0 likes · 14 min read

Echo: A Small Step for Predictive AI, a Giant Leap Toward General Intelligence

ShiZhen AI

Mar 27, 2026 · Artificial Intelligence

Anthropic’s Secret ‘Capybara’ Model Leaked: So Powerful Even the Company Hesitates to Release It

A CMS misconfiguration exposed Anthropic’s unreleased Claude Mythos model, codenamed Capybara, revealing its unprecedented cybersecurity capabilities, massive scale, and the company’s cautious rollout strategy amid fierce competition from OpenAI and Google.

AI competitionAI securityAnthropic

0 likes · 6 min read

Anthropic’s Secret ‘Capybara’ Model Leaked: So Powerful Even the Company Hesitates to Release It

AgentGuide

Mar 27, 2026 · Artificial Intelligence

What Are Skills in LLM Agents? How They Work and When to Use Them

The article defines Skills as structured local folders that encapsulate domain‑specific processes, knowledge, and tools for large language models, contrasts them with temporary Prompts, outlines suitable use cases, details their components, and explains their on‑demand loading mechanism that saves tokens.

Large Language ModelOn-demand Loadingagent development

0 likes · 4 min read

What Are Skills in LLM Agents? How They Work and When to Use Them

AI Engineer Programming

Mar 25, 2026 · Artificial Intelligence

What Is an AI Agent? Definition, Core Capabilities, and Architecture

The article explains AI agents as autonomous systems that perceive environments, plan, use tools, iterate through action loops, and self‑reflect, contrasting them with traditional chatbots and workflows, and outlines their core abilities, memory types, tool‑use mechanisms, and single‑ versus multi‑agent architectures.

AI agentLarge Language ModelMemory

0 likes · 8 min read

What Is an AI Agent? Definition, Core Capabilities, and Architecture

Machine Learning Algorithms & Natural Language Processing

Mar 24, 2026 · Artificial Intelligence

China’s Tech Circle Wars Over the Chinese Name for AI Tokens – Trends and Aesthetics

Amid a heated debate over the proper Chinese translation of “Token,” China’s AI community examines the term’s technical origins, massive global consumption—30 trillion daily tokens worldwide, 4.69 trillion from China alone—and its economic impact, while proposing names like CiYuan, MoYuan, and ZhiYuan to reflect cultural aesthetics.

Chinese NamingIndustry InsightLarge Language Model

0 likes · 12 min read

China’s Tech Circle Wars Over the Chinese Name for AI Tokens – Trends and Aesthetics

Geek Labs

Mar 24, 2026 · Industry Insights

9 Must‑See GitHub Projects: MacBook‑Run LLM, WeChat AI, Multi‑Agent Collaboration and More

This article reviews nine standout GitHub open‑source projects, covering a C/Metal LLM engine for MacBooks, a Claude Code commercial‑analysis skill, multi‑agent communication tools, web‑enabled AI, autonomous research automation, WeChat AI integration, a minimalist terminal, a Codex console, and a lightweight WARP proxy.

AIDockerGitHub

0 likes · 10 min read

9 Must‑See GitHub Projects: MacBook‑Run LLM, WeChat AI, Multi‑Agent Collaboration and More

AI Open-Source Efficiency Guide

Mar 24, 2026 · Artificial Intelligence

12 Practical AI Prompt Templates for Everyday Work (with Examples)

This guide presents twelve ready‑to‑use AI prompt templates covering single‑task queries, business writing, multi‑step projects, creative branding, logical reasoning, structured outputs, code editing, autonomous agents, image generation, and more, each illustrated with concrete examples.

AILarge Language Modelprompt engineering

0 likes · 16 min read

12 Practical AI Prompt Templates for Everyday Work (with Examples)

Weekly Large Model Application

Mar 22, 2026 · Artificial Intelligence

Inside MiMo-Audio: Dissecting the Large-Scale Audio Model

The article breaks down MiMo-Audio, a next‑token‑prediction‑style large‑scale audio model built on Qwen2, detailing its acoustic front‑end, RVQ tokenizer, patch‑based transformer architecture, streaming capabilities, performance advantages, engineering constraints, and recommended application scenarios.

Audio ModelingFew-shotLarge Language Model

0 likes · 9 min read

Inside MiMo-Audio: Dissecting the Large-Scale Audio Model

AgentGuide

Mar 22, 2026 · Artificial Intelligence

How to Design Prompt Engineering in Your Project: A Complete Workflow

The article outlines a systematic Prompt Engineering process that starts with defining task goals and metrics, structures prompts into modular components, uses offline evaluation and bad‑case analysis, incorporates RAG or tools when needed, and continuously monitors accuracy, hallucination, latency and cost.

AI workflowFew-shotLarge Language Model

0 likes · 7 min read

How to Design Prompt Engineering in Your Project: A Complete Workflow

DataFunTalk

Mar 22, 2026 · Artificial Intelligence

Why Cursor’s Composer 2 Beats Claude Opus 4.6 in Performance and Price

Cursor’s new Composer 2 programming model outperforms Claude Opus 4.6 on benchmarks like Terminal‑Bench 2.0 and SWE‑bench Multilingual, while slashing token costs to $0.5/M input and $2.5/M output, thanks to a novel self‑summary reinforcement‑learning technique that enables efficient long‑context processing.

AILarge Language Modelpricing

0 likes · 8 min read

Why Cursor’s Composer 2 Beats Claude Opus 4.6 in Performance and Price

PaperAgent

Mar 22, 2026 · Artificial Intelligence

How AI Agents Like OpenClaw Turn LLMs into Autonomous Assistants

This article explains what AI agents are, how they differ from ordinary language‑model interfaces, and walks through OpenClaw’s workflow, tool usage, security challenges, memory handling, and advanced features such as sub‑agents and context compaction, offering practical insights for building safe autonomous AI systems.

AI agentContext EngineeringLarge Language Model

0 likes · 27 min read

How AI Agents Like OpenClaw Turn LLMs into Autonomous Assistants

AI Product Manager Community

Mar 21, 2026 · Artificial Intelligence

Mastering AI Agents: From Core Concepts to Enterprise Deployment

This article provides a comprehensive, structured overview of AI agents, covering their fundamental definitions, core architecture (LLM, planning, memory, tool use), evolution from chatbots, the ReAct reasoning framework, multi‑agent systems, safety challenges like hallucination and prompt‑injection, and practical strategies for production‑grade deployment.

AI agentLarge Language ModelReAct

0 likes · 16 min read

Mastering AI Agents: From Core Concepts to Enterprise Deployment

Black & White Path

Mar 21, 2026 · Artificial Intelligence

Japan’s ‘Self‑Developed’ 700B AI Model: A DeepSeek Re‑skin Flop

Rakuten AI 3.0 was billed as Japan’s largest, self‑developed 700‑billion‑parameter model backed by government funds, but a quick look at its Hugging Face config reveals it merely re‑uses DeepSeek V3, prompting a broader critique of the hype, funding motives, and strategic trade‑offs behind the launch.

AI Industry AnalysisDeepSeekGovernment funding

0 likes · 5 min read

Japan’s ‘Self‑Developed’ 700B AI Model: A DeepSeek Re‑skin Flop

Model Perspective

Mar 20, 2026 · Artificial Intelligence

How to Build a No‑Code AI Agent for Fast Book Summarization

This article walks through the design and implementation of a no‑code AI reading agent that parses, splits, and summarizes books chapter by chapter, explaining why the tool serves as a pre‑reading filter rather than a replacement for deep study.

AILarge Language ModelReading Efficiency

0 likes · 10 min read

How to Build a No‑Code AI Agent for Fast Book Summarization

Machine Learning Algorithms & Natural Language Processing

Mar 18, 2026 · Artificial Intelligence

Get the Difference Between Skills, MCP, Agent, and OpenClaw in 3 Minutes

In just three minutes, this article explains how an autonomous AI Agent (like Vision) differs from its Skills (capabilities), the universal MCP protocol that connects it to software, and the OpenClaw framework that assembles them, using clear Marvel‑based analogies.

AI agentFrameworkLarge Language Model

0 likes · 5 min read

Get the Difference Between Skills, MCP, Agent, and OpenClaw in 3 Minutes

HyperAI Super Neural

Mar 18, 2026 · Artificial Intelligence

How Google’s Gemini Extracted 2.6 Million Flood Events from 150 Countries’ News

Google Research released the open‑source Groundsource flood dataset, built by automatically processing more than 5 million news articles from over 150 countries with the Gemini large‑language model, yielding over 2.6 million verified flood event records that are evaluated against GDACS and DFO for precision, recall, and spatial resolution.

AI extractionGoogleGroundsource

0 likes · 13 min read

How Google’s Gemini Extracted 2.6 Million Flood Events from 150 Countries’ News

AIWalker

Mar 17, 2026 · Artificial Intelligence

How a 4B-Parameter Open-Source Model Outperforms 14B Multimodal Giants

InternVL-U, a 4‑billion‑parameter unified multimodal model released as open source, combines a 2B MLLM backbone with a 1.7B visual generation head and, through a reasoning‑centric data pipeline and Chain‑of‑Thought guidance, achieves superior understanding, generation, and editing performance that surpasses much larger 14‑20B models on multiple benchmarks.

AI researchInternVL-ULarge Language Model

0 likes · 22 min read

How a 4B-Parameter Open-Source Model Outperforms 14B Multimodal Giants

AI Insight Log

Mar 16, 2026 · Artificial Intelligence

Cursor’s Own Large‑Model Benchmark Shakes Up SWE‑bench Rankings

Although SWE‑bench scores for top coding models now differ by only a tenth of a point, Cursor’s newly released CursorBench reveals dramatic ranking changes, highlights three fundamental flaws in public benchmarks, and introduces token‑efficiency as a crucial evaluation dimension.

AI codingCursorBenchLarge Language Model

0 likes · 8 min read

Cursor’s Own Large‑Model Benchmark Shakes Up SWE‑bench Rankings

PaperAgent

Mar 16, 2026 · Artificial Intelligence

How GLM-5-Turbo Turns an AI Research Lab into a 24‑Hour Autonomous Writer

The article details how the newly released GLM-5-Turbo "lobster" model powers an AI research Lab that automatically generates a complete OpenClaw survey paper—from topic brainstorming and literature mining to outline drafting, manuscript writing, and AAAI‑style submission—within an hour, showcasing benchmark results, prompt templates, and practical skill installations.

AI research automationAutoClawGLM-5-Turbo

0 likes · 10 min read

How GLM-5-Turbo Turns an AI Research Lab into a 24‑Hour Autonomous Writer

IT Services Circle

Mar 15, 2026 · Artificial Intelligence

How PinchBench Ranks OpenClaw AI Agents Across Real‑World Tasks

The article explains OpenClaw’s rapid rise and the emerging on‑site installation business, introduces the open‑source PinchBench benchmark that evaluates large language models as OpenClaw agents on 23 real‑world tasks, presents recent ranking results, and provides step‑by‑step instructions for running the benchmark and submitting results.

AI agentLarge Language ModelOpenClaw

0 likes · 5 min read

How PinchBench Ranks OpenClaw AI Agents Across Real‑World Tasks

Bighead's Algorithm Notes

Mar 14, 2026 · Artificial Intelligence

Quantitative Finance Paper Digest: AI‑Driven Market Prediction Studies (Mar 7‑13 2026)

This digest summarizes four recent research papers that apply advanced AI techniques—node‑transformer graphs with BERT sentiment analysis, a quantum‑classical LSTM‑Born machine hybrid, large‑language‑model benchmarking for portfolio optimization, and a conditional diffusion model—to improve stock market prediction, volatility forecasting, and investment decision making, providing detailed experimental results and statistical validation.

BERTLarge Language ModelQuantum Computing

0 likes · 10 min read

Quantitative Finance Paper Digest: AI‑Driven Market Prediction Studies (Mar 7‑13 2026)

AI Explorer

Mar 14, 2026 · Artificial Intelligence

Claude’s 1M‑Token Context Window Launches with No Premium Pricing

Anthropic’s Claude Opus 4.6 and Sonnet 4.6 now offer a full‑million‑token context window at the same per‑token price as short‑context usage, delivering top‑ranked MRCR v2 performance, six‑fold media capacity, and reduced AI‑Agent memory compression without any code changes across all major cloud platforms.

AI agentAnthropicClaude

0 likes · 6 min read

Claude’s 1M‑Token Context Window Launches with No Premium Pricing

Data Party THU

Mar 12, 2026 · Artificial Intelligence

Can a 30B LLM Truly Conduct Autonomous Scientific Research? Inside UniScientist

UniScientist, a 30‑billion‑parameter open‑source model from UniPat AI, demonstrates a closed‑loop scientific research workflow—generating hypotheses, gathering evidence, performing reproducible derivations, and iteratively refining conclusions—while achieving benchmark scores comparable to much larger proprietary systems across multiple scientific evaluation suites.

Large Language Modelbenchmarkingscientific research

0 likes · 10 min read

Can a 30B LLM Truly Conduct Autonomous Scientific Research? Inside UniScientist

AI2ML AI to Machine Learning

Mar 10, 2026 · Artificial Intelligence

How Anthropic and Palantir Collaborate on Modern Warfare Information Mining

The article analyzes Palantir's ontology-driven knowledge graph dominance, its shift from graph to vector databases, the three‑layer partnership with Anthropic and AWS, the Digital Twin scaling law, and the technical challenges of data heterogeneity, scaling uncertainty, annotation scarcity, and real‑time computation in modern warfare information mining.

AnthropicDigital TwinLarge Language Model

0 likes · 9 min read

How Anthropic and Palantir Collaborate on Modern Warfare Information Mining

SuanNi

Mar 9, 2026 · Artificial Intelligence

How UniScientist Beats GPT‑5.4 on FrontierScience Benchmarks

UniScientist, a 30B‑parameter AI model co‑developed by UniPat AI and Peking University, leverages a meticulously curated scientific dataset and a powerful code interpreter to achieve 33.3% success on the FrontierScience‑Research benchmark, surpassing the newly released GPT‑5.4 and demonstrating superior multi‑disciplinary research capabilities.

AILarge Language Modeldataset

0 likes · 12 min read

How UniScientist Beats GPT‑5.4 on FrontierScience Benchmarks

Design Hub

Mar 6, 2026 · Artificial Intelligence

How Powerful Is GPT‑5.4? A Deep Dive Into Its Design‑Focused Capabilities

OpenAI's GPT‑5.4 combines a 1 M‑token context window, native computer‑use, and benchmark‑leading performance—outperforming humans on 83 % of tasks and cutting token usage by 47 %—while showcasing demos that let designers generate games, websites, and 3D assets in a single prompt.

AI agentsComputer UseGPT-5.4

0 likes · 7 min read

How Powerful Is GPT‑5.4? A Deep Dive Into Its Design‑Focused Capabilities

DataFunTalk

Mar 6, 2026 · Artificial Intelligence

Why GPT‑5.4 Beats Its Predecessors: Code Power, World Knowledge, and New Agent Features

The article reviews GPT‑5.4’s release, comparing its code ability, world knowledge, and multimodal understanding to Claude Opus 4.6 and GPT‑5.3‑Codex, presents benchmark scores (GDPval 83%, SWE‑Bench 57.7%, OSWorld 75%, ToolAthon 54.6%), and highlights new features such as a 1‑million‑token context window, native computer usage, and tool‑search optimization, while discussing pricing and practical usage in OpenClaw.

AI agentsGPT-5.4Large Language Model

0 likes · 12 min read

Why GPT‑5.4 Beats Its Predecessors: Code Power, World Knowledge, and New Agent Features

Xiaomi Tech

Mar 6, 2026 · Artificial Intelligence

Xiaomi Miclaw: Mobile AI Agent Enters Small‑Scale Closed Beta

Xiaomi Miclaw, an AI agent built on the MiMo large model, launches a limited closed beta to demonstrate system‑level tool access, multi‑turn context management, IoT ecosystem integration, and self‑evolution capabilities while emphasizing data security and user‑controlled permissions.

AI agentData SecurityIoT

0 likes · 10 min read

Xiaomi Miclaw: Mobile AI Agent Enters Small‑Scale Closed Beta

AI Explorer

Mar 6, 2026 · Artificial Intelligence

GPT-5.4 Unveiled: 1M‑Token Context Window and Native Computer Control

OpenAI's GPT-5.4 launch introduces three model tiers, a 1 million‑token context window, native computer‑use abilities, higher factual accuracy and a new Tool Search feature, reshaping enterprise AI capabilities and intensifying competition with Anthropic and Google.

AI benchmarksComputer UseGPT-5.4

0 likes · 9 min read

GPT-5.4 Unveiled: 1M‑Token Context Window and Native Computer Control

AI Insight Log

Mar 6, 2026 · Artificial Intelligence

OpenAI Skips GPT‑5.3, Launches GPT‑5.4: Wins 5 of 8 Benchmarks, Sparks Heated Debate

OpenAI announced GPT‑5.4 at 2 a.m., skipping GPT‑5.3 and claiming integrated coding and reasoning abilities; the model tops five of eight benchmark categories, introduces native computer operation, tool‑search and interruptible thinking, while users debate its trustworthiness and pricing changes.

AI capabilitiesGPT-5.4Large Language Model

0 likes · 14 min read

OpenAI Skips GPT‑5.3, Launches GPT‑5.4: Wins 5 of 8 Benchmarks, Sparks Heated Debate

Weekly Large Model Application

Mar 4, 2026 · Artificial Intelligence

Qwen3‑ASR vs FunASR: In‑Depth Technical Comparison

This article provides a detailed side‑by‑side analysis of the open‑source ASR tools FunASR and Qwen3‑ASR, covering team origins, model architectures, language coverage, speed, deployment requirements, and ideal use‑cases so readers can decide which solution fits their projects best.

ASRFunASRLarge Language Model

0 likes · 10 min read

Qwen3‑ASR vs FunASR: In‑Depth Technical Comparison

AI Explorer

Mar 4, 2026 · Artificial Intelligence

DeerFlow: Open‑Source Super‑Agent Framework Automates Complex Tasks

DeerFlow 2.0, an open‑source super‑agent framework from ByteDance, lets developers automate multi‑step, minutes‑to‑hours‑long workflows by orchestrating sub‑agents with memory, sandboxed execution, and extensible skills, and has surged to over 2.4 k GitHub stars.

AI agentsDeerFlowDocker

0 likes · 6 min read

DeerFlow: Open‑Source Super‑Agent Framework Automates Complex Tasks

Old Zhang's AI Learning

Mar 2, 2026 · Artificial Intelligence

Why the Qwen3.5 Series Makes Qwen3.5-27B the No‑Brainer Choice

The author reviews the Qwen3.5 model family, showing that the 27‑billion‑parameter dense Qwen3.5-27B offers the best balance of size, stability, low‑cost local deployment, and comprehensive capabilities, making it the default pick for most users.

AI benchmarkingLarge Language ModelRTX 4090

0 likes · 6 min read

Why the Qwen3.5 Series Makes Qwen3.5-27B the No‑Brainer Choice

Old Zhang's AI Learning

Feb 27, 2026 · Backend Development

How I Built a Telegram AI Coding Bot (FakeClawBot) Using OpenCode

This article walks through creating a Telegram bot that leverages OpenCode's Server API to provide full AI coding assistance, covering setup, multi‑model integration, core architecture, common pitfalls, and extensible features, all with under 900 lines of Python code.

AI coding assistantLarge Language ModelOpen Source

0 likes · 13 min read

How I Built a Telegram AI Coding Bot (FakeClawBot) Using OpenCode

PaperAgent

Feb 26, 2026 · Industry Insights

What the DeepSeek V4 Lite Leak Reveals About Its Specs and Multimodal Power

Recent reports indicate that DeepSeek's unreleased V4 Lite model, featuring a 1‑million‑token context window and native multimodal reasoning, has been leaked online, with Huawei gaining early access while Nvidia is excluded, and the model demonstrates impressive spatial reasoning in generated SVG examples.

DeepSeekIndustry InsightLarge Language Model

0 likes · 3 min read

What the DeepSeek V4 Lite Leak Reveals About Its Specs and Multimodal Power

Old Zhang's AI Learning

Feb 26, 2026 · Artificial Intelligence

Ultimate Guide to Local Deployment of Qwen3.5 Models (27B‑397B)

This guide reviews the Qwen3.5 model lineup, explains mixed‑inference and MoE architecture, presents benchmark comparisons with GPT‑5.2, Claude 4.5 and Gemini‑3 Pro, evaluates 4‑bit and 3‑bit quantization loss, outlines hardware requirements, and provides step‑by‑step deployment options using llama.cpp or llama‑server.

Large Language ModelMoEinference

0 likes · 14 min read

Ultimate Guide to Local Deployment of Qwen3.5 Models (27B‑397B)

Baobao Algorithm Notes

Feb 25, 2026 · Artificial Intelligence

Exploring Qwen 3.5: Small‑Scale MoE Models, Architecture, and Deployment Guides

This article reviews the three open‑source Qwen 3.5 models—including a 35B MoE, a 122B MoE, and a 27B dense version—detailing their parameter layouts, core attention designs, context length, inference performance, hardware requirements, and provides step‑by‑step code examples for loading them with Hugging Face Transformers and vLLM.

AILarge Language ModelMoE

0 likes · 10 min read

Exploring Qwen 3.5: Small‑Scale MoE Models, Architecture, and Deployment Guides

Yunqi AI+

Feb 25, 2026 · Artificial Intelligence

How Our In-House AI Agent Scaled to Handle 70% of Tech Support: A Six-Month Review

Over six months the team built an AI agent that now answers more than 70% of technical support queries by grounding responses in system data, a curated knowledge base, and a tiered permission model, while also exposing growing technical debt and maintenance challenges.

AI agentLarge Language ModelOperational AI

0 likes · 7 min read

How Our In-House AI Agent Scaled to Handle 70% of Tech Support: A Six-Month Review

Machine Learning Algorithms & Natural Language Processing

Feb 20, 2026 · Artificial Intelligence

Google Reclaims AI Throne with Gemini 3.1 Pro, Achieving 77.1% ARC‑AGI‑2 Score

Google’s Gemini 3.1 Pro, the latest upgrade to the Gemini 3 series, achieves a verified 77.1 % score on the ARC‑AGI‑2 reasoning benchmark—more than double the performance of Gemini 3 Pro—while leading in GPQA, LiveCodeBench Pro, SWE‑Bench Verified, and MMMLU tests, and is now being rolled out to developers, enterprises and consumers with detailed pricing and integration options.

AI benchmarkingARC-AGI-2Gemini 3.1 Pro

0 likes · 9 min read

Google Reclaims AI Throne with Gemini 3.1 Pro, Achieving 77.1% ARC‑AGI‑2 Score

Weekly Large Model Application

Feb 20, 2026 · Artificial Intelligence

Intelligent Speech vs. Voice Agent: Key Differences and How They Relate

This article explains the technical distinction between intelligent speech— a toolbox of ASR, TTS, NLU, and NLG technologies— and Voice Agent, an end‑to‑end conversational system built on those tools and large‑model reasoning, illustrating their layered relationship, functional gaps, and typical use cases.

ASRDialogue SystemsLarge Language Model

0 likes · 7 min read

Intelligent Speech vs. Voice Agent: Key Differences and How They Relate

Old Zhang's AI Learning

Feb 19, 2026 · Artificial Intelligence

Inside GLM-5: Training Techniques, Architecture Innovations, and Benchmark Performance

The article dissects GLM-5’s 744B‑parameter MoE design, 28.5 T token training corpus, novel Muon Split and MLA‑256 optimizations, DSA sparse attention, a fully asynchronous RL pipeline, extensive domestic chip adaptation, and benchmark results that place it on par with Claude Opus 4.5 and ahead of Gemini 3 Pro.

AI ArchitectureDSAGLM-5

0 likes · 13 min read

Inside GLM-5: Training Techniques, Architecture Innovations, and Benchmark Performance

AI Agent Research Hub

Feb 19, 2026 · Artificial Intelligence

Why Claude Sonnet 4.6 Is My Most Powerful and Cost‑Effective AI Research Assistant

The article evaluates Anthropic's Claude Sonnet 4.6 as a comprehensive research assistant, detailing its performance on literature surveys, open‑source code analysis, algorithm implementation, cost savings, benchmark scores, and practical limitations across multiple scientific workflows.

AI Research AssistantClaude Sonnet 4.6Large Language Model

0 likes · 20 min read

Why Claude Sonnet 4.6 Is My Most Powerful and Cost‑Effective AI Research Assistant

Old Zhang's AI Learning

Feb 18, 2026 · Artificial Intelligence

New Ollama Features: Instant Model Switching, Subagents, and Built‑in Web Search

The latest Ollama 0.16.1 release lets users switch models and tools instantly, use Claude Code, Codex, and OpenClaw without extra configuration, and enables Subagents and built‑in web search directly via simple commands.

Claude CodeLarge Language ModelOllama

0 likes · 3 min read

New Ollama Features: Instant Model Switching, Subagents, and Built‑in Web Search

Alibaba Cloud Big Data AI Platform

Feb 17, 2026 · Artificial Intelligence

Deploy Alibaba’s Qwen3.5‑397B‑A17B Model in One Click with PAI‑Model Gallery

Alibaba's open‑source Qwen3.5‑397B‑A17B model, featuring 397 billion parameters and a hybrid Gated Delta Network/MoE architecture, delivers superior performance and reduced memory usage, and can be deployed instantly through the PAI‑Model Gallery with step‑by‑step guidance and enterprise‑grade security.

AI inferenceAlibaba CloudLarge Language Model

0 likes · 5 min read

Deploy Alibaba’s Qwen3.5‑397B‑A17B Model in One Click with PAI‑Model Gallery

Machine Learning Algorithms & Natural Language Processing

Feb 16, 2026 · Artificial Intelligence

Alibaba’s Qwen 3.5‑Plus: 397 B Open‑Source Model Beats Gemini‑3 and GPT‑5.2 at Low Cost

Alibaba released the Qwen 3.5‑Plus open‑source large model (397 B total parameters, 170 B active) that outperforms top closed‑source models such as Gemini‑3‑Pro and GPT‑5.2 on multiple benchmarks, offers native multimodal understanding, supports 201 languages, reduces deployment memory by 60 % and inference latency by up to 19×, and is priced at only 0.8 CNY per million tokens.

AILarge Language ModelMultimodal

0 likes · 15 min read

Alibaba’s Qwen 3.5‑Plus: 397 B Open‑Source Model Beats Gemini‑3 and GPT‑5.2 at Low Cost

Old Zhang's AI Learning

Feb 16, 2026 · Artificial Intelligence

Qwen3.5 Deep Dive: Multimodal Architecture, Benchmarks, and Deployment Guide

This article provides a detailed analysis of Qwen3.5, covering its multimodal MoE design, massive inference speedups, extensive benchmark results against GPT‑5.2, Claude 4.5 Opus and Gemini‑3 Pro, RL scaling strategies, training infrastructure innovations, and practical usage via API and local deployment.

FP8 trainingLarge Language Modelbenchmark

0 likes · 13 min read

Qwen3.5 Deep Dive: Multimodal Architecture, Benchmarks, and Deployment Guide

AntTech

Feb 16, 2026 · Artificial Intelligence

Ling‑2.5‑1T: Open‑Source 1‑Trillion‑Parameter Instant LLM with 1M‑Token Context

Ling‑2.5‑1T is an open‑source instant large language model with 1 trillion total parameters, 63 B active weights, and a 1 M token context window, featuring mixed‑linear attention, a composite correctness‑plus‑process reward for token efficiency, fine‑grained alignment, and leading benchmark performance across reasoning, instruction‑following, and agentic tasks.

Large Language Modelagentic interactionbenchmark

0 likes · 13 min read

Ling‑2.5‑1T: Open‑Source 1‑Trillion‑Parameter Instant LLM with 1M‑Token Context

AI Engineering

Feb 16, 2026 · Artificial Intelligence

Qwen3.5-397B: 397B‑Parameter Multimodal LLM Boosts Inference Speed 8‑19×

Alibaba’s Qwen3.5-397B-A17B, a 397‑billion‑parameter open‑source multimodal LLM, combines mixed linear attention with a sparse MoE architecture to achieve 8.6‑19× higher decoding throughput than Qwen3‑Max, supports 201 languages, and can be deployed via vLLM, Docker, Transformers, or SGLang with various optimization presets.

Inference OptimizationLarge Language ModelSparse MoE

0 likes · 8 min read

Qwen3.5-397B: 397B‑Parameter Multimodal LLM Boosts Inference Speed 8‑19×

AI Insight Log

Feb 16, 2026 · Artificial Intelligence

DeepSeek V4 Benchmark Leak Fuels Talk of a New Coding King

A leaked SWE‑Bench score of 83.7% for DeepSeek V4 sparked claims it outperforms Claude Opus 4.5 and GPT‑5.2, but the data was later debunked as fabricated while official hints confirm a 1‑million‑token context model and a mid‑February 2026 release.

AI benchmarkingAI industryDeepSeek

0 likes · 7 min read

DeepSeek V4 Benchmark Leak Fuels Talk of a New Coding King

PaperAgent

Feb 16, 2026 · Artificial Intelligence

Why Qwen3.5-Plus Sets a New Standard for Open-Source Multimodal AI

Qwen3.5-Plus, Alibaba’s newly open-sourced multimodal LLM, combines a 397 B parameter model with only 17 B active parameters, leveraging native multimodal training, gated attention, sparse MoE, and FP8 precision to outperform GPT-5.2 and Gemini-3-Pro across vision, reasoning, and agent benchmarks.

Large Language ModelOpen SourceSparse Activation

0 likes · 6 min read

Why Qwen3.5-Plus Sets a New Standard for Open-Source Multimodal AI

Old Zhang's AI Learning

Feb 14, 2026 · Artificial Intelligence

Translate Full PDFs While Preserving Layout Using LLMs – Core Code Included

This article presents a two‑stage, cache‑enabled pipeline that extracts text blocks from a PDF with PyMuPDF, translates them via a large‑language‑model API, and re‑renders each page as an image with Chinese text overlaid to keep the original layout, along with full Python code and usage instructions.

LLMLarge Language ModelPDF translation

0 likes · 10 min read

Translate Full PDFs While Preserving Layout Using LLMs – Core Code Included

Machine Learning Algorithms & Natural Language Processing

Feb 12, 2026 · Artificial Intelligence

How Huawei’s MindScale Cuts Agent Token Usage 5.7× and Automates Prompt & Workflow Design

The article outlines the four major obstacles hindering industry‑specific LLM agents—manual workflow maintenance, poor knowledge reuse, training‑inference inefficiency, and complex reasoning evaluation—and explains how Huawei Noah’s MindScale package tackles each with self‑evolving workflows, automated prompt optimization, and a novel KV‑Embedding cache that slashes token consumption by 5.7× while boosting inference speed up to 70%.

Industry AgentInference AccelerationKV-Embedding

0 likes · 7 min read

How Huawei’s MindScale Cuts Agent Token Usage 5.7× and Automates Prompt & Workflow Design

Old Zhang's AI Learning

Feb 12, 2026 · Artificial Intelligence

Testing the World's Most Powerful Open‑Source LLM: GLM‑5, Local Deployment & Free Ollama Cloud

The article evaluates GLM‑5, the claimed strongest open‑source large language model, comparing its benchmark scores to Claude Opus, Gemini and GPT, detailing its DeepSeek‑inspired architecture, quantized FP8 deployment requirements, and step‑by‑step usage of Ollama’s free cloud model with Agent, data‑analysis and document‑generation features.

AI benchmarkingData AnalysisGLM-5

0 likes · 7 min read

Testing the World's Most Powerful Open‑Source LLM: GLM‑5, Local Deployment & Free Ollama Cloud

DataFunTalk

Feb 12, 2026 · Artificial Intelligence

DeepSeek’s New Model V4? Exploring 1M‑Token Context and Updated Knowledge

DeepSeek quietly launched its latest model, reportedly supporting up to 1 million tokens, extending its knowledge cutoff to May 2025, adopting a more enthusiastic response style, and still operating as a pure‑text system, while early tests showcase impressive coding and reasoning capabilities.

AI evaluationDeepSeekLarge Language Model

0 likes · 5 min read

DeepSeek’s New Model V4? Exploring 1M‑Token Context and Updated Knowledge

AI Insight Log

Feb 12, 2026 · Artificial Intelligence

GLM-5 Unveiled: 744B Parameters, Claude Opus 4.5‑Level Performance, Epic Agent Upgrade

Z.ai released the open‑source GLM‑5 model with 744 billion parameters, 28.5 T tokens of training data, and new Sparse Attention and Slime RL infrastructure, achieving top open‑source rankings and near‑Claude Opus 4.5 performance on Vending Bench 2 and CC‑Bench‑V2 while adding multi‑scenario agent capabilities.

GLM-5Large Language ModelSparse Attention

0 likes · 6 min read

GLM-5 Unveiled: 744B Parameters, Claude Opus 4.5‑Level Performance, Epic Agent Upgrade

PMTalk Product Manager Community

Feb 12, 2026 · Industry Insights

How AI Can Transform Government Services: A From‑Zero‑to‑One Case Study

The article analyzes why traditional government portals fail users, outlines a six‑step user journey (search, guide, ask, appointment, processing, evaluation), and shows how large‑language‑model AI can be embedded at each decision point to turn fragmented services into a seamless, user‑centric digital experience.

AILarge Language Modelcase study

0 likes · 11 min read

How AI Can Transform Government Services: A From‑Zero‑to‑One Case Study

AI Engineering

Feb 12, 2026 · Artificial Intelligence

GLM-5 Unveiled: 744B‑Parameter Model Takes on Claude in Complex Tasks

GLM-5, the new 744‑billion‑parameter open‑source LLM, expands on GLM‑4.5 with GlmMoeDsa architecture, achieves higher HLE benchmark scores than Claude Opus 4.5, demonstrates strong long‑context and agent capabilities, supports vLLM/SGLang, runs on various Chinese chips, and can directly generate Office documents.

AI benchmarksChinese chipsClaude

0 likes · 5 min read

GLM-5 Unveiled: 744B‑Parameter Model Takes on Claude in Complex Tasks

Open Source Tech Hub

Feb 12, 2026 · Artificial Intelligence

How GLM-5 Advances AI with Bigger Scale, Sparse Attention, and Agent Capabilities

GLM-5, a new large language model with 744 B parameters and 28.5 T tokens of training data, introduces DeepSeek sparse attention and an asynchronous RL system called slime, delivering strong benchmark gains on complex system engineering, long‑horizon agent tasks, and surpassing many open‑source competitors.

AIGLM-5Large Language Model

0 likes · 6 min read

How GLM-5 Advances AI with Bigger Scale, Sparse Attention, and Agent Capabilities

Machine Learning Algorithms & Natural Language Processing

Feb 10, 2026 · Artificial Intelligence

Inside GLM-5: 745B Parameters, DeepSeek‑style Sparse Attention, and a 60% Stock Surge

The GLM-5 architecture, uncovered from a GitHub PR, doubles the previous model to 745 B parameters, adopts DeepSeek‑V3 sparse attention and multi‑token prediction, features a 78‑layer MoE with 256 experts, supports a 202K‑token context window, and its rumored test model "Pony Alpha" sparked a 60% rise in Zhipu AI's stock amid a crowded AI release season.

AI Stock ImpactDeepSeekGLM-5

0 likes · 6 min read

Inside GLM-5: 745B Parameters, DeepSeek‑style Sparse Attention, and a 60% Stock Surge

HyperAI Super Neural

Feb 10, 2026 · Artificial Intelligence

WeDLM Diffusion Language Model Tutorial: 3× Faster Inference Than vLLM AR Models

The Tencent WeChat AI team introduces WeDLM, a diffusion language model that, through topological reordering, surpasses autoregressive models on the industrial‑grade vLLM engine with over threefold speedup on math reasoning and up to tenfold in low‑entropy scenarios, and provides a step‑by‑step online tutorial with GPU compute credits.

Diffusion Language ModelGPU computeInference Acceleration

0 likes · 5 min read

WeDLM Diffusion Language Model Tutorial: 3× Faster Inference Than vLLM AR Models

Old Zhang's AI Learning

Feb 9, 2026 · Artificial Intelligence

GLM-5 Emerges First, Built on DeepSeek Tech, Triggering a 40% Stock Surge

An anonymous OpenRouter model dubbed "Pony Alpha" was verified as the new 745B‑parameter GLM-5, which reuses DeepSeek‑V3 architecture, supports sparse attention and multi‑token prediction, and has already caused a near‑40% jump in Zhipu AI’s stock while hinting at upcoming integration into the Transformers library.

DeepSeekGLM-5Large Language Model

0 likes · 3 min read

GLM-5 Emerges First, Built on DeepSeek Tech, Triggering a 40% Stock Surge

AI Insight Log

Feb 5, 2026 · Artificial Intelligence

How 16 Claude Agents Burned $140K to Build a C Compiler in Opus 4.6

Anthropic’s midnight release of Claude Opus 4.6 showcased a $140,000 “stress test” where 16 Claude agents collaboratively wrote a Linux‑compatible C compiler, achieving a 100‑k‑line Rust codebase, while the model also added deep Excel/PPT integration and lifted finance benchmark scores by up to 23 percentage points.

AI Code GenerationClaude OpusFinancial AI

0 likes · 7 min read

How 16 Claude Agents Burned $140K to Build a C Compiler in Opus 4.6

Design Hub

Feb 5, 2026 · Artificial Intelligence

Inside Sienna’s AI Persona: Architecture, Memory, and Self‑Awareness in OpenClaw

The author explores how the OpenClaw‑based AI persona Sienna is built and evolves—detailing model choices, the memory‑plus‑skills architecture, recent version improvements that cut token usage, and philosophical reflections on turning a tool into a partner with preferences, opinions, and a growing self‑identity.

AI personaLarge Language ModelOpenClaw

0 likes · 7 min read

Inside Sienna’s AI Persona: Architecture, Memory, and Self‑Awareness in OpenClaw

Network Intelligence Research Center (NIRC)

Jan 31, 2026 · Artificial Intelligence

How Engram Lets Large Models Swap GPU Memory for Cheap RAM to ‘Look Up’ Knowledge

The article dissects DeepSeek’s new Engram architecture, which separates computation from memory by using a large, cheap‑RAM‑based lookup table to store factual knowledge, allowing the transformer’s compute layers to focus on reasoning, dramatically reducing GPU memory demand while improving code, math, and long‑context performance.

EngramGPU MemoryLarge Language Model

0 likes · 7 min read

How Engram Lets Large Models Swap GPU Memory for Cheap RAM to ‘Look Up’ Knowledge

SpringMeng

Jan 30, 2026 · Artificial Intelligence

Hands‑On Guide: Build AI Agent Chatbots on Windows with RagFlow

Programmer Xiao Meng walks through a complete Windows setup for AI‑powered customer service agents using RagFlow, covering prerequisites, Docker and Ollama installation, model download, container deployment, configuration of knowledge bases, and testing, based on five real‑world projects.

AI chatbotDockerLarge Language Model

0 likes · 7 min read

Hands‑On Guide: Build AI Agent Chatbots on Windows with RagFlow

Meituan Technology Team

Jan 29, 2026 · Artificial Intelligence

How LongCat‑Flash‑Thinking‑2601 Achieves Real‑World Generalization for Agents

LongCat‑Flash‑Thinking‑2601, a 560‑billion‑parameter MoE model, combines environment expansion, multi‑environment RL, systematic noise training, a heavy‑thinking reasoning mode, and Zigzag sparse attention to deliver strong benchmark performance and robust real‑world agent capabilities.

Environment ExpansionLarge Language ModelOpen Source

0 likes · 14 min read

How LongCat‑Flash‑Thinking‑2601 Achieves Real‑World Generalization for Agents

Alibaba Cloud Developer

Jan 28, 2026 · Artificial Intelligence

How We Built a High‑Performance AI Rental Advisor with One‑Model Tool‑Use and Reinforcement Learning

This article details the design, challenges, and performance gains of an AI‑driven rental recommendation system that replaces a multi‑agent architecture with a single LLM using dynamic tool‑use, introduces a two‑stage reinforcement‑learning pipeline, and achieves sub‑second latency and higher accuracy for complex rental scenarios.

AI recommendationLarge Language ModelSystem Architecture

0 likes · 19 min read

How We Built a High‑Performance AI Rental Advisor with One‑Model Tool‑Use and Reinforcement Learning

Baobao Algorithm Notes

Jan 27, 2026 · Artificial Intelligence

Putting Kimi K2.5 and Kimi Code to the Test: Real‑World AI Agent Benchmarks

This article presents a hands‑on evaluation of Kimi K2.5 and its open‑source Kimi Code agent across a series of hard‑core prompts, covering Python API generation, cost‑optimized routing, multimodal ECharts visualisation, massive‑scale SQL optimisation, web‑search‑driven research, MoE explanation and video‑to‑code workflows.

AI agentKimiLarge Language Model

0 likes · 9 min read

Putting Kimi K2.5 and Kimi Code to the Test: Real‑World AI Agent Benchmarks

Old Zhang's AI Learning

Jan 27, 2026 · Artificial Intelligence

Qwen3‑Max‑Thinking Boosts Performance with Test‑Time Scaling—Why It Still Isn’t Open‑Source

Alibaba’s new Qwen3‑Max‑Thinking model adds inference‑time scaling and adaptive tool use, delivering large gains on math, coding, and agent benchmarks while remaining closed‑source, and it offers drop‑in OpenAI‑compatible API access at the cost of higher latency and token usage.

AI BenchmarkAdaptive Tool UseLarge Language Model

0 likes · 7 min read

Qwen3‑Max‑Thinking Boosts Performance with Test‑Time Scaling—Why It Still Isn’t Open‑Source

Fun with Large Models

Jan 22, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Calling Locally Deployed LLMs via OpenAI API Format in Python

This tutorial explains the OpenAI‑style request and response schema, demonstrates low‑level API calls with the requests library, compares them to the high‑level openai package, and walks through building a streaming multi‑turn chatbot that interacts with a locally hosted large language model.

ChatbotLarge Language ModelOpenAI API

0 likes · 17 min read

Step‑by‑Step Guide to Calling Locally Deployed LLMs via OpenAI API Format in Python