Tagged articles
22 articles
Page 1 of 1
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 15, 2026 · Cloud Native

A Kernel‑Embedded ‘Perspective Mirror’: Achieving Full‑Stack Observability with CloudMonitor 2.0 Without Code Changes

The article explains how OpenTelemetry eBPF Instrumentation (OBI) leverages Linux kernel eBPF probes to provide zero‑code, cross‑language observability for applications, networks, logs, and GPU workloads, detailing its protocol detection, deep runtime integration, data‑pipeline architecture, deployment options, and practical considerations.

Cloud MonitoringDistributed TracingGPU Tracing
0 likes · 22 min read
A Kernel‑Embedded ‘Perspective Mirror’: Achieving Full‑Stack Observability with CloudMonitor 2.0 Without Code Changes
Alibaba Cloud Native
Alibaba Cloud Native
Jun 14, 2026 · Operations

From API to AI Agent: Alibaba Cloud Monitoring CLI + Agent Skill in Action

The article explains how Alibaba Cloud Monitoring CLI (aliyun cms2) and its Agent Skill turn traditional API‑based operations into AI‑driven, natural‑language workflows, enabling secure, auditable, and automated observability tasks such as resource onboarding, alarm management, and data queries.

Alibaba CloudAutomationCLI
0 likes · 18 min read
From API to AI Agent: Alibaba Cloud Monitoring CLI + Agent Skill in Action
Alibaba Cloud Native
Alibaba Cloud Native
Jun 7, 2026 · Cloud Native

Eliminate Complex Integration: AI Agent Skill Powers Cloud Monitoring

The article shows how Alibaba Cloud's CMS CLI and the AI‑driven alibabacloud‑cms‑manage Skill turn a multi‑step observability setup into a single natural‑language command, detailing the six‑step CLI workflow, the two‑stage confirmation safety, and a full K8s LangChain auto‑integration demo.

AutomationCLICloud Monitoring
0 likes · 10 min read
Eliminate Complex Integration: AI Agent Skill Powers Cloud Monitoring
Alibaba Cloud Native
Alibaba Cloud Native
May 31, 2026 · Cloud Native

Why Alibaba Cloud’s AI Agent Observability Platform Is the Enterprise‑Grade Choice for Full‑Stack Monitoring

The article analyzes the rapid growth of AI Agents, outlines the four core challenges of production‑grade agents—cost overruns, fault‑location inefficiency, security risks, and quality measurement—and presents Alibaba Cloud’s AI Agent Observability solution with a four‑layer architecture, end‑to‑end tracing, real‑time health dashboards, and Agentic Ops capabilities to address these issues.

Agentic OpsCloud MonitoringGenAI
0 likes · 14 min read
Why Alibaba Cloud’s AI Agent Observability Platform Is the Enterprise‑Grade Choice for Full‑Stack Monitoring
Alibaba Cloud Observability
Alibaba Cloud Observability
Apr 6, 2026 · Artificial Intelligence

How OpenClaw’s New Plugin Reveals Every LLM Decision Step

The OpenClaw CMS plugin 0.1.2 upgrades observability for AI agents by fully restoring multi‑round execution traces, stabilizing concurrent chains, adding STEP spans, and quantifying agent metrics, turning raw trace graphs into actionable insights for debugging, testing, cost control, and cross‑team collaboration.

AI OperationsCloud MonitoringLLM
0 likes · 8 min read
How OpenClaw’s New Plugin Reveals Every LLM Decision Step
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 12, 2025 · Operations

How to Build End‑to‑End Observability for Large‑Model Applications on Alibaba Cloud

This guide explains how to design and implement a complete observability solution for large‑model AI services on Alibaba Cloud, covering architecture, core metrics, logging standards, demo code, log collection, dashboard design, alerting, monitoring tools, troubleshooting SOPs, and recovery procedures.

AI OperationsAlibaba CloudCloud Monitoring
0 likes · 21 min read
How to Build End‑to‑End Observability for Large‑Model Applications on Alibaba Cloud
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 2, 2025 · Operations

Mastering Error and Latency Diagnosis for Online Applications

This article presents a systematic root‑cause diagnosis framework for online applications, covering how to identify and resolve both error ("wrong") and performance ("slow") problems using trace links, associated data, high‑quality observability, and large‑language‑model‑driven intelligence.

Cloud MonitoringLarge Language ModelRoot Cause Analysis
0 likes · 12 min read
Mastering Error and Latency Diagnosis for Online Applications
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 9, 2024 · Cloud Native

How to Design and Use Cloud Monitoring Event Subscriptions on Alibaba Cloud

This guide explains the purpose, design, and step‑by‑step configuration of Alibaba Cloud's cloud‑monitor event subscription feature, covering typical multi‑team and application‑group scenarios, flexible filtering, aggregation, custom notifications, and integration with external services for robust cloud‑native operations.

Alibaba CloudCloud MonitoringCloud Native
0 likes · 10 min read
How to Design and Use Cloud Monitoring Event Subscriptions on Alibaba Cloud
Alibaba Cloud Observability
Alibaba Cloud Observability
Jun 13, 2024 · Cloud Native

Kickstart Your Observability Journey with Alibaba Cloud Monitoring

This guide introduces new Alibaba Cloud users to the fundamentals of cloud observability, explaining the metric‑trace‑log stack, the layered monitoring pyramid, and step‑by‑step how to set up out‑of‑the‑box resource monitoring, dashboards, alerts, and advanced integration options.

Alibaba CloudCloud MonitoringCloud Native
0 likes · 7 min read
Kickstart Your Observability Journey with Alibaba Cloud Monitoring
Tencent Cloud Developer
Tencent Cloud Developer
Jan 9, 2024 · Operations

Tencent Cloud APM Full-Link Tracing Implementation and Best Practices

The article explains how Tencent Cloud APM implements full‑link tracing using OpenTelemetry standards, addresses challenges such as protocol compatibility, massive trace storage, and bytecode overhead with solutions like conversion gateways, tail sampling and thread profiling, and showcases best‑practice scenarios for topology analysis, front‑end/back‑end integration, and log‑trace correlation within the broader TCOP observability suite.

APMCloud MonitoringFull‑Link Tracing
0 likes · 11 min read
Tencent Cloud APM Full-Link Tracing Implementation and Best Practices
Alibaba Cloud Native
Alibaba Cloud Native
Nov 3, 2022 · Cloud Native

How to Leverage Alibaba Cloud Prometheus for Fine‑Grained Cloud Product Monitoring

This guide explains why native cloud monitoring falls short, how building custom Prometheus exporters adds overhead, and how Alibaba Cloud's fully managed Prometheus service—through enterprise cloud‑monitoring and self‑monitoring integration modes—provides ready‑to‑use exporters, agents, Grafana dashboards, and alert templates for dozens of cloud products.

Alibaba CloudCloud MonitoringCloud Native
0 likes · 12 min read
How to Leverage Alibaba Cloud Prometheus for Fine‑Grained Cloud Product Monitoring
Alibaba Cloud Native
Alibaba Cloud Native
Oct 19, 2022 · Cloud Native

How to Monitor Non‑Kubernetes ECS Apps with Alibaba Cloud Managed Prometheus

This guide explains how to use Alibaba Cloud's fully managed Prometheus service to collect and visualize metrics from ECS‑based applications across pure VPC, hybrid VPC‑IDC, and multi‑cloud scenarios, detailing the pain points of self‑built solutions and providing step‑by‑step configuration instructions.

Alibaba CloudCloud MonitoringECS
0 likes · 11 min read
How to Monitor Non‑Kubernetes ECS Apps with Alibaba Cloud Managed Prometheus
Efficient Ops
Efficient Ops
Mar 29, 2022 · Big Data

How Tencent Cloud Boosted APM Metric Computation Speed 2‑3× with Flink Optimizations

This article explains how Tencent Cloud's APM metric calculation, which transforms massive Span data into aggregated metrics using Flink, faced performance bottlenecks and was optimized through job splitting, batch merging, and dimension pruning, ultimately achieving a 2‑3× speed increase and cutting resource usage to about 30% of the original.

APMCloud MonitoringFlink
0 likes · 10 min read
How Tencent Cloud Boosted APM Metric Computation Speed 2‑3× with Flink Optimizations
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Nov 24, 2021 · Cloud Computing

How Proactive Link Monitoring Transforms Cloud Network Reliability

This article explains Huawei Cloud Stack's proactive link monitoring system, detailing its point‑line‑plane architecture, golden metrics of packet loss and latency, detection techniques, system components, and key innovations such as strategy optimization, alarm aggregation, and visualized performance dashboards for cloud data‑center networks.

Cloud MonitoringData CenterNetwork Reliability
0 likes · 13 min read
How Proactive Link Monitoring Transforms Cloud Network Reliability
Efficient Ops
Efficient Ops
Jun 23, 2021 · Operations

Agent vs Network Data: Choosing the Right Cloud Performance Monitoring Approach

This article compares agent‑based and network‑data approaches to cloud‑native application performance monitoring, discussing their architectures, advantages, challenges, and how combining white‑box and black‑box techniques can improve fault detection, scalability, and operational efficiency in complex cloud environments.

AgentCloud MonitoringOperations
0 likes · 10 min read
Agent vs Network Data: Choosing the Right Cloud Performance Monitoring Approach
Tencent Cloud Developer
Tencent Cloud Developer
May 14, 2020 · Operations

Tencent Classroom Monitoring Practices: Challenges, Strategies, and Future Directions

During the pandemic’s “停课不停学” surge, Tencent Classroom tackled a 120‑fold traffic jump by rapidly deploying Grafana dashboards, Kibana logs, internal Moniter and cloud monitoring tools, establishing a three‑layer feedback‑alert‑on‑call model, and now plans automation, unified visualizations, and chaos‑engineering to further boost observability and service reliability.

Cloud MonitoringSRETencent Classroom
0 likes · 14 min read
Tencent Classroom Monitoring Practices: Challenges, Strategies, and Future Directions