Tag

performance troubleshooting

1 views collected around this technical thread.

Aikesheng Open Source Community
Aikesheng Open Source Community
Dec 13, 2023 · Operations

Troubleshooting High System Load Caused by Stuck NFS Processes and Zabbix Monitoring

This article details a step‑by‑step analysis of a server experiencing sustained high load, uncovering a stuck NFS mount and an overactive Zabbix monitoring job, and explains how targeted process termination and monitoring adjustments reduced the load from 85 to normal levels.

LinuxNFSZabbix
0 likes · 6 min read
Troubleshooting High System Load Caused by Stuck NFS Processes and Zabbix Monitoring
Tencent Cloud Developer
Tencent Cloud Developer
Nov 9, 2023 · Backend Development

Troubleshooting Golang GC Performance Issues Causing Request Timeout Spikes

The article details how a Go service’s default GOGC setting caused overly frequent garbage‑collection pauses that spiked request timeouts, and how adjusting GOGC dynamically with debug.SetGCPercent and setting memory limits reduced GC CPU usage, extended pause intervals, and eliminated timeout spikes.

GOGCGo Memory ManagementRuntime Trace
0 likes · 20 min read
Troubleshooting Golang GC Performance Issues Causing Request Timeout Spikes
vivo Internet Technology
vivo Internet Technology
Apr 19, 2023 · Backend Development

Investigation of Midnight Interface Timeout in Vivo E‑commerce Activity System

The article details how a midnight interface timeout in Vivo’s e‑commerce activity system was traced to a logging bottleneck: a synchronous Log4j call blocked all threads while a cron‑driven log‑rotation script copied a 2.6 GB file, and the issue was resolved by switching to asynchronous logging with a non‑blocking appender.

Tomcatbackendjava
0 likes · 17 min read
Investigation of Midnight Interface Timeout in Vivo E‑commerce Activity System
Aikesheng Open Source Community
Aikesheng Open Source Community
Sep 20, 2022 · Databases

Analyzing MySQL Slow Log: Identifying Disk I/O and Semi‑Sync Replication Bottlenecks

This article walks through a systematic analysis of MySQL slow‑log entries, correlating them with error‑log flush events, disk‑I/O monitoring, and semi‑synchronous replication traces to pinpoint performance bottlenecks and provide practical troubleshooting steps.

Database MonitoringDisk I/OMySQL
0 likes · 20 min read
Analyzing MySQL Slow Log: Identifying Disk I/O and Semi‑Sync Replication Bottlenecks
Aikesheng Open Source Community
Aikesheng Open Source Community
May 25, 2022 · Operations

Diagnosing High CPU Load Caused by Frequent Short‑Lived Processes in a MongoDB Environment Using execsnoop

The article describes how a MongoDB test environment on a single VM experienced persistent high CPU load despite low visible QPS, how the root cause was traced to thousands of short‑lived processes spawned by Zabbix monitoring, and how execsnoop was used to identify and eliminate the offending processes.

CPU LoadMongoDBZabbix
0 likes · 6 min read
Diagnosing High CPU Load Caused by Frequent Short‑Lived Processes in a MongoDB Environment Using execsnoop
Efficient Ops
Efficient Ops
Jan 18, 2018 · Operations

Understanding Linux Load Average: Reading, Interpreting, and Using It for Troubleshooting

This article explains what Linux load average measures, how to view the 1‑, 5‑, and 15‑minute values, interprets the numbers using traffic analogies, presents stress‑test scenarios across different CPU cores, and shows how load average guides effective troubleshooting of CPU and I/O bottlenecks.

Linuxload averageoperations
0 likes · 8 min read
Understanding Linux Load Average: Reading, Interpreting, and Using It for Troubleshooting