Ops Community
Author

Ops Community

A leading IT operations community where professionals share and grow together.

189
Articles
0
Likes
590
Views
0
Comments
Recent Articles

Latest from Ops Community

100 recent articles max
Ops Community
Ops Community
Apr 19, 2026 · Databases

How to Diagnose and Resolve MySQL CPU Spikes: A Complete Step‑by‑Step Guide

This guide walks you through identifying why MySQL CPU usage jumps, from confirming the MySQL process consumes CPU to checking connection counts, slow queries, lock waits, configuration settings, and business‑level traffic, and then provides short‑term mitigations and long‑term solutions such as read‑write splitting, sharding, and caching.

CPUDatabaseMonitoring
0 likes · 17 min read
How to Diagnose and Resolve MySQL CPU Spikes: A Complete Step‑by‑Step Guide
Ops Community
Ops Community
Apr 18, 2026 · Operations

Master Linux Host Monitoring: Prometheus, Node Exporter, Thresholds & Scripts

This comprehensive guide walks you through building a robust Linux host monitoring system with Prometheus and node_exporter, covering CPU, memory, disk, and network metrics, practical threshold formulas, ready‑to‑run Bash scripts, Alertmanager rules, Grafana dashboards, and best‑practice recommendations for reliable operations.

AlertmanagerGrafanaLinux monitoring
0 likes · 49 min read
Master Linux Host Monitoring: Prometheus, Node Exporter, Thresholds & Scripts
Ops Community
Ops Community
Apr 15, 2026 · Operations

Why Do Linux Processes Vanish? A Complete Troubleshooting Guide

This article systematically explains why Linux processes may disappear, covering OOM Killer, signal termination, cgroup limits, systemd timeouts, manual kills, and provides step‑by‑step diagnostic commands and preventive measures for RHEL, AlmaLinux, and Ubuntu environments.

LinuxMemoryOOM killer
0 likes · 26 min read
Why Do Linux Processes Vanish? A Complete Troubleshooting Guide
Ops Community
Ops Community
Apr 10, 2026 · Databases

How to Diagnose and Fix MySQL Too Many Connections Errors in Production

When MySQL reports 'Too many connections', this guide walks you through emergency assessment, step‑by‑step diagnostics, quick mitigation scripts, root‑cause analysis of slow queries, connection leaks, short‑connection spikes, and long‑term solutions including parameter tuning, connection‑pool configuration, and Prometheus‑based monitoring to prevent future outages.

AlertmanagerConnection PoolConnection leak
0 likes · 40 min read
How to Diagnose and Fix MySQL Too Many Connections Errors in Production
Ops Community
Ops Community
Apr 9, 2026 · Operations

Mastering Nginx Reverse Proxy: From Basics to Advanced Load Balancing and High Availability

This comprehensive guide explains the fundamentals of reverse proxy, walks through Nginx configuration, load‑balancing algorithms, health‑check setups, caching strategies, session‑persistence methods, high‑availability designs, performance tuning, monitoring, and troubleshooting, providing practical code snippets for real‑world deployments.

health checkhigh availabilityload balancing
0 likes · 30 min read
Mastering Nginx Reverse Proxy: From Basics to Advanced Load Balancing and High Availability
Ops Community
Ops Community
Apr 8, 2026 · Operations

Recover Accidentally Deleted Linux Files: Kernel Secrets & Practical Tools

This article explains the kernel‑level mechanics of file deletion on Linux, compares hard and soft links, shows how processes keep deleted files open, and provides step‑by‑step guides for using extundelete, testdisk, photorec, debugfs, XFS tools, snapshot strategies, scripts and preventive measures to reliably restore lost data.

File RecoveryLinuxShell Scripts
0 likes · 29 min read
Recover Accidentally Deleted Linux Files: Kernel Secrets & Practical Tools
Ops Community
Ops Community
Apr 5, 2026 · Operations

Choosing the Right Ingress Controller: Nginx, Traefik, or Envoy?

This guide provides a deep technical comparison of Nginx Ingress Controller, Traefik, and Envoy Proxy, covering architecture, configuration, performance, feature sets, deployment patterns, security hardening, monitoring, and troubleshooting to help operators select the best solution for their Kubernetes clusters.

EnvoyKubernetesMonitoring
0 likes · 28 min read
Choosing the Right Ingress Controller: Nginx, Traefik, or Envoy?
Ops Community
Ops Community
Apr 2, 2026 · Operations

Build a Production‑Ready Prometheus + Grafana Monitoring Stack in Minutes

Learn how to quickly set up a complete, production‑grade monitoring system using Prometheus 3.x and Grafana 11, covering installation, service discovery, PromQL queries, recording rules, Alertmanager routing, Grafana dashboards, best‑practice configurations, and troubleshooting for environments of any size.

Grafanaalertingcloud-native
0 likes · 55 min read
Build a Production‑Ready Prometheus + Grafana Monitoring Stack in Minutes