Tag

system monitoring

1 views collected around this technical thread.

Raymond Ops
Raymond Ops
Jun 6, 2025 · Operations

Master Linux Process Management: Essential Commands and Monitoring Tools

This guide explains the fundamentals of Linux programs and processes, their lifecycle, creation methods, and provides detailed usage of key commands such as ps, top, pgrep, pstree, lsof, vmstat, free, iostat, iftop, dstat, as well as background job control, at, and crontab for scheduling tasks.

LinuxProcess Managementcommand line
0 likes · 11 min read
Master Linux Process Management: Essential Commands and Monitoring Tools
DevOps Operations Practice
DevOps Operations Practice
Jun 3, 2025 · Operations

7 Practical Shell Scripts for Linux Automation

This article presents seven practical Bash shell scripts for Linux system administration, covering automatic file backup, disk usage alerts, old log cleanup, server reachability checks, DNS resolution monitoring, HTTP service health checks, and random password generation, each with explanations and code examples.

BackupDisk usage alertLinux automation
0 likes · 6 min read
7 Practical Shell Scripts for Linux Automation
Cognitive Technology Team
Cognitive Technology Team
May 26, 2025 · Operations

Getting Started with OSHI: A Java Library for Cross‑Platform System Information

This article introduces OSHI, a free open‑source Java library that uses JNA to provide cross‑platform hardware and operating‑system details, outlines its main features, advantages, and limitations, and supplies complete Maven dependency and Java code examples for building system‑monitoring tools.

Hardware InfoJNAJava
0 likes · 14 min read
Getting Started with OSHI: A Java Library for Cross‑Platform System Information
IT Xianyu
IT Xianyu
Apr 19, 2025 · Operations

Comprehensive Guide to System Monitoring, Text Editing, Permission Management, and Log Analysis for Library Operations

This article provides a detailed, step‑by‑step tutorial on using Linux commands such as top, grep, awk, and bash scripts for performance bottleneck detection, memory‑leak investigation, automated alerts, efficient Vim editing, permission matrix configuration, multi‑condition log analysis, and quick troubleshooting in a library management environment.

Log AnalysisPermissionsbash scripting
0 likes · 9 min read
Comprehensive Guide to System Monitoring, Text Editing, Permission Management, and Log Analysis for Library Operations
Raymond Ops
Raymond Ops
Feb 27, 2025 · Operations

Unlock Linux Secrets: Exploring /proc and /proc/self for Process Insight

This article explains the Linux /proc virtual file system and its /proc/self shortcut, detailing how to read process information such as command line, working directory, executable path, environment variables, memory maps, and memory image using simple shell commands.

LinuxUnixoperations
0 likes · 5 min read
Unlock Linux Secrets: Exploring /proc and /proc/self for Process Insight
DataFunSummit
DataFunSummit
Jan 23, 2025 · Artificial Intelligence

Improving Observability in Multi‑Agent Systems: Analysis and Extension of OpenAI Swarm

This article examines the research‑oriented topic of observability in multi‑agent systems, reviews existing open‑source MAS frameworks such as Swarm, MetaGPT, AutoGen, and AutoGPT, identifies their observability challenges, and proposes extensions and visualization techniques to enhance debugging, testing, and control of OpenAI Swarm‑based applications.

AIObservabilityOpenAI Swarm
0 likes · 26 min read
Improving Observability in Multi‑Agent Systems: Analysis and Extension of OpenAI Swarm
Efficient Ops
Efficient Ops
Jan 22, 2025 · Operations

Essential Ops Metrics Every Engineer Should Monitor

Operations engineers need to track a comprehensive set of system, application, fault, security, and backup metrics—such as CPU and memory usage, response time, alert counts, incident rates, and recovery objectives—to quickly assess health, anticipate problems, and ensure reliable performance.

Backup and RecoveryPerformance MetricsReliability
0 likes · 5 min read
Essential Ops Metrics Every Engineer Should Monitor
Test Development Learning Exchange
Test Development Learning Exchange
Dec 23, 2024 · Fundamentals

Linux Fundamentals: Directory Structure, Kernel Architecture, and Essential Commands

This article provides a comprehensive overview of Linux fundamentals, covering directory structure, kernel architecture, essential commands, and advanced piping techniques for system administration and development.

Directory StructureLinux fundamentalsNetwork Administration
0 likes · 10 min read
Linux Fundamentals: Directory Structure, Kernel Architecture, and Essential Commands
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Nov 7, 2024 · Operations

Understanding and Calculating QPS (Queries Per Second) with JMeter Performance Testing

This article explains what QPS (Queries Per Second) is, presents formulas and examples for calculating overall and per‑interface QPS, and shows how to use JMeter to create a test plan, add thread groups and listeners for accurate performance measurement.

Backend MetricsJMeterQPS
0 likes · 5 min read
Understanding and Calculating QPS (Queries Per Second) with JMeter Performance Testing
Test Development Learning Exchange
Test Development Learning Exchange
Oct 31, 2024 · Operations

Using top and htop for Real‑Time System Resource Monitoring and Performance Analysis

This guide explains how to use the Linux utilities top and htop to monitor CPU, memory, disk I/O and network usage in real time, record performance data, analyze bottlenecks, and apply advanced techniques such as per‑process tracking, logging, chart generation and optimization recommendations.

LinuxPerformance analysishtop
0 likes · 9 min read
Using top and htop for Real‑Time System Resource Monitoring and Performance Analysis
Python Programming Learning Circle
Python Programming Learning Circle
Sep 15, 2024 · Operations

Using Python Scripts for Operations Automation: Remote Execution, Log Parsing, Monitoring, Deployment, and Backup

This article explains how operations engineers can leverage Python scripts and popular libraries such as paramiko, regex, psutil, fabric, and shutil to automate tasks like remote command execution, log analysis, system monitoring with alerts, batch software deployment, and file backup and recovery, enhancing efficiency and reducing manual errors.

Remote Executionoperations automationscripting
0 likes · 9 min read
Using Python Scripts for Operations Automation: Remote Execution, Log Parsing, Monitoring, Deployment, and Backup
Test Development Learning Exchange
Test Development Learning Exchange
Mar 30, 2024 · Operations

Monitoring macOS and Windows System Resources with Python

This guide explains why and how to monitor CPU, memory, and disk I/O on macOS or Windows using Python's psutil, matplotlib, and numpy libraries, covering performance analysis, troubleshooting, capacity planning, automated alerts, and includes a complete example script that visualizes resource usage over time.

Performance analysisautomationpsutil
0 likes · 6 min read
Monitoring macOS and Windows System Resources with Python
DevOps Operations Practice
DevOps Operations Practice
Mar 15, 2024 · Operations

Practical Linux Command-Line Techniques: Using xargs, ps, and netstat for File Classification, Resource Monitoring, and Network Inspection

This tutorial demonstrates how to leverage the xargs command for batch file processing, employ ps to list high‑memory and high‑CPU processes, and use netstat combined with awk to examine TCP connection states and identify the most active IP addresses, all illustrated with concrete command examples.

Linuxcommand linenetstat
0 likes · 4 min read
Practical Linux Command-Line Techniques: Using xargs, ps, and netstat for File Classification, Resource Monitoring, and Network Inspection
Java Tech Enthusiast
Java Tech Enthusiast
Jan 7, 2024 · Operations

Using the Linux top Command for Real-Time System Monitoring

The Linux top command offers a dynamic, real‑time view of system processes and resource usage—showing overall statistics, CPU and memory breakdowns, and detailed process columns—while supporting customizable refresh intervals, batch mode, and interactive shortcuts for sorting, column selection, and monitoring crucial metrics like %idle, %wa, and %steal.

CPULinuxOps
0 likes · 7 min read
Using the Linux top Command for Real-Time System Monitoring
Practical DevOps Architecture
Practical DevOps Architecture
Dec 15, 2023 · Operations

How to Configure Zabbix Agent to Monitor CPU Temperature on Linux

This guide explains how to install lm_sensors, write a Bash script to retrieve CPU temperature, configure a Zabbix agent UserParameter, restart the agent, and set up a corresponding monitoring item on the Zabbix server, enabling automated CPU temperature monitoring.

CPU monitoringLinuxZabbix
0 likes · 3 min read
How to Configure Zabbix Agent to Monitor CPU Temperature on Linux
Aikesheng Open Source Community
Aikesheng Open Source Community
Dec 13, 2023 · Operations

Troubleshooting High System Load Caused by Stuck NFS Processes and Zabbix Monitoring

This article details a step‑by‑step analysis of a server experiencing sustained high load, uncovering a stuck NFS mount and an overactive Zabbix monitoring job, and explains how targeted process termination and monitoring adjustments reduced the load from 85 to normal levels.

LinuxNFSZabbix
0 likes · 6 min read
Troubleshooting High System Load Caused by Stuck NFS Processes and Zabbix Monitoring
Architecture and Beyond
Architecture and Beyond
Nov 25, 2023 · Operations

Designing and Implementing an Effective Log System for Internet Startups

The article explains why comprehensive logging is essential for internet startups, outlines the three stages of a log system, details log levels, required fields, best‑practice principles, collection architectures such as local files and ELK, and how collected logs support monitoring, debugging, and analytics.

DevOpsELKObservability
0 likes · 12 min read
Designing and Implementing an Effective Log System for Internet Startups
Efficient Ops
Efficient Ops
Sep 5, 2023 · Operations

Master Linux Performance: Boost Throughput, Cut Latency, and Optimize CPU & Memory

This guide explains how high concurrency and fast response depend on throughput and latency, defines key performance metrics, shows how to interpret average load, CPU context switches, and memory usage, and provides practical Linux tools and command‑line examples for diagnosing and tuning system performance.

CPU optimizationLinux performanceMemory Management
0 likes · 44 min read
Master Linux Performance: Boost Throughput, Cut Latency, and Optimize CPU & Memory
HomeTech
HomeTech
Aug 16, 2023 · Operations

Webeye: A Low‑Code UI Automation Platform for Comprehensive Web Monitoring

This article introduces the Webeye low‑code platform, explains the fundamentals of system monitoring—including logging, metrics, and tracing—describes layered monitoring across hardware to UI, details Webeye’s actions and validators, presents a real‑world use case, and discusses extensibility, selector flexibility, and impact mitigation.

UI AutomationWebeyeheadless browser
0 likes · 14 min read
Webeye: A Low‑Code UI Automation Platform for Comprehensive Web Monitoring