Tag

Fault Prediction

1 views collected around this technical thread.

Efficient Ops
Efficient Ops
Apr 22, 2025 · Operations

How AI Agents Are Transforming IT Operations and Fault Management

This article explores how AI agents powered by large models can predict failures, perform root‑cause analysis, enhance knowledge‑based Q&A, automate change releases, and enable intelligent decision‑making, dramatically improving efficiency and reliability in modern IT operations.

AI opsFault Predictionautomation
0 likes · 7 min read
How AI Agents Are Transforming IT Operations and Fault Management
Qunar Tech Salon
Qunar Tech Salon
May 8, 2019 · Operations

Intelligent Fault Prediction and Application Health Management Practices at Qunar

This article presents the goals, methods, and evolution of Qunar's operations team in reducing application failures, improving reliability through fault definition, rapid repair, automation, and AIOps-driven fault prediction, while sharing lessons from industrial PHM and outlining future challenges.

AIOpsFault PredictionPHM
0 likes · 24 min read
Intelligent Fault Prediction and Application Health Management Practices at Qunar
Efficient Ops
Efficient Ops
May 5, 2019 · Operations

How Qunar Uses AI-Driven Fault Prediction to Boost System Reliability

This article outlines Qunar's operational strategy for reducing failures and extending uptime through precise fault detection, rapid recovery, and AI-powered predictive health management, detailing the evolution of their OPS processes, practical implementations, and future challenges in applying PHM to internet services.

AIOpsFault PredictionPHM
0 likes · 18 min read
How Qunar Uses AI-Driven Fault Prediction to Boost System Reliability
Qunar Tech Salon
Qunar Tech Salon
Jan 10, 2019 · Operations

Applying AIOps for Zero‑Downtime Operations at China Aviation Information

The talk by chief architect Luo Hao explains how China Aviation Information tackles heavy legacy systems, non‑standard architectures, and zero‑downtime requirements by using AIOps techniques such as automated configuration discovery, cluster analysis, fault prediction, anomaly detection, event compression and rapid root‑cause automation.

AIOpsFault Predictionautomation
0 likes · 22 min read
Applying AIOps for Zero‑Downtime Operations at China Aviation Information
Efficient Ops
Efficient Ops
Dec 4, 2018 · Operations

How AIOps Transforms Zero‑Downtime Operations at China Aviation Information

This talk explains how China Aviation Information applies practical AIOps techniques—such as automated configuration management, cluster analysis, fault prediction, anomaly detection, and event compression—to achieve near‑zero downtime in a complex, legacy‑heavy ticketing and travel system.

AIAIOpsFault Prediction
0 likes · 24 min read
How AIOps Transforms Zero‑Downtime Operations at China Aviation Information