Operations 21 min read

Master Keepalived: Build Reliable Linux Load‑Balancing and HA

This guide explains Keepalived’s role in Linux load‑balancing and high‑availability, covering its VRRP‑based architecture, core modules, layered operation, configuration syntax, practical deployment with Nginx, common split‑brain issues, and advanced settings such as nopreempt and multicast conflict resolution.

Ops Development Stories

Sep 17, 2021

Master Keepalived: Build Reliable Linux Load‑Balancing and HA

Keepalived Introduction

Keepalived is a routing software written in C that provides simple, robust load‑balancing and high‑availability for Linux systems. It monitors server health, automatically removes failed nodes and adds recovered nodes without manual intervention.

Originally designed for LVS, it also implements the VRRP protocol to protect services such as Nginx, HAProxy, MySQL and others.

Application Scenarios

Manage LVS clusters

Implement high‑availability via VRRP

Health check and automatic failover

Two main uses are load‑balancing (LVS + Keepalived) and active‑passive high‑availability.

Working Principle

VRRP Protocol

VRRP (Virtual Router Redundancy Protocol) creates a virtual router IP shared by multiple physical routers or servers. The master handles ARP, ICMP and traffic forwarding, while backups listen and take over when the master fails.

VRRP uses a virtual router ID (VRID) and a priority (0‑255) to elect the master; the highest priority becomes master.

Core Components

Modules: core (process management), check (health checks), VRRP (protocol implementation). Additional modules include system call, watchdog, libipfwc and libipvs.

Layered Operation

Load‑balancing works at TCP/IP layers 3‑5, while high‑availability operates at layers 3, 4, 7 using appropriate health‑check protocols: ICMP for layer 3, TCP for layer 4, and HTTP for layer 7.

Process Model

When started, Keepalived launches three processes: a parent, a VRRP child and a checkers child. Health‑check failures cause the VRRP child to withdraw the virtual IP and switch to BACKUP state.

Configuration and Usage

Configuration files are hierarchical, using braces and include statements. Key sections include global_defs, vrrp_instance and virtual_server.

global_defs {
    router_id LVS_DEVEL
}
vrrp_instance VI_1 {
    state MASTER
    interface ens33
    virtual_router_id 51
    priority 100
    advert_int 1
    authentication {
        auth_type PASS
        auth_pass 1111
    }
    virtual_ipaddress {
        192.168.119.130
    }
}
virtual_server 192.168.119.130 80 {
    delay_loop 6
    lb_algo rr
    lb_kind DR
    protocol TCP
    real_server 192.168.119.120 80 {
        weight 1
        TCP_CHECK {
            connect_timeout 10
            retry 3
            delay_before_retry 3
            connect_port 80
        }
    }
}

Typical deployment steps: install Keepalived and Nginx, create health‑check scripts, configure master and backup instances, start services, verify VIP movement, and test failover.

Common Issues – Split‑Brain

When heartbeat links fail, both nodes may think they are master, leading to resource contention. Causes include network failures, firewall blocks, mismatched VRID, or mis‑configured heartbeat interfaces.

Mitigation methods: dual heartbeat links, STONITH devices, monitoring tools, disk locking, arbitration via external gateway, or automatic reboot of the isolated node.

Preventing Split‑Brain with Nginx Monitoring

A script checks Nginx port and process; if unhealthy, it stops Keepalived to avoid split‑brain.

Preventing VIP Flapping After Recovery

Set the nopreempt parameter so the original master does not automatically reclaim the virtual IP after recovery; manual restart is required.

Multicast Address Conflict

Multiple Keepalived groups in the same LAN may clash on the default multicast address 224.0.0.18. Configure a unique vrrp_mcast_group4 address in each group to avoid conflicts.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

failover high-availability HA VRRP keepalived load-balancing

Written by

Ops Development Stories

Maintained by a like‑minded team, covering both operations and development. Topics span Linux ops, DevOps toolchain, Kubernetes containerization, monitoring, log collection, network security, and Python or Go development. Team members: Qiao Ke, wanger, Dong Ge, Su Xin, Hua Zai, Zheng Ge, Teacher Xia.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.