Cloud Native 21 min read

Reflections on CAP Theory, ACID, BASE, and Cloud‑Native Fault Tolerance

Reflecting on reading, the author reviews CAP theory’s consistency‑availability‑partition trade‑offs, extends ACID and BASE concepts, proposes modernizing CAP objects to consistency, fault and disaster tolerance, and examines how cloud‑native architectures, micro‑services, and SLA‑driven designs reshape fault tolerance and future self‑healing systems.

DaTaobao Tech

Aug 15, 2022

Reflections on CAP Theory, ACID, BASE, and Cloud‑Native Fault Tolerance

“Believe everything you read is not as good as having no book.” – The author reflects on reading and proposes to discuss CAP theory, data consistency, fault tolerance, and disaster recovery.

CAP Theory Overview

CAP states that a distributed system can satisfy at most two of Consistency, Availability, and Partition tolerance. The article explains the three possible trade‑offs (CA without P, CP without A, AP without C) and gives examples from traditional databases, NoSQL systems, and large‑scale internet services.

ACID Extension

ACID consists of Atomicity, Consistency, Isolation, Durability. The author describes how consistency in ACID ensures logical data rules and mentions that systems like Google Spanner provide both ACID and CAP guarantees via two‑phase commit and Paxos replication.

BASE Extension

BASE (Basically Available, Soft state, Eventual consistency) is presented as a practical alternative to strong consistency, used by many distributed systems such as asynchronous MySQL replication.

Rethinking CAP Objects

The author argues that the original CAP objects (database, software, network) are outdated for modern distributed systems and suggests upgrading the concepts to Consistency, Fault tolerance, and Disaster tolerance.

Impact of Cloud‑Native

Cloud‑native infrastructure changes how fault tolerance and disaster recovery are designed. The article discusses the shift from over‑design to SLA‑driven resilience, the role of micro‑services, containers, and DevOps, and how cloud providers’ SLAs affect disaster‑recovery strategies.

Fault‑Tolerance Basics

Fault tolerance is defined as the ability to tolerate errors (code bugs, system failures, user misuse). The author provides a C code example that checks the return value of fork() and logs an error.

if ((pid = fork()) < 0){
    // strerror returns a string describing the error.
    fprintf(stderr, "fork error: %s
", strerror(errno));
    exit(0)
}

Various fault‑tolerance techniques (retry, graceful degradation, user prompts) are compared, and the importance of identifying uncertainty (errors vs. disasters) is emphasized.

Future Outlook

The author envisions algorithm‑driven self‑healing systems that learn from large amounts of exception data to automatically design fault‑tolerance measures.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Distributed Systems CAP theorem disaster recovery fault tolerance ACID BASE

Written by

DaTaobao Tech

Official account of DaTaobao Technology

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.