Clustering Fundamentals: Scaling Through Unity

A Cluster is a collection of inter-connected servers (nodes) that work together to act as a single, more powerful system. Clustering is the backbone of horizontal scaling and high availability.

1. Why do we need Clusters?

Individual servers have limits (CPU, RAM, Network). When a single server can no longer handle the load, we "Cluster" multiple servers to share the burden.

Key Benefits:

Scalability: Add more nodes to increase capacity (Horizontal Scaling).
Availability: If one node fails, others take over, preventing downtime.
Cost-Efficiency: It's often cheaper to use multiple mid-range servers than one extremely high-end "supercomputer."

2. Types of Clusters

A. Load Balancing Clusters

These focus on performance. They distribute incoming user requests across all nodes to ensure no single server is overwhelmed.

Best for: Web servers, API gateways.

B. High Availability (HA) Clusters

These focus on reliability. They use redundancy to ensure that if the "Active" node fails, a "Passive" node takes over immediately (Failover).

Core Concept: "Heartbeat" (Nodes check if others are still alive).
Best for: Databases, Critical infrastructure.

C. Compute (HPC) Clusters

These focus on raw power. They break a single massive task (like weather simulation or financial modeling) into tiny pieces and solve them in parallel across all nodes.

Best for: Scientific research, AI training.

3. High-Level Architecture

In a typical web-scale cluster, a Load Balancer sits in front, and a Shared Storage system (or State) sits behind to ensure every node sees the same data.

4. Implementation Example: Server-Level Clustering

While infrastructure clustering (like Kubernetes) handles multiple machines, modern languages like Node.js allow you to cluster multiple processes on a single machine to utilize every CPU core.

javascript

const cluster = require("cluster");
const http = require("http");
const os = require("os");

if (cluster.isMaster) {
  const numCPUs = os.cpus().length;
  console.log(`Master process is running. Spawning ${numCPUs} workers...`);

  // Create a worker for each CPU core
  for (let i = 0; i < numCPUs; i++) {
    cluster.fork();
  }

  cluster.on("exit", (worker, code, signal) => {
    console.log(`Worker ${worker.process.pid} died. Spawning a replacement...`);
    cluster.fork();
  });
} else {
  // Workers can share any TCP connection
  // In this case, it is an HTTP server
  http
    .createServer((req, res) => {
      res.writeHead(200);
      res.end(`Hello from Worker process ${process.pid}\n`);
    })
    .listen(8000);

  console.log(`Worker ${process.pid} started.`);
}

5. Summary: Key Trade-offs

Feature	Single Server	Cluster
Complexity	Low	High (Need LB, State Management)
Cost	Fixed	Pay for what you use
Fault Tolerance	Zero	High (Redundant nodes)
Maintenance	Single point of failure	Nodes can be updated one by one (Rolling updates)

Clustering Fundamentals: Scaling Through Unity ​

1. Why do we need Clusters? ​

Key Benefits: ​

2. Types of Clusters ​

A. Load Balancing Clusters ​

B. High Availability (HA) Clusters ​

C. Compute (HPC) Clusters ​

3. High-Level Architecture ​

4. Implementation Example: Server-Level Clustering ​

5. Summary: Key Trade-offs ​