Container Runtime Hierarchy

NOTE This module explores the core principles of Container Runtime Hierarchy, deriving solutions from first principles and hardware constraints to build world-class, production-ready expertise.

1. The Monolith Problem (A War Story)

Before 2016, Docker was a monolith. One binary (dockerd) did everything: downloading, unpacking, networking, and running containers.

The Real-World Problem: Imagine you were running a production server in 2015 with 50 critical containers. The Docker daemon (dockerd) needed a security patch. You upgraded Docker, which required restarting the daemon. The Disaster: Because dockerd was a single massive monolith, restarting it immediately killed all 50 running containers. A simple upgrade caused a total system outage.

To fix this, Docker was violently split into smaller, specialized components. This separation of concerns allows you to upgrade or crash the main Docker daemon without affecting the running containers.

2. Anatomy of the Stack (The Restaurant Analogy)

Today, Docker is a modular stack. To understand how it works, let’s map the components to a high-end restaurant:

Docker CLI (The Waiter): The user interface. You say “I want a container” (the order). It sends an HTTP JSON request to the daemon.
Dockerd (The Manager): The high-level manager. Handles the front-of-house: seating (networks), pantry (images), and storage (volumes).
Containerd (The Head Chef): The industry standard container runtime. Manages the kitchen lifecycle (Start/Stop/Kill). It doesn’t cook the food itself; it delegates.
Containerd-Shim (The Heat Lamp): A tiny process that sits between the Chef and the Cook. If the Head Chef (containerd) leaves or crashes, the Shim stays behind to hold the container’s standard I/O (keep the food warm) so the container doesn’t die.
Runc (The Line Cook): The low-level runtime. It rapidly spawns the process, sets up the physical isolation (Namespaces/Cgroups), and immediately exits to save memory.
Kernel (The Stove): The underlying Linux operating system providing the actual isolation features.

3. Interactive: Runtime Flow Simulator

Trace the path of a docker run command.

Docker CLI

→

Dockerd

→

Containerd

→

Shim

→

Runc

→

Kernel

      Waiting for command...
  

4. Code Examples

1. Go Implementation (Containerd Client)

This is how Kubernetes (via CRI) talks to containerd directly, bypassing Docker entirely.

Go

package main

import (
	"context"
	"fmt"
	"github.com/containerd/containerd"
	"github.com/containerd/containerd/namespaces"
)

func main() {
	// 1. Connect to Containerd Socket
	client, err := containerd.New("/run/containerd/containerd.sock")
	if err != nil {
		panic(err)
	}
	defer client.Close()

	ctx := namespaces.WithNamespace(context.Background(), "default")

	// 2. Pull Image
	image, err := client.Pull(ctx, "docker.io/library/redis:alpine", containerd.WithPullUnpack)
	if err != nil {
		panic(err)
	}

	// 3. Create Container (Metadata)
	container, err := client.NewContainer(ctx, "redis-server", containerd.WithNewSnapshot("redis-snapshot", image))
	if err != nil {
		panic(err)
	}

	// 4. Create Task (Process) -> Calls Runc
	task, err := container.NewTask(ctx, containerd.Stdio)
	if err != nil {
		panic(err)
	}

	fmt.Printf("Task PID: %d\n", task.Pid())
	task.Start(ctx)
}

2. Java Implementation (Docker Socket)

Docker exposes a Unix Domain Socket at /var/run/docker.sock. Java can talk to it directly using HTTP. This is how tools like Testcontainers work.

Java

import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.net.UnixDomainSocketAddress; // Java 16+

public class DockerClient {

    public static void main(String[] args) throws Exception {
        // 1. Create HTTP Client over Unix Socket
        HttpClient client = HttpClient.newBuilder().build();

        // 2. Construct Request (List Containers)
        // Note: Java 16 introduced UnixDomainSocketAddress
        // For older Java, use a library like 'junixsocket'

        HttpRequest request = HttpRequest.newBuilder()
            .uri(new URI("http://localhost/containers/json"))
            .header("Content-Type", "application/json")
            .GET()
            .build();

        // This is pseudo-code for the transport layer
        // In reality, you need a custom BodyPublisher for Unix Sockets
        System.out.println("Sending GET /containers/json to Docker Daemon...");

        // Response would be JSON: [{"Id": "a1b2...", "Image": "redis", ...}]
    }
}

NOTE Why the Shim? The shim allows containerd to exit (for upgrades) without killing the containers it started. The shim becomes the new parent of the container process, holding its STDIN/STDOUT open.

5. First Principles: Why Layers?

Why separate dockerd from containerd from runc?

Standardization (OCI): runc is the reference implementation of the OCI Runtime Spec. Because the stack is layered, anyone can write a replacement for the lowest level (e.g., kata-runtime for hardware VM isolation, gvisor for user-space sandboxing) without having to rewrite the Docker CLI or Image management.
Stability (Daemonless Containers): The massive Docker Daemon can crash or upgrade, but containerd and the shims keep the containers running. By using fork/exec syscalls, the Shim decouples the parent-child process tree.
Memory Efficiency: runc exits immediately after setting up the kernel primitives. If you have 1,000 containers, you do not have 1,000 runc processes consuming RAM. You only have lightweight shims.

Container Runtime Hierarchy

Container Runtime Hierarchy

1. The Monolith Problem (A War Story)

2. Anatomy of the Stack (The Restaurant Analogy)

3. Interactive: Runtime Flow Simulator

4. Code Examples

1. Go Implementation (Containerd Client)

Go

2. Java Implementation (Docker Socket)

Java

5. First Principles: Why Layers?

Found this lesson helpful?