Introduction to Arrays & Lists

[!NOTE] This module explores the core principles of Introduction to Arrays & Lists, deriving solutions from first principles and hardware constraints to build world-class, production-ready expertise.

1. The Hook: The Physics of Continuous Memory

To understand Arrays, you must first understand your computer’s memory (RAM). Imagine RAM as a single, infinitely long street of houses.

Every house has a unique number (Address).
Every house is exactly the same size (e.g., 1 byte or 8 bits).
You can teleport to any house instantly if you know its number.

This “teleportation” is why Arrays are fast. It’s not magic; it’s physics.

2. The 4 Pillars of Array Mastery

Pillar	Focus	Why it matters
1. Intuition	Continuous Blocks	Visualizing data as a packed line of houses.
2. Rigor	Amortized Proofs	Proving that “expensive” resizes don’t ruin performance.
3. Hardware	Cache Locality	Leveraging the CPU’s ability to fetch neighbors.
4. Patterns	Windows & Pointers	Techniques to avoid O(N²) brute force.

3. The O(1) Access Formula

When you declare an array int[] arr = new int[5], the OS finds a contiguous block of empty houses. Let’s say it starts at memory address 1000. If an integer takes 4 bytes, where is arr[3]?

We don’t need to search. We just do the math:

Address = Base Address + (Index \times Element Size)

For arr[3]: Address = 1000 + (3 × 4) = 1012

The CPU calculates 1012 in one cycle and teleports there. This is why Array Access is O(1). It doesn’t matter if the array has 5 elements or 5 billion. The math takes the same amount of time.

4. The Static Problem

Standard arrays are Static. You must decide their size before you use them.

“I want 5 slots.” → OS gives you addresses 1000-1019.

The Crisis: What happens when you want to add a 6th item? You can’t just take address 1020. That “house” might already be owned by your web browser, or Spotify, or the OS itself. Memory is crowded.

To fix this, we need a smarter data structure: the Dynamic Array (Java ArrayList, Go Slice, Python List, C++ std::vector).

5. The Dynamic Solution: Geometric Expansion

A Dynamic Array is just a Static Array that knows how to grow. But it can’t grow in place. It must move.

The Algorithm

When the array is full (Size == Capacity):

Allocate: Find a new contiguous block of memory with double the capacity (2N).
Copy: Copy all N existing elements to the new block.
Free: Delete the old block.
Insert: Add the new element.

Why Double? (The Intuition)

Why not just add 10 extra slots? Or 100? If we only added a constant amount (e.g., +10), we would run out of space constantly. We’d have to copy the entire array every 10 insertions. This would turn our “fast” array into an O(N²) disaster.

By Doubling (Geometric Progression), we ensure that resizes happen rarely. As the array gets bigger, the time between resizes gets longer.

6. Mathematical Proof: Amortized Analysis

“Wait,” you say. “Copying N elements takes O(N) time. Doesn’t that make insertion slow?” Yes, occasionally. But on average, it is still O(1).

Let’s prove it using the Aggregate Method. Imagine we start with an empty array and insert N elements (where N is a power of 2).

The Cost:

Normal Insert: Cost is 1 (just writing the value).
Resize: Cost is K (copying K elements).

Let’s trace the costs:

Insert 1: Cost 1. (Size 1, Cap 1)
Insert 2: Resize (Copy 1) + Insert 1 = Cost 2. (Size 2, Cap 2)
Insert 3: Resize (Copy 2) + Insert 1 = Cost 3. (Size 3, Cap 4)
Insert 4: Cost 1.
Insert 5: Resize (Copy 4) + Insert 1 = Cost 5. (Size 5, Cap 8) …
Insert 9: Resize (Copy 8) + Insert 1 = Cost 9.

Total Cost for N insertions = (Sum of all Normal Inserts) + (Sum of all Copies)

Normal Inserts: N (Every element is inserted once).
Copies: 1 + 2 + 4 + 8 + … + N/2.

This is a Geometric Series: Σ_i=0^{log N} 2ⁱ ≈ N

Total Work ≈ N (inserts) + N (copies) = 2N.

Average Cost per Insert: 2N / N = 2

Therefore, the Amortized Time Complexity is O(1).

7. Interactive Visualizer: The Resize Cost

See the “Copy Cost” in action.

Green: Free space.
Blue: Occupied space.
Flash: The expensive copy operation.

CAPACITY 4

SIZE 0

LAST OP COST 0

System Ready.

8. Code: Implementation from Scratch

How do we actually build ArrayList? We wrap a static array and manage the resizing logic manually.

Java

public class DynamicArray<T> {
  private Object[] data;
  private int size;
  private int capacity;

  public DynamicArray(int initialCapacity) {
    this.capacity = initialCapacity;
    this.size = 0;
    this.data = new Object[initialCapacity];
  }

  public T get(int index) {
    if (index >= size) throw new IndexOutOfBoundsException();
    return (T) data[index]; // O(1) Access
  }

  public void add(T element) {
    if (size == capacity) {
      resize(); // The expensive part
    }
    data[size] = element; // O(1) Insert
    size++;
  }

  private void resize() {
    capacity *= 2; // Double the size
    Object[] newData = new Object[capacity];

    // O(N) Copy operation
    for (int i = 0; i < size; i++) {
      newData[i] = data[i];
    }

    data = newData; // Pointer swap
  }
}

Go

package main

import "fmt"

type DynamicArray struct {
  data []int // Using slice as underlying storage for demo, but treating as static
  size int
  cap  int
}

func NewDynamicArray(initialCap int) *DynamicArray {
  return &DynamicArray{
    data: make([]int, initialCap), // Allocate static block
    size: 0,
    cap:  initialCap,
  }
}

func (da *DynamicArray) Add(element int) {
  if da.size == da.cap {
    da.resize()
  }
  da.data[da.size] = element
  da.size++
}

func (da *DynamicArray) resize() {
  newCap := da.cap * 2
  newData := make([]int, newCap) // Allocate new block

  // Copy O(N)
  for i := 0; i < da.size; i++ {
    newData[i] = da.data[i]
  }

  da.data = newData
  da.cap = newCap
}

func main() {
  da := NewDynamicArray(2)
  da.Add(10)
  da.Add(20)
  da.Add(30) // Triggers resize
  fmt.Println(da.data) // [10 20 30 0]
}

[!NOTE] Java Specific: Java ArrayLists use a default initial capacity of 10 and grow by 50% (oldCapacity » 1) when full.

[!NOTE] Go Specific: Go slices are headers to an underlying array. When append exceeds capacity, Go typically doubles the capacity (up to a certain threshold) before switching to a more gradual growth.

9. Summary

O(1) Access is due to RAM’s physical address math: Base + (Index * Size).
Static Arrays are fixed because memory neighbors are occupied.
Dynamic Arrays solve this by moving to a larger neighborhood (Doubling).
Amortized Analysis proves that despite occasional O(N) copies, the average insert is O(1).

10. Deep Dive Strategy Lab: Introduction

Intuition Through Analogy

Think of this chapter like organizing items on a warehouse shelf. The goal is not to memorize a fixed trick, but to repeatedly answer:

What is the bottleneck?
Which constraint dominates (time, memory, latency, correctness)?
Which representation makes the bottleneck easier to eliminate?