LRU Cache

Q: How does an LRU cache achieve O(1) time for get and put?

It combines two data structures: a hash map for O(1) key lookup and a doubly linked list for O(1) node removal and insertion. On get, the hash map finds the node instantly, then the doubly linked list moves it to the front. On put, the same operations apply, plus eviction removes the tail node in O(1).

Q: Why use a doubly linked list instead of a singly linked list for LRU cache?

A doubly linked list allows O(1) removal of any node because each node has a prev pointer. With a singly linked list, removing a node from the middle requires traversing from the head to find its predecessor, making removal O(n).

Q: What are dummy head and tail nodes and why use them?

Dummy (sentinel) head and tail nodes are placeholder nodes that always exist at the boundaries of the linked list. They eliminate null checks and special cases for empty lists, single-element lists, or operations at the boundaries, making the code simpler and less bug-prone.

Q: How does LRU cache differ from LFU cache?

LRU evicts the least recently used item based on access time. LFU evicts the least frequently used item based on access count. LRU is simpler to implement and adapts better to changing access patterns, while LFU is better when some items are consistently popular over long periods.

Q: Why do you need to store the key in each linked list node?

When evicting the least recently used node (tail.prev), you need its key to delete the corresponding entry from the hash map. Without the key stored in the node, you would have no way to find and remove the hash map entry in O(1) time.

30 minute read

Master LRU cache design: O(1) get/put with hash map + doubly linked list. Critical for interviews and production caching systems.

TL;DR

An LRU cache pairs a hash map (key to node) with a doubly linked list (access order) to deliver O(1) get and put. Dummy head/tail sentinels eliminate edge cases. On access, remove the node from its current position and re-insert at the front. On eviction, remove tail.prev and delete its key from the hash map. Python’s OrderedDict gives you this for free with move_to_end() and popitem(last=False). This data structure powers linked list pointer manipulation in interviews and real-world systems like prediction caches, feature stores, and CDN edge caching. See also add two numbers for more linked list practice.

Problem Statement

Design a data structure that follows the constraints of a Least Recently Used (LRU) cache.

Implement the LRUCache class:

LRUCache(int capacity) Initialize the LRU cache with positive size capacity.
int get(int key) Return the value of the key if the key exists, otherwise return -1.
put(int key, int value) Update the value of the key if the key exists. Otherwise, add the key-value pair to the cache. If the number of keys exceeds the capacity from this operation, evict the least recently used key.

The functions get and put must each run in O(1) average time complexity.

Examples

Input:
["LRUCache", "put", "put", "get", "put", "get", "put", "get", "get", "get"]
[[2], [1, 1], [2, 2], [1], [3, 3], [2], [4, 4], [1], [3], [4]]

Output:
[null, null, null, 1, null, -1, null, -1, 3, 4]

Explanation:
LRUCache lRUCache = new LRUCache(2);
lRUCache.put(1, 1); // cache is {1=1}
lRUCache.put(2, 2); // cache is {1=1, 2=2}
lRUCache.get(1); // return 1
lRUCache.put(3, 3); // LRU key was 2, evicts key 2, cache is {1=1, 3=3}
lRUCache.get(2); // returns -1 (not found)
lRUCache.put(4, 4); // LRU key was 1, evicts key 1, cache is {4=4, 3=3}
lRUCache.get(1); // return -1 (not found)
lRUCache.get(3); // return 3
lRUCache.get(4); // return 4

Constraints

1 <= capacity <= 3000
0 <= key <= 10^4
0 <= value <= 10^5
At most 2 * 10^5 calls will be made to get and put

Understanding LRU Cache

What is LRU?

Least Recently Used (LRU) is a cache eviction policy that removes the least recently accessed item when the cache is full.

Access = Read or Write

When you:

get(key) - The key becomes most recently used
put(key, value) - The key becomes most recently used

Why LRU? The Intuition

Imagine you’re organizing files on your desk. You have limited space, so you stack recent documents on top. When you need space for a new document, you remove the one at the bottom (least recently used).

The temporal locality principle: Recently accessed data is likely to be accessed again soon.

Example scenario:

You visit websites: A → B → C → D → A → B

Notice that A and B are accessed multiple times.
LRU keeps these "hot" items in cache, evicting C or D if space is needed.

Why not other policies?

Policy	How it works	Drawback
FIFO	Remove oldest inserted	Doesn’t consider access patterns
Random	Remove random item	No intelligence, unpredictable
LFU	Remove least frequently used	Complex to implement, doesn’t adapt to changing patterns
LRU	Remove least recently used	✅ Simple, adaptive, good hit rates

Real-World Examples

1. Browser Cache When you visit websites, your browser caches images and assets. If the cache fills up, it removes pages you haven’t visited in a while (LRU).

2. Database Query Cache Databases cache query results. Popular queries (accessed recently) stay in cache, while old queries are evicted.

3. CDN Edge Caching Content Delivery Networks cache content at edge locations. Popular content (recently accessed) stays cached close to users.

4. Operating System Memory When RAM is full, OS moves least recently used pages to disk (swap/page file).

The Challenge: Achieving O(1) Operations

The problem: We need both:

Fast lookup - O(1) to check if key exists
Fast reordering - O(1) to move item to “most recent”
Fast eviction - O(1) to remove “least recent”

Why is this hard?

If we use only one data structure:

Array: Lookup O(n), reordering O(n) ❌
Hash Map: Lookup O(1), but no ordering ❌
Linked List: Reordering O(1), but lookup O(n) ❌

The solution: Combine both!

Hash Map: For O(1) lookup
Doubly Linked List: For O(1) reordering

Understanding the Data Structure Choice

Why Doubly Linked List?

Let’s think through what we need:

Add to front (most recent): O(1)
- Need to know the front ✓
Remove from back (least recent): O(1)
- Need to know the back ✓
- Need prev pointer to update second-to-last node ✓
Remove from middle (when accessed): O(1)
- Need prev pointer to update previous node ✓
- Need next pointer to update next node ✓

Singly linked list won’t work because:

Can’t update prev.next when removing a node from middle
Would need to traverse from head to find previous node: O(n) ❌

Doubly linked list gives us:

prev ← [Node] → next

Can update both directions without traversal!

Why Hash Map?

We need O(1) lookup by key. Hash map provides this:

cache[key] # O(1) average case

Without hash map, we’d need to traverse the list: O(n) ❌

The Dummy Node Trick

One of the most important patterns in linked list problems!

Problem without dummy nodes:

# Edge cases everywhere!
if self.head is None:
    # Special handling for empty list
    if node == self.head:
        # Special handling for removing head
        if node == self.tail:
            # Special handling for removing tail

Solution with dummy nodes:

# Dummy head and tail always exist
self.head = Node() # Dummy
self.tail = Node() # Dummy
self.head.next = self.tail
self.tail.prev = self.head

# Now we NEVER have to check for None!
# Always have prev and next pointers

Why this works:

Before (without dummy):
head=None OR head → [1] → [2] → None
 ↑ Special case!

After (with dummy):
Dummy Head ↔ [1] ↔ [2] ↔ Dummy Tail
 ↑ Always have structure!

Empty cache:
Dummy Head ↔ Dummy Tail
 ↑ Still valid!

Benefits:

No null checks needed
Same logic for all operations
Fewer bugs
Cleaner code

Visualization

Initial: capacity=3, cache is empty

put(1, 'a')
Cache: [1='a']
 ↑ most recent

put(2, 'b')
Cache: [2='b'] -> [1='a']
 ↑ most recent

put(3, 'c')
Cache: [3='c'] -> [2='b'] -> [1='a']
 ↑ most recent ↑ least recent

get(1) # Access 1, move to front
Cache: [1='a'] -> [3='c'] -> [2='b']
 ↑ most recent ↑ least recent

put(4, 'd') # Cache full, evict 2 (LRU)
Cache: [4='d'] -> [1='a'] -> [3='c']
 ↑ most recent ↑ least recent

Solution 1: Hash Map + Doubly Linked List (Optimal)

Intuition

To achieve O(1) for both get and put:

Hash Map - O(1) lookup by key
Doubly Linked List - O(1) insertion/deletion from any position

Why doubly linked list?

Move node to front: O(1) (need prev pointer)
Remove from back: O(1) (need prev pointer)
Remove from middle: O(1) (need prev pointer)

Data Structure Design

┌─────────────────────────────────────────────┐
│ LRU CACHE │
├─────────────────────────────────────────────┤
│ │
│ Hash Map (key -> node) │
│ ┌─────────────────────────────────────┐ │
│ │ key=1 -> Node(1, 'a') │ │
│ │ key=2 -> Node(2, 'b') │ │
│ │ key=3 -> Node(3, 'c') │ │
│ └─────────────────────────────────────┘ │
│ │
│ Doubly Linked List (access order) │
│ ┌──────────────────────────────────────┐ │
│ │ Dummy Head <-> [3] <-> [2] <-> [1] │ │
│ │ <-> Dummy Tail │ │
│ │ ↑ MRU ↑ LRU │ │
│ └──────────────────────────────────────┘ │
│ │
└─────────────────────────────────────────────┘

Implementation

Before we dive into code, let’s understand the step-by-step approach:

Step 1: Define the Node Each node needs:

key: To identify the item
value: The cached data
prev: Pointer to previous node (for doubly linked list)
next: Pointer to next node (for doubly linked list)

Why store key in the node? When we evict the LRU node, we need its key to delete it from the hash map!

class Node:
    """
    Doubly linked list node

    This is the building block of our cache.
    Each node stores a key-value pair and links to neighbors.

    Why doubly linked?
    - `prev` lets us remove nodes from anywhere in O(1)
    - `next` lets us traverse forward

    Think of it like a train car:
        [prev car] ← [this car] → [next car]
        """
    def __init__(self, key=0, value=0):
        self.key = key # Needed to delete from hash map when evicting
        self.value = value # The cached data
        self.prev = None # Link to previous node
        self.next = None # Link to next node

    class LRUCache:
        """
        LRU Cache with O(1) get and put

        Data structures:
            - Hash map: key -> node (for O(1) lookup)
            - Doubly linked list: nodes in access order (for O(1) reorder)

            Layout:
                head <-> [MRU] <-> ... <-> [LRU] <-> tail

                Most recently used (MRU) is after head
                Least recently used (LRU) is before tail
                """

    def __init__(self, capacity: int):
        """
        Initialize LRU cache

        Time: O(1)
        Space: O(capacity)
        """
        self.capacity = capacity
        self.cache = {} # key -> node

        # Dummy head and tail for easier operations
        self.head = Node()
        self.tail = Node()

        # Connect head and tail
        self.head.next = self.tail
        self.tail.prev = self.head

    def get(self, key: int) -> int:
        """
        Get value by key

        If exists, move to front (most recently used)

        Time: O(1)
        Space: O(1)

        Why move to front on get()?
        Because accessing the key makes it "recently used"!
        LRU means "least RECENTLY USED", so we need to track when things are accessed.
        """
        # Step 1: Check if key exists (O(1) hash map lookup)
        if key not in self.cache:
            return -1 # Not found

            # Step 2: Get the node from hash map (O(1))
            node = self.cache[key]

            # Step 3: Move to front (O(1))
            # Why? Because we just accessed it, making it "most recently used"
            #
            # Before: head ↔ [A] ↔ [B] ↔ [C] ↔ tail
            # ↑ We want this
            #
            # After: head ↔ [B] ↔ [A] ↔ [C] ↔ tail
            # ↑ Most recent now!

            self._remove(node) # Remove from current position
            self._add_to_front(node) # Add at front (most recent)

            return node.value

    def put(self, key: int, value: int) -> None:
        """
        Put key-value pair

        If key exists, update value and move to front
        If key doesn't exist, add to front
        If capacity exceeded, evict LRU (before tail)

        Time: O(1)
        Space: O(1)

        This is the heart of the LRU cache!
        """
        # Case 1: Key already exists
        # We need to UPDATE the value and move to front
        if key in self.cache:
            node = self.cache[key]
            node.value = value # Update value

            # Move to front (most recently used)
            # Why? Because we just wrote to it!
            self._remove(node)
            self._add_to_front(node)

            # Case 2: Key doesn't exist - NEW insertion
        else:
            # Create new node
            node = Node(key, value)

            # Add to hash map (for O(1) lookup)
            self.cache[key] = node

            # Add to front of list (most recently used)
            self._add_to_front(node)

            # CRITICAL: Check if we exceeded capacity
            if len(self.cache) > self.capacity:
                # We have ONE TOO MANY items!
                # Must evict the LRU item (the one before tail)

                # Why tail.prev?
                # Because dummy tail is at the end, so tail.prev is the actual LRU item
                lru_node = self.tail.prev

                # Remove from list
                self._remove(lru_node)

                # IMPORTANT: Also remove from hash map!
                # Many people forget this step in interviews!
                del self.cache[lru_node.key] # lru_node.key tells us which key to delete

    def _remove(self, node: Node) -> None:
        """
        Remove node from doubly linked list

        Time: O(1)

        This is the magic of doubly linked lists!
        We can remove ANY node in O(1) if we have a reference to it.
        """
        # Get neighbors
        prev_node = node.prev
        next_node = node.next

        # Connect neighbors to each other, bypassing node
        # Before: prev ↔ node ↔ next
        # After: prev ↔↔↔↔↔↔ next
        prev_node.next = next_node
        next_node.prev = prev_node

        # Note: We don't need to set node.prev or node.next to None
        # because we'll reuse this node or it will be garbage collected

    def _add_to_front(self, node: Node) -> None:
        """
        Add node to front (after head, before first real node)

        Time: O(1)

        We always add to the front because that's the "most recently used" position.

        Visual:
            Before: head ↔ [A] ↔ [B] ↔ tail
            After: head ↔ [node] ↔ [A] ↔ [B] ↔ tail
            ↑ Most recent!
            """
            # Step 1: Set node's pointers
            node.prev = self.head
            node.next = self.head.next # This is the old first node

            # Step 2: Update neighbors to point to node
            # Order matters here! Update in the right sequence:
            # First: old first node's prev should point to new node
            self.head.next.prev = node
            # Second: head's next should point to new node
            self.head.next = node

            # Why this order?
            # If we did head.next = node first, we'd lose the reference to old first node!

    def __repr__(self):
        """String representation for debugging"""
        items = []
        current = self.head.next

        while current != self.tail:
            items.append(f"{current.key}={current.value}")
            current = current.next

            return f"LRUCache({self.capacity}): [" + " -> ".join(items) + "]"

            # Example usage
            cache = LRUCache(2)

            cache.put(1, 1)
            print(cache) # [1=1]

            cache.put(2, 2)
            print(cache) # [2=2 -> 1=1]

            print(cache.get(1)) # Returns 1
            print(cache) # [1=1 -> 2=2] (1 moved to front)

            cache.put(3, 3) # Evicts key 2
            print(cache) # [3=3 -> 1=1]

            print(cache.get(2)) # Returns -1 (not found)

            cache.put(4, 4) # Evicts key 1
            print(cache) # [4=4 -> 3=3]

            print(cache.get(1)) # Returns -1 (not found)
            print(cache.get(3)) # Returns 3
            print(cache.get(4)) # Returns 4

Complexity Analysis

Time Complexity:

get: O(1) - Hash map lookup + linked list reorder
put: O(1) - Hash map insert + linked list operations

Space Complexity:

O(capacity) - Store up to capacity nodes

Solution 2: OrderedDict (Python Built-in)

Python’s collections.OrderedDict maintains insertion order and provides move_to_end() for O(1) reordering.

from collections import OrderedDict

class LRUCache:
    """
    LRU Cache using OrderedDict

    Simpler implementation but same complexity
    """

    def __init__(self, capacity: int):
        self.cache = OrderedDict()
        self.capacity = capacity

    def get(self, key: int) -> int:
        """
        Get value and move to end (most recent)

        Time: O(1)
        """
        if key not in self.cache:
            return -1

            # Move to end (most recently used)
            self.cache.move_to_end(key)

            return self.cache[key]

    def put(self, key: int, value: int) -> None:
        """
        Put key-value and move to end

        Time: O(1)
        """
        if key in self.cache:
            # Update and move to end
            self.cache.move_to_end(key)

            self.cache[key] = value

            # Evict LRU if over capacity
            if len(self.cache) > self.capacity:
                # popitem(last=False) removes first (oldest) item
                self.cache.popitem(last=False)

Pros:

Clean and concise
Good for interviews if allowed

Cons:

Less educational (hides implementation details)
May not be allowed in interviews

Detailed Walkthrough

Let’s trace through a complete example step by step:

def trace_lru_cache():
    """
    Trace LRU cache operations with detailed output
    """
    cache = LRUCache(3)

    operations = [
    ('put', 1, 'apple'),
    ('put', 2, 'banana'),
    ('put', 3, 'cherry'),
    ('get', 1, None),
    ('put', 4, 'date'),
    ('get', 2, None),
    ('get', 3, None),
    ('put', 5, 'elderberry'),
    ]

    print("="*60)
    print("LRU CACHE TRACE (capacity=3)")
    print("="*60)

    for op in operations:
        if op[0] == 'put':
            _, key, value = op
            print(f"\nput({key}, '{value}')")
            cache.put(key, value)
        else:
            _, key, _ = op
            result = cache.get(key)
            print(f"\nget({key}) -> {result}")

            # Print cache state
            print(f" Cache: {cache}")
            print(f" Size: {len(cache.cache)}/{cache.capacity}")

            trace_lru_cache()

Output:

============================================================
LRU CACHE TRACE (capacity=3)
============================================================

put(1, 'apple')
 Cache: [1=apple]
 Size: 1/3

put(2, 'banana')
 Cache: [2=banana -> 1=apple]
 Size: 2/3

put(3, 'cherry')
 Cache: [3=cherry -> 2=banana -> 1=apple]
 Size: 3/3

get(1) -> apple
 Cache: [1=apple -> 3=cherry -> 2=banana]
 Size: 3/3
 (1 moved to front)

put(4, 'date')
 Cache: [4=date -> 1=apple -> 3=cherry]
 Size: 3/3
 (2 evicted - was LRU)

get(2) -> -1
 Cache: [4=date -> 1=apple -> 3=cherry]
 Size: 3/3
 (2 not found)

get(3) -> cherry
 Cache: [3=cherry -> 4=date -> 1=apple]
 Size: 3/3
 (3 moved to front)

put(5, 'elderberry')
 Cache: [5=elderberry -> 3=cherry -> 4=date]
 Size: 3/3
 (1 evicted - was LRU)

Common Mistakes & Edge Cases

Mistake 1: Forgetting to Update on Get

# ❌ WRONG: Don't update access order on get
def get(self, key):
    if key in self.cache:
        return self.cache[key].value # Not moving to front!
        return -1

        # ✅ CORRECT: Move to front on get
    def get(self, key):
        if key not in self.cache:
            return -1

            node = self.cache[key]
            self._remove(node)
            self._add_to_front(node)
            return node.value

Mistake 2: Not Using Dummy Nodes

# ❌ WRONG: No dummy nodes - many edge cases
class LRUCache:
    def __init__(self, capacity):
        self.head = None # Can be None!
        self.tail = None # Can be None!

        # ✅ CORRECT: Dummy nodes simplify logic
    class LRUCache:
    def __init__(self, capacity):
        self.head = Node() # Always exists
        self.tail = Node() # Always exists
        self.head.next = self.tail
        self.tail.prev = self.head

Mistake 3: Incorrect Removal Logic

# ❌ WRONG: Forgetting to update both prev and next
def _remove(self, node):
    node.prev.next = node.next # Only updates next!

    # ✅ CORRECT: Update both connections
def _remove(self, node):
    node.prev.next = node.next
    node.next.prev = node.prev

Edge Cases to Test

def test_edge_cases():
    """Test important edge cases"""

    # Edge case 1: Capacity of 1
    print("Test 1: Capacity 1")
    cache = LRUCache(1)
    cache.put(1, 1)
    cache.put(2, 2) # Should evict 1
    assert cache.get(1) == -1
    assert cache.get(2) == 2
    print("✓ Passed")

    # Edge case 2: Update existing key
    print("\nTest 2: Update existing key")
    cache = LRUCache(2)
    cache.put(1, 1)
    cache.put(2, 2)
    cache.put(1, 10) # Update 1
    assert cache.get(1) == 10
    print("✓ Passed")

    # Edge case 3: Get non-existent key
    print("\nTest 3: Get non-existent key")
    cache = LRUCache(2)
    assert cache.get(999) == -1
    print("✓ Passed")

    # Edge case 4: Fill to capacity
    print("\nTest 4: Fill to capacity")
    cache = LRUCache(3)
    cache.put(1, 1)
    cache.put(2, 2)
    cache.put(3, 3)
    cache.put(4, 4) # Should evict 1
    assert cache.get(1) == -1
    print("✓ Passed")

    # Edge case 5: Access pattern
    print("\nTest 5: Complex access pattern")
    cache = LRUCache(2)
    cache.put(2, 1)
    cache.put(1, 1)
    cache.put(2, 3)
    cache.put(4, 1)
    assert cache.get(1) == -1
    assert cache.get(2) == 3
    print("✓ Passed")

    print("\n" + "="*40)
    print("All edge case tests passed!")
    print("="*40)

    test_edge_cases()

Production-Ready Implementation

Thread-Safe LRU Cache

import threading
from collections import OrderedDict

class ThreadSafeLRUCache:
    """
    Thread-safe LRU cache for concurrent access

    Uses RLock (reentrant lock) to protect cache operations
    """

    def __init__(self, capacity: int):
        self.cache = OrderedDict()
        self.capacity = capacity
        self.lock = threading.RLock()

        # Metrics
        self.hits = 0
        self.misses = 0

    def get(self, key: int) -> int:
        """Thread-safe get"""
        with self.lock:
            if key not in self.cache:
                self.misses += 1
                return -1

                self.hits += 1
                self.cache.move_to_end(key)
                return self.cache[key]

    def put(self, key: int, value: int) -> None:
        """Thread-safe put"""
        with self.lock:
            if key in self.cache:
                self.cache.move_to_end(key)

                self.cache[key] = value

                if len(self.cache) > self.capacity:
                    self.cache.popitem(last=False)

    def get_stats(self):
        """Get cache statistics"""
        with self.lock:
            total = self.hits + self.misses
            hit_rate = self.hits / total if total > 0 else 0

            return {
            'hits': self.hits,
            'misses': self.misses,
            'total': total,
            'hit_rate': hit_rate,
            'size': len(self.cache),
            'capacity': self.capacity
            }

            # Test thread safety
            import time
            import random

    def worker(cache, worker_id, operations=1000):
        """Worker thread that performs cache operations"""
        for _ in range(operations):
            key = random.randint(1, 20)

            if random.random() < 0.7:
                # 70% reads
                cache.get(key)
            else:
                # 30% writes
                cache.put(key, worker_id)

                # Create cache and threads
                cache = ThreadSafeLRUCache(capacity=10)
                threads = []

                print("Testing thread safety with 10 concurrent threads...")
                start = time.time()

                for i in range(10):
                    t = threading.Thread(target=worker, args=(cache, i, 1000))
                    threads.append(t)
                    t.start()

                    for t in threads:
                        t.join()

                        elapsed = time.time() - start

                        print(f"\nCompleted in {elapsed:.2f}s")
                        print("\nCache Statistics:")
                        stats = cache.get_stats()
                        for key, value in stats.items():
                            if key == 'hit_rate':
                                print(f" {key}: {value:.2%}")
                            else:
                                print(f" {key}: {value}")

LRU Cache with TTL (Time To Live)

import time

class LRUCacheWithTTL:
    """
    LRU cache with time-based expiration

    Items expire after TTL seconds
    """

    def __init__(self, capacity: int, ttl: int = 300):
        """
        Args:
            capacity: Max number of items
            ttl: Time to live in seconds
            """
            self.capacity = capacity
            self.ttl = ttl
            self.cache = OrderedDict()
            self.timestamps = {} # key -> insertion time

    def get(self, key: int) -> int:
        """Get with expiration check"""
        if key not in self.cache:
            return -1

            # Check if expired
            if time.time() - self.timestamps[key] > self.ttl:
                # Expired, remove
                del self.cache[key]
                del self.timestamps[key]
                return -1

                # Not expired, move to end
                self.cache.move_to_end(key)
                return self.cache[key]

    def put(self, key: int, value: int) -> None:
        """Put with timestamp"""
        if key in self.cache:
            self.cache.move_to_end(key)

            self.cache[key] = value
            self.timestamps[key] = time.time()

            # Evict LRU if over capacity
            if len(self.cache) > self.capacity:
                lru_key = next(iter(self.cache))
                del self.cache[lru_key]
                del self.timestamps[lru_key]

    def cleanup_expired(self):
        """Remove all expired items"""
        current_time = time.time()
        expired_keys = [
        key for key, timestamp in self.timestamps.items()
        if current_time - timestamp > self.ttl
        ]

        for key in expired_keys:
            del self.cache[key]
            del self.timestamps[key]

            return len(expired_keys)

            # Example
            cache = LRUCacheWithTTL(capacity=3, ttl=5)

            cache.put(1, 'apple')
            cache.put(2, 'banana')

            print("Immediately after insert:")
            print(f"get(1) = {cache.get(1)}") # 'apple'

            print("\nWait 6 seconds...")
            time.sleep(6)

            print("After TTL expired:")
            print(f"get(1) = {cache.get(1)}") # -1 (expired)

            print(f"\nCleaned up {cache.cleanup_expired()} expired items")

Performance Benchmarking

import time
import random
import matplotlib.pyplot as plt

def benchmark_lru_implementations():
    """
    Compare performance of different LRU implementations
    """
    capacities = [10, 100, 1000, 10000]
    implementations = {
    'Custom (Doubly Linked List)': LRUCache,
    'OrderedDict': lambda cap: LRUCacheOrderedDict(cap),
    }

    results = {name: [] for name in implementations}

    print("Benchmarking LRU Cache Implementations")
    print("="*60)

    for capacity in capacities:
        print(f"\nCapacity: {capacity}")

        for name, impl_class in implementations.items():
            cache = impl_class(capacity)

            # Generate workload
            num_operations = 10000
            operations = []

            for _ in range(num_operations):
                if random.random() < 0.7:
                    # 70% reads
                    key = random.randint(0, capacity * 2)
                    operations.append(('get', key))
                else:
                    # 30% writes
                    key = random.randint(0, capacity * 2)
                    value = random.randint(0, 1000)
                    operations.append(('put', key, value))

                    # Benchmark
                    start = time.perf_counter()

                    for op in operations:
                        if op[0] == 'get':
                            cache.get(op[1])
                        else:
                            cache.put(op[1], op[2])

                            elapsed = (time.perf_counter() - start) * 1000 # ms

                            results[name].append(elapsed)
                            print(f" {name:30s}: {elapsed:6.2f}ms")

                            # Plot results
                            plt.figure(figsize=(10, 6))

                            for name, times in results.items():
                                plt.plot(capacities, times, marker='o', label=name)

                                plt.xlabel('Capacity')
                                plt.ylabel('Time (ms) for 10,000 operations')
                                plt.title('LRU Cache Performance Comparison')
                                plt.legend()
                                plt.grid(True)
                                plt.xscale('log')
                                plt.savefig('lru_benchmark.png')
                                plt.close()

                                print("\n" + "="*60)
                                print("Benchmark complete! Plot saved to lru_benchmark.png")

    class LRUCacheOrderedDict:
        """OrderedDict implementation for comparison"""
    def __init__(self, capacity):
        self.cache = OrderedDict()
        self.capacity = capacity

    def get(self, key):
        if key not in self.cache:
            return -1
            self.cache.move_to_end(key)
            return self.cache[key]

    def put(self, key, value):
        if key in self.cache:
            self.cache.move_to_end(key)
            self.cache[key] = value
            if len(self.cache) > self.capacity:
                self.cache.popitem(last=False)

                benchmark_lru_implementations()

Connection to ML Systems

1. Model Prediction Cache

class ModelPredictionCache:
    """
    Cache model predictions to avoid recomputation

    Use case: Frequently requested predictions
    """

    def __init__(self, model, capacity=1000):
        self.model = model
        self.cache = LRUCache(capacity)

    def predict(self, features):
        """
        Predict with caching

        Args:
            features: tuple of feature values (must be hashable)

            Returns:
                prediction
                """
                # Create cache key from features
                cache_key = hash(features)

                # Try cache
                cached_prediction = self.cache.get(cache_key)
                if cached_prediction != -1:
                    return cached_prediction

                    # Cache miss: compute prediction
                    prediction = self.model.predict([features])[0]

                    # Store in cache
                    self.cache.put(cache_key, prediction)

                    return prediction

                    # Example
                    from sklearn.ensemble import RandomForestClassifier
                    import numpy as np

                    # Train model
                    X_train = np.random.randn(100, 5)
                    y_train = (X_train.sum(axis=1) > 0).astype(int)
                    model = RandomForestClassifier(n_estimators=10)
                    model.fit(X_train, y_train)

                    # Create cached predictor
                    cached_model = ModelPredictionCache(model, capacity=100)

                    # Make predictions
                    for _ in range(1000):
                        features = tuple(np.random.randint(0, 10, size=5))
                        prediction = cached_model.predict(features)

                        print("Cache statistics:")
                        print(f" Capacity: {cached_model.cache.capacity}")
                        print(f" Current size: {len(cached_model.cache.cache)}")

2. Feature Store Cache

class FeatureStoreCache:
    """
    Cache feature store lookups

    Feature stores can be slow (database/API calls)
    LRU cache reduces latency
    """

    def __init__(self, feature_store, capacity=10000):
        self.feature_store = feature_store
        self.cache = ThreadSafeLRUCache(capacity)

    def get_features(self, entity_id):
        """
        Get features for entity with caching

        Args:
            entity_id: Unique entity identifier

            Returns:
                Feature dictionary
                """
                # Try cache
                cached_features = self.cache.get(entity_id)
                if cached_features != -1:
                    return cached_features

                    # Cache miss: query feature store
                    features = self.feature_store.query(entity_id)

                    # Store in cache
                    self.cache.put(entity_id, features)

                    return features

    def get_cache_stats(self):
        """Get cache performance statistics"""
        return self.cache.get_stats()

        # Example usage
    class MockFeatureStore:
        """Mock feature store with latency"""
    def query(self, entity_id):
        time.sleep(0.001) # Simulate 1ms latency
        return {'feature1': entity_id * 2, 'feature2': entity_id ** 2}

        feature_store = MockFeatureStore()
        cached_store = FeatureStoreCache(feature_store, capacity=1000)

        # Simulate requests
        start = time.time()
        for _ in range(10000):
            entity_id = random.randint(1, 500)
            features = cached_store.get_features(entity_id)

            elapsed = time.time() - start

            print(f"Total time: {elapsed:.2f}s")
            print("\nCache stats:")
            stats = cached_store.get_cache_stats()
            for key, value in stats.items():
                if key == 'hit_rate':
                    print(f" {key}: {value:.2%}")
                else:
                    print(f" {key}: {value}")

3. Embedding Cache

class EmbeddingCache:
    """
    Cache embeddings for frequently queried items

    Useful for recommendation systems, search, etc.
    """

    def __init__(self, embedding_model, capacity=10000):
        self.model = embedding_model
        self.cache = LRUCache(capacity)

    def get_embedding(self, item_id):
        """
        Get embedding with caching

        Computing embeddings can be expensive (neural network inference)
        """
        # Try cache
        cached_embedding = self.cache.get(item_id)
        if cached_embedding != -1:
            return cached_embedding

            # Cache miss: compute embedding
            embedding = self.model.encode(item_id)

            # Store in cache
            self.cache.put(item_id, embedding)

            return embedding

    def batch_get_embeddings(self, item_ids):
        """
        Get embeddings for multiple items

        Separate cache hits from misses for efficient batch processing
        """
        embeddings = {}
        cache_misses = []

        # Check cache
        for item_id in item_ids:
            cached = self.cache.get(item_id)
            if cached != -1:
                embeddings[item_id] = cached
            else:
                cache_misses.append(item_id)

                # Batch compute misses
                if cache_misses:
                    batch_embeddings = self.model.batch_encode(cache_misses)

                    for item_id, embedding in zip(cache_misses, batch_embeddings):
                        self.cache.put(item_id, embedding)
                        embeddings[item_id] = embedding

                        return [embeddings[item_id] for item_id in item_ids]

Interview Tips

Discussion Points

Why not just use a hash map?
- Hash map gives O(1) lookup but doesn’t track access order
- Need additional data structure for ordering
Why doubly linked list instead of array?
- Array: O(n) to remove from middle
- Doubly linked list: O(1) to remove from any position
Why dummy head/tail?
- Simplifies edge cases
- No null checks needed
- Consistent operations
Can you use a single data structure?
- No, you need both:
- Fast lookup: Hash map
- Fast reordering: Linked list

Follow-up Questions

Q: How would you implement LFU (Least Frequently Used)?

class LFUCache:
    """
    Least Frequently Used cache

    Evicts item with lowest access frequency
    """

    def __init__(self, capacity):
        self.capacity = capacity
        self.cache = {} # key -> (value, freq)
        self.freq_map = {} # freq -> OrderedDict of keys
        self.min_freq = 0

    def get(self, key):
        if key not in self.cache:
            return -1

            value, freq = self.cache[key]

            # Increment frequency
            self._increment_freq(key, value, freq)

            return value

    def put(self, key, value):
        if self.capacity == 0:
            return

            if key in self.cache:
                # Update existing
                _, freq = self.cache[key]
                self._increment_freq(key, value, freq)
            else:
                # Add new
                if len(self.cache) >= self.capacity:
                    # Evict LFU
                    self._evict()

                    self.cache[key] = (value, 1)

                    if 1 not in self.freq_map:
                        self.freq_map[1] = OrderedDict()

                        self.freq_map[1][key] = None
                        self.min_freq = 1

    def _increment_freq(self, key, value, freq):
        """Move key to higher frequency bucket"""
        # Remove from current frequency
        del self.freq_map[freq][key]

        if not self.freq_map[freq] and freq == self.min_freq:
            self.min_freq += 1

            # Add to next frequency
            new_freq = freq + 1
            if new_freq not in self.freq_map:
                self.freq_map[new_freq] = OrderedDict()

                self.freq_map[new_freq][key] = None
                self.cache[key] = (value, new_freq)

    def _evict(self):
        """Evict least frequently used (and least recently used within that frequency)"""
        # Get first key from min frequency bucket
        key_to_evict = next(iter(self.freq_map[self.min_freq]))

        del self.freq_map[self.min_freq][key_to_evict]
        del self.cache[key_to_evict]

Q: How would you handle cache invalidation?

class LRUCacheWithInvalidation(LRUCache):
    """
    LRU cache with manual invalidation

    Useful when data changes externally
    """

    def invalidate(self, key):
        """Remove key from cache"""
        if key not in self.cache:
            return False

            node = self.cache[key]
            self._remove(node)
            del self.cache[key]

            return True

    def invalidate_pattern(self, pattern):
        """
        Invalidate keys matching pattern

        Example: invalidate_pattern('user_*')
        """
        import fnmatch

        keys_to_remove = [
        key for key in self.cache.keys()
        if fnmatch.fnmatch(str(key), pattern)
        ]

        for key in keys_to_remove:
            self.invalidate(key)

            return len(keys_to_remove)

Key Takeaways

✅ O(1) operations - Hash map + doubly linked list ✅ Dummy nodes - Simplify edge case handling ✅ Update on access - Both get() and put() update recency ✅ Thread safety - Use locks for concurrent access ✅ Real-world use - Prediction cache, feature store, embeddings

Core Pattern:

Hash map for O(1) lookup
Doubly linked list for O(1) reordering
MRU at head, LRU at tail

FAQ

How does an LRU cache achieve O(1) time for get and put?

It combines two data structures: a hash map for O(1) key lookup and a doubly linked list for O(1) node removal and insertion. On get, the hash map finds the node instantly, then the doubly linked list moves it to the front (most recently used). On put, the same operations apply, plus eviction removes the tail node in O(1) when capacity is exceeded.

Why use a doubly linked list instead of a singly linked list?

A doubly linked list allows O(1) removal of any node because each node stores a prev pointer. With a singly linked list, removing a node from the middle requires traversing from the head to find its predecessor, making removal O(n) and defeating the purpose of the cache.

What are dummy head and tail nodes and why use them?

Dummy (sentinel) head and tail nodes are placeholder nodes that always exist at the list boundaries. They eliminate null checks and special-case handling for empty lists, single-element lists, or operations at the boundaries. Every real node always has valid prev and next pointers, making the code simpler and less prone to bugs.

How does LRU cache differ from LFU cache?

LRU evicts the least recently used item based on the time of last access. LFU evicts the least frequently used item based on total access count. LRU is simpler (hash map + doubly linked list) and adapts faster to changing patterns, while LFU requires tracking frequency buckets and handles ties by LRU within each frequency level.

Why do you need to store the key in each linked list node?

When evicting the least recently used node (tail.prev), you need its key to delete the corresponding entry from the hash map. Without the key stored in the node, you would have no way to find and remove the hash map entry in O(1) time, which would break the overall O(1) guarantee.

Originally published at: arunbaby.com/dsa/0011-lru-cache

If you found this helpful, consider sharing it with others who might benefit.

LRU Cache

TL;DR

Problem Statement

Examples

Constraints

Understanding LRU Cache

What is LRU?

Why LRU? The Intuition

Real-World Examples

The Challenge: Achieving O(1) Operations

Understanding the Data Structure Choice

The Dummy Node Trick

Visualization

Solution 1: Hash Map + Doubly Linked List (Optimal)

Intuition

Data Structure Design

Implementation

Complexity Analysis

Solution 2: OrderedDict (Python Built-in)

Detailed Walkthrough

Common Mistakes & Edge Cases

Mistake 1: Forgetting to Update on Get

Mistake 2: Not Using Dummy Nodes

Mistake 3: Incorrect Removal Logic

Edge Cases to Test

Production-Ready Implementation

Thread-Safe LRU Cache

LRU Cache with TTL (Time To Live)

Performance Benchmarking

Connection to ML Systems

1. Model Prediction Cache

2. Feature Store Cache

3. Embedding Cache

Interview Tips

Discussion Points

Follow-up Questions

Key Takeaways

FAQ

Related across topics

Share on

TL;DR

Problem Statement

Examples

Constraints

Understanding LRU Cache

What is LRU?

Why LRU? The Intuition

Real-World Examples

The Challenge: Achieving O(1) Operations

Understanding the Data Structure Choice

The Dummy Node Trick

Visualization

Solution 1: Hash Map + Doubly Linked List (Optimal)

Intuition

Data Structure Design

Implementation

Complexity Analysis

Solution 2: OrderedDict (Python Built-in)

Detailed Walkthrough

Common Mistakes & Edge Cases

Mistake 1: Forgetting to Update on Get

Mistake 2: Not Using Dummy Nodes

Mistake 3: Incorrect Removal Logic

Edge Cases to Test

Production-Ready Implementation

Thread-Safe LRU Cache

LRU Cache with TTL (Time To Live)

Performance Benchmarking

Connection to ML Systems

1. Model Prediction Cache

2. Feature Store Cache

3. Embedding Cache

Interview Tips

Discussion Points

Follow-up Questions

Key Takeaways

FAQ

Related across topics

Content Delivery Networks (CDN)

Speech Separation

Vector Search for Agents

Share on