A restaurant has one chef who must prepare every course of a five-course meal alone. She boils the pasta, waits, plates it, then starts the soup, waits, then works on the dessert. The oven sits idle while she chops vegetables. The grill goes cold while she stirs sauce. The meal takes two hours.

Now add a second chef. Both work in the same kitchen, sharing the same refrigerator, the same pantry, the same oven. One handles hot dishes, the other handles cold prep. They communicate constantly. The meal is done in forty-five minutes.

Threads are those two chefs. They share the same process resources — the same memory space, the same open files, the same code — but run independently on separate CPU cores (or take turns on the same core). This is concurrency, and it is one of the most powerful ideas in systems programming.

Why Threads?

Single-threaded programs are simple but limited. Consider a web browser:

The UI must stay responsive while a page loads
Six images must download simultaneously
JavaScript executes in one tab while another tab renders HTML
A video plays while you type in the address bar

All of these would be impossible with a single thread. The UI would freeze every time a network request was made. Threads are the solution.

Reasons to use threads:

Responsiveness — keep a UI alive while background work happens
Resource sharing — threads share process memory cheaply (no IPC overhead)
Economy — creating a thread is ~100× cheaper than creating a process
Parallelism — on multi-core CPUs, threads literally run at the same time

Thread vs. Process

Feature	Process	Thread
Address space	Own private address space	Shares process address space
Code segment	Private	Shared with all process threads
Data / Heap	Private	Shared with all process threads
Stack	Own stack	Own private stack
Registers / PC	Own (saved in PCB)	Own (saved in Thread Control Block)
Open files	Private	Shared
Creation cost	High (full memory copy via fork)	Low (just a new stack + TCB)
Context switch	Expensive (TLB flush, page table swap)	Cheap (same address space)
Crash isolation	Process crash doesn't affect others	Thread crash kills entire process

Thread Components

Each thread has its own private elements and shares the rest of the process with its siblings.

Process Memory Space
+-----------------------------------------------+
|              TEXT (code) - SHARED             |
+-----------------------------------------------+
|              DATA + HEAP - SHARED             |
|    (global vars, malloc'd memory, objects)    |
+-----------------------------------------------+
|  Stack T1  |  Stack T2  |  Stack T3           |
|  (private) |  (private) |  (private)          |
+-----------------------------------------------+

Each Thread has its own:
  - Stack (local variables, function call frames)
  - Program Counter (where it is executing)
  - CPU Registers (its own register snapshot)
  - Thread ID (TID)

All threads in a process share:
  - Code (text segment)
  - Global variables (data segment)
  - Heap memory
  - Open file descriptors
  - Signals and signal handlers

Thread Lifecycle

Threads go through the same states as processes:

  [New] ──────► [Runnable] ◄──── [Waiting/Blocked]
                    │                    ▲
                    │ scheduled          │ I/O / lock
                    ▼                    │
                [Running] ───────────────┘
                    │
                    ▼
              [Terminated]

User Threads vs. Kernel Threads

User-level threads are managed entirely by a user-space library. The kernel sees only one process. They are fast to create and switch, but if one blocks on I/O, the whole process blocks.

Kernel-level threads are known to the OS. The kernel can schedule them independently on different cores. Modern systems (Linux, Windows, macOS) use kernel threads.

Threading Models

Many-to-One:                One-to-One:           Many-to-Many:
  UU UU UU                   U U U                  UU UU UU
     |                       | | |                   | | | |
  [Kernel]                [K][K][K]              [K][K][K][K]

All user threads       Each user thread     M user threads map to
share one kernel       maps to one kernel   N kernel threads (M >= N)
thread (no true        thread (true         (flexible, used in some
parallelism)           parallelism)         POSIX implementations)

Linux implements the One-to-One model using clone() — every thread is a kernel thread that shares the parent's address space.

Python Threading Example

import threading
import time

def download_file(name, duration):
    print(f"Starting download: {name}")
    time.sleep(duration)  # simulate network I/O
    print(f"Finished download: {name}")

# Without threads: sequential, takes 1+2+3 = 6 seconds
# With threads: parallel, takes max(1,2,3) = 3 seconds

threads = []
for name, dur in [("file_a.zip", 1), ("file_b.mp4", 2), ("file_c.tar", 3)]:
    t = threading.Thread(target=download_file, args=(name, dur))
    threads.append(t)
    t.start()

for t in threads:
    t.join()  # wait for all threads to finish

print("All downloads complete")

The Python GIL Caveat

Python has a Global Interpreter Lock (GIL) — a mutex that allows only one thread to execute Python bytecode at a time. This means Python threads do NOT achieve true parallelism for CPU-bound tasks. However, they are still useful for I/O-bound tasks (networking, disk) because threads release the GIL while waiting for I/O.

For CPU parallelism in Python, use multiprocessing (separate processes) or concurrent.futures.ProcessPoolExecutor.

Data Races: The Dark Side of Shared Memory

Because threads share memory, they can interfere with each other in dangerous ways. A data race occurs when two threads access the same variable concurrently and at least one is writing.

import threading

counter = 0  # shared variable

def increment():
    global counter
    for _ in range(100000):
        counter += 1  # NOT atomic! read → add → write

t1 = threading.Thread(target=increment)
t2 = threading.Thread(target=increment)
t1.start(); t2.start()
t1.join(); t2.join()

print(counter)  # Expected: 200000
                # Actual: could be 143829 or 178001 — unpredictable!

The counter += 1 operation is three CPU instructions: READ, ADD, WRITE. If Thread 1 reads the value (say, 50000) and Thread 2 also reads it (50000) before either writes back, both write 50001 — and one increment is lost forever.

This is why synchronization (mutexes, semaphores) is essential — covered in the next lesson.

Real-World Thread Usage

Google Chrome — each browser tab is a separate process (for crash isolation), but within each process, multiple threads handle:

The renderer (HTML/CSS layout)
The JavaScript engine (V8)
Network requests (fetching resources)
The compositor (GPU painting)

Linux web servers (nginx) — uses an event-driven, single-threaded model per worker process, relying on non-blocking I/O rather than spawning threads per connection.

Java — java.lang.Thread and java.util.concurrent are foundational; the JVM maps Java threads 1:1 to OS kernel threads on Linux and Windows.

Windows — CreateThread() in Win32 API; .NET's System.Threading.Thread wraps it.

Checking Thread Count on Linux

# Number of threads in a process
cat /proc/<PID>/status | grep Threads

# Show threads for all processes
ps -eLf | head -20

# htop: press H to toggle thread display

Key Takeaways

A thread is a unit of execution within a process — it shares code, heap, and open files but has its own stack, registers, and thread ID.
Threads are much cheaper to create and switch between than processes because no memory duplication is needed.
The One-to-One model (used by Linux and Windows) maps each thread to a kernel thread, enabling true parallelism on multi-core CPUs.
Python's GIL limits CPU-bound thread parallelism; for I/O-bound work, threads are effective.
Data races are the critical hazard of shared-memory threads — unsynchronized concurrent writes produce unpredictable results.
Browsers, servers, game engines, and virtually every modern application use threads extensively.

💬 DiscussionPowered by GitHub Discussions

📱

Get this course's notes on Telegram!

Free cheat sheets, summaries & practice exercises

Get Notes Free →

28 minLesson 4 of 17

Course Contents(17 lessons)

▾

Chapter 1: OS Foundations

What Is an Operating System? Goals and Architecture22 min

Kernel, System Calls, and User vs Kernel Mode28 min

Chapter 2: Processes and Threads

Processes: Programs in Execution30 min

Threads: Lightweight Processes and Concurrency28 min

Synchronization: Race Conditions, Mutex, Semaphores35 min

Inter-Process Communication (IPC)28 min

Chapter 3: CPU Scheduling

CPU Scheduling: Goals and Criteria22 min

Scheduling Algorithms: FCFS, SJF, Round Robin, Priority38 min

Multilevel Queue and Real-Time Scheduling28 min

Chapter 4: Memory Management

Memory Management: Contiguous Allocation and Fragmentation28 min

Paging: How Virtual Memory Maps to Physical Memory35 min

Virtual Memory, Page Replacement Algorithms35 min

Chapter 5: Deadlocks

Deadlocks: Conditions, Resource Allocation Graphs28 min

Deadlock Prevention, Avoidance (Banker's Algorithm), Detection35 min

Chapter 6: File Systems + Project

File Systems: Structures, Directories, and Allocation32 min

Disk Scheduling and I/O Management28 min

Final Project: OS Concepts Simulation in Python45 min