Picture a high school in 1975. Every student has a paper folder in a filing cabinet. Inside: their name, address, grades, attendance record, and course enrollments. The principal wants a list of every student with a GPA above 3.5 for the honor roll ceremony — happening tomorrow morning.

A secretary pulls open the first drawer. Hundreds of folders. She opens one, checks the GPA field, writes it down if it qualifies, moves to the next. Three hours later, after going through 600 folders, she has her list — and two missed a student whose folder was misfiled under the wrong last name.

A modern database returns that same list in under a millisecond.

That gap — between paper folders and a database — is what this entire course is about.

What Is a Database?

A database is an organized collection of structured data stored so it can be efficiently accessed, managed, and updated.

The key word is organized. A pile of papers is data. A library catalog is a database. Organization is what makes data useful.

A Database Management System (DBMS) is the software that sits between you and the data. It handles storing, retrieving, securing, and managing the data on your behalf. You don't talk to the disk — you talk to the DBMS.

Think of it this way: Data is the books. The database is the library's filing system. The DBMS is the librarian who knows where everything is, enforces the rules, and handles multiple visitors at once.

Why Not Just Use Files?

Before databases, developers stored data in flat files — plain text or CSV files on disk. This approach breaks down fast.

Feature	Flat File System	DBMS
Data Integrity	No enforcement — any value allowed	Constraints enforce valid data
Concurrent Access	Two users writing simultaneously corrupts data	Transactions handle concurrency safely
Querying	Read entire file, filter manually in code	SQL returns only what you need
Security	File-level permissions only	Table/row/column-level access control
Redundancy	Same data duplicated across files	Normalization eliminates duplication
Relationships	Managed manually in application code	Foreign keys enforced by DBMS
Backup & Recovery	Manual, error-prone	Built-in backup and point-in-time recovery

With flat files, every application that touches the data must implement its own validation, querying logic, and concurrency handling. A DBMS does all of that once, correctly, for every application.

A Brief History of Databases

Understanding where databases came from explains why they work the way they do.

1950s–60s  →  Paper records + magnetic tape (sequential access only)
1960s      →  Hierarchical databases (IBM IMS) — tree structure, fast but rigid
1970s      →  Network databases — more flexible but complex navigation
1970       →  Edgar F. Codd publishes "A Relational Model of Data" at IBM
              This paper changes everything
1979       →  Oracle becomes the first commercial relational DBMS
1986       →  SQL becomes an ANSI standard
1990s      →  MySQL, PostgreSQL emerge as open-source options
2000s      →  Web scale demands → NoSQL databases emerge
2010s      →  NewSQL, cloud databases, multi-model systems

Edgar F. Codd's 1970 paper is the foundation of modern databases. He proposed organizing data into relations (tables) and accessing it with a mathematical query language. His idea separated what you want from how to retrieve it — you describe the result, the DBMS figures out the fastest path to get there.

Types of Databases

Not all problems need the same solution. Different database types are optimized for different use cases.

Relational (SQL) Data organized into tables with rows and columns. Relationships defined through foreign keys. Queried with SQL. Strong consistency guarantees. Best for structured data with complex relationships.

Examples: MySQL, PostgreSQL, SQL Server, Oracle, SQLite

Document Data stored as JSON-like documents. No fixed schema — each document can have different fields. Best for hierarchical or variable-structure data.

Examples: MongoDB, CouchDB, Firestore

Key-Value Simplest model: a key maps to a value. Extremely fast lookups. Best for caching, session storage, and simple associations.

Examples: Redis, DynamoDB, Memcached

Graph Data stored as nodes and edges. Optimized for traversing relationships. Best for social networks, recommendation engines, fraud detection.

Examples: Neo4j, Amazon Neptune

Time-Series Optimized for data points indexed by time. Best for metrics, monitoring, IoT sensor data.

Examples: InfluxDB, TimescaleDB, Prometheus

Real-World Database Deployments

These are not abstract concepts. Every time you use the web, you're touching a database.

System	Used By	Why
MySQL	WordPress, Facebook (historically), Twitter	Fast reads, open-source, massive community
PostgreSQL	Instagram, Shopify, GitHub	Advanced SQL features, reliability, extensibility
MongoDB	Airbnb, eBay, Forbes	Flexible schema for varying content structures
Redis	Twitter, Stack Overflow, GitHub	Sub-millisecond caching and session storage
Cassandra	Netflix, Apple, Uber	Massive write throughput across distributed nodes

When Netflix recommends a show, Redis caches your preferences. When you check out on Shopify, PostgreSQL records the transaction. When you search on eBay, MongoDB retrieves product listings with varying attributes.

What You'll Build in This Course

Throughout this course, you'll design and build a complete library database system from scratch. By the final lesson, you'll have:

An ER diagram modeling the real-world library domain
Normalized tables for Books, Authors, Members, Loans, and Reservations
SQL queries that answer real business questions
Indexes that make slow queries fast
Transactions that keep data consistent

You'll start with a blank page and end with a production-ready schema — learning every concept by applying it to the same system.

The Core Idea

A database isn't magic. It's organized data plus software that enforces rules about that data. The rules are what matter. Without them, data rots — duplicates appear, invalid values sneak in, relationships break. The DBMS is the enforcer.

Every lesson builds on this: the goal of a database is not to store data, it's to store data correctly.

Next lesson: Before you write a single line of SQL, you draw an ER diagram — the blueprint for your database.

💬 DiscussionPowered by GitHub Discussions

📱

Get this course's notes on Telegram!

Free cheat sheets, summaries & practice exercises

Get Notes Free →

20 minLesson 1 of 16

Course Contents(16 lessons)

▾

Chapter 1: Database Fundamentals

What Is a Database? DBMS vs File Systems20 min

ER Diagrams: Designing Databases Visually35 min

The Relational Model: Tables, Keys, and Constraints28 min

Chapter 2: SQL

SQL DDL: Creating Tables, Constraints, and Schemas28 min

SQL DML: SELECT, INSERT, UPDATE, DELETE35 min

SQL JOINs: Combining Tables38 min

SQL Advanced: Subqueries, Aggregates, Window Functions40 min

Chapter 3: Normalization

Functional Dependencies, 1NF, and 2NF32 min

3NF, BCNF: Removing All Redundancy30 min

Chapter 4: Transactions and Concurrency

Transactions and ACID Properties32 min

Concurrency Control: Locks, Isolation Levels30 min

Deadlocks: Detection, Prevention, and Recovery28 min

Chapter 5: Indexing and Optimization

Indexes and B-Trees: Making Queries Fast32 min

Query Optimization and Execution Plans30 min

Chapter 6: NoSQL and Final Project

NoSQL Databases: Types, Use Cases, Tradeoffs30 min

Final Project: Design a Complete Database System45 min

Chapter 1: Database Fundamentals

What Is a Database? DBMS vs File Systems

What Is a Database?

A School in Chaos

A modern database returns that same list in under a millisecond.

That gap — between paper folders and a database — is what this entire course is about.