Difference between primary key and unique key

database

SQL

primary key

unique key

database design

Difference between primary key and unique key

Master System Design with Codemia

Enhance your system design skills with over 120 practice problems, detailed solutions, and hands-on exercises.

Start Practicing Learn More

In the context of relational databases, keys play a crucial role in maintaining data integrity and enabling efficient data retrieval. Two essential types of keys are the primary key and the unique key. While both have similarities, they serve distinct purposes and have unique characteristics. This article delves into the technical differences between primary keys and unique keys.

Technical Definition

Primary Key

A primary key is a column or a set of columns in a database table that uniquely identifies each row in that table. The primary key ensures that no two rows have the same primary key value, thereby enforcing the entity integrity of the database.

Characteristics of a Primary Key:

It must contain unique values; no duplicate values are allowed.
A primary key column cannot contain NULL values. This is because a NULL would imply a lack of a unique identifier, which violates the basic tenet of a primary key.
A table can have only one primary key, though that primary key can consist of multiple columns (composite key).

Unique Key

A unique key is a constraint that guarantees that all values in a column or a group of columns are distinct from one another. Unlike primary keys, unique keys allow for the presence of NULL (but only once, since multiple NULLs are treated as equal).

Characteristics of a Unique Key:

Like the primary key, it enforces uniqueness, but NULL values are permitted (with certain restrictions).
A table can have multiple unique keys.
Unique keys are often used to enforce a secondary level of uniqueness within the data, governing columns that need to maintain isolated unique values apart from the primary key.

Technical Differences

Constraints on Values

Primary Key: No duplicates, no NULLs.
Unique Key: No duplicates, single NULL allowed (depending on the database system).

Number of Keys Per Table

Primary Key: One per table.
Unique Key: Multiple per table.

Purpose and Use Cases

Primary Key: Used to uniquely identify each record in a table.
Unique Key: Used to ensure data uniqueness in one or more columns other than the primary key column(s).

Indexing

Both primary and unique keys generate indexes, which improve search operations. However, primary keys often use more efficient clustered indexes by default, while unique keys generally create non-clustered indexes.

Comparison Table

Characteristic	Primary Key	Unique Key
Uniqueness	Ensures all key values are unique	Ensures all key values are unique
NULLs Allowed	No	Yes (but only once, if at all)
Number Per Table	One	Multiple
Index Type	Often creates a clustered index	Creates a non-clustered index
Purpose	Uniquely identifies each table record	Ensures data uniqueness in columns
Mandatory	Yes, typically needed for entity integrity (e.g., primary identification)	No, used as needed (e.g., to enforce specific constraints)

Example

Consider a simple Students table where StudentID is the primary key, which uniquely identifies each student:

sql

1CREATE TABLE Students (
2    StudentID INT PRIMARY KEY,
3    FirstName VARCHAR(100),
4    LastName VARCHAR(100),
5    Email VARCHAR(100) UNIQUE
6);

In this example:

StudentID is the primary key, ensuring no two students can have the same ID and each ID is non-null.
Email serves as a unique key, meaning no two students should have the same email address, but theoretically, a NULL email could exist once.

Use Cases and Considerations

Primary Key: Best used for columns that naturally describe the row with some inherent uniqueness, like national IDs, product IDs, etc.
Unique Key: Suitable for additional properties like email addresses or phone numbers that should be unique among entries but are not the primary identifier for a record.

Understanding the differences between primary and unique keys is vital for database design and management. Both constraints help maintain database integrity, improve query performance, and support logical data modeling. Having a clear grasp of their functionalities allows developers to build robust, efficient, and reliable databases.