Database Concepts for a java Dev: Database Normalization
In this part, I will be briefing about different types of Database Normalizations using a sample data model.
What is Database Normalization?
Normalization is the process of efficiently organizing data in the database.
Primary Goal of Normalization?
Eliminating redundant data & ensuring meaningful data dependencies.
Types of Normalization
The following are the three most common normal forms in the database normalization process
- First Normal Form (1NF)
- Second Normal Form (2NF)
- Third Normal Form (3NF)
Sample Data Model for Demonstration
The following data model will be used to demonstrate all the three normal forms
First Normal Form (1NF)
First Normal Form (1NF) sets the very basic rules for an organized database:
- Create separate set of tables for each group of related data and identify each row with a unique columns [primary key] or set of columns [composite key]
- Eliminate duplicate columns from the table
The following data model depicts the tables after 1NF rules are applied -
Second Normal Form (2NF)
Second Normal Form (2NF) further addresses the concept of removing duplicate data:
- Meet all the requirements of the first normal form
- Remove subsets of data that apply to multiple rows of a table and place them in separate tables
- Create relationships between these new tables and their predecessors through the use of foreign keys
So basically the objective of the Second Normal Form is to take that is only partly dependent on the primary key and enter that data into another table.
The following data model depicts the tables after 2NF rules are applied.
Data from EMPLOYEE_TABLE is split into 2 tables – EMPLOYEE_TABLE and EMPLOYEE_HR_TABLE.
Similarly data from CUSTOMER_TABLE is moved to CUSTOMER_TABLE and CUSTOMER_ORDER table
Third Normal Form (3NF)
Third normal form (3NF) goes one large step further:
- Meet all the requirements of the second normal form.
- Remove columns that are not dependent upon the primary key.
The following data model depicts the tables after 3NF rules are applied.
Further state and country details are moved to their own tables because they are not dependent on the primary key.
Advantages of Normalizing the Database
There are several advantages of normalization -
- Data can be stored as small atomic pieces
- Saves space
- Increases speed
- Reduces data anomalies
- Easy maintenance
Other parts of this series include:
Part 1 – ACID Properties
Part 2 – Keys
Part 4 – Database Transactions [coming soon]
Part 5 – Indexes [coming soon]