Normalization in DBMS
Normalization in DBMS
A large database defined as a single relation may result in data duplication. This
repetition of data may result in:
○ It isn't easy to maintain and update data as it would involve searching many
records in relation.
So to handle these problems, we should analyze and decompose the relations with
redundant data into smaller, simpler, and well-structured relations that are satisfy
desirable properties. Normalization is a process of decomposing the relations into
relations with fewer attributes.
What is Normalization?
○ Normalization divides the larger table into smaller and links them using
relationships.
○ The normal form is used to reduce redundancy from the database table.
○ Insertion Anomaly: Insertion Anomaly refers to when one cannot insert a new
tuple into a relationship due to lack of data.
○ Deletion Anomaly: The delete anomaly refers to the situation where the
deletion of data results in the unintended loss of some other important data.
2NF A relation will be in 2NF if it is in 1NF and all non-key attributes are fully
exists.
4NF A relation will be in 4NF if it is in Boyce Codd's normal form and has no
multivalued dependency.
5NF A relation is in 5NF. If it is in 4NF and does not contain any join
Advantages of Normalization
○ You cannot start building the database before knowing what the user needs.