Informaldesignguidelinesforrelationschemas 240831001812 622a78a1

The document outlines four informal guidelines for assessing the quality of relation schema design in databases: ensuring clear semantics of attributes, minimizing redundant information and NULL values, and preventing spurious tuples. Each guideline emphasizes the importance of clarity, data integrity, and proper normalization to avoid anomalies and maintain accurate relationships within the data. Strategies for implementing these guidelines include using default values, proper join conditions, and maintaining referential integrity.

Uploaded by

nithyarevathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views34 pages

Informaldesignguidelinesforrelationschemas 240831001812 622a78a1

Uploaded by

nithyarevathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Informal Design

Guidelines for
Relation Schemas

Dr. Gowthami V
Four informal guidelines that may be used as measures to determine the
quality of relation schema design:

• Making sure that the semantics of the attributes is clear in the schema

• Minimizing the redundant information in tuples

• Minimizing the NULL values in tuples

• Disallowing the possibility of generating spurious tuples

1. Imparting Clear Semantics to
Attributes in Relations
• Whenever we group attributes to form a relation schema, we assume
that attributes belonging to one relation have certain real-world
meaning and a proper interpretation associated with them. The
semantics of a relation refers to its meaning resulting from the
interpretation of attribute values in a tuple.
Guideline 1
• Design a relation schema so that it is easy to explain its meaning. Do
not combine attributes from multiple entity types and relationship
types into a single relation. Intuitively, if a relation schema
corresponds to one entity type or one relationship type, it is
straightforward to interpret and to explain its meaning. Otherwise, if
the relation corresponds to a mixture of multiple entities and
relationships, semantic ambiguities will result and the relation cannot
be easily explained.
• each tuple relates an employee to a project but also includes the
employee name (Ename), project name (Pname), and project location
(Plocation). Although there is nothing wrong logically with these two
relations, they violate Guideline 1 by mixing attributes from distinct
real-world entities: EMP_DEPT mixes attributes of employees and
departments, and EMP_PROJ mixes attributes of employees and
projects and the WORKS_ON relationship. Hence, they fare poorly
against the above measure of design quality.
2. Redundant Information in Tuples
and Update Anomalies
• Design the base relation schemas so that no insertion, deletion, or
modification anomalies are present in the relations. If any anomalies
are present, note them clearly and make sure that the programs that
update the database will operate correctly.
• In database design, redundant information refers to the unnecessary
repetition of data within a database. Redundancy can lead to several
problems, often referred to as anomalies:
• Update Anomalies: If redundant data exists in multiple places,
updating the data in one place but not in others can lead to
inconsistencies. For example, if a customer's address is stored in
several tables and the customer moves, failing to update all instances
of the address can result in having multiple, conflicting addresses for
the same customer.
• Insertion Anomalies: These occur when you cannot add data to the
database due to the absence of other data. For instance, if a new
student must be added to a database but there is no course record
yet, and if the database design requires a course record for each
student, then the student cannot be added without also creating a
placeholder for the course.
• Deletion Anomalies: These occur when deleting some data
inadvertently results in losing other valuable data. For example, if a
student record is deleted, and this record also stores unique
information about a course (and if this information is only stored in
the student record), then deleting the student record would result in
losing the course information as well.
Why Avoid Redundant
Information?
• Consistency is maintained: Changes to data only need to be made in
one place, preventing conflicting data entries.
• Data Integrity is preserved: The database accurately reflects real-
world entities and their relationships without inconsistencies.
• Efficiency is improved: Reducing the size of the database by
eliminating duplicate entries saves storage space and makes data
management more straightforward.
• There are three main types of anomalies:
• Update Anomalies:
• Definition: Occur when data is duplicated in multiple places, and a
change to the data in one place does not automatically propagate to
all instances of that data.
• Example: Suppose a customer's phone number is stored in multiple
rows of a table because the customer has multiple orders. If the
customer changes their phone number and the update is made in
only one row, the other rows will have outdated information. This
inconsistency can cause confusion and errors.
• Insertion Anomalies:
• Definition: Occur when certain data cannot be added to the database
without the presence of other data.
• Example: Consider a table that stores both student and course
information. If the database design requires that each student must
be associated with a course, then it becomes impossible to add a new
student who hasn't yet enrolled in any course. This restriction is due
to the poor structure of the table, where unrelated data is combined
in a single table.
• Deletion Anomalies:
• Definition: Occur when the deletion of some data inadvertently
causes the loss of other data.
• Example: Suppose a database table stores both employee information
and department details. If the last employee in a department is
deleted from the table, the department information might also be
lost, even though the department still exists independently of its
employees. This happens because the department information is tied
directly to the employee record, and there’s no separate record for
departments.
3. NULL Values in Tuples
• In relational databases, a null value represents the absence of a value
or that the value is unknown. While null values can be useful in
certain situations, overusing them or allowing them to be scattered
throughout the database can lead to various problems, such as
complications in query results, misunderstandings in data
interpretation, and difficulties in maintaining data integrity.
Guideline 3: Minimize the Use of
Null Values
Clarity and Meaning:
• Null values can make it unclear what data is missing. For instance, if
an employee's "end_date" in a database is null, it could mean that the
employee is still employed, the end date is not yet determined, or
that it is simply unknown whether the employee has left.
• Minimize nulls to ensure that the meaning of each data entry is clear
and unambiguous.
• Data Integrity and Consistency:
• Having too many nulls in a database can lead to inconsistent data
states. For example, if a table recording sales transactions has several
null values in the "amount" column, it could lead to confusion about
whether the sale amounts were missed, not applicable, or if there
was a mistake in data entry.
• Consistent use of nulls ensures that the database integrity is
maintained, making it easier to understand and analyze the data.
• Complications in Queries:
• Null values can complicate SQL queries, particularly when performing
aggregations or comparisons. Functions like SUM() or AVG() might
ignore nulls, leading to results that do not accurately reflect the data's
true state.
• Queries must be carefully written to handle nulls appropriately, often
requiring additional conditions (IS NULL, IS NOT NULL), which can
complicate query logic and reduce performance.
• Potential for Anomalies:
• Null values can create anomalies, especially in operations like
updates, inserts, and deletes. For example, if a database enforces
certain constraints (like foreign keys), a null value might violate these
constraints, leading to data anomalies and integrity issues.
Strategies to Minimize Null
Values
• Use Default Values: Instead of using nulls, consider using default
values that make sense within the context. For example, using
"Unknown" or "Not Applicable" as a default for textual fields can help
avoid nulls.
• Break Down Tables: Normalize the database to separate optional data
into different tables. For instance, optional fields like a second phone
number can be moved to a separate table that links back to the main
table.
• Use Proper Data Types and Constraints: Define columns with NOT
NULL constraints where possible to enforce that values must be
provided. This practice ensures that no nulls are entered accidentally.
• Evaluate the Necessity of Nulls: Sometimes nulls are necessary, such
as in cases where a value truly is unknown or inapplicable. However,
they should be used judiciously and only when there is a strong
justification.
4. Generation of Spurious Tuples
• In the context of relational database design, spurious tuples are
unintended and incorrect rows that can appear when joining tables
that are not properly normalized or when the joins are based on
incorrect or inadequate conditions. These tuples do not represent
real-world entities or relationships and can lead to inaccurate query
results and misleading data interpretations.
Guideline 4:
• Understanding Spurious Tuples:Spurious tuples are the result of a
poor database schema design, especially when the schema allows
incorrect join operations to produce meaningless or incorrect data.
• This often happens when tables are joined on attributes that do not
uniquely identify rows in each table, causing a Cartesian product-like
result rather than a meaningful combination of related data.
• Causes of Spurious Tuples:
• Incorrect Joins: Using non-key attributes (i.e., attributes that are not primary or
foreign keys) to join tables can lead to spurious tuples. For example, if two tables
are joined on a common attribute that does not uniquely identify records, extra
rows can appear in the result set.
• Redundant or Overlapping Data: When tables store overlapping or redundant data,
improper normalization can lead to situations where the join conditions
accidentally match unrelated rows, producing spurious tuples.
• Improper Decomposition: When a table is decomposed (split into two or more
tables) in such a way that it loses important information about how to correctly
reassemble (join) them, spurious tuples can be generated. This is often due to the
loss of functional dependencies that ensure the correct relationships between data.
• Impact of Spurious Tuples:
• Data Integrity Issues: Spurious tuples compromise data integrity, as
they do not accurately represent the underlying real-world data or
relationships. This can lead to incorrect conclusions, faulty reports,
and errors in data analysis.
• Misleading Information: Spurious tuples can produce misleading
results in queries, especially in aggregations or when attempting to
retrieve meaningful patterns from the data.
Strategies to Avoid Spurious
Tuples
Normalize the Database:
• Proper normalization (up to at least the Third Normal Form or Boyce-
Codd Normal Form) helps ensure that each table contains only related
attributes and that each attribute depends on the primary key. This
process reduces redundancy and minimizes the chances of generating
spurious tuples during joins.
Use Proper Join Conditions:
• Always use primary keys and foreign keys as join conditions. These
keys uniquely identify records in their respective tables and ensure
that only related records are joined together, preventing spurious
tuples.
• Avoid using non-key attributes in join conditions unless there is a
specific, logical reason that these attributes are meant to join.
Maintain Referential Integrity:
• Ensure that foreign key constraints are properly defined and enforced.
Referential integrity guarantees that relationships between tables are
consistent and that every foreign key corresponds to a primary key in
another table.
Careful Decomposition:
• When decomposing tables, ensure that the decomposition does not
lead to loss of important information or functional dependencies. A
well-decomposed schema should allow tables to be joined without
generating spurious tuples, preserving the original data’s integrity and
relationships.
Check Join Operations:
• Regularly review and test join operations to ensure that they produce
the expected results without generating spurious tuples. Use sample
data and validate results to identify and correct any issues early.

Relational Database Design
No ratings yet
Relational Database Design
17 pages
Features of Good Relational Design and Schema Refinement 1
No ratings yet
Features of Good Relational Design and Schema Refinement 1
25 pages
Module 4
No ratings yet
Module 4
30 pages
Relational Database Design Guidelines
No ratings yet
Relational Database Design Guidelines
20 pages
Relational Database Design - Features of Good Relational Designs
100% (1)
Relational Database Design - Features of Good Relational Designs
27 pages
M3 Imp
No ratings yet
M3 Imp
13 pages
DBMS - Module 3
No ratings yet
DBMS - Module 3
49 pages
Relational Database Design Pitfalls and Norms
No ratings yet
Relational Database Design Pitfalls and Norms
6 pages
Lecture 3.1.1 Anomalies
No ratings yet
Lecture 3.1.1 Anomalies
14 pages
RDBMS Unit3 Informaldesign Guidelines
No ratings yet
RDBMS Unit3 Informaldesign Guidelines
27 pages
Database Management System Basics
No ratings yet
Database Management System Basics
141 pages
Lecture 1
No ratings yet
Lecture 1
3 pages
Module - III
No ratings yet
Module - III
38 pages
FDS Chapter 5 Database Design and Normalization Part 1 STUDENT
No ratings yet
FDS Chapter 5 Database Design and Normalization Part 1 STUDENT
27 pages
1 - Dbms Module 4 PPT 1
No ratings yet
1 - Dbms Module 4 PPT 1
64 pages
Database Normalization Guidelines
No ratings yet
Database Normalization Guidelines
114 pages
Normalization in Database Design Theory
No ratings yet
Normalization in Database Design Theory
24 pages
Data Normalization Guidelines for Databases
No ratings yet
Data Normalization Guidelines for Databases
14 pages
Chapter Five
No ratings yet
Chapter Five
35 pages
This Approach Is Not Very Popular in Practice Because It Suffers From The
No ratings yet
This Approach Is Not Very Popular in Practice Because It Suffers From The
6 pages
Database Design: Theory & Normalization
No ratings yet
Database Design: Theory & Normalization
175 pages
DBMS Module4 Notes
No ratings yet
DBMS Module4 Notes
124 pages
05 - Relational Database Design - Week 05
No ratings yet
05 - Relational Database Design - Week 05
37 pages
DBMS Module4
No ratings yet
DBMS Module4
124 pages
Removing Anomalies via Normalization
No ratings yet
Removing Anomalies via Normalization
25 pages
Dbms 2nd Ia Question Bank
No ratings yet
Dbms 2nd Ia Question Bank
28 pages
Avoiding Database Design Pitfalls
No ratings yet
Avoiding Database Design Pitfalls
50 pages
5-Review of DBMS Techniques - Normalization-09-01-2024
No ratings yet
5-Review of DBMS Techniques - Normalization-09-01-2024
62 pages
Chapter 4 - Database Design - (Normalization)
No ratings yet
Chapter 4 - Database Design - (Normalization)
43 pages
Module 3 DBMS Revision
No ratings yet
Module 3 DBMS Revision
16 pages
DBMS Module-3 Notes
No ratings yet
DBMS Module-3 Notes
47 pages
Schema Refinement in Database Design
No ratings yet
Schema Refinement in Database Design
30 pages
Unit - 3
No ratings yet
Unit - 3
92 pages
DBMS Module-4 Notes
No ratings yet
DBMS Module-4 Notes
38 pages
Informal Guidelines for Relational Schema
No ratings yet
Informal Guidelines for Relational Schema
20 pages
CH - 5 FD and Normalization
No ratings yet
CH - 5 FD and Normalization
49 pages
Informal Guidelines for Relation Schemas
No ratings yet
Informal Guidelines for Relation Schemas
1 page
CO3-Notes-Database Design and Normalization
No ratings yet
CO3-Notes-Database Design and Normalization
17 pages
MM 3
No ratings yet
MM 3
14 pages
Meaning of Manjima Name Explained
No ratings yet
Meaning of Manjima Name Explained
16 pages
Functional Dependencies & Normalization in DBMS
No ratings yet
Functional Dependencies & Normalization in DBMS
12 pages
Understanding Database Normalization Techniques
No ratings yet
Understanding Database Normalization Techniques
10 pages
DBMS Unit 2
No ratings yet
DBMS Unit 2
276 pages
DBMS3
No ratings yet
DBMS3
14 pages
Relational Database Design Guidelines
No ratings yet
Relational Database Design Guidelines
90 pages
Database Design for Beginners
No ratings yet
Database Design for Beginners
55 pages
Informal Design Guidelines For Relational Databases
No ratings yet
Informal Design Guidelines For Relational Databases
19 pages
Normalization: Benefits and Drawbacks
No ratings yet
Normalization: Benefits and Drawbacks
12 pages
Functional Dependencies in Database Design
No ratings yet
Functional Dependencies in Database Design
44 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
56 pages
Key Principles of Database Design
No ratings yet
Key Principles of Database Design
68 pages
Database Functional Dependency Guide
No ratings yet
Database Functional Dependency Guide
29 pages
Database Design: Functional Dependency & Normalization
No ratings yet
Database Design: Functional Dependency & Normalization
29 pages
Schema Refinement and Normalization Guide
No ratings yet
Schema Refinement and Normalization Guide
21 pages
Understanding Database Management Systems
No ratings yet
Understanding Database Management Systems
7 pages
Relational Schema Design Guidelines
No ratings yet
Relational Schema Design Guidelines
6 pages
Normalization in Relational Databases
No ratings yet
Normalization in Relational Databases
20 pages
Database Design and Management Essentials
No ratings yet
Database Design and Management Essentials
17 pages
Relational Database Design Guide
No ratings yet
Relational Database Design Guide
19 pages
Modules 1 and 2 QB
No ratings yet
Modules 1 and 2 QB
4 pages
Requirements For Synopsis & Thesis Format Checking
No ratings yet
Requirements For Synopsis & Thesis Format Checking
1 page
Relational Constraints DBMS Lecture
No ratings yet
Relational Constraints DBMS Lecture
7 pages
A Rule Based Expert System To Assess Coronary Arte
No ratings yet
A Rule Based Expert System To Assess Coronary Arte
17 pages
Single Row Functions
No ratings yet
Single Row Functions
9 pages
Chap 2
No ratings yet
Chap 2
15 pages
COVID 19 Pneumonia Level Detection Using Deep Learning Algorithm and Transfer Learning
No ratings yet
COVID 19 Pneumonia Level Detection Using Deep Learning Algorithm and Transfer Learning
12 pages
Lab Experiments
No ratings yet
Lab Experiments
12 pages
FALLSEM2025-26 VL BCSE302P 00100 LO 2025-07-30 Lab-Assessment-constraint
No ratings yet
FALLSEM2025-26 VL BCSE302P 00100 LO 2025-07-30 Lab-Assessment-constraint
1 page
Dbs LS06EN Er2rm
No ratings yet
Dbs LS06EN Er2rm
20 pages
The National Academies Press: Mathematics and Physics of Emerging Biomedical Imaging (1996)
No ratings yet
The National Academies Press: Mathematics and Physics of Emerging Biomedical Imaging (1996)
261 pages
Normal Urine Ketone Levels
No ratings yet
Normal Urine Ketone Levels
6 pages
Deep Learning Segmentation of Major Vessels in X-Ray Coronary Angiography
No ratings yet
Deep Learning Segmentation of Major Vessels in X-Ray Coronary Angiography
11 pages
Willis Carrier: The Fundamentals of Air Conditioning
No ratings yet
Willis Carrier: The Fundamentals of Air Conditioning
4 pages
1 s2.0 S2772662223001716 Main
No ratings yet
1 s2.0 S2772662223001716 Main
10 pages
AVL Tree Insertion Example: 50 Sequence
No ratings yet
AVL Tree Insertion Example: 50 Sequence
2 pages
Mitigating Mimicking Attacks in Cyberspace
No ratings yet
Mitigating Mimicking Attacks in Cyberspace
11 pages
Essential Shell & C Programming Tasks
100% (1)
Essential Shell & C Programming Tasks
2 pages
Legal AI: Trends and Challenges
No ratings yet
Legal AI: Trends and Challenges
14 pages
The Complete Guide To The ELK Stack - Logz - Io
100% (2)
The Complete Guide To The ELK Stack - Logz - Io
101 pages
SQL Views and Indexes Explained
No ratings yet
SQL Views and Indexes Explained
6 pages
Understanding Oracle Application Object Library
No ratings yet
Understanding Oracle Application Object Library
55 pages
Django Internship Project Report
No ratings yet
Django Internship Project Report
33 pages
List of All AWS Services
No ratings yet
List of All AWS Services
15 pages
Rman Deletes Archive Log On Primary Database Not Applied On Physical Standby
No ratings yet
Rman Deletes Archive Log On Primary Database Not Applied On Physical Standby
2 pages
SQL and PL/SQL Experiments Guide
No ratings yet
SQL and PL/SQL Experiments Guide
1 page
Database Administration Course Outline
100% (1)
Database Administration Course Outline
66 pages
M.Sc. Computer Science Exam Questions
No ratings yet
M.Sc. Computer Science Exam Questions
33 pages
Big Data PYQ
No ratings yet
Big Data PYQ
24 pages
100 Submission
No ratings yet
100 Submission
6 pages
Designing Databases For Historical Research
No ratings yet
Designing Databases For Historical Research
74 pages
Java, Spring Boot, Microservices, and Angular
No ratings yet
Java, Spring Boot, Microservices, and Angular
38 pages
Salesforce Certified Agentforce - 6
No ratings yet
Salesforce Certified Agentforce - 6
5 pages
DBMS-BSCS-Course-Outlines - Fall 2025
No ratings yet
DBMS-BSCS-Course-Outlines - Fall 2025
7 pages
213T1A0416 Aicte Internship Document2
No ratings yet
213T1A0416 Aicte Internship Document2
27 pages
Paper 2 ICT Notes
No ratings yet
Paper 2 ICT Notes
9 pages
Manual - Icontrolv5 Configuration Graphic Edition - Eng.rev3
No ratings yet
Manual - Icontrolv5 Configuration Graphic Edition - Eng.rev3
317 pages
MUST-DO Questions For Interviews (DBMS, CN and OS)
No ratings yet
MUST-DO Questions For Interviews (DBMS, CN and OS)
3 pages
Power BI Advanced Features
No ratings yet
Power BI Advanced Features
4 pages
CRUD Operation in C# Application
No ratings yet
CRUD Operation in C# Application
4 pages
Good Programming Practice (GPP) in SAS® & Clinical Trials
No ratings yet
Good Programming Practice (GPP) in SAS® & Clinical Trials
10 pages
Conference 2020
No ratings yet
Conference 2020
258 pages
Senior .NET Developer Profile Summary
No ratings yet
Senior .NET Developer Profile Summary
2 pages
Computing and Communication Resources Multiple Choice Questions
100% (2)
Computing and Communication Resources Multiple Choice Questions
44 pages
Lab Manual Mongo DB
No ratings yet
Lab Manual Mongo DB
20 pages
DBMS Unit-2
No ratings yet
DBMS Unit-2
61 pages
Topic 3 Relational Data Model Ict200
No ratings yet
Topic 3 Relational Data Model Ict200
56 pages
Oracle Label Security Overview
No ratings yet
Oracle Label Security Overview
14 pages

Informaldesignguidelinesforrelationschemas 240831001812 622a78a1

Uploaded by

Informaldesignguidelinesforrelationschemas 240831001812 622a78a1

Uploaded by

Informal Design

• Minimizing the redundant information in tuples

• Minimizing the NULL values in tuples

• Disallowing the possibility of generating spurious tuples

You might also like