Data Cleaning in Excel
Data Cleaning in Excel
Pooja Pawar
Data cleaning is an essential step in preparing data for analysis. It
involves identifying and correcting errors, ensuring consistency, and
formatting data for usability. Excel offers numerous tools and
functions to make this process efficient. Here’s an in-depth guide to
cleaning data in Excel with detailed explanations and fresh examples.
1. Remove Duplicates
Duplicate data can lead to inaccurate analysis. Removing duplicates
ensures each record is unique.
4. Click OK, and Excel will retain the first occurrence and
remove duplicates.
Pooja Pawar
Example:
Pooja Pawar
o Use formulas like =IF(A1="", "No Data", A1) to handle
blanks dynamically.
Example:
Phone 500
Laptop
Tablet 300
Phone 500
Laptop 0
Tablet 300
Pooja Pawar
Formula: =TRIM(A1)
o Select the range > Click on the error flag (⚠️) > Select
Convert to Number.
Pooja Pawar
Example: Input: "01-15-2025" (as text)
Output: 01/15/2025 (as a date).
Steps:
1. Press Ctrl + H.
Name City
Michael Ross NY
Donna Paulsen LA
Pooja Pawar
Output:
Name City
Donna Paulsen LA
Functions:
Pooja Pawar
o Example: Split "Alice, New York, 25" into three columns:
Name, City, Age.
o Use =A1 & " " & B1 or =CONCATENATE(A1, " ", B1) to
combine data.
Example:
Alex Carter
8. Handle Errors
Errors in data can interrupt analysis. Excel offers functions to manage
errors effectively.
Steps:
Steps:
Pooja Pawar
Example: Combine monthly sales data from multiple sheets
into one consolidated table with duplicate removal.
Example: Ensure only valid country codes (e.g., US, UK, CA) are
entered in the "Country" column.
Pooja Pawar
13. Check Spelling
Misspellings can lead to inconsistent data. Use Spell Check (F7) to
find and correct errors.
Pooja Pawar
Practical Examples: Real-World Data Cleaning in
Excel
Cleaning Steps:
Pooja Pawar
2. Cleaning Patient Records in a Healthcare System
Patient Name Age Appointment Date Gender Diagnosis
Cleaning Steps:
1. Fill Missing Age Values: Replace "NA" in "Age" with the average
age using =AVERAGE(B2:B100) or a default value.
Pooja Pawar
3. Cleaning Employee Data for HR Reports
Employee Name Department Salary Date of Joining Email
Cleaning Steps:
Pooja Pawar
4. Cleaning Product Inventory Data for an E-Commerce Store
Product Name Category Price Stock SKU
Cleaning Steps:
Pooja Pawar
5. Cleaning Customer Feedback Data for Marketing
Customer Name Rating Feedback Comments Feedback Date
Cleaning Steps:
1. Fill Missing Ratings: Replace "NA" with the average rating using
=AVERAGE(B2:B100).
Pooja Pawar