Introduction To Data Quality
Introduction To Data Quality
2019
Morka Mercy
Outline
• Health Management Information Systems(HMIS)
• Basic Concepts/Terms
• Data Quality
• Ten Problems affecting Data Quality
• Resolving Data Quality errors
• Group Exercise
• Summary
Health Management Information Systems(HMIS)
• Definition: ‘Health Management Information Systems (HMIS)’ can be
described as a tool which helps in gathering, aggregating, analyzing
and using information for taking actions to improve performance of
the health systems.
Information
Knowledge
when information is analyzed, communicated and acted upon, it becomes knowledge.
Basic Concepts/Terms: Examples
Data
No. of pregnant women in an area who received skilled birth assistance
Knowledge Why are some pregnant women able to receive skilled birth assistance?
Why are some pregnant women left out? Who were left out? What are
the issues related to access to service?
Data element
• A data element is a record of health event or health related event.
• Data Elements are recorded in a primary register (recording formats)
by the service provider.
• Similar events for the month are aggregated and reported in
specified reporting formats.
• Example: Number of HIV+ pregnant women who Record of a service delivered
were delivered in the health facility by a
midwife in the reporting month.
Ambiguity
Data entry
about data
errors
element
Dishonesty
Systemic
in
errors
reporting
Example: Examine ANC data reported by all the
PHCs of District X and check for accuracy in data.
Data elements PHC 1 PHC 2 PHC 3 PHC 4 PHC 5 TOTAL
PMTCT HTS given to ANC 1 215 406 500 330 230 1681
attendees (Preg HIV CTR)
ANC 1 HTS coverage rate (Percent 93% 111% 139% 100% 90% 109%
of ANC1 CTR)
Observations
• PHC 1, 4 & 5 have reported correct figures and no problem was found
while processing/analyzing data.
• Data from PHC 2 & 3 is intriguing; probably the PHCs had high number
of actual beneficiaries of PMTCT HTS at ANC1 or pregnant women were
not given PMTCT HTS in past months because PHC was out of stock and
now back log was being cleared. Further probing is required to identify
the reason for the errors.
Data Entry Errors
• Typing errors: wrong numbers entered in computer or MSF
• Wrong box entry: data entered in wrong box e.g., ‘ANC 1 attendance’
data entered in ‘Total ANC attendance’.
• Calculation errors: during data entry basic computation happens if
formula are incorrect, then errors can happen.
Performing validation checks
• Validation is performed by comparing values of 2 (or more) data elements that
are comparable.
Validation rule Left side Operator Right side
Total Preg HIV tested Total Preg HIV ≤ (less than or Total Preg HIV
positive is less than or tested positve equal to ) tested
equal to Total Preg HIV
tested
• Deviations away from the range (can also be viewed on stem and leaf
plot) it is identified as an outlier.
3. Multiple registers.
• Ensure data entry for a new month starts on a new page; other wise,
over reporting of totals may set in.
• Example:
• a. ‘ANC 1 Attendees’, vs. ‘Number of pregnant women attending ANC
for the first time’
Problem 3: Problems in data aggregation
3.Data Aggregation problems
1. Data is difficult to add up across hundreds of facilities especially manually,
disaggregated….need applications
{Facility-Wise data entry in off line application computes a block and LGA
aggregation sheet which can then be uploaded in MS Excel sheets if nothing
else available. DHIS2 e.t.c}
3. Facilities reporting late, or not at all….. Needs rules to cope with this.
• Solutions
• Forms adequate for six monthly basis
• Attend to hardware/staff problems or relax schedules accordingly.
• Mobile communication to save travel and staff time- but requires more
applications management and funding.
Problem 6: Duplication
• Data duplication leads to false higher coverage of services and
inaccurate decision making. {It covers up for lack of private sector
data}.
• Multiple entries of same client data in the register/ “Achieving
program targets syndrome for a site or health facility”?
• Multipoint testing for pregnant women leading to repeated testing of
same client!
Problem 7: The Zero Problem: How to report
nonexistent vs Non utilized services
• Example: Test kit is ran out of stock; HF report says there are
‘pregnant women tested for HIV’ cases; HF reports test done in a
private laboratory.
• What problem you can face by this?
– it adversely affects data accuracy because HFs may
overestimate or underestimate Preg HIV tested.
• Solution: Follow data collection & reporting guidelines.- suggest these
are reported as zero- and no difference be made between zeros and
blanks. Agreed?
Problem 8: Wrong choice of indicators
/denominators
• This refers to a common problem where data element itself is correct but
denominator chosen is inappropriate.
• Need to identify and remove data elements that are not used.
• Some “rates”- need far too many data elements to compute- high
degree of inaccuracy results.
Problem 10: Death Reporting issues
• Facilities/ LGAs/ State which are under-reporting deaths (for
economic or political reasons) need to be identified and worked
upon.
• Get “Government Feedback Orders” issued from LGA level to resolve the
errors.
• Ensure that every HMIS manager/data entry operator keeps a file
containing all such orders.