Data Mining Concepts
Data Mining Concepts
Data mining is the process of discovering actionable information from large sets of data. Data
mining uses mathematical analysis to derive patterns and trends that exist in data. Typically, these patterns
cannot be discovered by traditional data exploration because the relationships are too complex or because
there is too much data.
These patterns and trends can be collected and defined as a data mining model. Mining models can be
applied to specific scenarios, such as:
Risk and probability: Choosing the best customers for targeted mailings, determining the
probable break-even point for risk scenarios, assigning probabilities to diagnoses or other
outcomes
Finding sequences: Analyzing customer selections in a shopping cart, predicting next likely
events
Grouping: Separating customers or events into cluster of related items, analyzing and predicting
affinities
Building a mining model is part of a larger process that includes everything from asking questions
about the data and creating a model to answer those questions, to deploying the model into a working
environment. This process can be defined by using the following six basic steps:
Data understandingcollect the initial data, become familiar with it, and look for any data
quality problems.
Data mining in health care has become increasingly popular because it offers benefits to care
providers, patients, healthcare organizations, researchers, and insurers.
Care providers can use data analysis to identify effective treatments and best practices. By
comparing causes, symptoms, treatments, and their adverse effects, data mining can analyze which
courses of action are most effective for specific patient groups. It can also identify clinical best practices
to help develop guidelines and standards of care.
Patients can receive better, more affordable healthcare services. This is especially true when
healthcare managers use data mining applications to identify and track chronic diseases and high-risk
patients, design appropriate interventions, and reduce the number of hospital admissions and claims.
Healthcare organizations can use data mining to make better patient-related decisions. For
instance, it provides information to guide patient interactions by determining patient preferences, usage
patterns, and current and future needsall of which helps to improve patient satisfaction. With healthcare
organizations under increasing financial pressure, data mining can also influence revenues, costs, and
operating efficiency while maintaining high-quality care.
Insurers can detect medical insurance fraud and abuse through data mining by establishing norms
and then identifying unusual claims patterns. For example, data mining can pinpoint inappropriate
prescriptions or referrals and fraudulent insurance and medical claims. Insurers can use this information
to
reduce
their
lossesand
the
costs
of
health
care.
Nurses have shouldered a large portion of the responsibility for recording patient data in the EHR, but
their efforts will contribute to a significant potential benefit to patients and to the health delivery system.
As more data becomes available to data miners, nurses will also benefit by having more opportunities to
provide appropriate, well planned, and cost-effective patient care.
Submitted By:
Geramei V. Tejada
BSN-II