0% found this document useful (0 votes)
22 views1 page

Data Mining: Outlier Analysis - Presentation Transcript

Outlier analysis is a data mining task that identifies outliers, which are data objects that do not comply with the general behavior or model of the data. Outliers may be of particular interest for detecting fraudulent activity. There are different techniques for outlier analysis, including statistical distribution-based approaches, distance-based algorithms, and using OLAP data cube techniques to identify anomalous regions in multidimensional data.

Uploaded by

annamyem
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views1 page

Data Mining: Outlier Analysis - Presentation Transcript

Outlier analysis is a data mining task that identifies outliers, which are data objects that do not comply with the general behavior or model of the data. Outliers may be of particular interest for detecting fraudulent activity. There are different techniques for outlier analysis, including statistical distribution-based approaches, distance-based algorithms, and using OLAP data cube techniques to identify anomalous regions in multidimensional data.

Uploaded by

annamyem
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

Data Mining: Outlier analysis - Presentation Transcript

1. Outlier Analysis
2. What are outliers?
Very often, there exist data objects that do not comply with the general behavior or model
of the data. Such data objects, which are grossly different from or inconsistent with the
remaining set of data, are called outliers.
3. What is Outlier Analysis?
The outliers may be of particular interest, such as in the case of fraud detection, where
outliers may indicate fraudulent activity. Thus, outlier detection and analysis is an
interesting data mining task, referred to as outlier mining or outlier analysis.
4. Statistical Distribution-Based Outlier Detection
Two basic types of procedures for detecting outliers:
Block procedures: In this case, either all of the suspect objects are treated as outliersor all
of them are accepted as consistent.
Consecutive (or sequential) procedures: An example of such a procedure is the
insideoutprocedure.
5. Distance-Based Outlier Detection
Some efficient algorithms for mining distance-based outliers are as follows:
Index-based algorithm
Nested-loop algorithm:
Cell-based algorithm
Density-Based Local Outlier Detection
Deviation-Based Outlier Detection with Sequential Exception Technique
6. OLAP Data Cube Technique
An OLAP approach to deviation detection uses data cubes to identify regions of
anomaliesin large multidimensional data

You might also like