SlideShare a Scribd company logo
Data Mining Fundamentals
Presented by
S.P. Siddique Ibrahim AP/CSE
Kumaraguru College of
Technology
09/06/18
1
09/06/18
2
DATA
• Fact, Numbers (or) text.
• Different format and different databases.
• Examples:
• College department,CoE,Hostal, Payroll,
• Police traffic, fraud data
• RTO
• Revenue
09/06/18
3
09/06/18
4
DATA Contd.,
• Operational or Transactional – sales, cost,
inventory, payroll
• Non operational- industry sales , forecasting
data, micro economic data
• Meta data – logical database design (or)
dictionary design
09/06/18
5
Information
• The pattern, relationships among all this data
can provide information.
• Example:
• Analysis retail point of sale transaction
• Product –selling and date
09/06/18
6
Knowledge
• Information can be converted into knowledge
about historical patterns and future trends.
• Example:
• Summary information of petrol/Diesel usage.
09/06/18
7
09/06/18
8
09/06/18
9
09/06/18
10
/
media/siddique/Files1/Kct/Siddique/Datami
ning Lab/id93_480x480.jpg
General Definition of Mining
• Mining is the process of extracting commercially
valuable minerals resources from earth’s surface.
• -Stones
• -Ores
• -Solid Fuels
• -Crude Oil
 Surface Mining
 Subsurface mining
09/06/18
11
09/06/18
12
09/06/18
13
Definition contd.,
• Analyzing data from different perspective and
summarize information.
• Information-revenue
• Technically, it is the process of finding
correlation (or) patterns among dozens of fields
in large relational databases.
• Example: Purchase pattern, credit card
09/06/18
14
Data Mining
• Data mining, “the extraction of hidden predictive
information from large databases”.
• It is a powerful new technology with great potential to
help companies focus on the most important information
in their data warehouses.
• The main focus of data mining process is to obtain
information from the data and converted it into an
knowledgeable and reasonable structure for further use.
09/04/18
15
09/06/18
15
Different Mining in Real Time
09/06/18
16
09/06/18
17
Data set
bank-data.csv
09/06/18
18
What is Machine Learning??
Machine learning is a field of computer science
that uses statistical techniques to give computer
systems the ability to "learn" with data, without
being explicitly programmed.
09/06/18
19
Why data mining for you?
• Research Opportunities.
• High Opening.
• Handling Big data.
• Analysis capability
• Future
09/06/18
20
What can data mining do?
• Company competition
• Internal and external factor
• Analyze real feelings of customer/Auditions
09/06/18
21
09/06/18
22
09/06/18
23
What is pattern? How it would be?
• Customer buying pattern
• Mobile pattern
• Bus ticket booking
• Cricket analysis
• Political parties election freebies
• Credit card
09/06/18
24
Amazon recommendation
Big Bazar
09/06/18
25
09/06/18
26
09/06/18
27
09/06/18
28
Password Hacking
ATM Machine-Fraud
09/06/18
29
09/06/18
30
09/06/18
31
What is Data Warehouse????
A data warehousing is a technique for collecting
and managing data from varied sources to
provide meaningful business insights.
Online Airline/Bus/Railway ticket booking
09/06/18
32
Data warehouse
• Centralized storage
• Free use of data
09/06/18
33
How does data mining work?
• Classes – stored data is used to locate data in
predetermined groups.
• Example:
• Restaurant chain
09/06/18
34
09/06/18
35
What is this? Classify it!!!!
09/06/18
36
09/06/18
37
Can u able to Identify Now!!??
09/06/18
38
Cluster
• Data items are grouped based on
similarity(logical relationship) (or) consumer
preferences.
•
• Example:
• Data used for market segments
09/06/18
39
Association
• Mined for associations.
Example:
Market basket analysis
09/06/18
40
Sequential patterns
• Data is mined to anticipate behavior patterns
and trends.
09/06/18
41
Regression Analysis
• A measure of the relation between the mean
value of one variable(E.g. Output) and
corresponding values of other variables(E.g
Time and cost)
09/06/18
42
09/06/18
43
09/06/18
44
09/06/18
45
09/06/18
46
09/06/18
47
09/06/18
48
09/06/18
49
09/06/18
50
09/06/18
51
09/06/18
52
09/06/18
53
09/06/18
54
09/06/18
55
09/06/18
56
09/06/18
57
09/06/18
58
https://create.kahoot.it/kahoots/my-kahoots
09/06/18
59

More Related Content

PPTX
Data Mining: Graph mining and social network analysis
PPTX
WEB BASED INFORMATION RETRIEVAL SYSTEM
PPTX
Boolean,vector space retrieval Models
PPTX
Ppt evaluation of information retrieval system
PPTX
Probabilistic retrieval model
PPTX
Text MIning
PDF
Mining Frequent Patterns And Association Rules
PPT
Compiler Design
Data Mining: Graph mining and social network analysis
WEB BASED INFORMATION RETRIEVAL SYSTEM
Boolean,vector space retrieval Models
Ppt evaluation of information retrieval system
Probabilistic retrieval model
Text MIning
Mining Frequent Patterns And Association Rules
Compiler Design

What's hot (20)

PPTX
Cryptography - Block cipher & stream cipher
PPTX
Web usage mining
PPT
3. mining frequent patterns
PPTX
Data science unit1
PDF
Tutorial on Web Scraping in Python
PPTX
Automatic indexing
PPT
Neural Networks in Data Mining - “An Overview”
PPTX
BIBLIOMETRICS LAWS
PPTX
Introduction to Information Retrieval
PPTX
Data warehouse and data mining
PPTX
Information retrieval s
PPTX
Model of information retrieval (3)
PPTX
Information retrieval introduction
PPTX
Probabilistic information retrieval models & systems
PPTX
Encryption
PDF
The fundamentals of Machine Learning
PPTX
Text mining
PPTX
Double DES & Triple DES
PPSX
Frequent itemset mining methods
PPTX
Informatio retrival evaluation
Cryptography - Block cipher & stream cipher
Web usage mining
3. mining frequent patterns
Data science unit1
Tutorial on Web Scraping in Python
Automatic indexing
Neural Networks in Data Mining - “An Overview”
BIBLIOMETRICS LAWS
Introduction to Information Retrieval
Data warehouse and data mining
Information retrieval s
Model of information retrieval (3)
Information retrieval introduction
Probabilistic information retrieval models & systems
Encryption
The fundamentals of Machine Learning
Text mining
Double DES & Triple DES
Frequent itemset mining methods
Informatio retrival evaluation
Ad

Similar to Data mining basic fundamentals (20)

PPT
Unit 1 (Chapter-1) on data mining concepts.ppt
PPT
Introduction of Data Mining - Concept and techniques
PPTX
Lect 1 introduction
PPT
3RD B.TECH-DATAMINING-INTRODUCTION-UNIT1 .ppt
PPT
Data Mining and Warehousing Concept and Techniques
PPT
introduction to datamining and warehousing
PPT
DATA MINING: INTRODUCTION TO DATA MINING
PPT
01Intro.ppt data mining dahauuehuwhuwrwhrurhuqhuahura
PPTX
DWDM 3rd EDITION TEXT BOOK SLIDES24.pptx
PPTX
dataminingintroductionpptpptpptptro.pptx
PPT
01Intro.ppt data analytics r language slide 1
PPT
Chapter 1. Introduction.ppt
PPT
hanjia chapter_1.ppt data mining chapter 1
PPTX
Business Intelligence and Analytics Unit-2 part-A .pptx
PPT
Data Mining
PPT
Data Mining
PPT
01Intro(1).ppt Introduction In computer science
PPT
01Intro.ppt
PPT
Data Mining: Concepts and Techniques for beginner
PPT
Data Mining Intro
Unit 1 (Chapter-1) on data mining concepts.ppt
Introduction of Data Mining - Concept and techniques
Lect 1 introduction
3RD B.TECH-DATAMINING-INTRODUCTION-UNIT1 .ppt
Data Mining and Warehousing Concept and Techniques
introduction to datamining and warehousing
DATA MINING: INTRODUCTION TO DATA MINING
01Intro.ppt data mining dahauuehuwhuwrwhrurhuqhuahura
DWDM 3rd EDITION TEXT BOOK SLIDES24.pptx
dataminingintroductionpptpptpptptro.pptx
01Intro.ppt data analytics r language slide 1
Chapter 1. Introduction.ppt
hanjia chapter_1.ppt data mining chapter 1
Business Intelligence and Analytics Unit-2 part-A .pptx
Data Mining
Data Mining
01Intro(1).ppt Introduction In computer science
01Intro.ppt
Data Mining: Concepts and Techniques for beginner
Data Mining Intro
Ad

More from Siddique Ibrahim (20)

PPTX
List in Python
PPT
Python Control structures
PPTX
Python programming introduction
PPT
Basic networking
PPT
Virtualization Concepts
PPT
Networking devices(siddique)
PPT
Osi model 7 Layers
PPT
Mysql grand
PPT
Getting started into mySQL
PPT
pipelining
PPT
Micro programmed control
PPTX
Hardwired control
PPT
interface
PPT
Interrupt
PPT
Interrupt
PPT
Io devies
PPT
Stack & queue
PPT
Metadata in data warehouse
PPTX
Data extraction, transformation, and loading
List in Python
Python Control structures
Python programming introduction
Basic networking
Virtualization Concepts
Networking devices(siddique)
Osi model 7 Layers
Mysql grand
Getting started into mySQL
pipelining
Micro programmed control
Hardwired control
interface
Interrupt
Interrupt
Io devies
Stack & queue
Metadata in data warehouse
Data extraction, transformation, and loading

Recently uploaded (20)

PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
project resource management chapter-09.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
A Presentation on Touch Screen Technology
PDF
Approach and Philosophy of On baking technology
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Getting Started with Data Integration: FME Form 101
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Tartificialntelligence_presentation.pptx
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
August Patch Tuesday
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
WOOl fibre morphology and structure.pdf for textiles
Web App vs Mobile App What Should You Build First.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A comparative study of natural language inference in Swahili using monolingua...
project resource management chapter-09.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Hindi spoken digit analysis for native and non-native speakers
A Presentation on Touch Screen Technology
Approach and Philosophy of On baking technology
Encapsulation_ Review paper, used for researhc scholars
DP Operators-handbook-extract for the Mautical Institute
Getting Started with Data Integration: FME Form 101
SOPHOS-XG Firewall Administrator PPT.pptx
Unlocking AI with Model Context Protocol (MCP)
Tartificialntelligence_presentation.pptx
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
August Patch Tuesday
Agricultural_Statistics_at_a_Glance_2022_0.pdf
WOOl fibre morphology and structure.pdf for textiles

Data mining basic fundamentals