0% found this document useful (0 votes)

75 views33 pages

Cosc6211 Advanced Concepts in Data Mining: Weekend

This document provides an overview of an advanced concepts in data mining course. It outlines the instructor details, grading breakdown, data sources, teaching materials, and course outline. The course will cover topics like data preprocessing, association rule mining, classification, clustering, complex data mining and text mining.

Uploaded by

jemal yahyaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views33 pages

Cosc6211 Advanced Concepts in Data Mining: Weekend

Uploaded by

jemal yahyaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 33

Cosc6211 Advanced Concepts in

Data Mining
Weekend
General Information

• Instructor: A/fetah A.A

– Email: [email protected]
– Tel:
• Lecture time:
– 9:30am-11:00am, Saturday and Sunday
• Room:

Cosc6211 2
Grading: tentative

• Individual homework : 20-25%

• Project: 15-20%
• Test 10% -> modifiable (if you want..)
• Final Exam: 50%
• Homework and project will be done with
software:
– Weka and other if you have any preference

Cosc6211 3
Data sources
• Data sources from internet
· UCI KDD Archive
· UCI Machine Learning Library

Cosc6211 4
Where to Find the Set of Slides
for the Text Book

• Tutorial sections (MS PowerPoint files):

– http://www.cs.sfu.ca/~han/dmbook
• Other conference presentation slides (.ppt):
– http://db.cs.sfu.ca/ or http://www.cs.sfu.ca/~han

• Research papers, DBMiner system, and other related

information:
– http://db.cs.sfu.ca/ or http://www.cs.sfu.ca/~han

Cosc6211 5
Teaching materials
Textbooks
• Jiawei Han and Micheline Kamber, “Data Mining:
Concepts and Techniques”.
References
• Pang-Ning Tan, Michael Steinbach, and Vipin
Kumar, "Introduction to Data Mining", Pearson
Addison Wesley, 2008, ISBN: 0-32-134136-7
• Margaret H. Dunham, Data Mining: Introductory
and Advanced Topics, Prentice Hall, 2003.
Cosc6211 6
Course outline
1. Introduction , Data preprocessing and
Association Rules Mining
2. Classification and Predication
– Decision Trees , Bayesian Classifier, rule based,
Ensemble and SVM , k-nearest neighbor, Neural
Networks and other classifications
3. Clustering
4. Complex data mining and text mining

Cosc6211 7
Outline
• Motivation: Why data mining?
• What is data mining?
• Data Mining: On what kind of data?
• Data mining Task
• Are all the patterns interesting?
• Major issues in data mining
• Association Rule Mining (if time allows)

Cosc6211 8
1. Introduction and Data preprocessing
and Association rule mining
Why Data Mining?
• The Explosive Growth of Data: from terabytes
to petabytes(1 million gigabytes)
– Data collection and data availability
• Automated data collection tools, database systems,
Web, computerized society
– Major sources of abundant data
• Business: Web, e-commerce, transactions, stocks, …
• Science: Remote sensing, bioinformatics, scientific
simulation, …
• Society and everyone: news, digital cameras, YouTube

Cosc6211 10
Why Data Mining? Commercial Viewpoint
• Lots of data is being collected and warehoused
– Web data
• Google has Peta Bytes of web data
• Facebook has billions of active users
– purchases at department/ grocery stores, e-commerce
• Amazon handles millions of visits/day
– Bank/Credit Card transactions
• Computers have become cheaper and more powerful
• Competitive Pressure is Strong
– Provide better, customized services for an edge (e.g. in
Customer Relationship Management)

Cosc6211 11
Why Data Mining? Scientific Viewpoint
• Data collected and stored at enormous speeds
– remote sensors on a satellite
• NASA EOSDIS archives over petabytes of earth science data / year
– telescopes scanning the skies
• Sky survey data
– scientific simulations
• terabytes of data generated in a few hours
• Data mining helps scientists
– in automated analysis of massive datasets
– In hypothesis formation

Cosc6211 12
Why is data mining?

• Make use of your data assets

• There is a big gap from stored data to knowledge;
and the transition won’t occur automatically.
• Many interesting things that one wants to find
cannot be found using database queries
– “find people likely to buy my products”
– “Who are likely to respond to my promotion.
– “Which movies should be recommended to each
customer?”

Cosc6211 13
What Is Data Mining?

• Data mining is also called knowledge discovery and data

mining (KDD)
• Data mining is
– extraction of useful patterns from data sources, e.g.,
databases, texts, web, images, etc.
– Patterns must be:
• valid, novel, potentially useful, understandable
• Alternative names
– Knowledge discovery (mining) in databases (KDD), knowledge
extraction, data/pattern analysis, data archeology, data
dredging, information harvesting, business intelligence.

Cosc6211 14
What is (not) Data Mining?
• Is not data mining:
– Look up phone number in phone directory
– Query a Web search engine for information about “Amazon”
• Is data mining:
– Group together similar documents returned by search engine
according to their context (e.g. Amazon rainforest,
Amazon.com,)
• Identify the following:
– Sales analysis
• What are the sales by quarter and region?
• How do sales compare in two different stores in the same state?

Cosc6211 15
Knowledge Discovery (KDD) Process
– Data mining: the core of
knowledge discovery Knowledge Interpretation
process.
Data Mining

Task-relevant Data
Data transformations

Preprocessed Selection
Data
Data Cleaning

Data Integration

Databases Cosc6211 16
Steps of a KDD Process

• Learning the application domain

– relevant prior knowledge and goals of application
• Data cleaning: missing values, noisy data, and inconsistent data
• Data integration: merging data from multiple data stores
• Data selection: select the data relevant to the analysis
• Data transformation: aggregation (daily sales to weekly or monthly sales)
or generalisation (street to city; age to young, middle age and senior)
• Data mining: apply intelligent methods to extract patterns
• Pattern evaluation: interesting patterns should contradict the user’s
belief or confirm a hypothesis the user wished to validate
• Knowledge presentation: visualization and representation techniques to
present the mined knowledge to the users

Cosc6211 17
Why Data Preprocessing?

• Data in the real world is dirty

– incomplete: lacking attribute values, lacking certain
attributes of interest, or containing only aggregate data
• e.g., occupation=“ ”
– noisy: containing errors or outliers
• e.g., Salary=“-10”
– inconsistent: containing discrepancies in codes or names
• e.g., Age=“42” Birthdate=“03/07/1997”
• e.g., Was rating “1,2,3”, now rating “A, B, C”
• e.g., discrepancy between duplicate records

Cosc6211 18
Why Is Data Dirty?

• Incomplete data may come from

– “Not applicable” data value when collected
– Different considerations between the time when the data was collected and
when it is analyzed.
– Human/hardware/software problems
• Noisy data (incorrect values) may come from
– Faulty data collection instruments
– Human or computer error at data entry
– Errors in data transmission
• Inconsistent data may come from
– Different data sources
– Functional dependency violation (e.g., modify some linked data)
• Duplicate records also need data cleaning

Cosc6211 19
Why Is Data Preprocessing Important?

• No quality data, no quality mining results!

– Quality decisions must be based on quality data
• e.g., duplicate or missing data may cause incorrect or
even misleading statistics.
– Data warehouse needs consistent integration of
quality data
• Data extraction, cleaning, and transformation
comprises the majority of the work of building
a data warehouse
Cosc6211 20
Data Mining: on what kinds of data
In principle, data mining should be applicable to any data repository
• Database-oriented data sets and applications
– Relational database, data warehouse, transactional database
• Advanced data sets and advanced applications
– Object-relational databases
– Time-series data, temporal data, sequence data (incl. bio-sequences)
– Spatial data and spatiotemporal data
– Text databases and Multimedia databases
– The World-Wide Web
– Heterogeneous databases …

Cosc6211 21
Origins of Data Mining

• Data Mining combines

ideas from statistics,
Artificial
machine learning, intelligence

artificial intelligence, and

database systems
– Tries to overcome
shortcomings of Database Pattern
traditional techniques systems recognition

concerning
• large amount of data
• high dimensionality of data
• heterogeneous and
complex nature of data Statistics

Cosc6211 22
Data Mining Tasks
• Descriptive Tasks
– Goal: Find patterns in the data.
– Example: Which products are often bought together?
• Predictive Tasks
– Goal: Predict unknown values of a variable
• given observations (e.g., from the past)
• Machine Learning Terminology
– descriptive = unsupervised
– predictive = supervised

Cosc6211 23
Classic data mining tasks

• Classification:
mining patterns that can classify future (new) data into known
classes.
• Association rule mining
mining any rule of the form X  Y, where X and Y are sets of
data items. E.g.,
Cheese, Milk Bread [sup =5%, confid=80%]
Age(X, ”20..29”) and income(X, ”20k..29k”) -> buys(X, ”cd-
player”) [support=2%, confidence=60%]
• Clustering
identifying a set of similarity groups in the data

Cosc6211 24
Classic data mining tasks (contd)

• Sequential pattern mining:

– A sequential rule: A B, says that event A will be
immediately followed by event B with a certain
confidence
• Deviation, outlier, and novelty detection:
– Discovering the most significant changes in data
• Data visualization
– using graphic methods to show patterns in data.

Cosc6211 25
Data Mining Applications
Market analysis and management
• Target marketing
– Find clusters of “model” customers who share the
same characteristics: interest, income level,
spending habits, etc.
– Determine customer purchasing patterns over time
• Cross-market analysis—Find associations/co-
relations between product sales, & predict
based on such association
Cosc6211 26
Data Mining Applications
Market analysis and management(2)
• Customer profiling
– data mining can identify what types of customers
buy what products (clustering or classification)
• Identify customer requirements
– identify the “best” products for different
customers
– use prediction techniques to find what factors will
attract new customers
Cosc6211 27
Data Mining Applications
Fraud Detection & Mining Unusual Patterns
• Approaches: Clustering & model construction for frauds, outlier analysis
• Applications: Health care, retail, credit card service, telecomm.
– Auto insurance: ring of collisions
– Money laundering: suspicious monetary transactions
– Medical insurance
• Professional patients, ring of doctors, and ring of references
• Unnecessary or correlated screening tests
– Telecommunications: phone-call fraud
• Phone call model: destination of the call, duration, time of day or week. Analyze patterns
that deviate from an expected norm
– Retail industry
• Analysts estimate that 38% of retail shrink is due to dishonest employees
– Anti-terrorism

Cosc6211 28
Are All the “Discovered” Patterns Interesting?

• Data mining may generate thousands of patterns: Not all of

them are interesting
– Suggested approach: Human-centered, query-based, focused mining
• Interestingness measures
– A pattern is interesting if it is easily understood by humans, valid on
new or test data with some degree of certainty, potentially useful,
novel, or validates some hypothesis that a user seeks to confirm
• Objective vs. subjective interestingness measures
– Objective: based on statistics and structures of patterns, e.g.,
support, confidence, etc.
– Subjective: based on user’s belief in the data, e.g., unexpectedness,
novelty, actionability, etc

Cosc6211 29
Major Issues in Data Mining
• Mining methodology
– Mining different kinds of knowledge from diverse data types, e.g., bio, stream, Web
– Performance: efficiency, effectiveness, and scalability
– Pattern evaluation: the interestingness problem
– Incorporation of background knowledge
– Handling noise and incomplete data
– Parallel, distributed and incremental mining methods
– Integration of the discovered knowledge with existing one: knowledge fusion
• User interaction
– Data mining query languages and ad-hoc mining
– Expression and visualization of data mining results
– Interactive mining of knowledge at multiple levels of abstraction
• Applications and social impacts
– Domain-specific data mining & invisible data mining
– Protection of data security, integrity, and privacy

Cosc6211 30
Summary
• Data Mining is a process of extracting knowledge from data
• Data to be mined can be of any type
– Relational Databases, Advanced databases, etc.
• Knowledge to be discovered
– Frequent patterns, correlations, associations, classification, prediction,
clustering
• Data Mining is interdisciplinary
– Large amount of complex data and sophisticated applications
• Challenges of data Mining
– Efficiency, scalability, parallel and distributed mining, handling high
dimensionality, handling noisy data, mining heterogeneous data, etc.

Cosc6211 31
Where to Find References?

• More conferences on data mining

– PAKDD (1997), PKDD (1997), SIAM-Data Mining (2001), (IEEE) ICDM (2001), etc.
• Data mining and KDD
– Conferences: ACM-SIGKDD, IEEE-ICDM, SIAM-DM, PKDD, PAKDD, etc.
– Journal: Data Mining and Knowledge Discovery, KDD Explorations
• Database systems
– Conferences: ACM-SIGMOD, ACM-PODS, VLDB, IEEE-ICDE, EDBT, ICDT, DASFAA
– Journals: ACM-TODS, IEEE-TKDE, JIIS, J. ACM, etc.
• AI & Machine Learning
– Conferences: Machine learning (ML), AAAI, IJCAI, COLT (Learning Theory), etc.
– Journals: Machine Learning, Artificial Intelligence, etc.
• Statistics
– Conferences: Joint Stat. Meeting, etc.
– Journals: Annals of statistics, etc.
• Visualization
– Conference proceedings: CHI, ACM-SIGGraph, etc.
– Journals: IEEE Trans. visualization and computer graphics, etc.

Cosc6211 32
• Next: Association rule mining

Cosc6211 33

Data Mining Concepts and Techniques - Han, Kamber & Pei
No ratings yet
Data Mining Concepts and Techniques - Han, Kamber & Pei
953 pages
Family Skills Module
100% (2)
Family Skills Module
21 pages
Cosc411 M7.2 2023
No ratings yet
Cosc411 M7.2 2023
4 pages
Introduction To Data Mining & Business Intelligence
No ratings yet
Introduction To Data Mining & Business Intelligence
25 pages
BIS 541 Ch01 20-21 S
No ratings yet
BIS 541 Ch01 20-21 S
129 pages
Unit 3 DW
No ratings yet
Unit 3 DW
19 pages
001lecture - 1 Introduction-1
No ratings yet
001lecture - 1 Introduction-1
40 pages
DM-Unit-I Introduction To Association-1
No ratings yet
DM-Unit-I Introduction To Association-1
97 pages
Unit - I MLT
No ratings yet
Unit - I MLT
137 pages
CIS 467 - Topic 1 - Introduction - 2020
No ratings yet
CIS 467 - Topic 1 - Introduction - 2020
79 pages
DE Unit1 - Introdcution - DE - 8jul24
No ratings yet
DE Unit1 - Introdcution - DE - 8jul24
56 pages
Unit III
No ratings yet
Unit III
101 pages
Mekelle University-Mekelle Institute of Technology Department of Information Technology Data Mining and Knowledge Discovery
No ratings yet
Mekelle University-Mekelle Institute of Technology Department of Information Technology Data Mining and Knowledge Discovery
36 pages
Chapter 3
No ratings yet
Chapter 3
9 pages
02-Introduction To Data Mining
No ratings yet
02-Introduction To Data Mining
40 pages
Data Mining New Notes Unit 3 PDF
No ratings yet
Data Mining New Notes Unit 3 PDF
12 pages
CSE2021 - MODULE 1ppt
No ratings yet
CSE2021 - MODULE 1ppt
62 pages
Unit-1 A
No ratings yet
Unit-1 A
47 pages
Datamining 1
No ratings yet
Datamining 1
30 pages
Data Mining and Warehousing-1
No ratings yet
Data Mining and Warehousing-1
43 pages
DataMining S
No ratings yet
DataMining S
103 pages
1 Chapter One
No ratings yet
1 Chapter One
54 pages
01-Introduction To Data Mining
No ratings yet
01-Introduction To Data Mining
43 pages
RMM Unit-I Introdution To Data Mining
No ratings yet
RMM Unit-I Introdution To Data Mining
129 pages
Datawarehouse&Data Mining - ALL
No ratings yet
Datawarehouse&Data Mining - ALL
46 pages
Unit-4 Introduction To Data Mining
No ratings yet
Unit-4 Introduction To Data Mining
26 pages
Combine 056
No ratings yet
Combine 056
57 pages
Unit - 2 Data Minig Notes
No ratings yet
Unit - 2 Data Minig Notes
15 pages
Data Mining Written Notes 1
No ratings yet
Data Mining Written Notes 1
35 pages
Lecture - 1 02032023 095637am 1 29022024 124126pm
No ratings yet
Lecture - 1 02032023 095637am 1 29022024 124126pm
33 pages
Data Mining:: Concepts and Techniques
No ratings yet
Data Mining:: Concepts and Techniques
28 pages
Data Mining
No ratings yet
Data Mining
395 pages
Notes For DMDWH - Module1
No ratings yet
Notes For DMDWH - Module1
21 pages
01 Intro
No ratings yet
01 Intro
22 pages
2-Introduction To Data Mining, Steps in Data Mining Process-31-07-2024
No ratings yet
2-Introduction To Data Mining, Steps in Data Mining Process-31-07-2024
77 pages
UNIT-3 DATA MINING - Part1
No ratings yet
UNIT-3 DATA MINING - Part1
111 pages
Concepts and Techniques: - Chapter 1
No ratings yet
Concepts and Techniques: - Chapter 1
48 pages
Unit-1 Notes
No ratings yet
Unit-1 Notes
24 pages
July 16, 2009 1 Data Mining
No ratings yet
July 16, 2009 1 Data Mining
26 pages
MC5403 Adbdm Unit Ii Notes
No ratings yet
MC5403 Adbdm Unit Ii Notes
59 pages
Week 4 - Introduction To Data Mining and Data Mining Techniques
No ratings yet
Week 4 - Introduction To Data Mining and Data Mining Techniques
44 pages
01 Intro
No ratings yet
01 Intro
29 pages
Chapter 1
No ratings yet
Chapter 1
6 pages
02 DM BI Data Mining
No ratings yet
02 DM BI Data Mining
66 pages
DM 1
No ratings yet
DM 1
47 pages
Data Mining 1
No ratings yet
Data Mining 1
39 pages
Data Mining Chapter 1 Notes
No ratings yet
Data Mining Chapter 1 Notes
40 pages
ADET - Lesson 2
No ratings yet
ADET - Lesson 2
21 pages
What Is Data Mining: Effective Data Collection Warehousing
No ratings yet
What Is Data Mining: Effective Data Collection Warehousing
21 pages
Datamining&warehousing
No ratings yet
Datamining&warehousing
65 pages
01 Intro
No ratings yet
01 Intro
28 pages
DWDM 01 Introduction
No ratings yet
DWDM 01 Introduction
43 pages
Wk. 1. Introduction (08.10.2020)
No ratings yet
Wk. 1. Introduction (08.10.2020)
30 pages
01 Intro 1
No ratings yet
01 Intro 1
50 pages
Course Manual On Data Mining - CSC 425 - 015446
No ratings yet
Course Manual On Data Mining - CSC 425 - 015446
44 pages
Data Mining Merged PDF CS1 CS8
No ratings yet
Data Mining Merged PDF CS1 CS8
272 pages
DM Mod1
No ratings yet
DM Mod1
29 pages
Mastering Data Mining Techniques
From Everand
Mastering Data Mining Techniques
Dhaanyalakshmi Ahuja
No ratings yet
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Data Mining for Beginners: A Programmer’s Guide
From Everand
Data Mining for Beginners: A Programmer’s Guide
Agasti Khatri
No ratings yet
151604 GRF
No ratings yet
151604 GRF
1 page
Health Certificate For Driving License
No ratings yet
Health Certificate For Driving License
2 pages
151604 GRF
No ratings yet
151604 GRF
1 page
Goods Return Form (GRF)
No ratings yet
Goods Return Form (GRF)
1 page
Goods Return Form (GRF)
No ratings yet
Goods Return Form (GRF)
1 page
2023 EER RAN Rollout Half Year Report
No ratings yet
2023 EER RAN Rollout Half Year Report
12 pages
151404#LinkDesign20241012-195350
No ratings yet
151404#LinkDesign20241012-195350
5 pages
SSV-262 Reg2-09062024092131
No ratings yet
SSV-262 Reg2-09062024092131
13 pages
Link Information - 2024-01-31 16-34-48
No ratings yet
Link Information - 2024-01-31 16-34-48
35 pages
EER December 2023 Attendance
No ratings yet
EER December 2023 Attendance
4 pages
Cooperation Request
No ratings yet
Cooperation Request
1 page
Raso Town E2e & RF Plan
No ratings yet
Raso Town E2e & RF Plan
22 pages
CHQ2023060216305350
No ratings yet
CHQ2023060216305350
2 pages
Regional 219 LTE Expansion Project Nominal Planning - E2E V1.4
No ratings yet
Regional 219 LTE Expansion Project Nominal Planning - E2E V1.4
22 pages
Second Phase Regional 262 LTE Layering Project - 106 MW Link Upgrade - 8G ODU Resource Mapping Data
No ratings yet
Second Phase Regional 262 LTE Layering Project - 106 MW Link Upgrade - 8G ODU Resource Mapping Data
2 pages
Regional 219 LTE Expansion Project RFC Request
No ratings yet
Regional 219 LTE Expansion Project RFC Request
27 pages
Site Survey & TSSR Progresss Report V1.1
No ratings yet
Site Survey & TSSR Progresss Report V1.1
14 pages
SSV-262 Reg-09062024085551
No ratings yet
SSV-262 Reg-09062024085551
13 pages
Daily Staff Mapping
No ratings yet
Daily Staff Mapping
4 pages
Material Requisition: Region Code Works Order No
No ratings yet
Material Requisition: Region Code Works Order No
2 pages
Dec 19, 2022 Brifing
No ratings yet
Dec 19, 2022 Brifing
4 pages
Microwave Link 151263 - 151265-151200-151199-151414 GRF
No ratings yet
Microwave Link 151263 - 151265-151200-151199-151414 GRF
1 page
EER-Regional 262 LTE Layering-106 Microwave Link Swap E2E-V-1.5 - As - of - 17-11-2022
No ratings yet
EER-Regional 262 LTE Layering-106 Microwave Link Swap E2E-V-1.5 - As - of - 17-11-2022
14 pages
1204-1594-1088-1602-1608 On Site Recorded GRF
No ratings yet
1204-1594-1088-1602-1608 On Site Recorded GRF
1 page
(151092) - 151007
No ratings yet
(151092) - 151007
12 pages
EER 3G Expansion Staff-Route Mapping
No ratings yet
EER 3G Expansion Staff-Route Mapping
1 page
EER U2100 Layering MR
100% (1)
EER U2100 Layering MR
1 page
Dorn Method
No ratings yet
Dorn Method
2 pages
Colby Whites 5e Lesson Plan
No ratings yet
Colby Whites 5e Lesson Plan
3 pages
College,' by Andrew Delbanco - NYTimes PDF
No ratings yet
College,' by Andrew Delbanco - NYTimes PDF
3 pages
Mathematics Grade 4 Controlled Test Term 2 2023
No ratings yet
Mathematics Grade 4 Controlled Test Term 2 2023
6 pages
Learning From Modern Heritage Methodolog
No ratings yet
Learning From Modern Heritage Methodolog
19 pages
Mi 1
No ratings yet
Mi 1
2 pages
15 Conversation Topic With Guide
No ratings yet
15 Conversation Topic With Guide
2 pages
Nominal Roll Edited New New
No ratings yet
Nominal Roll Edited New New
10 pages
Lesson Plan-Croc-New
No ratings yet
Lesson Plan-Croc-New
3 pages
01 PGM Reg r2
No ratings yet
01 PGM Reg r2
29 pages
1688329698-01 STUDENT Answer Booklet C3L6 2023
No ratings yet
1688329698-01 STUDENT Answer Booklet C3L6 2023
7 pages
Australian Biblical Review Aramaic Gramm
No ratings yet
Australian Biblical Review Aramaic Gramm
2 pages
Engaged Listening Worksheet 3 - 7
No ratings yet
Engaged Listening Worksheet 3 - 7
3 pages
Activity 1-Informational Listening Deped
No ratings yet
Activity 1-Informational Listening Deped
2 pages
Middleton and Spanias - Motivation For Achievement in Mathematics
No ratings yet
Middleton and Spanias - Motivation For Achievement in Mathematics
25 pages
Quiz01 - Nco113 - Jan24 - t02 - Teamwork in The Modern Workplace
No ratings yet
Quiz01 - Nco113 - Jan24 - t02 - Teamwork in The Modern Workplace
5 pages
ET5697 ET5694 Dissertation Submit November 2024
No ratings yet
ET5697 ET5694 Dissertation Submit November 2024
13 pages
Waiver For Graduating Students: Subject Code Subject Description
No ratings yet
Waiver For Graduating Students: Subject Code Subject Description
2 pages
0500 Scheme of Work Unit 5
No ratings yet
0500 Scheme of Work Unit 5
4 pages
Job Analysis and Job Design
No ratings yet
Job Analysis and Job Design
55 pages
IIT (BHU) /Acad./UG/Admissions/2020 October 23, 2020
No ratings yet
IIT (BHU) /Acad./UG/Admissions/2020 October 23, 2020
2 pages
The Master and His Emissary The Divided Brain and The Making of The Western World Iain Mcgilchrist Download
No ratings yet
The Master and His Emissary The Divided Brain and The Making of The Western World Iain Mcgilchrist Download
87 pages
Critical Review of Quantitative and Qualitative Research: Xueting Xiong
No ratings yet
Critical Review of Quantitative and Qualitative Research: Xueting Xiong
4 pages
Expt. No.7 - Study of Simulation Software Proteus &...
No ratings yet
Expt. No.7 - Study of Simulation Software Proteus &...
2 pages
Amad
No ratings yet
Amad
1 page
CRITERIA
No ratings yet
CRITERIA
1 page
Philhis Handouts Week 1
No ratings yet
Philhis Handouts Week 1
5 pages
Advising List Summer 2024
No ratings yet
Advising List Summer 2024
19 pages
Dr. Susanta Kumar Nayak PDF
No ratings yet
Dr. Susanta Kumar Nayak PDF
11 pages

Cosc6211 Advanced Concepts in Data Mining: Weekend

Uploaded by

Cosc6211 Advanced Concepts in Data Mining: Weekend

Uploaded by

Cosc6211 Advanced Concepts in

• Instructor: A/fetah A.A

• Individual homework : 20-25%

• Tutorial sections (MS PowerPoint files):

• Research papers, DBMiner system, and other related

• Make use of your data assets

• Data mining is also called knowledge discovery and data

• Learning the application domain

• Data in the real world is dirty

• Incomplete data may come from

• No quality data, no quality mining results!

• Data Mining combines

artificial intelligence, and

• Sequential pattern mining:

• Data mining may generate thousands of patterns: Not all of

• More conferences on data mining

You might also like