0% found this document useful (0 votes)
34 views3 pages

DW&M Syllabus

The document outlines the course 'Data Warehousing and Mining' offered by the Department of Computer Science and Engineering at GITAM University, detailing its objectives, structure, and content. It covers data mining techniques, data warehousing, and various algorithms for data analysis, aiming to equip students with practical skills in data handling and decision-making. The course also aligns with Sustainable Development Goal 8, emphasizing the importance of data in economic growth and business decision-making.

Uploaded by

varcommando
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views3 pages

DW&M Syllabus

The document outlines the course 'Data Warehousing and Mining' offered by the Department of Computer Science and Engineering at GITAM University, detailing its objectives, structure, and content. It covers data mining techniques, data warehousing, and various algorithms for data analysis, aiming to equip students with practical skills in data handling and decision-making. The course also aligns with Sustainable Development Goal 8, emphasizing the importance of data in economic growth and business decision-making.

Uploaded by

varcommando
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Department of Computer Science and Engineering, GITAM Deemed to be University

L T P S J C
CSEN3201 DATA WAREHOUSING AND MINING
2 1 0 0 0 3
Pre-requisite
Co-requisite None
Preferable
None
exposure

Course Description:
Due to the advent of technology, the internet, and advanced applications like social media, a
huge amount of digital data has been accumulated in data centres or in Cloud storage devices,
which has led to a situation “we are drowning in data but starving for knowledge”. Various
data mining techniques like Association Analysis, Classification, Clustering, Outlier Analysis and
Web mining are applied to the data to extract golden nuggets useful for the decision-making
process.Data warehousing (DW) is an integral part of the knowledge discovery process, where
DW plays a vital role. DW is an integration of multiple heterogeneous data repositories under
a unified schema at a single site. The students will acquire knowledge in Data modelling, design,
architecture, data warehouse implementation and further development of data cube
technology.
Course Educational Objectives:
Illustrate the importance of Data Mining and its applications
Explain various types of data, pre-processing techniques and OLAP operations
Examine the characteristics of various data mining models
Experiment with various data mining algorithms
Illustrate the performance of data mining algorithms

UNIT 1 Introduction to Data Mining 8 hours, p 4 hours


Introduction to Data mining: Motivation for Data Mining, Importance of Data Mining,
Definition, kinds of data, Data mining functionalities, kinds of patterns to be mined, pattern
interestingness, Classification of data mining systems. Four views of data mining, key
components in the data mining.

Data understanding: Identifying key data properties and characterize different datasets,
Objects and Attributes, Statistics, Visualization, and Data Similarity.

UNIT 2 Data Pre-processing and Data Warehousing 8 hours, p 2 hours


Data Pre-processing: Need for pre-processing and various data pre-processing techniques.
Data Cleaning, Data Integration, Data Transformation, Data Reduction.

B Tech. Computer Science and Engineering w.e.f. 2021-22 admitted batch


Department of Computer Science and Engineering, GITAM Deemed to be University

Data Warehousing: Key characteristics of data warehousing and the techniques to support
data warehousing. Data Warehouse, Data Cube and OLAP Data Cube Computation, Data
Warehouse Architecture.

UNIT 3 Frequent Pattern Analysis 6 hours, p 6 hours


Introduction to frequent pattern analysis, Apriori Algorithm, FP-growth Algorithm,
Association and Correlation analysis.

UNIT 4 Classification 6 hours, p 6 hours


Classification: Decision Tree Induction, Bayesian Classification, Support Vector Machines,
Neural Network, Ensemble methods, Model Evaluation.

UNIT 5 Clustering and Outlier Detection Methods 8 hours, p 6 hours


Clustering: Partitioning, Hierarchical, Grid-based, and Density-based Clustering algorithms;
Probabilistic, High- dimensional, Bi-clustering, Graph, Constraint-based Cluster algorithms;
Outlier Detection Methods: Types of Outliers, Outlier Detection Methods, Mining Complex
Data, and Research Frontiers of Data Mining.

TextBooks:
1. Jiawei Han, Micheline Kamber, Jian Pei, Data Mining: Concepts and Techniques, Morgan
Kaufmann publishers, 3/e, 2011. (Modules 2 – 5)
2. Jiawei Han, Micheline Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann
publishers, 2/e, 2006. (Module 1)
References:
1. Michael Steinbach, Vipin Kumar, Pang-Ning Tan, Introduction to Data Mining,
AddisonWesley, 1/e, 2006.
2. Margaret H. Dunham, Data Mining: Introductory and Advanced Topics, Pearson publishers,
1/e, 2006
3. 1.https://www.coursera.org/programs/gitam-coursera-program-for-faculty-
p4k5n/browse?authProvider=gitam&productId=y-IghDp5Eeus1Q4-7xV-
4. Ww&productType=s12n&query=Data+Mining+Methods&showMiniModal=true&source=se
arch=
5. https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=69
6. https://www.sciencedirect.com/book/9780123814791/data-mining-concepts-and-
techniques

B Tech. Computer Science and Engineering w.e.f. 2021-22 admitted batch


Department of Computer Science and Engineering, GITAM Deemed to be University

Course Outcomes:
After successful completion of the course the student will be able to:
1. Explain the functionality of various data mining components(L2)
2. Apply data pre-processing techniques and OLAP operations(L2)
3. Compare and contrast the strengths and limitations of various data mining models(L2)
4. Apply the data mining algorithms on real world datasets(L3)
5. Evaluate the performance of data mining algorithms(L4)

CO-PO Mapping:
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO2 PSO3
CO1 3 2 2 0 0 0 0 0 0 0 0 0 3 2 2
CO2 2 2 2 0 0 0 0 0 0 0 0 0 2 2 2
CO3 1 1 3 3 3 0 0 0 0 2 0 0 2 2 2
CO4 2 0 0 3 3 2 0 0 0 2 0 0 0 2 2
CO5 2 0 0 3 3 2 0 0 0 2 0 0 0 2 2

Note: 1 - Low Correlation 2 - Medium Correlation 3 - High Correlation

APPROVED IN:
BOS : 06-09-2021 ACADEMIC COUNCIL: 01-04-2022

SDG No. & Statement:

SDG 8 : Decent Work and Economic Growth


The Data Warehouse is a huge repository, contains archived information about an
organization(s). In the Data Mining process, the Data Warehouse play a key role. Interesting
patterns are discovered using Data Mining techniques for business decision making process.

SDG Justification:

B Tech. Computer Science and Engineering w.e.f. 2021-22 admitted batch

You might also like