0% found this document useful (0 votes)
22 views5 pages

ADS Syllabus

The document outlines the course details for 'Applied Data Science' (CSDC8013), including prerequisites, objectives, outcomes, and a detailed module breakdown covering topics such as data exploration, visualization, anomaly detection, and time series forecasting. It also includes information about the associated lab course (CSL8023), assessment methods, suggested experiments, and textbooks. The course aims to equip students with practical skills and theoretical knowledge in data science applications.

Uploaded by

siblu khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views5 pages

ADS Syllabus

The document outlines the course details for 'Applied Data Science' (CSDC8013), including prerequisites, objectives, outcomes, and a detailed module breakdown covering topics such as data exploration, visualization, anomaly detection, and time series forecasting. It also includes information about the associated lab course (CSL8023), assessment methods, suggested experiments, and textbooks. The course aims to equip students with practical skills and theoretical knowledge in data science applications.

Uploaded by

siblu khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Course Code Course Name Credit

CSDC8013 Applied Data Science 03

Prerequisite: Machine Learning, Data Structures & Algorithms


Course Objectives:

1 To introduce students to the basic concepts of data science.

2 To acquire an in-depth understanding of data exploration and data visualization.

3 To be familiar with various anomaly detection techniques.

4 To understand the data science techniques for different applications.

Course Outcomes:

1 To gain fundamental knowledge of the data science process.

2 To apply data exploration and visualization techniques.

3 To apply anomaly detection techniques.

4 To gain an in-depth understanding of time-series forecasting.

5 Apply different methodologies and evaluation strategies.

6 To apply data science techniques to real world applications.

Module Detailed Content Hours

1 Introduction to Data Science 2

1.1 Introduction to Data Science, Data Science Process

Motivation to use Data Science Techniques: Volume, Dimensions and


1.2 Complexity, Data Science Tasks and Examples

Overview of Data Preparation, Modeling, Difference between data


1.3 science and data analytics

2 Data Exploration 8
Types of data, Properties of data
Descriptive Statistics:
Univariate Exploration: Measure of Central Tendency, Measure of
2.1 Spread, Symmetry, Skewness: Karl Pearson Coefficient of skewness,
Bowley‘s Coefficient, Kurtosis
Multivariate Exploration: Central Data Point, Correlation, Different
forms of correlation, Karl Pearson Correlation Coefficient for bivariate
distribution

Inferential Statistics:

2.2 Overview of Various forms of distributions: Normal, Poisson, Test


Hypothesis, Central limit theorem, Confidence Interval, Z-test, t-test,
Type-I, Type-II Errors, ANOVA

3 Methodology and Data Visualization 06

Methodology: Overview of model building, Cross Validation, K-


3.1
fold cross validation, leave-1 out, Bootstrapping
Data Visualization
Univariate Visualization: Histogram, Quartile, Distribution Chart
3.2 Multivariate Visualization: Scatter Plot, Scatter Matrix, Bubble chart,
Density Chart
Roadmap for Data Exploration
Self-Learning Topics: Visualizing high dimensional data: Parallel
3.3 chart, Deviation chart, Andrews Curves.

4 Anomaly Detection 06

Outliers, Causes of Outliers, Anomaly detection techniques, Outlier


4.1 Detection using Statistics

Outlier Detection using Distance based method, Outlier detection using


4.2 density-based methods, SMOTE

5 Time Series Forecasting 4

5.1 Taxonomy of Time Series Forecasting methods, Time Series


Decomposition
Smoothening Methods: Average method, Moving Average smoothing,
5.2 Time series analysis using linear regression, ARIMA Model,
Performance Evaluation: Mean Absolute Error, Root Mean Square Error,
Mean Absolute Percentage Error, Mean Absolute Scaled Error
5.3 Self-Learning Topics: Evaluation parameters for Classification,
regression and clustering.
6 Applications of Data Science 4
Predictive Modeling: House price prediction, Fraud Detection Clustering:
Customer Segmentation
Time series forecasting: Weather Forecasting Recommendation
engines: Product recommendation

Textbooks:
1 Vijay Kotu, Bala Deshpande. ―Data Science Concepts and Practice‖, Elsevier, M.K.
Publishers.

2 Steven Skiena, ―Data Science Design Manual‖, Springer International Publishing AG

3 Samir Madhavan. ―Mastering Python for Data Science‖, PACKT Publishing

4 Dr. P. N. Arora, Sumeet Arora, S. Arora, Ameet Arora, ―Comprehensive Statistical


Methods‖, S.Chand Publications, New Delhi.

References:
1 Jake VanderPlas. ―Python Data Science Handbook‖, O‘reilly Publications.

2 Francesco Ricci, LiorRokach, BrachaShapira, Paul B. Kantor, ―Recommender Systems


Handbook‖, Springer.

3 S.C. Gupta, V. K. Kapoor ―Fundamentals of Mathematical Statistics‖, S. Chand and


Sons, New Delhi.

4 B. L. Agrawal. ―Basic Statistics‖, New Age Publications, Delhi.

Useful Links
1 https://onlinecourses.nptel.ac.in/noc22_cs32/preview

https://onlinecourses.nptel.ac.in/noc21_cs69/preview
2

Assessment:

Internal Assessment:
Assessment consists of two class tests of 20 marks each. The first-class test is to be conducted
when approx. 40% syllabus is completed and second class test when additional40% syllabus is
completed. Duration of each test shall be one hour.

End Semester Theory Examination:


1 Question paper will comprise a total of six questions.

2 All questions carry equal marks.

3 Questions will be mixed in nature (for example supposed Q.2 has part (a) from module 3
then part (b) will be from any module other than module 3).

4 Only Four questions need to be solved.

Lab Code Lab Name Credit


CSL8023 Applied Data Science Lab 1

Prerequisite: Engineering Mathematics, Machine Learning, Programming fundamentals


Lab Objectives:
1 To explore various stages in the data science lifecycle.
2 To understand data preparation, exploration and visualization techniques.
3 To model and evaluate different supervised/unsupervised learning techniques.
Lab Outcomes: At the end of the course, the students will be able to
1 Apply various stages of the data science lifecycle for the selected case study.
2 Demonstrate data preparation, exploration and visualization techniques.
3 Implement and evaluate different supervised and unsupervised techniques.

Suggested List of Experiments


(Select a case study and perform the experiments 1 to 8.).
Star (*) marked experiments are compulsory.
Name of the Experiment
1. Explore the descriptive and inferential statistics on the given dataset.
2. Apply data cleaning techniques (e.g. Data Imputation).
3. Explore data visualization techniques.
4. Implement and explore performance evaluation metrics for Data Models
(Supervised/Unsupervised Learning)
5. Use SMOTE technique to generate synthetic data.(to solve the problem of class imbalance)
6. Outlier detection using distance based/density based method.
7. Implement time series forecasting.
Illustrate data science lifecycle for selected case study. (Prepare case study document for the
selected case study)
Suggested Case Studies:
• Customer Segmentation
• Fraud Detection
• House Price prediction
• Product Recommendation
• Stock price prediction
• Weather prediction

Suggested Assignment List


Assignments can be given on self learning topics or data deployment tools.

Term Work:
1 Term work should consist of 8 experiments.
2 The final certification and acceptance of term work ensures satisfactory performance of
laboratory work and minimum passing marks in term work.
3 The final certification and acceptance of term work ensures satisfactory performance of
laboratory work and minimum passing marks in term work. Total 25 Marks (Experiments: 15-
marks, Attendance Theory & Practical: 05-marks, Assignment: 05-marks)
Practical and Oral exam

You might also like