0% found this document useful (0 votes)
8 views17 pages

Applied Data Science Orientation

The document outlines an orientation program for Applied Data Science (CSDO8013) and its corresponding lab (CSL8023) for BE Computer students. It includes course objectives, prerequisites, syllabus contents, assessment schemes, and suggested experiments and case studies. The program aims to equip students with fundamental knowledge and practical skills in data science techniques, data exploration, visualization, and anomaly detection.

Uploaded by

dayanand
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views17 pages

Applied Data Science Orientation

The document outlines an orientation program for Applied Data Science (CSDO8013) and its corresponding lab (CSL8023) for BE Computer students. It includes course objectives, prerequisites, syllabus contents, assessment schemes, and suggested experiments and case studies. The program aims to equip students with fundamental knowledge and practical skills in data science techniques, data exploration, visualization, and anomaly detection.

Uploaded by

dayanand
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

Orientation Program

on
Applied Data Science(CSDO8013) and
Applied Data Science Lab (CSL8023)
BE(Computer/VIII)
Content
• Course Objectives and outcomes
• Syllabus Contents
• Text Books and References books
• Assessment/Examination Scheme

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 2


pplied Data Science Lab(CSL8023)
Prerequisites and Credits
• Applied Data Science(CSDO8013)
• Prerequisite: Engineering Mathematics, Machine Learning, Data Structures &
Algorithms
• Credits: 3
• Applied Data Science Lab (CSL8023)
• Prerequisite: Python Programming, Engineering Mathematics
• Credits:1

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 3


pplied Data Science Lab(CSL8023)
Course Objectives and Outcomes
• Course Objectives:
1. To introduce students to the basic concepts of data science.
2. To acquire an in-depth understanding of data exploration and data
visualization.
3. To be familiar with various anomaly detection techniques.
4. To understand the data science techniques for different
applications.

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 4


pplied Data Science Lab(CSL8023)
Course Objectives and Outcomes
• Course Outcomes
1. To gain fundamental knowledge of the data science process.
2. To apply data exploration and visualization techniques.
3. To apply anomaly detection techniques.
4. To gain an in-depth understanding of time-series forecasting.
5. Apply different methodologies and evaluation strategies.
6. To apply data science techniques to real world applications.

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 5


pplied Data Science Lab(CSL8023)
Syllabus Contents
Modul Content 39Hrs
e
1 Introduction to Data Science 05
1.1 Introduction to Data Science, Data Science Process (T1-ch2) General Topic -All
Books
1.2 Motivation to use Data Science Techniques: Volume, Dimensions and General Topic -All
Complexity, Data Science Tasks and Examples Books
1.3 Overview of Data Preparation, Modeling, Difference between data science T1-ch2, ch3
and data analytics
2 Data Exploration 09
2.1 Types of data, Properties of data -T1 T2-ch5
R3-ch2, ch3,
Descriptive Statistics: T4-ch4, ch5, ch6
Univariate Exploration: Measure of Central Tendency, Measure of Spread, Theory +
Symmetry, Skewness: Karl Pearson Coeff. of skewness, Bowley’s Coefficient, Numerical
Kurtosis
Multivariate Exploration: Central Data Point, Correlation, Different forms of
correlation, Karl Pearson Correlation Coefficient for bivariate distribution

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 6


pplied Data Science Lab(CSL8023)
Syllabus
Module
Contents
Content 39Hrs
2.2 Inferential Statistics: T2-ch5,R3-ch 14,
Overview of Various forms of distributions: Normal, Poisson, Test T4-ch17, ch19 Theory
Hypothesis, Central limit theorem, Confidence Interval, Z-test, t-test, + Numerical
Type-I, Type-II Errors, ANOVA
3 Methodology and Data Visualization 06
3.1 Methodology: Overview of model building, Cross Validation, K-fold T1(Ch2, Ch4), T2-Ch7
cross validation, leave-1 out, Bootstrapping
3.2 Data Visualization T1-scattered across
Chapters, T2-Ch6
Univariate Visualization: Histogram, Quartile, Distribution Chart
Multivariate Visualization: Scatter Plot, Scatter Matrix, Bubble chart,
Density Chart, Roadmap for Data Exploration
Self-Learning Topics: Visualizing high dimensional data: Parallel chart, T1,T2(Ch6) Avoid
Deviation chart, Andrews Curves. numericals /case
study during theory
QP setting. Short
notes may be
included.

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 7


pplied Data Science Lab(CSL8023)
Syllabus Contents
Module Content Ref. 39Hrs
4 Anomaly Detection 06
4.1 Outliers, Causes of Outliers, Anomaly detection techniques, Outlier T1- ch13
Detection using Statistics
4.2 Outlier Detection using Distance based method, Outlier detection T1- ch13
using density-based methods, SMOTE
5 Time Series Forecasting 08
5.1 Taxonomy of Time Series Forecasting methods, Time Series T1- ch12
Decomposition
5.2 Smoothening Methods: Average method, Moving Average T1- ch12 Theory +
smoothing, Time series analysis using linear regression, ARIMA Numericals on
Model, Performance Evaluation: Mean Absolute Error, Root performance evaluation
Mean Square Error, Mean Absolute Percentage Error, Mean
Absolute Scaled Error
Self Learning Topics: Evaluation parameters for Classification, Avoid numericals during
regression and clustering. Th. QP setting. General
topics may be included
for vivas.

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 8


pplied Data Science Lab(CSL8023)
Syllabus Contents
Module Content Ref. 39Hrs
6 Applications of Data Science 05
6.1 Predictive Modeling: House price prediction, Fraud Detection General Case studies --
Clustering: Customer Segmentation Depends on entire data
science cycle
Time series forecasting: Weather Forecasting
T1
Recommendation engines: Product recommendation
Paper setting- Confined
to these case studies or
similar case studies in
those respective areas.
Total 39

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 9


pplied Data Science Lab(CSL8023)
Text Books and References
Textbooks:
1 Vijay Kotu, Bala Deshpande. “Data Science Concepts and Practice”, Elsevier, M.K. Publishers.

2 Steven Skiena, “Data Science Design Manual”, Springer International Publishing AG

3 Samir Madhavan. “Mastering Python for Data Science”, PACKT Publishing


4 Dr. P. N. Arora, Sumeet Arora, S. Arora, Ameet Arora, “Comprehensive Statistical Methods”,
S.Chand Publications, New Delhi.
References:
1 Jake VanderPlas. “Python Data Science Handbook”, O’reilly Publications.

2 Francesco Ricci, Lior Rokach, Bracha Shapira, Paul B. Kantor, “Recommender Systems
Handbook”, Springer.
3 S.C. Gupta, V. K. Kapoor “Fundamentals of Mathematical Statistics”, S. Chand and Sons, New
Delhi.

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 10


pplied Data Science Lab(CSL8023)
Assessment
Assessment:
Internal Assessment:
The assessment consists of two class tests of 20 marks each. The first class test is to be
conducted when approx. 40% syllabus is completed and second class test when additional 40%
syllabus is completed. Duration of each test shall be one hour.

End Semester Theory Examination:


1 Question paper will comprise a total of six questions.
2 All questions carry equal marks.
3 Questions will be mixed in nature (for example supposed Q.2 has part (a) from module 3
then part (b) will be from any module other than module 3).

4 Only Four questions need to be solved.


5 In question paper weightage of each module will be proportional to the number of
respective lecture hours as mentioned in the syllabus.

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 11


pplied Data Science Lab(CSL8023)
Useful Links
Useful Links
1 https://onlinecourses.nptel.ac.in/noc22_cs32/preview
2 https://onlinecourses.nptel.ac.in/noc21_cs69/preview
3 https://www.coursera.org/specializations/applied-data-science
4 www.IntroDataScience.com.
6 https://rapidminer.com/
7 https://towardsdatascience.com/machine-learning/home

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 12


pplied Data Science Lab(CSL8023)
Applied Data Science Lab(CSL8023)
Prerequisite: Engineering Mathematics, Machine Learning, Programming fundamentals
Lab Objectives:

1 To explore various stages in the data science lifecycle.

2 To understand data preparation, exploration and visualization techniques.

3 To model and evaluate different supervised/unsupervised learning techniques.

Lab Outcomes: At the end of the course, the students will be able to

1 Apply various stages of the data science lifecycle for the selected case study.
2 Demonstrate data preparation, exploration and visualization techniques.

3 Implement and evaluate different supervised and unsupervised techniques.

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 13


pplied Data Science Lab(CSL8023)
Suggested List of Experiments
Suggested List of Experiments
1.

(Select a case study and perform the experiments 1 to 8.)


1. Explore the descriptive and inferential statistics on the given dataset.
2. Apply data cleaning techniques (e.g. Data Imputation).
3. Explore data visualization techniques.
4. Implement and explore performance evaluation metrics for Data Models
(Supervised/Unsupervised Learning)
5. Use SMOTE technique to generate synthetic data. (to solve the problem of class imbalance)
6. Outlier detection using distance based/density based method.
7. Implement time series forecasting.
8. Illustrate data science lifecycle for selected case study. (Prepare case study document for the
selected case study)

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 14


pplied Data Science Lab(CSL8023)
Suggested List of Experiments
Suggested Case Studies:
4.

1.Customer Segmentation
2.Fraud Detection
3.House Price prediction
4.Product Recommendation
5.Stock price prediction
6.Weather prediction
Suggested Assignment List
Assignments can be given on self-learning topics or data deployment tools.
Python Libraries such ‘dtale’ (for Exploratory analysis), ‘pandas-profiling’ (for statistics, charts,
correlations etc.)

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 15


pplied Data Science Lab(CSL8023)
Assessment
Term Work:
1 Term work should consist of 8 experiments.
2 The final certification and acceptance of term work ensures satisfactory performance of laboratory
work and minimum passing marks in term work.

3 The final certification and acceptance of term work ensures satisfactory performance of laboratory
work and minimum passing marks in term work. Total 25 Marks (Experiments: 15-marks,
Attendance Theory & Practical: 05-marks, Assignment: 05-marks)

Practical and Oral exam

Based on the entire syllabus of CSDO 8013: Applied Data Science

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 16


pplied Data Science Lab(CSL8023)
Thank You!!

23/05/2025 Orientation Program on Applied Data Science (CSDLO8013) & A 17


pplied Data Science Lab(CSL8023)

You might also like