0% found this document useful (0 votes)
7 views

AI3104 Foundation of Data Science (handout) 2024 (1)

The document outlines the course AI-3104 Foundations of Data Science offered at Manipal University Jaipur, targeting B.Tech-CSE (AIML) students. It details course objectives, outcomes, assessment plans, and a comprehensive syllabus covering various data science techniques and tools. The course aims to equip students with the necessary skills for a career in data science and includes both theoretical and practical components.

Uploaded by

iaminstinctgamer
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

AI3104 Foundation of Data Science (handout) 2024 (1)

The document outlines the course AI-3104 Foundations of Data Science offered at Manipal University Jaipur, targeting B.Tech-CSE (AIML) students. It details course objectives, outcomes, assessment plans, and a comprehensive syllabus covering various data science techniques and tools. The course aims to equip students with the necessary skills for a career in data science and includes both theoretical and practical components.

Uploaded by

iaminstinctgamer
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

MANIPAL UNIVERSITY JAIPUR

Faculty of Engineering || School of Computer Science & Engineering


DEPARTMENT OF ARTIFICIAL INTELLIGENCE & MACHINE LEARNING

Course Hand-out
AI-3104 || Foundations of Data Science || L T P C [3 1 0 4] || Core Course
Session: July 2024 – Nov 2024 || B.Tech-CSE (AIML) || Semester -5th

Course Instructors: Dr. Priya Goyal, Ms. Shubh Lakshmi Agrwal, Ms. Rishika Singh,
Mr. Harish Sharma, Dr. Ajay Kumar
Course Coordinator: Mr. Harish Sharma

A. INTRODUCTION:
This course is offered by the Department of AIML, mainly targeting students who wish to pursue a
career in Data Science or higher studies in Engineering discipline with AIML specialization. This course
objectives to discuss techniques to explain how advanced data analytics can be leveraged to create
data with statistical environment and how the data scientist role and skills differ from those of a
traditional business intelligence analyst. This course also supports the design, implementation, and
inference of advanced technologies in data science.

B. COURSE OUTCOMES:
At the end of the course, students will be able to

CO Statement Cognitive Level


AI-3104.1 Identify Data types and sampling approached for data science and its Understand
statistical applications.
AI-3104.2 Apply Data Science analytic testing techniques and tools, create statistical Apply
models, and identify insights that can lead to statistical results.
AI-3104.3 Illustrate the multiple aspects of Data Science and Classify various data Analyze, Evaluate
with their applications in industry, research, and entrepreneurship.
AI-3104.4 Apply visualization techniques to clearly communicate data analytic Apply, Analyze
insights for business stakeholder others.

C. PROGRAM OUTCOMES AND PROGRAM SPECIFIC OUTCOMES


[PO.1] Engineering knowledge: Apply the knowledge of mathematics, science, engineering fundamentals,
and an engineering specialization to the solution of complex engineering problems.
[PO.2] Problem analysis: Identify, formulate, research literature, and analyze complex engineering
problems reaching substantiated conclusions using first principles of mathematics, natural sciences, and
engineering sciences.
[PO.3] Design/development of solutions: Design solutions for complex engineering problems and
design system components or processes that meet the specified needs with appropriate consideration for the
public health and safety, and the cultural, societal, and environmental considerations.
[PO.4] Conduct investigations of complex problems: Use research-based knowledge and research
methods including design of experiments, analysis and interpretation of data, and synthesis of the information
to provide valid conclusions.
[PO.5] Modern tool usage: Create, select, and apply appropriate techniques, resources, and modern
engineering and IT tools including prediction and modeling to complex engineering activities with an
understanding of the limitations.
[PO.6] The engineer and society: Apply reasoning informed by the contextual knowledge to assess
societal, health, safety, legal, and cultural issues and the consequent responsibilities relevant to the professional
engineering practice.
[PO.7] Environment and sustainability: Understand the impact of the professional engineering solutions
in societal and environmental contexts, and demonstrate the knowledge of, and need for sustainable
development.
[PO.8] Ethics: Apply ethical principles and commit to professional ethics and responsibilities and norms of
the engineering practices.
[PO.9] Individual and teamwork: Function effectively as an individual, and as a member or leader in
diverse teams, and in multidisciplinary settings.
[PO.10] Communication: Communicate effectively on complex engineering activities with the engineering
community and with society at large, such as, being able to comprehend and write effective reports and design
documentation, make effective presentations, and give and receive clear instructions.
[PO.11] Project management and finance: Demonstrate knowledge and understanding of the
engineering and management principles and apply these to one’s own work, as a member and leader in a team,
to manage projects and in multidisciplinary environments.
[PO.12] Life-long learning: Recognize the need for and have the preparation and ability to engage in
independent and life-long learning in the broadest context of technological change.

[PSO1]: Graduates will be able to examine the applications of Artificial Intelligence and Machine Learning in
real-life problems.
[PSO2]: Graduates will be able to design and implement intelligent systems for multidisciplinary problems.

D. ASSESSMENT PLAN:

Criteria Description Maximum Marks


Internal Assessment Mid Term Exam (Close Book) 30

(Summative) Project / Case Study (10) 30

Physical Quiz (10)

Attendance (05)

Assignment (05)

End Term Exam (Summative) End Term Exam (Close Book) 40

Total 100

Attendance A minimum of 75% Attendance is required to be maintained by a


(Formative) student to be qualified for taking up the End Semester examination.
The allowance of 25% includes all types of leaves including medical
leaves.
Homework/ There are situations where a student may have to work in home,
Home Assignment/ especially before a flipped classroom. Although these works are not
Activity Assignment (Formative) graded with marks. However, a student is expected to participate and
perform these assignments with full zeal since the activity/ flipped
classroom participation by a student will be assessed and marks will be
awarded.
E. SYLLABUS
Introduction: Elements of Structured Data, Estimates of Location, Estimates of Variability,
Exploring the Data Distribution, Types of Data, Sampling Distributions: Random Sampling and
Sample Bias, Selection Bias, Sampling Distribution, Bootstrap, Confidence Intervals; Data and
Sampling Distributions: Random Sampling and Sample Bias, Selection Bias, Sampling Distribution,
Bootstrap, Confidence Intervals; Statistical Experiments and Significance Testing: A/B
Testing, Hypothesis Tests, Resampling, Statistical Significance and p-Values, ANOVA, Chi-Square
Test, Classification: Discriminant Analysis, Covariance & correlation, Fisher’s Linear Discriminant,
Model Evaluation - Confusion Matrix, Rare Class Problem, Precision, Recall, and Specificity, ROC
Curve, AUC. Clustering and Association: Descriptive, inferential statistics, univariate, and
multivariate analysis. Cluster Analysis- distance measures, partitioning, hierarchical, density-based
methods. Association Analysis, Market Basket Analysis. Data Visualization: Data Visualization:
The matplotlib and seaborn library, Scatter plot, Bar Graph, Histogram, Pie Chart, Factor plot,
Boxplot, Heatmap, Using Tableau for analysis and visualization.

Reference Books
1. P. Bruce et al., Practical Statistics for Data Scientists, (2e), O'Reilly Media, Inc., May 2020.
2. G.J. Myatt, et al., Making Sense of Data II: A Practical Guide to Data Visualization, Advanced
Data Mining Methods, and Applications, John Wiley & Sons Publication, 2019.
3. C. Douglas and C. George, Applied Statistics and Probability for Engineers, John Wiley and
Sons, 2010.
4. Gorge Peck, Tableau 9 the Official guide, McGraw-Hill Education publication (Ind. Edition),
2020.
F. LECTURE PLAN:

S. Topics Session Outcome Mode of Delivery Correspondin Mode of Assessing the


No. g CO Outcome
1 Introduction: Elements of Understand basic elements of structured Lecture, Discussion AI-3104.1 Quiz, Participation
Structured Data data
2 Estimates of Location, Estimates of Learn measures of central tendency and Lecture, Hands-on AI-3104.1 Assignment, Quiz
Variability variability Activity
3 Exploring the Data Distribution Explore data distributions Lecture, Case Study AI-3104.1 Quiz, Case Study
Analysis
4 Types of Data Identify different types of data Lecture, Practical AI-3104.1 Practical Test,
Exercise Assignment
5 Random Sampling and Sample Understand random sampling and Lecture, Lab Session AI-3104.1 Lab Report, Quiz
Bias sample bias
6 Selection Bias Learn about selection bias Lecture, Discussion AI-3104.1 Quiz, Participation
7 Sampling Distribution Understand sampling distribution Lecture, Practical AI-3104.1 Practical Test,
Exercise Assignment
8 Bootstrap Apply bootstrap methods Lecture, Hands-on AI-3104.2 Assignment, Quiz
Activity
9 Confidence Intervals Calculate confidence intervals Lecture, Lab Session AI-3104.2 Lab Report, Quiz
10 A/B Testing Conduct A/B testing Lecture, Practical AI-3104.2 Practical Test,
Exercise Assignment
11 Hypothesis Tests Perform hypothesis tests Lecture, Hands-on AI-3104.2 Assignment, Quiz
Activity
12 Resampling Understand resampling techniques Lecture, Lab Session AI-3104.2 Lab Report, Quiz
13 Statistical Significance and p- Learn about statistical significance and Lecture, Practical AI-3104.2 Practical Test,
Values p-values Exercise Assignment
14 ANOVA Perform ANOVA tests Lecture, Hands-on AI-3104.2 Assignment, Quiz
Activity
15 Chi-Square Test Conduct Chi-Square tests Lecture, Lab Session AI-3104.2 Lab Report, Quiz
16 Discriminant Analysis Apply discriminant analysis Lecture, Practical AI-3104.3 Practical Test,
Exercise Assignment
17 Covariance & Correlation Understand covariance and correlation Lecture, Hands-on AI-3104.3 Assignment, Quiz
Activity
18 Fisher’s Linear Discriminant Use Fisher’s Linear Discriminant Lecture, Lab Session AI-3104.3 Lab Report, Quiz
19 Confusion Matrix Evaluate models using confusion matrix Lecture, Practical AI-3104.3 Practical Test,
Exercise Assignment
20 Rare Class Problem Address rare class problems Lecture, Hands-on AI-3104.3 Assignment, Quiz
Activity
21 Precision, Recall, and Specificity Learn about precision, recall, and Lecture, Lab Session AI-3104.3 Lab Report, Quiz
specificity
22 ROC Curve, AUC Evaluate models using ROC curve and Lecture, Practical AI-3104.3 Practical Test,
AUC Exercise Assignment
23 Descriptive Statistics Understand descriptive statistics Lecture, Hands-on AI-3104.3 Assignment, Quiz
Activity
24 Inferential Statistics Learn about inferential statistics Lecture, Lab Session AI-3104.3 Lab Report, Quiz
25 Univariate Analysis Perform univariate analysis Lecture, Practical AI-3104.3 Practical Test,
Exercise Assignment
26 Multivariate Analysis Conduct multivariate analysis Lecture, Hands-on AI-3104.3 Assignment, Quiz
Activity
27 Cluster Analysis: Distance Apply distance measures in clustering Lecture, Lab Session AI-3104.3 Lab Report, Quiz
Measures
28 Partitioning Methods Learn about partitioning methods Lecture, Practical AI-3104.3 Practical Test,
Exercise Assignment
29 Hierarchical Clustering Understand hierarchical clustering Lecture, Hands-on AI-3104.3 Assignment, Quiz
methods Activity
30 Density-Based Methods Apply density-based clustering methods Lecture, Lab Session AI-3104.3 Lab Report, Quiz
31 Association Analysis Perform association analysis Lecture, Practical AI-3104.3 Practical Test,
Exercise Assignment
32 Market Basket Analysis Conduct market basket analysis Lecture, Hands-on AI-3104.3 Assignment, Quiz
Activity
MID-TERM Exams
33 Introduction to Data Visualization Understand the basics of data Lecture, Lab Session AI-3104.4 Lab Report, Quiz
visualization
34 Matplotlib Library Use Matplotlib for data visualization Lecture, Practical AI-3104.4 Practical Test,
Exercise Assignment
35 Seaborn Library Apply Seaborn for data visualization Lecture, Hands-on AI-3104.4 Assignment, Quiz
Activity
36 Scatter Plot Create scatter plots Lecture, Lab Session AI-3104.4 Lab Report, Quiz
37 Bar Graph Create bar graphs Lecture, Practical AI-3104.4 Practical Test,
Exercise Assignment
38 Histogram Create histograms Lecture, Hands-on AI-3104.4 Assignment, Quiz
Activity
39 Pie Chart Create pie charts Lecture, Lab Session AI-3104.4 Lab Report, Quiz
40 Factor Plot Use factor plots for data visualization Lecture, Practical AI-3104.4 Practical Test,
Exercise Assignment
41 Boxplot Create boxplots Lecture, Hands-on AI-3104.4 Assignment, Quiz
Activity
42 Heatmap Create heatmaps Lecture, Lab Session AI-3104.4 Lab Report, Quiz
43 Using Tableau for Analysis Apply Tableau for data analysis Lecture, Practical AI-3104.4 Practical Test,
Exercise Assignment
44 Using Tableau for Visualization Use Tableau for data visualization Lecture, Hands-on AI-3104.4 Assignment, Quiz
Activity
45 Advanced Tableau Techniques Learn advanced Tableau techniques Lecture, Lab Session AI-3104.4 Lab Report, Quiz
46 Review of All Topics Review all covered topics Discussion All Participation
47 Final Assessment Preparation Prepare for final assessment Discussion, Q&A All Participation
48 Final Assessment Assess overall understanding Exam All Final Exam

G. Target attainment (%) for course outcomes:

CO Target attainment (%)


AI-3104.1 90%
AI-3104.2 90%
AI-3104.3 80%
AI-3104.4 80%
H. Course Articulation Matrix: (Mapping of COs with POs and PSOs)

CO STATEMENT CORRELATION WITH PROGRAM OUTCOMES CORRELATION


WITH PROGRAM
SPECIFIC
OUTCOMES
PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO
1 2 3 4 5 6 7 8 9 10 11 12 1 2
[AI 3104.1] Identify Data types and
sampling approached for data
3 3 2 2 2 1 1 1 2 2 1 2 2 2
science and its statistical
applications.
[AI 3104.2] Apply Data Science analytic
testing techniques and tools,
create statistical models, and 3 3 3 3 3 2 2 1 2 2 2 3 3 3
identify insights that can lead
to statistical results.
[AI 3104.3] Illustrate the multiple aspect
of Data Science and Classify
various data with their 3 3 3 3 3 2 2 1 2 2 2 3 3 3
applications in industry,
research & entrepreneurship.
[AI 3104.4] Apply visualization techniques
to clearly communicate data
analytic insights for business 2 2 2 2 2 2 2 1 2 3 2 3 2 2
stakeholder others.
1- Low Correlation; 2- Moderate Correlation; 3- Substantial Correlation

Course Coordinator Head of the Department Student Representative


Name: Mr. Harish Sharma Dr. Puneet Mittal
Name:
MUJ ID.: MUJ0736
Registration No.

You might also like