0% found this document useful (0 votes)

326 views3 pages

Foundations of Data Science

The document outlines a course on the Foundations of Data Science, detailing objectives, units of study, practical exercises, software requirements, and course outcomes. Key topics include data analysis concepts, statistical methods, Python tools like NumPy and Pandas, data visualization techniques, and recent trends in data science applications. The course aims to equip students with essential skills for data inspection, cleansing, and interpretation using various data science tools.

Uploaded by

vjay2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

326 views3 pages

Foundations of Data Science

Uploaded by

vjay2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

FOUNDATIONS OF DATA SCIENCE L TP C

30 2 4

COURSE OBJECTIVES:

 To Understand the basic concepts of Data Analysis

 To acquire skills in data preparatory and preprocessing steps
 To understand the mathematical skills in statistics
 To learn the tools and packages in Python for data science
 To acquire knowledge in data interpretation and visualization techniques
 Will gain Knowledge about recent trends in Data Science

UNIT- I INTRODUCTION 8

Applications: Search engines, Image recognition

Need for data science – benefits and uses – facets of data – data science process – setting the research
goal – retrieving data – cleansing, integrating, and transforming data – exploratory data analysis – build
the models – presenting and building applications

UNIT- II DESCRIBING DATA 9

Applications: Speech recognition, Recommendation systems

Frequency distributions –Outliers –relative frequency distributions –cumulative frequency distributions –
frequency distributions for nominal data –interpreting distributions –graphs –averages -normal
distributions –z scores –normal curve problems –finding proportions –finding scores –more about z–
interpretation of r2 –multiple regression equations –regression toward the mean- statistical metrics with
python.
.
UNIT- III INTRODUCTION TO NUMPY 8

Applications: Machine Learning, Scientific Computing

Data types in Python -basics of Numpy arrays - computations on Numpy Arrays-universal functions-
aggregations: min, max and Everything in between-computation on arrays: broadcasting - comparisons,
masks, and Boolean logic - fancy indexing -sorting values in Numpy array-fast sorting-sorting along
rows or columns-partial sorts-K nearest neighbors- Numpy’s structured arrays
.
UNIT- IV DATA MANIPULATION WITH PANDAS 8

Applications: Financial Analysis, Data Visualization

Pandas objects - data indexing and selection - operating on data in pandas -handling missing data -
hierarchical indexing - combining datasets: concat and append - combining datasets: merge and join-
aggregation and grouping- pivot tables-vectorized string operations - working with time Series - high-
performance pandas: eval()and query().
.
UNIT -V PYTHON FOR DATA VISUALIZATION 7

Applications : Climate Change Analysis, Sports data Analysis

Visualization with matplotlib – line plots – scatter plots – visualizing errors – density and contour plots –
histograms, binnings, and density –three dimensional plotting – geographic data – data analysis using
statmodels and seaborn – graph plotting using Plotly – interactive data visualization using Bokeh

UNIT -VI RECENT TRENDS IN DATA SCIENCE 5

Healthcare- Drug development, Virtual healthcare assistance- Finance- Fraud detection- Marketing-
Targeted advertising, Customer interactions- Transportation - Driverless cars, Airline routing.
TOTAL: 45 PERIODS
PRACTICAL EXERCISES:
1. Download, install and explore the features of NumPy, SciPy, Jupyter, Statsmodels and Pandas
packages.
2. Working with Numpy arrays
3. Working with Pandas data frames
4. Reading data from text files, Excel and the web and exploring various commands for doing
descriptive analytics on the Iris data set.
5. Use the diabetes data set from UCI and Pima Indians Diabetes data set for performing the
following:
a. Univariate analysis: Frequency, Mean, Median, Mode, Variance, Standard Deviation,
Skewness and Kurtosis.
b. Bivariate analysis: Linear and logistic regression modeling
c. Multiple Regression analysis
d. Also compare the results of the above analysis for the two data sets.
6. Apply and explore various plotting functions on UCI data sets.
a. Normal curves
b. Density and contour plots
c. Correlation and scatter plots
d. Histograms
e. Three dimensional plotting
7. Visualizing Geographic Data with Basemap
8. Importing Data from External Source Using Python
SOFTWARE REQUIREMENTS
Python, Numpy, Scipy, Matplotlib, Pandas, statmodels, seaborn, plotly, bokeh
TOTAL : 30 PERIODS
TOTAL: 75 PERIODS
COURSE OUTCOMES:

At the end of this course, the students will be able to:

CO1: Apply the skills of data inspecting and cleansing.
CO2: Determine the relationship between data dependencies using statistics
CO3: Represent the useful information using mathematical skills
CO4: Handle data using primary tools used for data science in Python
CO5: Apply the knowledge for data describing and visualization using tools
CO6: Aware of the current scope and limitations of DS and societal implications

TEXT BOOKS
1. David Cielen, Arno D. B. Meysman, and Mohamed Ali, “Introducing Data Science”, Manning
Publications, 2016. (first two chapters for Unit I)
2. Robert S. Witte and John S. Witte, “Statistics”, Eleventh Edition, Wiley Publications, 2017.
(Chapters 1–7 for Units II)
3. Jake VanderPlas, “Python Data Science Handbook”, O’Reilly, 2016. (Parts of chapters 2–4 for
Units III,IV and V)

REFERENCES
1. Allen B. Downey, “Think Stats: Exploratory Data Analysis in Python”, Green Tea Press, 2014
2.Sanjeev J. Wagh, Manisha S. Bhende, Anuradha D. Thakare, “Fundamentals of Data
Science”, CRC Press, 2022
3.Chirag Shah, “A Hands-On Introduction to Data Science”, Cambridge University Press

CO’s & PO’s MAPPING

CO PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11
CO1 2 2 1 2 2 - - - 1 1 1
CO2 2 1 - 1 1 - - - 2 1 1
CO3 2 2 1 2 2 1 1 - 1 2 1
CO4 3 2 2 1 2 - - - 1 1 2
CO5 2 2 1 2 2 - - - 1 1 1
CO6 2 2 1 2 2 - - - 1 1 1

Ocs353 DSF Syllabus
No ratings yet
Ocs353 DSF Syllabus
3 pages
Data Science Foundations Syllabus
No ratings yet
Data Science Foundations Syllabus
2 pages
AD1301A Introduction To Data Science Syllabus
No ratings yet
AD1301A Introduction To Data Science Syllabus
2 pages
Cs3352 Foundations of Data Science L T P C
No ratings yet
Cs3352 Foundations of Data Science L T P C
2 pages
Ocs353 Data Science Fundamentals
No ratings yet
Ocs353 Data Science Fundamentals
2 pages
Data Science with Python Course Overview
No ratings yet
Data Science with Python Course Overview
116 pages
DSP U2
No ratings yet
DSP U2
172 pages
Course Plan - FDS Theory
No ratings yet
Course Plan - FDS Theory
8 pages
CS3352 FDS
No ratings yet
CS3352 FDS
23 pages
DSP U1
No ratings yet
DSP U1
89 pages
Data Science Principles Overview
No ratings yet
Data Science Principles Overview
2 pages
Cs3361 Data Science Laboratory
No ratings yet
Cs3361 Data Science Laboratory
2 pages
Edit Ds
No ratings yet
Edit Ds
37 pages
Unit 1 Fod
No ratings yet
Unit 1 Fod
43 pages
303 - Data Analysis Using Python
No ratings yet
303 - Data Analysis Using Python
6 pages
OCS353 Syllabus
No ratings yet
OCS353 Syllabus
5 pages
CS3352 Foundations of Data Science
No ratings yet
CS3352 Foundations of Data Science
1 page
Data Science and Visualization Updated
No ratings yet
Data Science and Visualization Updated
3 pages
# Syllabus
No ratings yet
# Syllabus
2 pages
Cs3352 - Foundation of Data Science
No ratings yet
Cs3352 - Foundation of Data Science
56 pages
Introduction To Data Science Course Outline
No ratings yet
Introduction To Data Science Course Outline
5 pages
Data Science and Visualization Updated
No ratings yet
Data Science and Visualization Updated
3 pages
21CSS303T Data Science Syllabus
No ratings yet
21CSS303T Data Science Syllabus
2 pages
Fds Syllabus Syllabus
No ratings yet
Fds Syllabus Syllabus
3 pages
Data Science Foundations and Lab Guide
No ratings yet
Data Science Foundations and Lab Guide
2 pages
227C4A Data Science
No ratings yet
227C4A Data Science
2 pages
B.Tech. Data Science Syllabus 2023-24
No ratings yet
B.Tech. Data Science Syllabus 2023-24
7 pages
Data Science Course Overview CS3352
No ratings yet
Data Science Course Overview CS3352
15 pages
Data Virt QB Updated
No ratings yet
Data Virt QB Updated
12 pages
Data Science Minor Degree Courses
No ratings yet
Data Science Minor Degree Courses
12 pages
Ocs353dsf Unit Wise Notes
100% (4)
Ocs353dsf Unit Wise Notes
121 pages
Foundations of Data Science
No ratings yet
Foundations of Data Science
139 pages
SYLLABUS
No ratings yet
SYLLABUS
2 pages
FDS Course Plan - Update
No ratings yet
FDS Course Plan - Update
7 pages
Introduction to Data Science Course
No ratings yet
Introduction to Data Science Course
3 pages
Data Science Machine Learning Batch 01 Bluep
No ratings yet
Data Science Machine Learning Batch 01 Bluep
34 pages
Cab112:Introduction To Data Science: Session 2024-25 Page:1/2
No ratings yet
Cab112:Introduction To Data Science: Session 2024-25 Page:1/2
2 pages
Functional Programming in Python Course
No ratings yet
Functional Programming in Python Course
3 pages
Data Science Course Overview and Lab
No ratings yet
Data Science Course Overview and Lab
4 pages
Foundations of Data Science - Syllabus
No ratings yet
Foundations of Data Science - Syllabus
4 pages
Data Scientist Analytics Career Path Guide
No ratings yet
Data Scientist Analytics Career Path Guide
7 pages
Combined SoCIT OE 7th Sem
No ratings yet
Combined SoCIT OE 7th Sem
7 pages
Data Analysis and Visualization-Theory - R22A
No ratings yet
Data Analysis and Visualization-Theory - R22A
2 pages
Course Plan Fods
No ratings yet
Course Plan Fods
6 pages
CS352 - Lab Syllabus
No ratings yet
CS352 - Lab Syllabus
2 pages
CS 3361 Data Science Laboratory Syllabus
No ratings yet
CS 3361 Data Science Laboratory Syllabus
1 page
Sem 6
No ratings yet
Sem 6
12 pages
Python Data Visualization and Analysis
No ratings yet
Python Data Visualization and Analysis
2 pages
Gujarat Technological University: Overview of Python and Data Structures
No ratings yet
Gujarat Technological University: Overview of Python and Data Structures
4 pages
PDS Merged New
No ratings yet
PDS Merged New
19 pages
Data Science Topics
No ratings yet
Data Science Topics
7 pages
Foundations of Data Science
No ratings yet
Foundations of Data Science
2 pages
1152CS239-Intro. To Data Science-Syllabus
No ratings yet
1152CS239-Intro. To Data Science-Syllabus
6 pages
Data Science & Engineering Principles Course
No ratings yet
Data Science & Engineering Principles Course
4 pages
Lesson Planning Sheet - EDA
No ratings yet
Lesson Planning Sheet - EDA
3 pages
Data Science Laboratory Course Overview
No ratings yet
Data Science Laboratory Course Overview
2 pages
FDS Lesson Plan
No ratings yet
FDS Lesson Plan
8 pages
Data Science Laboratory Course Overview
No ratings yet
Data Science Laboratory Course Overview
2 pages
CS3591 Computer Networks Update QB
No ratings yet
CS3591 Computer Networks Update QB
12 pages
Question Cyber Security
No ratings yet
Question Cyber Security
19 pages
CP4161 - ADS Lab
No ratings yet
CP4161 - ADS Lab
66 pages
CCS340 QB
No ratings yet
CCS340 QB
10 pages
FDSA Lab Manual FINAL
No ratings yet
FDSA Lab Manual FINAL
55 pages
Oops Question Bank 2025-2026 Final
No ratings yet
Oops Question Bank 2025-2026 Final
21 pages
ML Lab - Laboratory - Course - Evaluation - Sheet
No ratings yet
ML Lab - Laboratory - Course - Evaluation - Sheet
2 pages
Unit II
No ratings yet
Unit II
8 pages
Unit I
No ratings yet
Unit I
19 pages
APP QB
No ratings yet
APP QB
12 pages
IT3501 FullStack
No ratings yet
IT3501 FullStack
19 pages
R2021 IQAC Updated Question Bank 2025-2026 ODD New
No ratings yet
R2021 IQAC Updated Question Bank 2025-2026 ODD New
11 pages
Vision and Mission of Computer Science Program
No ratings yet
Vision and Mission of Computer Science Program
3 pages
K-Means and GMM Classification of Iris Data
No ratings yet
K-Means and GMM Classification of Iris Data
3 pages
IBS Question Bank 2025
No ratings yet
IBS Question Bank 2025
9 pages
Machine Learning Laboratory
No ratings yet
Machine Learning Laboratory
3 pages
Cse, Aids, CSBS, Aiml, It, Cyber Security III Sem Syllabus (1) N
No ratings yet
Cse, Aids, CSBS, Aiml, It, Cyber Security III Sem Syllabus (1) N
3 pages
Bonafide Certificate and Letter Writing Guide
No ratings yet
Bonafide Certificate and Letter Writing Guide
31 pages
Machine Learning-Updated
No ratings yet
Machine Learning-Updated
4 pages
Backpropagation Neural Network Guide
No ratings yet
Backpropagation Neural Network Guide
4 pages
Install and Explore Python Data Packages
No ratings yet
Install and Explore Python Data Packages
38 pages
FIND-S and Candidate Elimination Algorithms
No ratings yet
FIND-S and Candidate Elimination Algorithms
55 pages
CP4292-Multicore Lab
No ratings yet
CP4292-Multicore Lab
39 pages
Key IT2352 & Cryptography Questions
No ratings yet
Key IT2352 & Cryptography Questions
1 page
GE3151-Lab Manual
No ratings yet
GE3151-Lab Manual
117 pages
CS8501 - TOC - Lesson Plan 2022-23
No ratings yet
CS8501 - TOC - Lesson Plan 2022-23
6 pages
PART B (Answer Any 3) (3x10 30) : Reg - No
No ratings yet
PART B (Answer Any 3) (3x10 30) : Reg - No
4 pages
Cryptography and Network Security Plan
No ratings yet
Cryptography and Network Security Plan
3 pages
Understanding Balance of Payments Overview
No ratings yet
Understanding Balance of Payments Overview
35 pages
Refill Price List V01 210624
No ratings yet
Refill Price List V01 210624
32 pages
JMC N900 Chassis Repair Guide
No ratings yet
JMC N900 Chassis Repair Guide
41 pages
Distribution System Design Guide
No ratings yet
Distribution System Design Guide
4 pages
AI-Powered Security Solutions by Dahua
No ratings yet
AI-Powered Security Solutions by Dahua
28 pages
Development Plan & Master Plan
100% (1)
Development Plan & Master Plan
55 pages
GST Notes 2017 18
No ratings yet
GST Notes 2017 18
39 pages
English Listening Practice Questions
No ratings yet
English Listening Practice Questions
8 pages
Supervising and Appraising Well Social Care
No ratings yet
Supervising and Appraising Well Social Care
38 pages
Editing Worksheets: With Polaris Office, You Can Create New .Xls and .XLSX Sheets or Edit Your Worksheets With Ease
No ratings yet
Editing Worksheets: With Polaris Office, You Can Create New .Xls and .XLSX Sheets or Edit Your Worksheets With Ease
10 pages
Stratford Planning Applications List
No ratings yet
Stratford Planning Applications List
7 pages
1job Mismatch
No ratings yet
1job Mismatch
19 pages
Anf 2 A Application Form For Issue / Modification in Importer Exporter Code Number (IEC)
No ratings yet
Anf 2 A Application Form For Issue / Modification in Importer Exporter Code Number (IEC)
6 pages
Magnetic Particle Examination Specification
No ratings yet
Magnetic Particle Examination Specification
4 pages
Range Hood Installation & User Manual
No ratings yet
Range Hood Installation & User Manual
40 pages
Centralized Medical Waste Management Solutions
No ratings yet
Centralized Medical Waste Management Solutions
19 pages
Color Idioms List With Meanings and Examples: Idiom / Phrase Meaning Example Sentence
No ratings yet
Color Idioms List With Meanings and Examples: Idiom / Phrase Meaning Example Sentence
3 pages
09 Managing Plant Diseases
100% (1)
09 Managing Plant Diseases
25 pages
SOP NJMP-STT - Final
No ratings yet
SOP NJMP-STT - Final
21 pages
Employee Seminar Feedback and Actions
No ratings yet
Employee Seminar Feedback and Actions
40 pages
Zisumbo v. Ogden Medical Center Appeal
No ratings yet
Zisumbo v. Ogden Medical Center Appeal
36 pages
Navotas Fishing Culture and Challenges
No ratings yet
Navotas Fishing Culture and Challenges
1 page
SPC261 - Practical Approach To SharePoint Governance - The Key To Successful SharePoint 2010 Solutions
No ratings yet
SPC261 - Practical Approach To SharePoint Governance - The Key To Successful SharePoint 2010 Solutions
35 pages
Solar Kiln by Bill Stuewe
100% (3)
Solar Kiln by Bill Stuewe
5 pages
30 Habits of Highly Effective Teachers
No ratings yet
30 Habits of Highly Effective Teachers
7 pages
Community Play: The Bonny Moorhen
No ratings yet
Community Play: The Bonny Moorhen
8 pages
8086 Addressing Modes
No ratings yet
8086 Addressing Modes
63 pages
Family Incivility SLR
No ratings yet
Family Incivility SLR
10 pages
Section 7 GCC
No ratings yet
Section 7 GCC
56 pages

Foundations of Data Science

Uploaded by

Foundations of Data Science

Uploaded by

FOUNDATIONS OF DATA SCIENCE L TP C

 To Understand the basic concepts of Data Analysis

Applications: Search engines, Image recognition

UNIT- II DESCRIBING DATA 9

Applications: Speech recognition, Recommendation systems

Applications: Machine Learning, Scientific Computing

Applications: Financial Analysis, Data Visualization

Applications : Climate Change Analysis, Sports data Analysis

UNIT -VI RECENT TRENDS IN DATA SCIENCE 5

At the end of this course, the students will be able to:

CO’s & PO’s MAPPING

You might also like