PROFESSIONAL
TRAINING
DATA SCIENCE
MOHIT PAL
22001001904
CSE
3RD YEAR
WHAT IS DATA SCIENCE ?
Data science combines math and statistics,
specialized programming, advanced
analytics, artificial intelligence (AI), and
machine learning with specific subject matter
expertise to uncover actionable insights
hidden in an organization’s data. These
insights can be used to guide decision
making and strategic planning.
COMPONENTS OF DATA
SCIENCE
PURPOSE OF DATA SCIENCE
ROLE OF DATA SCIENTIST
Collect data and identify data sources.
Analyze huge amounts of data, both structured and
unstructured.
Create solutions and strategies to business problems.
Work with team members and leaders to develop data
strategy.
To discover trends and patterns, combine various
algorithms and modules.
Present data using various data visualization techniques
and tools.
Investigate additional technologies and tools for
developing innovative data strategies
WHAT IS PYTHON ?
Python is a dynamic, high-level, free open
source, and interpreted programming
language. It supports object-oriented
programming as well as procedural-oriented
programming. In Python, we don’t need to
declare the type of variable because it is a
dynamically typed language.
FEATURES OF PYTHON
Free and open source
Easy to code
Easy to read
Oops concept
Large community support
Python is portable language
Large standard library
Basic libraries for data science
TensorFlow
NumPy
Pandas
Matplotlib
SciKit-Learn
Keras
CONCEPTS OF STATISTICS
MACHINE LEARNING
Machine learning is a branch of artificial intelligence(AI)
and computer science which focuses on the use of data
and algorithms to imitate the way that humans learn,
gradually improving its accuracy.
Machine learning also performs manual tasks that are
beyond our ability to execute at scale -- for example,
processing the huge quantities of data generated today by
digital devices. Machine learning's ability to extract patterns
and insights from vast data sets has become a competitive
differentiator in fields ranging from finance and retail to
healthcare and scientific discovery. Many of today's leading
companies, including Facebook, Google and Uber, make
machine learning a central part of their operations.
MACHINE LEARNING
MODELS
What Is Predictive Modeling?
Predictive modeling is a statistical technique
using machine learning and data mining to
predict and forecast likely future outcomes
with the aid of historical and existing data. It
works by analyzing current and historical
data and projecting what it learns on a model
generated to forecast likely outcomes.
Predictive modeling can be used to predict
just about anything, from TV ratings and a
customer’s next purchase to credit risks and
corporate earnings.
Types of Predictive Models
Classification model
Clustering model
Forecast model
Outliers model
Time series model
THANK YOU