2
Most read
4
Most read
17
Most read
The Colorful World of
Data Science
Sreejith C
Data Scientist
Calpine Labs
UVJ Technologies
Kochi
Overview
- Presentaion:
Introduction to Data Science
- Demonstration :
Loan Prediction Problem
- Exploratory data analysis in Python
- Data Munging in Python
- Building a Predictive Model in Python
Logistic Regression
Decision Tree
Random Forest
What is Data Science ?
The Science of
- Discovering what we don’t know from data
- Obtaining predictive, actionable insight from data
- Creating Data Products that have business impact
now
- Communicating relevant business stories from data
- Building confidence in decisions that drive business
value
“ Data science is clearly a blend of the hackers’ arts,
statistics and machine learning...
and the expertise in mathematics and the domain of
the data for the analysis to be interpretable...
It requires creative decisions and open-mindedness in
a scientific context “
Hilary Mason and Chris Wiggins
Hilary Mason is an American data scientist and the founder of technology startup Fast Forward Labs as well as Data Scientist in Residence at Accel Partners. She
was the Chief Scientist at bitly.
Christopher H. Wiggins is an associate professor of applied mathematics at Columbia University, the first Chief Data Scientist at The New York Times, and co-
founder and co-organizer of hackNY hackNY.org
THE DATA SCIENCE VENN DIAGRAM
Who is a Data Scientist ?
“ We realized that as our organizations grew, we both had to figure
out what to call the people on our teams.
Business analyst and Data analyst seemed too limiting.
The focus of our teams was to work on data applications that would
have an immediate and massive impact on the business.
The term that seemed to fit best was data scientist:
those who use both data and science to create something new “
DJ Patil
Chief Data Scientist of the United States Office of Science and Technology Policy, Patil is credited for coining the term "data science"
Data science
What Does a Data Scientist
Do?
“... on any given day, a team member could author a multistage
processing pipeline in Python,
design a hypothesis test, perform a regression analysis over data
samples with R,
design and implement an algorithm for some data-intensive product
or service in Hadoop,
communicate the results of our analyses to other members of the
organization “
Jeff Hammerbacher
Data scientist as well as chief scientist and cofounder at Cloudera.Along with Along with Jeff Hammerbacher, Patil is credited with coining the term "data science", Jeff
Hammerbacher is credited with coining the term "data science"
Data science
Machine Learning
- Regression
- Classification
- Clustering
Big Data Analytics
How to become a data scientist ?
Data scientists need to know how to code
Python
R
Julia
Java
Scala
Sql / NoSql
Spark / Hadoop
Data scientists need to be comfortable with
mathematics & statistics.
Data scientists need know machine learning &
software engineering.
Putting the pieces together .....
SIMPLE (Students' Innovations in Morphology Phonology and
Language Engineering) groups
CLEAR (Computational Linguistics in Engineering And
Research) magazine
- Blog / Write about your experience
- Build sample projects
- Share ideas
Puzzle
A huntsman can hit a target with a probability of 0.8
He sees a flock of birds (150 birds) atop a banyan tree.
He takes aim and fires 5 continuos shots.
Question : How many birds remain on the tree ?
Don't lose the big picture !!
0 !
Loan Prediction Problem
challenge is to predict approval status of loan
(Approved/ Reject)
Link :
https://github.com/sreejithc321/ML_Regression/tree/master/loan
_prediction
Demonstration
References
http://www.slideshare.net/ryanorban/how-to-become-a-data-
scientist
http://www.slideshare.net/datasciencelondon/big-data-sorry-data-
science-what-does-a-data-scientist-do
https://speakerdeck.com/bargava/introduction-to-machine-learning
https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-
learn-data-science-python-scratch-2/
Connect me at : http://in.linkedin.com/in/sreejithc321
Follow me at : https://twitter.com/sreejithc321

More Related Content

PPTX
Computational Fluid Dynamics (CFD)
PPTX
Presentation on ozone depletion
PDF
Machine learning
PDF
Introduction to AI & ML
PPTX
CALCIUM METABOLISM
PDF
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
PDF
Working With Big Data
PPTX
Artificial intelligence
Computational Fluid Dynamics (CFD)
Presentation on ozone depletion
Machine learning
Introduction to AI & ML
CALCIUM METABOLISM
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Working With Big Data
Artificial intelligence

What's hot (20)

PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
PDF
Introduction to Data Science
PDF
Data science
PPTX
Introduction to data science
PDF
Data science presentation
PDF
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
PPTX
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
PPTX
Data science
PDF
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
PPTX
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
PDF
Data Science Training | Data Science Tutorial | Data Science Certification | ...
PDF
Introduction to Data Science
PDF
Introduction to Data Science
PDF
Introduction To Data Science
PPTX
Data science & data scientist
PDF
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
PPTX
Data science applications and usecases
PPTX
Data Science
PDF
Introduction to data science
PDF
Introduction on Data Science
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Introduction to Data Science
Data science
Introduction to data science
Data science presentation
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
Data science
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Introduction to Data Science
Introduction to Data Science
Introduction To Data Science
Data science & data scientist
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data science applications and usecases
Data Science
Introduction to data science
Introduction on Data Science
Ad

Similar to Data science (20)

PPTX
intro to data science Clustering and visualization of data science subfields ...
PDF
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
PDF
Data+Science : A First Course
PDF
From Rocket Science to Data Science
PPT
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
PDF
The Research Blueprint: Excelling in Data science, Data Analysis and AI
PPTX
Workshop_Presentation.pptx
PDF
IIPGH Webinar 1: Getting Started With Data Science
PDF
Who is a data scientist
PDF
How to Prepare for a Career in Data Science
PDF
Come diventare data scientist - Paolo Pellegrini
PPTX
A Practical-ish Introduction to Data Science
PPTX
JavaZone 2018 - A Practical(ish) Introduction to Data Science
PPTX
Data science presentation - Management career institute
PDF
Top 10 data science takeaways for executives
PPTX
The Analytics and Data Science Landscape
PDF
Data Science Trends & Career Guide-kerala
PDF
Data Science Trends & Career Guide---ppt
PPTX
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
PDF
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
intro to data science Clustering and visualization of data science subfields ...
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data+Science : A First Course
From Rocket Science to Data Science
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
The Research Blueprint: Excelling in Data science, Data Analysis and AI
Workshop_Presentation.pptx
IIPGH Webinar 1: Getting Started With Data Science
Who is a data scientist
How to Prepare for a Career in Data Science
Come diventare data scientist - Paolo Pellegrini
A Practical-ish Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data Science
Data science presentation - Management career institute
Top 10 data science takeaways for executives
The Analytics and Data Science Landscape
Data Science Trends & Career Guide-kerala
Data Science Trends & Career Guide---ppt
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Ad

Recently uploaded (20)

PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
UiPath Agentic Automation session 1: RPA to Agents
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPT
Module 1.ppt Iot fundamentals and Architecture
PPTX
Chapter 5: Probability Theory and Statistics
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
DOCX
search engine optimization ppt fir known well about this
PDF
STKI Israel Market Study 2025 version august
PPTX
Configure Apache Mutual Authentication
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PPTX
Microsoft Excel 365/2024 Beginner's training
PDF
Five Habits of High-Impact Board Members
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
Credit Without Borders: AI and Financial Inclusion in Bangladesh
Enhancing emotion recognition model for a student engagement use case through...
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Zenith AI: Advanced Artificial Intelligence
UiPath Agentic Automation session 1: RPA to Agents
Flame analysis and combustion estimation using large language and vision assi...
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Module 1.ppt Iot fundamentals and Architecture
Chapter 5: Probability Theory and Statistics
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
1 - Historical Antecedents, Social Consideration.pdf
Taming the Chaos: How to Turn Unstructured Data into Decisions
search engine optimization ppt fir known well about this
STKI Israel Market Study 2025 version august
Configure Apache Mutual Authentication
Custom Battery Pack Design Considerations for Performance and Safety
Microsoft Excel 365/2024 Beginner's training
Five Habits of High-Impact Board Members
Developing a website for English-speaking practice to English as a foreign la...
NewMind AI Weekly Chronicles – August ’25 Week III

Data science