SlideShare a Scribd company logo
Hadoop for beginners   free course ppt
Hadoop for beginners   free course ppt
 Facebook,Twitter, Google generating petabytes of data everyday.
 Hadron Collider project discarding large amount of data as they won’t be able to
analyse. Hoping that they haven’t thrown anything valuable.
Interesting facts but ….Why is Big Data important?
Lets understand via an example
Bank
Optimal
Price?
Maximise Profit
Insurance
3rd Party Survey Expert Debates
Optimal Price
Bank
Optimal
Price?
Maximise Profit
Insurance
Optimal Price
Data Warehousing
Repository
WebActivity
Transaction
Competitors Pricing
MarketTrends
Statistics
Data
WarehouseRun Statistical
Algorithms
Decision Support
System
Hadoop for beginners   free course ppt
Volume VelocityVariety
Bank
Optimal
Price?
Maximise Profit
Insurance
Optimal Price
Data Warehousing
Repository
WebActivity
Transaction
Competitors Pricing
MarketTrends
Statistics
Data
WarehouseRun Statistical
Algorithms
Decision Support
System
Hadoop for beginners   free course ppt
Decision Support
System
Digital
Nervous
System
Data
Fundamental
block to
Data
Fundamental
Block to
Business @
speed of
thought
Sense
Interpret
Decide
Act
Organisations behaving like
Biological nervous system
AvatarSkynet
Bank
Repository
WebActivity
Transaction
Competitors Pricing
MarketTrends
Statistics
Optimal Price
Mobile Alert with
Travel insurance
International DataCorporation’s (IDC) 6th annual study:
 From 2005 to 2020, the digital universe will grow by a factor of 300, from 130
exabytes to 40,000 exabytes, or 40 trillion gigabytes
 More than 5,200 gigabytes for every man, woman, and child in 2020.
 From now until 2020, the digital universe will about double every two years.
 33% of the digital data might be valuable if analysed, compared with 25%
today.
From Gartner:
 4.4 Million IT Jobs Globally to Support Big Data By 2015.
Hadoop for beginners   free course ppt
2003-041996-2000 2005-06 2010 2013
Google File System
And MapReduce Papers
YARN/MapReduce 2/
Next Generation Hadoop
Hadoop spawns off
Nutch
Big Data problem faced by
All Search engines
and Mike
Dreadnaught
Doug Joins
Cloudera
0.xx Releases of
hadoop
Hadoop for beginners   free course ppt
Hadoop for beginners   free course ppt
PriceAdvantage:
1. Clusters use commodity
hardware, cheaper than
one expensive server.
2. Software License is free.
HDFS
MapReduce
Google File System
Google MapReduce
file1
Name node
Data nodes
map map map map map Reduce
User
Hadoop for beginners   free course ppt
Hadoop for beginners   free course ppt
HDFS
MapReduce HBase
Pig Hive
Sqoop/Flume
Log collection
Yahoo Facebook
Storm
Chukwa
Kafka
Structured Stores
Message broker
Oozie
Hadoop for beginners   free course ppt
Complex Algorithm
on a small dataset
SimpleAlgorithm
on a large dataset
1. Complex Algorithms needs to be
correctly sensitive to week
correlations.
2. Complex Algorithms are thus
difficult to code and design.
Data Engineer Data Scientist
Role
Skills
To solve business problems
using data.
To engineer software solutions.
More of programing and
technical skills and ability to
architect technical solutions.
Strong of Mathematical Skills
and understanding of statistical
Models.
-> SkeletonVersion
->All the ecosystems need
to be additionally installed.
-> Important ecosystem
members included.
-> Few Proprietary tools
like Enterprise Manager.
-> Proprietary Hadoop code
written in C.
-> Integrated with Hadoop
ecosystem members.
-> Based out of Apache
hadoop.
-> Supports .NET framework
-> Launches Hadoop
Distribution: Pivotal HD
ThankYou!!!
Superstar-Doug!!!
A small fan :- Me
And the real Hadoop

More Related Content

PPTX
Big data and Hadoop
PPTX
Introduction to Apache Hadoop
PPTX
Hadoop and Big Data
PPTX
Intro to Big Data Hadoop
PDF
Introduction to Bigdata and HADOOP
PPTX
Big Data Analytics for Non-Programmers
DOCX
Hadoop Seminar Report
DOCX
Hadoop Report
Big data and Hadoop
Introduction to Apache Hadoop
Hadoop and Big Data
Intro to Big Data Hadoop
Introduction to Bigdata and HADOOP
Big Data Analytics for Non-Programmers
Hadoop Seminar Report
Hadoop Report

What's hot (20)

PDF
Introduction to Big Data & Hadoop
DOCX
10 Popular Hadoop Technical Interview Questions
PDF
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
PDF
Report Hadoop Map Reduce
PPTX
Big data and hadoop
PPTX
Whatisbigdataandwhylearnhadoop
PPTX
Big Data & Hadoop Tutorial
PPTX
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
DOCX
Hadoop technology doc
PPTX
Big Data and Hadoop Introduction
PDF
Introduction To Big Data Analytics On Hadoop - SpringPeople
PPTX
Introduction to Apache Hadoop Eco-System
DOCX
Big data abstract
PPTX
Big Data Hadoop Tutorial by Easylearning Guru
PPTX
Big data ppt
PPTX
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
PDF
Big data technologies and Hadoop infrastructure
PPTX
Large Scale Data With Hadoop
PDF
Introduction to Big data & Hadoop -I
PPTX
Big data ppt
Introduction to Big Data & Hadoop
10 Popular Hadoop Technical Interview Questions
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
Report Hadoop Map Reduce
Big data and hadoop
Whatisbigdataandwhylearnhadoop
Big Data & Hadoop Tutorial
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
Hadoop technology doc
Big Data and Hadoop Introduction
Introduction To Big Data Analytics On Hadoop - SpringPeople
Introduction to Apache Hadoop Eco-System
Big data abstract
Big Data Hadoop Tutorial by Easylearning Guru
Big data ppt
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Big data technologies and Hadoop infrastructure
Large Scale Data With Hadoop
Introduction to Big data & Hadoop -I
Big data ppt
Ad

Similar to Hadoop for beginners free course ppt (20)

PDF
The book of elephant tattoo
PDF
ANALYTICS OF DATA USING HADOOP-A REVIEW
PDF
Big Data-Survey
PPTX
How to tackle big data from a security
PPT
Introduction to Big Data An analogy between Sugar Cane & Big Data
PDF
PDF
Problem Definition muAoPS | Analytics Problem Solving | Mu Sigma
PDF
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
PDF
Future of Big Data
PPTX
Big data Analytics
PDF
An Encyclopedic Overview Of Big Data Analytics
PPT
Big data with hadoop
PPSX
10-Hot-Data-Analytics-Tre-8904178.ppsx
PDF
Big Data Analytics
PPTX
A Big Data Concept
PDF
R180305120123
PDF
Big data data lake and beyond
PPTX
Big data
PPTX
Big data business case
PPTX
Big Data
The book of elephant tattoo
ANALYTICS OF DATA USING HADOOP-A REVIEW
Big Data-Survey
How to tackle big data from a security
Introduction to Big Data An analogy between Sugar Cane & Big Data
Problem Definition muAoPS | Analytics Problem Solving | Mu Sigma
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Future of Big Data
Big data Analytics
An Encyclopedic Overview Of Big Data Analytics
Big data with hadoop
10-Hot-Data-Analytics-Tre-8904178.ppsx
Big Data Analytics
A Big Data Concept
R180305120123
Big data data lake and beyond
Big data
Big data business case
Big Data
Ad

Recently uploaded (20)

PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
cuic standard and advanced reporting.pdf
PDF
KodekX | Application Modernization Development
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
A Presentation on Artificial Intelligence
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
“AI and Expert System Decision Support & Business Intelligence Systems”
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Reach Out and Touch Someone: Haptics and Empathic Computing
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Modernizing your data center with Dell and AMD
Network Security Unit 5.pdf for BCA BBA.
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
cuic standard and advanced reporting.pdf
KodekX | Application Modernization Development
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
A Presentation on Artificial Intelligence
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Per capita expenditure prediction using model stacking based on satellite ima...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows

Hadoop for beginners free course ppt