0% found this document useful (0 votes)

124 views14 pages

Recover From Namenode Failure

This document discusses recovering from Namenode failure in Hadoop. It explains that the Namenode is a single point of failure and stores all HDFS metadata. If it fails, the entire cluster becomes unavailable. However, checkpoints of the metadata are periodically taken by the Secondary Namenode and stored on its local disk. These checkpoints can be used to recover the Namenode by copying the latest fsimage file from the Secondary Namenode to the primary Namenode before restarting it and the HDFS services.

Uploaded by

vishnu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views14 pages

Recover From Namenode Failure

Uploaded by

vishnu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Recover from Namenode failure

For online Hadoop training, send mail to [email protected]

Agenda
What is Namenode
Responsibility of Namenode
Single point of failure
Causes of Namenode failures.
Namenode recovery
Role of Secondary Namenode
FsImage & Edits files.
Checkpoints in Hadoop
Creating checkpoints
Recovery with the help of checkpoint
What is Namenode

Namenode is a process which runs on master

machine of Hadoop cluster.

We need to contact Namenode for any read/write

operation in HDFS.

Namenode keeps metadata of the data which is

stored in HDFS.

Namenode cordinates with datanodes to

read/write data in HDFS.
Responsibility of a Namenode

Namenode keeps a block map of all the files in

HDFS.

It contacts each datanode & ask for block report.

It creates bigger block report from all datanode

block reports.

It keeps list of live nodes & dead nodes.

It balances the storage of Hadoop cluster.

Single point of failure

Namenode is single point of failure in Hadoop

cluster.

Hadoop cluster is not accessible if Namenode is

down.

We can't do any read/write operation, even

datanodes have all the data.

Hot backup is not yet supported in Hadoop.

Causes of Namenode failure

Master machine can stop working due to

hardware problem.

Namenode metadata can get corrupt.

Without metadata, Namenode is not capable of

finding the data in HDFS.

We can't contact Datanode directly for data.

Checkpoint can be used to recover metadata.

Namenode recovery

Namenode must be recovered in order to

access HDFS.

Hadoop cluster will remain offline, untill we

recover Namenode.

Secondary Namenode can help Namenode

to recover.
We can only recover till the last checkpoint saved.
Stale data is far better than no data.
Role of Secondary Namenode

Secondary Namenode must be on separate machine in

Hadoop production cluster.

Add the following entry in hdfs-site.xml to run Secondary

Namnode on another machine.
<property>
<name>dfs.secondary.http.address</name>
<value>192.168.1.2:50090</value>
<description>The secondary Namenode address and port </description>
</property>

Checkpoints, which are stored on Secondary Namenode,

helps in Namenode recovery.
FsImage & Edits files

FSImage contains snapshot of HDFS metadata.

Namenode loads FSImage at it's startup.

After every read/write operation, FSImage is not

updated.

Instead, all the changes are recorded in edits file.

Later, a new FSImage can be created by merging

old FSImage & edits file.
FsImage & Edits files
Creating checkpoints

Checkpoints are taken after every 1 hour ( by default )

Checkpoint are useful for recovering from failure.

We can create checkpoints, by running the below command

Checkpoints in Hadoop
Checkpoint are stored in below shown directory

Fsimage is used to recover the namenode

Recovery with the help of checkpoint
Follow below mentioned steps to recover namenode
1. Stop Hadoop by runnning ./stop-all.sh command

2. Copy fsimage file from checkpoint directory to current directory.

3. Start hadoop by runnning ./start-all.sh command

…Thanks…

For online Hadoop training, send mail to [email protected]

ELearnSecurity Mobile Application Penetration Testing
0% (1)
ELearnSecurity Mobile Application Penetration Testing
223 pages
Unit - 3 (HDFS)
No ratings yet
Unit - 3 (HDFS)
23 pages
4.2 HDFS Federation
No ratings yet
4.2 HDFS Federation
23 pages
Hadoop Week 2
No ratings yet
Hadoop Week 2
40 pages
Unit - 3 (HDFS) - 1
No ratings yet
Unit - 3 (HDFS) - 1
24 pages
CXD 310 2I en StudentExerciseWorkbook 4 5 Days v05 PDF
No ratings yet
CXD 310 2I en StudentExerciseWorkbook 4 5 Days v05 PDF
281 pages
5.apache Hadoop
No ratings yet
5.apache Hadoop
33 pages
HDFS Checkpointing
No ratings yet
HDFS Checkpointing
5 pages
Checkpointing and Deepdive
No ratings yet
Checkpointing and Deepdive
4 pages
SandeepKumar Das 20020343071
No ratings yet
SandeepKumar Das 20020343071
5 pages
004 - Hadoop Daemons (HDFS Only)
No ratings yet
004 - Hadoop Daemons (HDFS Only)
3 pages
HDFS - Mapreduce - Hadoop Namenode Single Point of Failure - Stack Overflow
No ratings yet
HDFS - Mapreduce - Hadoop Namenode Single Point of Failure - Stack Overflow
2 pages
OpenShift Container Platform-4.17-Authentication and authorization-en-US
No ratings yet
OpenShift Container Platform-4.17-Authentication and authorization-en-US
224 pages
Recovering Namenode From Secondarynamenode
No ratings yet
Recovering Namenode From Secondarynamenode
1 page
2
No ratings yet
2
231 pages
Ccure 9000 v2 10 Ccure ID Guide Rk0 LT en
No ratings yet
Ccure 9000 v2 10 Ccure ID Guide Rk0 LT en
338 pages
DAZ Studio Manuel
100% (2)
DAZ Studio Manuel
435 pages
Wps Company Validation Utility
No ratings yet
Wps Company Validation Utility
41 pages
User Manual For Bidder For E-Bidding Ver 3.3
No ratings yet
User Manual For Bidder For E-Bidding Ver 3.3
42 pages
CMS Video Monitor Platform User Manual
No ratings yet
CMS Video Monitor Platform User Manual
32 pages
Docu59880 RecoverPoint For Virtual Machines 4.3 Release Notes
No ratings yet
Docu59880 RecoverPoint For Virtual Machines 4.3 Release Notes
32 pages
Biometric Client Package Installation Process
No ratings yet
Biometric Client Package Installation Process
41 pages
SOP of Bootable Pendrive Ver. 23H2
No ratings yet
SOP of Bootable Pendrive Ver. 23H2
12 pages
Pig Tutorial PDF
No ratings yet
Pig Tutorial PDF
22 pages
Pyqt5 Project 2
No ratings yet
Pyqt5 Project 2
11 pages
En RN 9.1.x Es Hub PL Wde91rn Book
No ratings yet
En RN 9.1.x Es Hub PL Wde91rn Book
23 pages
Would Websites
No ratings yet
Would Websites
3 pages
Yarn Tutorial PDF
No ratings yet
Yarn Tutorial PDF
30 pages
2018 Dahua NVR Guide v1.0
No ratings yet
2018 Dahua NVR Guide v1.0
16 pages
Pig Tutorial
No ratings yet
Pig Tutorial
22 pages
IED Connectivity Package Version 2.2: Read Me
No ratings yet
IED Connectivity Package Version 2.2: Read Me
5 pages
ODI: Automating Deployment of Scenarios To Production in Oracle Data Integrator
No ratings yet
ODI: Automating Deployment of Scenarios To Production in Oracle Data Integrator
10 pages
Itec 103.1
No ratings yet
Itec 103.1
10 pages
Upgrade Your Hadoop Cluster
No ratings yet
Upgrade Your Hadoop Cluster
15 pages
Introduction To HBase
No ratings yet
Introduction To HBase
14 pages
Hadoop Demo
No ratings yet
Hadoop Demo
14 pages
Electric Circuit Analysis David e Johnson PDF
No ratings yet
Electric Circuit Analysis David e Johnson PDF
2 pages
Installing Single Node Hadoop
No ratings yet
Installing Single Node Hadoop
12 pages
Hadoop Installation
No ratings yet
Hadoop Installation
10 pages
Threads in Dot Net - C#: Example
No ratings yet
Threads in Dot Net - C#: Example
12 pages
Atozed Forums - Intraweb Using Template
No ratings yet
Atozed Forums - Intraweb Using Template
5 pages
Natura Manual
No ratings yet
Natura Manual
5 pages
Basic Set Up Using TMSH
No ratings yet
Basic Set Up Using TMSH
5 pages
Quick Start Guide Velocity V2 - 060116
No ratings yet
Quick Start Guide Velocity V2 - 060116
4 pages
Secure Installation Guide Log360
No ratings yet
Secure Installation Guide Log360
4 pages
Format Disk Solaris
No ratings yet
Format Disk Solaris
4 pages
SYNOPSIS Bank Management System
No ratings yet
SYNOPSIS Bank Management System
3 pages
ShellExperienceHost - Exe File Information
No ratings yet
ShellExperienceHost - Exe File Information
2 pages
ATLauncher Log 2
No ratings yet
ATLauncher Log 2
2 pages
Hadoop Beginner's Guide
From Everand
Hadoop Beginner's Guide
Garry Turkington
4/5 (7)
Hadoop实际解决方案手册: Chinese Edition
From Everand
Hadoop实际解决方案手册: Chinese Edition
Posts & Telecom Press
No ratings yet
Linux Commands By Example
From Everand
Linux Commands By Example
Khaled Jamal
4.5/5 (3)
Red Hat Enterprise Linux 6 Administration: Real World Skills for Red Hat Administrators
From Everand
Red Hat Enterprise Linux 6 Administration: Real World Skills for Red Hat Administrators
Sander van Vugt
No ratings yet
Windows Batch File Programming
From Everand
Windows Batch File Programming
Michael Elliott
2/5 (2)
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
JAVASCRIPT FRONT END PROGRAMMING: Crafting Dynamic and Interactive User Interfaces with JavaScript (2024 Guide for Beginners)
From Everand
JAVASCRIPT FRONT END PROGRAMMING: Crafting Dynamic and Interactive User Interfaces with JavaScript (2024 Guide for Beginners)
DAISY JOHNSTON
No ratings yet
Professional Hadoop Solutions
From Everand
Professional Hadoop Solutions
Boris Lublinsky
4/5 (2)
Mastering Hadoop
From Everand
Mastering Hadoop
Sandeep Karanth
No ratings yet
Linux Services Deployment
From Everand
Linux Services Deployment
Fabian Mestre
No ratings yet
TCP/IP for Everyone
From Everand
TCP/IP for Everyone
Murat Yildirimoglu
4/5 (32)
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
The Mac Terminal Reference and Scripting Primer
From Everand
The Mac Terminal Reference and Scripting Primer
Jay Docherty
4.5/5 (3)
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
Learn NodeJS in 1 Day: Complete Node JS Guide with Examples
From Everand
Learn NodeJS in 1 Day: Complete Node JS Guide with Examples
Krishna Rungta
3.5/5 (4)
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
VMware Horizon 6 Desktop Virtualization Solutions
From Everand
VMware Horizon 6 Desktop Virtualization Solutions
Ryan Cartwright
No ratings yet
Backend Handbook: for Ruby on Rails Apps
From Everand
Backend Handbook: for Ruby on Rails Apps
Francisco Quintero
1/5 (1)
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
Bash Command Line Pro Tips
From Everand
Bash Command Line Pro Tips
Jason Cannon
4.5/5 (8)
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hedaya Alasooly
No ratings yet
Evaluation of Some Intrusion Detection and Vulnerability Assessment Tools
From Everand
Evaluation of Some Intrusion Detection and Vulnerability Assessment Tools
Dr. Hedaya Mahmood Alasooly
No ratings yet
Software Knowledge
From Everand
Software Knowledge
Debojit Acharjee
No ratings yet
Node.js: Tools & Skills
From Everand
Node.js: Tools & Skills
James Hibbard
No ratings yet
Node.js, Express.js, and More
From Everand
Node.js, Express.js, and More
Tom Henricksen
No ratings yet
MCSA Windows Server 2012 Complete Study Guide: Exams 70-410, 70-411, 70-412, and 70-417
From Everand
MCSA Windows Server 2012 Complete Study Guide: Exams 70-410, 70-411, 70-412, and 70-417
William Panek
No ratings yet
The Beginner’s Guide to Node.js
From Everand
The Beginner’s Guide to Node.js
Steven Mcananey
No ratings yet
Node.js: The Definitive Resource
From Everand
Node.js: The Definitive Resource
Tom Henricksen
No ratings yet
QuickStart Guide to Db2 Development with Python
From Everand
QuickStart Guide to Db2 Development with Python
Roger E. Sanders
No ratings yet
P.H.P Simple C.R.U.D Design
From Everand
P.H.P Simple C.R.U.D Design
Rohaya Mohamad
4/5 (1)
Learn Kubernetes & Docker - .NET Core, Java, Node.JS, PHP or Python
From Everand
Learn Kubernetes & Docker - .NET Core, Java, Node.JS, PHP or Python
Arnaud Weil
No ratings yet
Windows Command Prompt
From Everand
Windows Command Prompt
Murat Yildirimoglu
No ratings yet
Linux 5 Day Introduction Course
From Everand
Linux 5 Day Introduction Course
Stephen Edwards
No ratings yet
Profound Linux For Users
From Everand
Profound Linux For Users
Onder Teker
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet
Living with Linux in the Industrial World
From Everand
Living with Linux in the Industrial World
Elaiya Iswera Lallan
No ratings yet
FreeBSD Mastery: Advanced ZFS: IT Mastery, #9
From Everand
FreeBSD Mastery: Advanced ZFS: IT Mastery, #9
Michael W. Lucas
No ratings yet
Evaluation of Some Windows and Linux Intrusion Detection Tools
From Everand
Evaluation of Some Windows and Linux Intrusion Detection Tools
Dr. Hedaya Alasooly
No ratings yet
Getting SASSY: A Practical Guide to SASS
From Everand
Getting SASSY: A Practical Guide to SASS
Tim Robards
No ratings yet
Building a NAS Server with Raspberry Pi and Openmediavault
From Everand
Building a NAS Server with Raspberry Pi and Openmediavault
Brian Schell
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Evaluation of Some Windows and Linux Intrusion Detection Tools
From Everand
Evaluation of Some Windows and Linux Intrusion Detection Tools
Dr. Hidaia Mahmood Alassouli
No ratings yet
Overview of Some Windows and Linux Intrusion Detection Tools
From Everand
Overview of Some Windows and Linux Intrusion Detection Tools
Dr. Hidaia Mahmood Alassouli
No ratings yet
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
Learn Cassandra in 24 Hours
From Everand
Learn Cassandra in 24 Hours
Alex Nordeen
No ratings yet
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet

Recover From Namenode Failure

Uploaded by

Recover From Namenode Failure

Uploaded by

Recover from Namenode failure

For online Hadoop training, send mail to [email protected]

Namenode is a process which runs on master

We need to contact Namenode for any read/write

Namenode keeps metadata of the data which is

Namenode cordinates with datanodes to

Namenode keeps a block map of all the files in

It contacts each datanode & ask for block report.

It creates bigger block report from all datanode

It keeps list of live nodes & dead nodes.

It balances the storage of Hadoop cluster.

Namenode is single point of failure in Hadoop

Hadoop cluster is not accessible if Namenode is

We can't do any read/write operation, even

Hot backup is not yet supported in Hadoop.

Master machine can stop working due to

Namenode metadata can get corrupt.

Without metadata, Namenode is not capable of

We can't contact Datanode directly for data.

Checkpoint can be used to recover metadata.

Namenode must be recovered in order to

Hadoop cluster will remain offline, untill we

Secondary Namenode can help Namenode

Secondary Namenode must be on separate machine in

Add the following entry in hdfs-site.xml to run Secondary

Checkpoints, which are stored on Secondary Namenode,

FSImage contains snapshot of HDFS metadata.

Namenode loads FSImage at it's startup.

After every read/write operation, FSImage is not

Instead, all the changes are recorded in edits file.

Later, a new FSImage can be created by merging

Checkpoints are taken after every 1 hour ( by default )

Checkpoint are useful for recovering from failure.

We can create checkpoints, by running the below command

Fsimage is used to recover the namenode

2. Copy fsimage file from checkpoint directory to current directory.

3. Start hadoop by runnning ./start-all.sh command

For online Hadoop training, send mail to [email protected]

You might also like