0% found this document useful (0 votes)

46 views10 pages

HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture

Uploaded by

Prabir Kisku

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views10 pages

HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture

Uploaded by

Prabir Kisku

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

HDFS (Hadoop Distributed File System)

HDFS Architecture
Components of the Architecture :
NameNode : The master server , mainly stores the meta data and the
information about all the data nodes, etc.

Secondary Namenode : Provides a checkpoint in HDFS with fsImage

and EditLogs.
DataNode : i) Datanodes perform read-write operations on the file
systems, as per client request. ii) They also perform operations such as
block creation, deletion, and replication as instructed by the
namenode.

Block : The file segments where user data is actually stored are called
as blocks. The default size is 64 MB.

Working with Hadoop – Starting HDFS

Introduction:

The HDFS (Hadoop Distributed File System) is the distributed fault

tolerant file system that can hold and process data which is really Big
for us. We discussed earlier about the concept of Big as it appears to
us....

We will be starting the Hadoop system and the HDFS. The commands
will start the NameNode, Secondary NameNode, DataNode, JobTracker,
TaskTracker, jps. We have already discussed about the basic arhitecture
of HDFS.

Pre Requisites:

Hadoop System should be properly installed in the system either as a

SingleNode Cluster or a MultiNode Cluster.

Sequences of Operations to start Hadoop Distributed File

System(HDFS)

Step 1 : To start HDFS we, use:

$> ~/hadoop-1.0.3/bin/start-all.sh [ Use appropriate Hadoop version as
per your installation. For Hadoop 2.x users, you can use : i) start-dfs.sh
ii) start-yarn.sh]

Step 2 : To check the status of the services started, the command used
is :

$> jps

It shows the start of 6 services as:

NameNode, JobTracker, Jps, TaskTracker, SecondaryNameNode,

DataNode.
Step 3: Ensure that the NameNode is NOT in safemode for proper
operations to be performed on HDFS. We use the following command
to set the safemode off for HDFS.

$> ~/hadoop-1.0.3/bin/hadoop dfsadmin -safemode leave

Step 4: To check the HDFS file system, use the browser and browse with
the following URL:

http://localhost:50070/dfshealth.jsp
The screen shot displays the HDFS in the browser.

Working with Hadoop Distributed File System – Using FS Shell

Commands
Introduction:

The FileSystem shell commands provides all the basic commands

needed to operate on file(s) and data between hdfs and local file
system. It is invoked by bin/hadoop fs commands. All the FS shell
commands take path URIs as arguments.

These shell commands needs the hadoop to be started normally, and

the safemode of the namenode to be turned off.

File System(FS) Shell Commands:

The following presents a list of syntaxes of the most important file
system commands.

1. cat command :

Usage: hadoop fs -cat URI [URI …]

Copies source paths to stdout.

Example:

hadoop fs -cat hdfs://user/hadoop/file1 hdfs://user/hadoop/file2

Exit Code:
Returns 0 on success and -1 on error.

2. chgrp command :

Usage: hadoop fs -chgrp [-R] GROUP URI [URI …]

Change group association of files. With -R, make the change recursively
through the directory structure. The user must be the owner of files, or
else a super-user.

3. chmod command:

Usage: hadoop fs -chmod [-R] URI [URI …]

Change the permissions of files. With -R, make the change recursively
through the directory structure. The user must be the owner of the file,
or else a super-user.

4. chown command:

Usage: hadoop fs -chown [-R] [OWNER][:[GROUP]] URI [URI ]

Change the owner of files. With -R, make the change recursively
through the directory structure, being the owner of the file or super-
user

5. copyFromLocal command:

Usage: hadoop fs -copyFromLocal URI

This copies file(s) from local directory to exsiting file reference at HDFS.

6. copyToLocal

Usage: hadoop fs -copyToLocal [-ignorecrc] [-crc] URI

Copies file(s) from HDFS to existing local file reference.

7. cp

Usage: hadoop fs -cp URI [URI …]

Copy files from source to destination. This command allows multiple

sources as well in which case the destination must be a directory.
Example:

1. hadoop fs -cp /user/hadoop/file1 /user/hadoop/file2

2. hadoop fs -cp /user/hadoop/file1 /user/hadoop/file2
/user/hadoop/dir

Exit Code: Returns 0 on success and -1 on error.

8. get

Usage: hadoop fs -get [-ignorecrc] [-crc]

Copy files to the local file system. Files that fail the CRC check may be
copied with the -ignorecrc option. Files and CRCs may be copied using
the -crc option.

Example:

 hadoop fs -get /user/hadoop/file localfile

Exit Code: Returns 0 on success and -1 on error.

9. mkdir

Usage: hadoop fs -mkdir

Takes path uri's as argument and creates directories. The behavior is

much like unix mkdir -p creating parent directories along the path.

Example:

 hadoop fs -mkdir /user/hadoop/dir1 /user/hadoop/dir2

Exit Code:

Returns 0 on success and -1 on error.

10. mv

Usage: hadoop fs -mv URI [URI …]

Moves files from source to destination. This command allows multiple

sources as well in which case the destination needs to be a directory.
Moving files across filesystems is not permitted.
Example:

 hadoop fs -mv /user/hadoop/file1 /user/hadoop/file2

Exit Code:

Returns 0 on success and -1 on error.

11. put

Usage: hadoop fs -put ...

Copy single src, or multiple srcs from local file system to the destination
filesystem. Also reads input from stdin and writes to destination
filesystem.

 hadoop fs -put localfile /user/hadoop/hadoopfile

 hadoop fs -put localfile1 localfile2 /user/hadoop/hadoopdir

Exit Code:

Returns 0 on success and -1 on error.

12. rm

Usage: hadoop fs -rm URI [URI …]

Delete files specified as args. Only deletes non empty directory and
files. Refer to rmr for recursive deletes.
Example:

 hadoop fs -rm hdfs://nn.example.com/file

/user/hadoop/emptydir
Exit Code:

Returns 0 on success and -1 on error.

13. rmr

Usage: hadoop fs -rmr URI [URI …]

Recursive version of delete.

Example:

 hadoop fs -rmr /user/hadoop/dir

Exit Code:

Returns 0 on success and -1 on error.

Conclusion:

The above provides a list of the most important commands to be used

from the HDFS shell, to work with files and directories.

Mysql Database Project
No ratings yet
Mysql Database Project
32 pages
Yahoo Hadoop Tutorial
No ratings yet
Yahoo Hadoop Tutorial
28 pages
Hadoop Command Line Interface
No ratings yet
Hadoop Command Line Interface
10 pages
Shrutika Slip 1 To 5
100% (2)
Shrutika Slip 1 To 5
17 pages
Implementing HR Analytics - Oracle EBS Adaptors
50% (4)
Implementing HR Analytics - Oracle EBS Adaptors
78 pages
NetBackup 75 76 Tuning Guide
No ratings yet
NetBackup 75 76 Tuning Guide
208 pages
Active Directory Architecture
100% (1)
Active Directory Architecture
43 pages
OAK-the Architecture of Apache Jackrabbit 3 PDF
No ratings yet
OAK-the Architecture of Apache Jackrabbit 3 PDF
46 pages
Database Testing
No ratings yet
Database Testing
3 pages
The Queuemetrics Uniloader User Manual
No ratings yet
The Queuemetrics Uniloader User Manual
47 pages
Web Development Using Wordpress - Unit-1
No ratings yet
Web Development Using Wordpress - Unit-1
16 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
Microstrategy Objects
No ratings yet
Microstrategy Objects
19 pages
Link List Data Structure
No ratings yet
Link List Data Structure
37 pages
Hadoop Distributed File System (HDFS)
No ratings yet
Hadoop Distributed File System (HDFS)
22 pages
Case Study On: Clinic Management System
No ratings yet
Case Study On: Clinic Management System
10 pages
Best Practices For Performance For Concurrent Managers in E-Business Suite (ID 1057802.1)
No ratings yet
Best Practices For Performance For Concurrent Managers in E-Business Suite (ID 1057802.1)
3 pages
Introduction of Bucket Cache
No ratings yet
Introduction of Bucket Cache
8 pages
HDFS Commands Updated
No ratings yet
HDFS Commands Updated
87 pages
Importing Text File
No ratings yet
Importing Text File
19 pages
Birt Report Designer Reference Guide
No ratings yet
Birt Report Designer Reference Guide
46 pages
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
No ratings yet
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
35 pages
Serviio Online Backup - 2014 12 19 10 27 49
No ratings yet
Serviio Online Backup - 2014 12 19 10 27 49
6 pages
Arules Viz
No ratings yet
Arules Viz
24 pages
Hadoop Namenode Commands: Command Description
No ratings yet
Hadoop Namenode Commands: Command Description
4 pages
DSA Interview Questions
No ratings yet
DSA Interview Questions
9 pages
TP 1 - HDFS
No ratings yet
TP 1 - HDFS
40 pages
Just Ask
No ratings yet
Just Ask
72 pages
Hdfs Commands
No ratings yet
Hdfs Commands
4 pages
Migrating The Domain
No ratings yet
Migrating The Domain
3 pages
SQL Training 2024 2
No ratings yet
SQL Training 2024 2
41 pages
Hadoop Hdfs Commands
No ratings yet
Hadoop Hdfs Commands
2 pages
Hadoop Tutorial
No ratings yet
Hadoop Tutorial
13 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
74 pages
Procedure For Decoding .Apk Files, Step-By-Step Method: Step 1
No ratings yet
Procedure For Decoding .Apk Files, Step-By-Step Method: Step 1
1 page
Unit 3.1
No ratings yet
Unit 3.1
88 pages
Hdfs and Pig
No ratings yet
Hdfs and Pig
13 pages
HDFS
No ratings yet
HDFS
6 pages
Coursera HX25954DJUTT
No ratings yet
Coursera HX25954DJUTT
1 page
Dsa Practical File
No ratings yet
Dsa Practical File
16 pages
COMMAND Line Interface
No ratings yet
COMMAND Line Interface
26 pages
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
No ratings yet
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
5 pages
Lab Report 10 Dbms
No ratings yet
Lab Report 10 Dbms
5 pages
Lista de Comandos HDFS
No ratings yet
Lista de Comandos HDFS
8 pages
Hadoop Linux Commands
No ratings yet
Hadoop Linux Commands
8 pages
PDF - HDFS Commandsdsa
No ratings yet
PDF - HDFS Commandsdsa
22 pages
2010 NoSQL Summer Reading List
No ratings yet
2010 NoSQL Summer Reading List
1 page
HDFS Commands v02 PDF
No ratings yet
HDFS Commands v02 PDF
7 pages
HDFS Commands1
No ratings yet
HDFS Commands1
18 pages
Create A Directory in HDFS at Given Path(s) .: Upload
No ratings yet
Create A Directory in HDFS at Given Path(s) .: Upload
11 pages
2 HDFS Commands
No ratings yet
2 HDFS Commands
7 pages
HDFS
No ratings yet
HDFS
13 pages
HDFS File System Shell Guide
No ratings yet
HDFS File System Shell Guide
10 pages
Chapter 4 - Hadoop Ecosystem
No ratings yet
Chapter 4 - Hadoop Ecosystem
24 pages
Hadoop Commands
100% (1)
Hadoop Commands
6 pages
HDFS Commands - Revised
No ratings yet
HDFS Commands - Revised
6 pages
Enhanced Student Management System JDBC API Documentation
No ratings yet
Enhanced Student Management System JDBC API Documentation
19 pages
Hadoop Linux Hdfs Commands
No ratings yet
Hadoop Linux Hdfs Commands
2 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
4.hadoop Commands
No ratings yet
4.hadoop Commands
6 pages
Big Data AnalyticUnit2
No ratings yet
Big Data AnalyticUnit2
19 pages
Hadoop Commands
No ratings yet
Hadoop Commands
2 pages
Ba Notes
No ratings yet
Ba Notes
7 pages
Command
No ratings yet
Command
1 page
Unit 2-HDFS SGS
No ratings yet
Unit 2-HDFS SGS
29 pages
Hadoop Hdfs Commands
No ratings yet
Hadoop Hdfs Commands
5 pages
Hands-On Hadoop Tutorial
100% (1)
Hands-On Hadoop Tutorial
13 pages
10 Dfs
No ratings yet
10 Dfs
5 pages
BDA Final Compiled - Pagenumber
No ratings yet
BDA Final Compiled - Pagenumber
71 pages
Hadoop Commands Only
No ratings yet
Hadoop Commands Only
19 pages
HDFS Commands
No ratings yet
HDFS Commands
7 pages
BigData Lab Manual
No ratings yet
BigData Lab Manual
44 pages
Hadoop 1
No ratings yet
Hadoop 1
15 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
3 - HDFS Hive HBase Pig
No ratings yet
3 - HDFS Hive HBase Pig
8 pages
Hadoop
No ratings yet
Hadoop
6 pages
Lab2 BD
No ratings yet
Lab2 BD
20 pages
Hadoop HDFS Commands With Examples
No ratings yet
Hadoop HDFS Commands With Examples
3 pages
BDA UNIT - 3 Updated
No ratings yet
BDA UNIT - 3 Updated
25 pages
MIS Lab New Course Content
No ratings yet
MIS Lab New Course Content
2 pages
3a HDFS
No ratings yet
3a HDFS
17 pages
HDFS and HAdoop Command
No ratings yet
HDFS and HAdoop Command
5 pages
BDA Exp 2
No ratings yet
BDA Exp 2
15 pages
Complete Hadoop Notes Final
No ratings yet
Complete Hadoop Notes Final
4 pages
Ai&Ml (Bdamanual)
No ratings yet
Ai&Ml (Bdamanual)
24 pages

HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture

Uploaded by

HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture

Uploaded by

HDFS (Hadoop Distributed File System)

Secondary Namenode : Provides a checkpoint in HDFS with fsImage

Working with Hadoop – Starting HDFS

The HDFS (Hadoop Distributed File System) is the distributed fault

Hadoop System should be properly installed in the system either as a

Sequences of Operations to start Hadoop Distributed File

Step 1 : To start HDFS we, use:

It shows the start of 6 services as:

NameNode, JobTracker, Jps, TaskTracker, SecondaryNameNode,

$> ~/hadoop-1.0.3/bin/hadoop dfsadmin -safemode leave

Working with Hadoop Distributed File System – Using FS Shell

The FileSystem shell commands provides all the basic commands

These shell commands needs the hadoop to be started normally, and

File System(FS) Shell Commands:

Usage: hadoop fs -cat URI [URI …]

Copies source paths to stdout.

hadoop fs -cat hdfs://user/hadoop/file1 hdfs://user/hadoop/file2

Usage: hadoop fs -chgrp [-R] GROUP URI [URI …]

Usage: hadoop fs -chmod [-R] URI [URI …]

Usage: hadoop fs -chown [-R] [OWNER][:[GROUP]] URI [URI ]

Usage: hadoop fs -copyFromLocal URI

Usage: hadoop fs -copyToLocal [-ignorecrc] [-crc] URI

Copies file(s) from HDFS to existing local file reference.

Usage: hadoop fs -cp URI [URI …]

Copy files from source to destination. This command allows multiple

1. hadoop fs -cp /user/hadoop/file1 /user/hadoop/file2

Exit Code: Returns 0 on success and -1 on error.

Usage: hadoop fs -get [-ignorecrc] [-crc]

 hadoop fs -get /user/hadoop/file localfile

Exit Code: Returns 0 on success and -1 on error.

Usage: hadoop fs -mkdir

Takes path uri's as argument and creates directories. The behavior is

 hadoop fs -mkdir /user/hadoop/dir1 /user/hadoop/dir2

Returns 0 on success and -1 on error.

Usage: hadoop fs -mv URI [URI …]

Moves files from source to destination. This command allows multiple

 hadoop fs -mv /user/hadoop/file1 /user/hadoop/file2

Returns 0 on success and -1 on error.

Usage: hadoop fs -put ...

 hadoop fs -put localfile /user/hadoop/hadoopfile

Returns 0 on success and -1 on error.

Usage: hadoop fs -rm URI [URI …]

 hadoop fs -rm hdfs://nn.example.com/file

Returns 0 on success and -1 on error.

Usage: hadoop fs -rmr URI [URI …]

Recursive version of delete.

 hadoop fs -rmr /user/hadoop/dir

Returns 0 on success and -1 on error.

The above provides a list of the most important commands to be used

You might also like