Apache HBase

Uploaded by

pprogram2314

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

103 views12 pages

Apache HBase

Uploaded by

pprogram2314

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Apache HBase

Apache HBase
• Apache Hbase is a non-relational (NoSQL) database.

• HBase was created for hosting very large tables with

billions of rows and millions of columns.

• Provides random , real-time data access.

• Allows table inserts, updates and deletes.

• Runs on top of the Hadoop distributed file system.

• Hbase data is automatically replicated by HDFS for

higher availability.
Hbase Architecture
Hbase Architecture
• An Hbase table is automatically distributed across a set of cluster
nodes to increase scalability and performance. Hbase can scale
out to thousands of nodes. Each cluster node contains a portion of
a table called a region. Each region contains some number of
table rows.
• Each region is managed by a RegionServer service. RegionServers
typically run on the same machines that run the Hadoop
distributed file system DataNode service.
• RegionServers are managed by the Hmaster master service.

Hmaster functions include such things as:

 Coordinating database metadata changes.
 Monitoring the RegionServer nodes
 Orchestrating load balanceing across RegionServer nodes.
 Orchestrating recovery from failed RegionServer nodes.
• A Zookeeper cluster handles all configuration management. Hbase
client programs communicate with ZooKeeper first to find the
RegionServer node that manages the data to be read.
• Clients access Hbase through a Java API, a REST interface, a Thrift
gateway, or the Hbase shell command-line interface.
Hbase Architecture
Interaction between Dameons
Key-Value Mappings
• Hbase contains maps of keys and thier values.
Key --> Value
If we know the key, we can retrieve the value.
• Keys are multi-part (column family name, rowID, column
qualifier, timestamp) > value
• Column family name- determines storage properties
• All data in the same column family is stored together on
disk.
• rowID- used to access data and divide table data into
regions.
• Regions are maintained on seperate RegionServer nodes.
• Column qualifier – the column name, which is just a label in
the multi-part key
• In any given row, one or more columns might or might not
exist.
• Timestamp-used to version the data and support data
updates.
Rows and Columns
• Rows and Columns are implemented differently than in most
relational databases.
• A multi-part key identifies a cell with a value.
• Because a table is just a set of key>value mappings, a row is
nothing more than a logical collection of values.
Hbase is a Column-Oriented
Database
• A Column-oriented database stores column items together
on disk.
• Column-oriented databases are well suited for:
Fast column operations:

For Example

 Calculating the sum or aggregate of an entire column of

data.
 Finding the 50 largest items in a column of 2 billion records.
 Spare datasets, which are common in big data use cases.
Hbase Operations Overview

• Hbase operations include put , get , delete and scan.

• There is no structured query language (SQL).
• Writes initially go to in-memory memstore.
• Writes are immediately logged to disk for durability.
• Writes are regularly flushed from memstore to a storefile on
disk.
HBase vs RDBMS

Data Mining - Data Reduction
No ratings yet
Data Mining - Data Reduction
6 pages
NoSQL Database Guide for Big Data
No ratings yet
NoSQL Database Guide for Big Data
5 pages
Storage System Hierarchy in DBMS
No ratings yet
Storage System Hierarchy in DBMS
20 pages
Unit 1 Bda Complete Notes
No ratings yet
Unit 1 Bda Complete Notes
15 pages
East West Institute of Technology: Sadp Notes
No ratings yet
East West Institute of Technology: Sadp Notes
30 pages
FSD Notes
No ratings yet
FSD Notes
47 pages
BDA Lab Manual 200305105108
No ratings yet
BDA Lab Manual 200305105108
44 pages
NOSQL
No ratings yet
NOSQL
55 pages
Understanding YARN in Hadoop 2
No ratings yet
Understanding YARN in Hadoop 2
16 pages
DHTML Basics for Web Developers
No ratings yet
DHTML Basics for Web Developers
4 pages
Model Question Paper - Big Data - 2024-25 - Kca022
No ratings yet
Model Question Paper - Big Data - 2024-25 - Kca022
3 pages
High Performance Computing Paradigms
No ratings yet
High Performance Computing Paradigms
7 pages
Bda Unit 4 PPT 2
No ratings yet
Bda Unit 4 PPT 2
44 pages
Big Data Analytics
No ratings yet
Big Data Analytics
131 pages
Object Detection With Tracking and Counting Object Using Machine Learning in Python
No ratings yet
Object Detection With Tracking and Counting Object Using Machine Learning in Python
54 pages
BDA Notes Unit-1
No ratings yet
BDA Notes Unit-1
18 pages
Introduction To Data Science UNIT-3
100% (1)
Introduction To Data Science UNIT-3
28 pages
NoSQL Technologies Notes Unit 1
100% (1)
NoSQL Technologies Notes Unit 1
20 pages
Data Stream Mining Techniques
No ratings yet
Data Stream Mining Techniques
16 pages
Final - Module-4 Cloud Computing - May 8, 2023
No ratings yet
Final - Module-4 Cloud Computing - May 8, 2023
88 pages
Challenges (NLP) and F C Structure
No ratings yet
Challenges (NLP) and F C Structure
8 pages
Tree Based Multicast Routing Protocols For Ad Hoc Networks
No ratings yet
Tree Based Multicast Routing Protocols For Ad Hoc Networks
8 pages
Devops Full Notes
No ratings yet
Devops Full Notes
92 pages
Software Risk & SCM Essentials
No ratings yet
Software Risk & SCM Essentials
45 pages
Last Year Question Paper - Big Data - (BCS 061)
No ratings yet
Last Year Question Paper - Big Data - (BCS 061)
9 pages
Big Data Stream Processing Guide
No ratings yet
Big Data Stream Processing Guide
22 pages
Internship Report on Compsoft Technologies
No ratings yet
Internship Report on Compsoft Technologies
30 pages
Analysis Modeling for Engineers
No ratings yet
Analysis Modeling for Engineers
21 pages
Information Retrieval Data Structures & Algorithms - William B. Frakes
No ratings yet
Information Retrieval Data Structures & Algorithms - William B. Frakes
630 pages
Bad601 Lab
No ratings yet
Bad601 Lab
32 pages
Unit 3 Notes UDS23201J Query Processing
No ratings yet
Unit 3 Notes UDS23201J Query Processing
38 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
Big Data Analytics - Lecture Slides
No ratings yet
Big Data Analytics - Lecture Slides
72 pages
Understanding UML Diagrams Explained
No ratings yet
Understanding UML Diagrams Explained
22 pages
Diljot Resume
No ratings yet
Diljot Resume
1 page
Question Bank BIKS609 IKS Module 2
No ratings yet
Question Bank BIKS609 IKS Module 2
6 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
9 pages
Unit V
No ratings yet
Unit V
67 pages
Distributed Systems (BCS515D) Important Question
No ratings yet
Distributed Systems (BCS515D) Important Question
2 pages
BAD601
No ratings yet
BAD601
3 pages
System Models For Distributed and Cloud Computing
No ratings yet
System Models For Distributed and Cloud Computing
15 pages
CC - Unit IV - Chapters
No ratings yet
CC - Unit IV - Chapters
47 pages
Unit 4 Session 1
No ratings yet
Unit 4 Session 1
17 pages
U1-U5 Consolidated PDF
No ratings yet
U1-U5 Consolidated PDF
222 pages
Stqa Viva
No ratings yet
Stqa Viva
10 pages
Big Data Analytics Course Syllabus
No ratings yet
Big Data Analytics Course Syllabus
4 pages
Language Design Trade-Offs in PPL
No ratings yet
Language Design Trade-Offs in PPL
8 pages
NoSQL Module1 PPT
No ratings yet
NoSQL Module1 PPT
64 pages
Object Oriented Modeling & Design
100% (1)
Object Oriented Modeling & Design
7 pages
Unit-Iii Advanced Database Systems
No ratings yet
Unit-Iii Advanced Database Systems
29 pages
Unit-2.1 PPT Basic Structural Modeling
No ratings yet
Unit-2.1 PPT Basic Structural Modeling
51 pages
Unit 1 P2 HBase
No ratings yet
Unit 1 P2 HBase
22 pages
HBase: A Key-Value NoSQL Database
100% (1)
HBase: A Key-Value NoSQL Database
47 pages
Hadoop HBASE
No ratings yet
Hadoop HBASE
71 pages
Bda - Unit 5
No ratings yet
Bda - Unit 5
30 pages
Ba Iift 17-18
No ratings yet
Ba Iift 17-18
40 pages
10 HBase
No ratings yet
10 HBase
13 pages
HBase: Scalable NoSQL Database Overview
No ratings yet
HBase: Scalable NoSQL Database Overview
32 pages
HBase: Key Features and Architecture
No ratings yet
HBase: Key Features and Architecture
31 pages
HBASE
No ratings yet
HBASE
18 pages
ARM Assembly Lab Guide
No ratings yet
ARM Assembly Lab Guide
12 pages
Hipath Hotel Advanced Hotel Server Solution: V4.2 Administrator Manual
No ratings yet
Hipath Hotel Advanced Hotel Server Solution: V4.2 Administrator Manual
172 pages
OS-MID-2 Question Bank
No ratings yet
OS-MID-2 Question Bank
2 pages
C Good
No ratings yet
C Good
197 pages
Qualifying Examination 3
No ratings yet
Qualifying Examination 3
4 pages
SPPU Second Year Computer Engineering Computer Graphics MiniProject
No ratings yet
SPPU Second Year Computer Engineering Computer Graphics MiniProject
13 pages
Manual RouterBoard RB433
0% (1)
Manual RouterBoard RB433
11 pages
Cell Format: - User-Network Interface (UNI)
No ratings yet
Cell Format: - User-Network Interface (UNI)
16 pages
Putting Arduino To Work in Your Shack PDF
No ratings yet
Putting Arduino To Work in Your Shack PDF
7 pages
HC-06 Bluetooth Module Commands Guide
No ratings yet
HC-06 Bluetooth Module Commands Guide
3 pages
Manual Olt phyhomeHL 2100
No ratings yet
Manual Olt phyhomeHL 2100
4 pages
ATRG - Threat Emulation
No ratings yet
ATRG - Threat Emulation
44 pages
Knapsack Datacasting Manual
No ratings yet
Knapsack Datacasting Manual
11 pages
CKA Exam - 28102022
100% (2)
CKA Exam - 28102022
31 pages
254 Exam1 FA24
No ratings yet
254 Exam1 FA24
4 pages
GC 2024 09 20
No ratings yet
GC 2024 09 20
17 pages
Subnetting Cheat Sheet PDF
100% (1)
Subnetting Cheat Sheet PDF
2 pages
Systemd On Linux - Manage Services, Run Levels and Logs
100% (1)
Systemd On Linux - Manage Services, Run Levels and Logs
8 pages
CIT 595: Cache Memory Overview
No ratings yet
CIT 595: Cache Memory Overview
11 pages
AutroMaster V System Datasheet Eng
No ratings yet
AutroMaster V System Datasheet Eng
2 pages
NPS-CT-0129E NeuViz 128 1.0.7 Software Installation Manual V1.1
No ratings yet
NPS-CT-0129E NeuViz 128 1.0.7 Software Installation Manual V1.1
26 pages
Arithmetic Algorithm
No ratings yet
Arithmetic Algorithm
16 pages
Postgresql Installation Steps
No ratings yet
Postgresql Installation Steps
4 pages
Memory Types
No ratings yet
Memory Types
13 pages
Brutus
100% (1)
Brutus
4 pages
Open Kicks Print
No ratings yet
Open Kicks Print
24 pages
Android Game Initialization Log
No ratings yet
Android Game Initialization Log
3 pages
COPA Skill Chart Overview
No ratings yet
COPA Skill Chart Overview
17 pages
Cloud Architecture Design Overview
No ratings yet
Cloud Architecture Design Overview
15 pages
Database Recovery Techniques
No ratings yet
Database Recovery Techniques
22 pages

Apache HBase

Uploaded by

Apache HBase

Uploaded by

Apache HBase

• HBase was created for hosting very large tables with

• Provides random , real-time data access.

• Allows table inserts, updates and deletes.

• Runs on top of the Hadoop distributed file system.

• Hbase data is automatically replicated by HDFS for

Hmaster functions include such things as:

 Calculating the sum or aggregate of an entire column of

• Hbase operations include put , get , delete and scan.

You might also like