0% found this document useful (0 votes)

21 views13 pages

Exp 9 - Merged

exoeriment

Uploaded by

Abhishek Tiwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views13 pages

Exp 9 - Merged

exoeriment

Uploaded by

Abhishek Tiwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

EXPERIMENT NUMBER – 06

AIM: To develop a MapReduce program to find the grades of students.

import java.util.Scanner;
public class Main{
public static void main(String args[]){
int marks[] = new int [6];
int i;
float total = 0, avg;
Scanner scanner = new Scanner (System.in);
for(i=0; i<6; i++) {
System.out.print("Enter Marks of Subject " + (i + 1) + ":");
marks[i] = scanner.nextInt();
total = total + marks[i];
}
scanner.close();
avg = total/6;
System.out.print("The student grade is: ");
if (avg>=80)
System.out.println("A");
else if (avg>=60 && avg<80)
System.out.println("B");
else if(avg>=40 && avg<60)
System.out.println("C");
else
System.out.println("D");
}
}
Loading the dataset and initiating pig:
Filtering & Sorting operation:
Grouping & Splitting operation:
EXPERIMENT NUMBER – 09

AIM: Perform the MapReduce program for matrix multiplication.

Prerequisites:
1. Hadoop setup in Cloudera: Ensure that Hadoop is installed and configured in
Cloudera.

2. Eclipse IDE installed: Eclipse should be installed with the appropriate plugins for
Java development.

3. Hadoop Libraries: Ensure that the Hadoop libraries are added to your project build
path in Eclipse.

Step 1: Set Up Your Eclipse Project:

1. Open Eclipse IDE and create a new Java project.

o Go to File > New > Java Project.

o Name your project, e.g., MatrixMultiplication.

o Click Finish.

2. Add Hadoop Libraries to the Build Path:

o Right-click on the project in the Project Explorer and select Build Path >
Configure Build Path.
o Under the Libraries tab, click Add External JARs.

o Navigate to your Hadoop installation directory, usually located at

/usr/lib/hadoop/ in Cloudera.
o Add all the required JAR files from hadoop-common, hadoop-hdfs, hadoop-
mapreduce-client-core, and hadoop-yarn.

Step 2: Create Input Matrices:

Matrix multiplication requires two matrices as input:
Store matrices in a sparse format in HDFS:
matrixA matrixB
A,0,0,3 B,0,0,1
A,0,1,4 B,1,0,2
A,1,0,2 B,0,1,5
…. …..

Step 3: Write the MapReduce Code

1. Create a Mapper Class:

o Right-click on your project, select New > Class, and name it MatrixMapper.

o Write the following code:

Code are as follows:
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Mapper;
import java.io.IOException;
public class MatrixMapper extends Mapper<Object, Text, Text, Text> {
private Text outputKey = new Text();
private Text outputValue = new Text();
@Override
protected void map(Object key, Text value, Context context) throws IOException,
InterruptedException {
// Split the input line into tokens
String[] tokens = value.toString().split(",");
// Ensure that the input line has the expected number of tokens (4 tokens: matrixName, i, j,
value_ij)
if (tokens.length == 4) {
String matrixName = tokens[0];
int i = Integer.parseInt(tokens[1]);
int j = Integer.parseInt(tokens[2]);
int value_ij = Integer.parseInt(tokens[3]);
// Handling matrix A
if (matrixName.equals("A")) {
for (int k = 0; k < context.getConfiguration().getInt("p", 0); k++) {
outputKey.set(i + "," + k);
outputValue.set("A," + j + "," + value_ij);
context.write(outputKey, outputValue);} }
// Handling matrix B
else if (matrixName.equals("B")) {
for (int k = 0; k < context.getConfiguration().getInt("m", 0); k++) {
outputKey.set(k + "," + j);
outputValue.set("B," + i + "," + value_ij);
context.write(outputKey, outputValue);
}
}
} else {
// Log or handle the case of an incorrect input format
System.err.println("Invalid input format: " + value.toString());
}
}
}
2. Create a Reducer class:

o Similarly, create a new class named MatrixReducer with the following code:

Code are as follows:

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Reducer;
import java.io.IOException;
import java.util.HashMap;
import java.util.Map;
public class MatrixReducer extends Reducer<Text, Text, Text, IntWritable> {
private IntWritable result = new IntWritable();
@Override
protected void reduce(Text key, Iterable<Text> values, Context context) throws
IOException, InterruptedException {
Map<Integer, Integer> aMap = new HashMap<>();
Map<Integer, Integer> bMap = new HashMap<>();
// Populate aMap and bMap with values from matrices A and B respectively
for (Text val : values) {
String[] tokens = val.toString().split(",");
String matrixName = tokens[0];
int index = Integer.parseInt(tokens[1]);
int matrixValue = Integer.parseInt(tokens[2]);
if (matrixName.equals("A")) {
aMap.put(index, matrixValue);
} else if (matrixName.equals("B")) {
bMap.put(index, matrixValue);} }
// Multiply the matrices and sum the products
int sum = 0;
for (Map.Entry<Integer, Integer> entry : aMap.entrySet()) {
int index = entry.getKey();
int aVal = entry.getValue();
if (bMap.containsKey(index)) {
int bVal = bMap.get(index);
sum += aVal * bVal; }
}result.set(sum);
context.write(key, result);} }

3. Create the Driver Class:

• Finally, create the main driver class called MatrixMultiplicationDriver:

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
import org.apache.hadoop.mapreduce.lib.input.TextInputFormat;
import org.apache.hadoop.mapreduce.lib.output.TextOutputFormat;
public class MatrixMultiplication {
public static void main(String[] args) throws Exception {
// Ensure the correct number of arguments are passed
if (args.length < 5) {
System.err.println("Usage: MatrixMultiplication <input path> <output path> <m> <n>
<p>");
System.exit(-1);
}
Configuration conf = new Configuration();
conf.setInt("m", Integer.parseInt(args[2])); // rows of A
conf.setInt("n", Integer.parseInt(args[3])); // columns of A (and rows of B)
conf.setInt("p", Integer.parseInt(args[4])); // columns of B
// Validate matrix dimensions
if (conf.getInt("m", 0) <= 0 || conf.getInt("n", 0) <= 0 || conf.getInt("p", 0) <= 0) {
System.err.println("Invalid matrix dimensions: m, n, and p must be positive
integers.");
System.exit(-1);
}
Job job = Job.getInstance(conf, "Matrix Multiplication");
job.setJarByClass(MatrixMultiplication.class);
job.setMapperClass(MatrixMapper.class);
job.setReducerClass(MatrixReducer.class);
job.setMapOutputKeyClass(Text.class);
job.setMapOutputValueClass(Text.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
job.setInputFormatClass(TextInputFormat.class);
job.setOutputFormatClass(TextOutputFormat.class);
FileInputFormat.addInputPath(job, new Path(args[0]));
FileOutputFormat.setOutputPath(job, new Path(args[1]));
System.exit(job.waitForCompletion(true) ? 0 : 1); } }

Step 4: Compile and Run the Program

1. Export the JAR file:

o Right-click on the project and select Export > JAR file.

o Choose a location to save the JAR file and click Finish.

2. Copy the Input Files to HDFS:

o Use the Cloudera terminal to create directories in HDFS and copy the input
matrices:
hadoop fs -mkdir -p /user/matrix/input

hadoop fs -put /path/to/matrixA.txt /user/matrix/input

hadoop fs -put /path/to/matrixB.txt /user/matrix/input

3. Run the MapReduce job:

• In the terminal, run the following command:

hadoop jar /path/to/MatrixMultiplication.jar /user/matrix/input /user/matrix/output

4. Check the Output:

• After the job completes, check the output in HDFS:

hadoop fs -cat /user/matrix/output/part-r-00000

Step 5: Debugging and Logs

• If the job fails, check the logs in Cloudera Manager or use the yarn logs command to
troubleshoot errors.

Step 6: Visualize the Results

• Use any tool or script to visualize the output matrix, depending on your needs.
Outputs:

COMMON 3 Perform Industry Calculation FINAL
100% (1)
COMMON 3 Perform Industry Calculation FINAL
53 pages
Ee Dec.2019-Jan.2020 PDF
No ratings yet
Ee Dec.2019-Jan.2020 PDF
116 pages
PJB Dissertation
No ratings yet
PJB Dissertation
178 pages
AS Paper 2 7356
No ratings yet
AS Paper 2 7356
24 pages
ADSP Savitha Notes
No ratings yet
ADSP Savitha Notes
103 pages
big_data_lab[1]
No ratings yet
big_data_lab[1]
52 pages
BDA record
No ratings yet
BDA record
58 pages
BDA-4 MapReduce v.2
No ratings yet
BDA-4 MapReduce v.2
22 pages
Lab Manual
No ratings yet
Lab Manual
86 pages
PBDS Unit4
No ratings yet
PBDS Unit4
32 pages
BDA
No ratings yet
BDA
19 pages
GRE Math Review
No ratings yet
GRE Math Review
22 pages
BDA Exp Removed Removed
No ratings yet
BDA Exp Removed Removed
33 pages
BDA Lab manual
No ratings yet
BDA Lab manual
54 pages
BDA Output
No ratings yet
BDA Output
32 pages
CSF443 Lab-Report Nimish Shandilya 1000016934
No ratings yet
CSF443 Lab-Report Nimish Shandilya 1000016934
17 pages
Morton D. Davis The Math of Money Making Mathematical Sense of Your Personal Finances
No ratings yet
Morton D. Davis The Math of Money Making Mathematical Sense of Your Personal Finances
181 pages
Object-Oriented Programming, Comple
No ratings yet
Object-Oriented Programming, Comple
12 pages
Bda - Unit I - Lecture 6, 7
No ratings yet
Bda - Unit I - Lecture 6, 7
48 pages
sets_bda
No ratings yet
sets_bda
19 pages
Bda Unit-Iii
No ratings yet
Bda Unit-Iii
42 pages
1to8
No ratings yet
1to8
16 pages
MapReduce - Notes
No ratings yet
MapReduce - Notes
17 pages
BDAV Practical
No ratings yet
BDAV Practical
17 pages
INTRODUCTION TO MATHEMATICS - LECTURE NOTES
No ratings yet
INTRODUCTION TO MATHEMATICS - LECTURE NOTES
31 pages
MapReduce Programs
No ratings yet
MapReduce Programs
10 pages
All
No ratings yet
All
11 pages
BIGDATALABCURRENT
No ratings yet
BIGDATALABCURRENT
54 pages
Big Data edited
No ratings yet
Big Data edited
11 pages
Hadoop
No ratings yet
Hadoop
19 pages
12112117-exp7
No ratings yet
12112117-exp7
7 pages
Ood - Lab Manual
No ratings yet
Ood - Lab Manual
50 pages
exp5bda
No ratings yet
exp5bda
9 pages
bd-2lab
No ratings yet
bd-2lab
7 pages
BIGDATA LAB MANUAL
No ratings yet
BIGDATA LAB MANUAL
27 pages
Chapter 1 - Formal Logic
No ratings yet
Chapter 1 - Formal Logic
27 pages
exp5bdafinal
No ratings yet
exp5bdafinal
7 pages
12112117
No ratings yet
12112117
5 pages
12112148-exp8
No ratings yet
12112148-exp8
5 pages
Big Data Lab
No ratings yet
Big Data Lab
12 pages
Final Research Proposal
No ratings yet
Final Research Proposal
31 pages
22MCC20017_Suraj_Kumar_Thakur_BIG_Data_2.1
No ratings yet
22MCC20017_Suraj_Kumar_Thakur_BIG_Data_2.1
7 pages
Algebraic Geometry - A Concise Dictionary (gnv64) PDF
100% (5)
Algebraic Geometry - A Concise Dictionary (gnv64) PDF
240 pages
BDA Exp5
No ratings yet
BDA Exp5
12 pages
Exp 9
No ratings yet
Exp 9
7 pages
Shapes Worksheets Pack
No ratings yet
Shapes Worksheets Pack
6 pages
Matrix Multiply
No ratings yet
Matrix Multiply
3 pages
LPP Pyq's
No ratings yet
LPP Pyq's
17 pages
SplitPDFFile 1 To 7
No ratings yet
SplitPDFFile 1 To 7
7 pages
Execute Java Map Reduce Sample Using Eclipse
No ratings yet
Execute Java Map Reduce Sample Using Eclipse
9 pages
BDA Lab 8 Manual
No ratings yet
BDA Lab 8 Manual
7 pages
17. Using Map Reduce Concept, Implement a Java Pro...
No ratings yet
17. Using Map Reduce Concept, Implement a Java Pro...
2 pages
Quarter 1: Learner'S Material
No ratings yet
Quarter 1: Learner'S Material
39 pages
Chapter 1: Exploring Data: Section 1.1
No ratings yet
Chapter 1: Exploring Data: Section 1.1
16 pages
SPACE Matrix
No ratings yet
SPACE Matrix
5 pages
Fmincon Code
No ratings yet
Fmincon Code
17 pages
Practical 2-3
No ratings yet
Practical 2-3
3 pages
4matrix
No ratings yet
4matrix
2 pages
OddEven Program
No ratings yet
OddEven Program
2 pages
18mcs35e U4
No ratings yet
18mcs35e U4
7 pages
4th Sem
No ratings yet
4th Sem
6 pages
Big Data Analytics IT
No ratings yet
Big Data Analytics IT
55 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Matrix Multiplication Using Hadoop Map-Reduce
No ratings yet
Matrix Multiplication Using Hadoop Map-Reduce
10 pages
Bda Lab
No ratings yet
Bda Lab
4 pages
Resistance To Change in An Organization: Palakh Jain, Chavi Asrani, Tinu Jain
No ratings yet
Resistance To Change in An Organization: Palakh Jain, Chavi Asrani, Tinu Jain
7 pages
SSD9 Unit5
No ratings yet
SSD9 Unit5
166 pages
CPC Solution W4 PDF
No ratings yet
CPC Solution W4 PDF
4 pages
Prácticas Bigdata: 1. Lanzar Un Proceso Mapreduce Contra El Cluster
No ratings yet
Prácticas Bigdata: 1. Lanzar Un Proceso Mapreduce Contra El Cluster
3 pages
Torque - Second Condition of Equilibrium
No ratings yet
Torque - Second Condition of Equilibrium
2 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
QC Mock Test IV
No ratings yet
QC Mock Test IV
7 pages
Cps 8210 Assignment 2
No ratings yet
Cps 8210 Assignment 2
3 pages
Bea Near Oliva - Application Form
No ratings yet
Bea Near Oliva - Application Form
2 pages
IBPS RRB Office Pre Model Paper - 1
No ratings yet
IBPS RRB Office Pre Model Paper - 1
11 pages
Discussion
100% (3)
Discussion
3 pages
2nd Assignment
No ratings yet
2nd Assignment
15 pages
Jackson 5 21 Homework Solution
No ratings yet
Jackson 5 21 Homework Solution
3 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
31 pages
Dakshana Class 11th Unit Test 3 - Laws of Motion
No ratings yet
Dakshana Class 11th Unit Test 3 - Laws of Motion
2 pages
PV (I%,n, A, F) : Finding Present Worth Given A Cash Flow Series Consisting of Uniform Series A and A Single Amount F at N
No ratings yet
PV (I%,n, A, F) : Finding Present Worth Given A Cash Flow Series Consisting of Uniform Series A and A Single Amount F at N
14 pages
Mastering Go A Practical Guide to Developers: A Practical Guide to Developers
From Everand
Mastering Go A Practical Guide to Developers: A Practical Guide to Developers
Miguel Miranda de Mattos
No ratings yet
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
From Everand
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
Kanto
No ratings yet
Core Java Programming Book
From Everand
Core Java Programming Book
Manish Soni
No ratings yet
MCTS 70-515 Exam: Web Applications Development with Microsoft .NET Framework 4 (Exam Prep)
From Everand
MCTS 70-515 Exam: Web Applications Development with Microsoft .NET Framework 4 (Exam Prep)
Eddie Vi
4/5 (1)
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
From Everand
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
Kanto
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet

Exp 9 - Merged

Uploaded by

Exp 9 - Merged

Uploaded by

EXPERIMENT NUMBER – 06

AIM: To develop a MapReduce program to find the grades of students.

AIM: Perform the MapReduce program for matrix multiplication.

Step 1: Set Up Your Eclipse Project:

o Go to File > New > Java Project.

o Name your project, e.g., MatrixMultiplication.

2. Add Hadoop Libraries to the Build Path:

o Navigate to your Hadoop installation directory, usually located at

Step 2: Create Input Matrices:

Step 3: Write the MapReduce Code

o Write the following code:

Code are as follows:

3. Create the Driver Class:

• Finally, create the main driver class called MatrixMultiplicationDriver:

Step 4: Compile and Run the Program

o Right-click on the project and select Export > JAR file.

o Choose a location to save the JAR file and click Finish.

2. Copy the Input Files to HDFS:

hadoop fs -put /path/to/matrixA.txt /user/matrix/input

hadoop fs -put /path/to/matrixB.txt /user/matrix/input

3. Run the MapReduce job:

hadoop jar /path/to/MatrixMultiplication.jar /user/matrix/input /user/matrix/output

4. Check the Output:

• After the job completes, check the output in HDFS:

hadoop fs -cat /user/matrix/output/part-r-00000

Step 5: Debugging and Logs

Step 6: Visualize the Results

You might also like