0% found this document useful (0 votes)
912 views

Introduction To Hadoop and Mapreduce - VM Setup

The document provides instructions for downloading and setting up a virtual machine (VM) with included data sets for training. It outlines two options: 1) downloading the preconfigured VM file, or 2) downloading just the data sets and setting up a blank VM. Setup instructions are given for both Oracle VirtualBox and VMWare, including downloading and installing the software, creating a new VM, selecting the preconfigured image file or setting up a blank VM, and starting the VM.

Uploaded by

David Llanes
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
912 views

Introduction To Hadoop and Mapreduce - VM Setup

The document provides instructions for downloading and setting up a virtual machine (VM) with included data sets for training. It outlines two options: 1) downloading the preconfigured VM file, or 2) downloading just the data sets and setting up a blank VM. Setup instructions are given for both Oracle VirtualBox and VMWare, including downloading and installing the software, creating a new VM, selecting the preconfigured image file or setting up a blank VM, and starting the VM.

Uploaded by

David Llanes
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Downloading the VM (includes data sets)

Alternative - download data sets


Virtual machine setup
Using Oracle VirtualBox
Using VMWare

Downloading the VM (includes data sets)


1. Download it from http://content.udacity-data.com/courses/ud617/Cloudera-Udacity-
Training-VM-4.1.1.c.zip. Warning - the zipped file size is 1.7 GB. If you are on a
Windows machine you will likely need to use WinRAR to open this .zip file because other
methods fail to open the unzipped file (which exceeds the maximum specified 4GB for a
.zip file).
2. MD5sum file can be found here http://content.udacity-
data.com/courses/ud617/Cloudera-Udacity-Training-VM-4.1.1.c.zip.md5
3. Unzip it. Warning - the unzipped size is 4.2GB
4. MD5 hashes for files:
8a610c151d4b1ebdce11542d13dd2a53 Cloudera-Training-VM-4.1.1.c.log
6b44c965c1c6062554bf4cc12d11e87e Cloudera-Training-VM-4.1.1.c.plist
46dedeba3e0affd8311431d7e370705e Cloudera-Training-VM-4.1.1.c.vmdk
d41d8cd98f00b204e9800998ecf8427e Cloudera-Training-VM-4.1.1.c.vmsd
096956c1cbabeaa652ca63a2d5e14612 Cloudera-Training-VM-4.1.1.c.vmx
c9f8a375e82ef1e9d96097850e237df9 Cloudera-Training-VM-4.1.1.c.vmxf
0d7c8becb5a515068e81bb303c794e4f nvram

Alternative - download data sets


Download and unzip data sets from:
http://content.udacity-data.com/courses/ud617/access_log.gz
http://content.udacity-data.com/courses/ud617/purchases.txt.gz

Virtual machine setup


Using Oracle VirtualBox
1. Download and install VirtualBox from https://www.virtualbox.org/wiki/Downloads
2. Create a new Virtual machine:
a. Create a new virtual machine by pressing the New button:

b. Choose a name, use Type: Linux:


c. Press Next
d. Select memory size for the VM.

e. Press Next
f. Select Use an existing virtual hard drive file, click the button to browse to the
directory you unzipped the provided VM image and press Create.

g. Start the VM!


Using VMWare
1. Download and install from
https://my.vmware.com/web/vmware/free#desktop_end_user_computing/vmware_player
/6_0
2. Create the Virtual Machine:
a. Click on Open a Virtual Machine and, when prompted, navigate to the folder you
unzipped the VM, choose the file and click Open.
b. Select the machine and click Play virtual machine

You might also like