HADOOP COURSE
Course Guide
Course Guide
Copy paste this guide and keep it somewhere safe, you will need these URLs later in this course. It's
easier to copy paste it from this document than type it over from the videos
URLs
• Course Documents: https://github.com/wardviaene/hadoop-ops-course/
• Course Documents zip: https://github.com/wardviaene/hadoop-ops-course/archive/master.zip
• Presentation: https://www.slideshare.net/secret/gm8u9XU8AE0tWN
• Facebook group: https://www.facebook.com/groups/947611682012330/
Hadoop Cluster Demos
To setup your own cluster (see the Hadoop installation), you need to install software.
This page gives you all the links to the material. You don't need to install it yet, I'll show you the demo in
one of the demo lectures in the first section of the course.
Multi-node cluster
If you have enough memory in your machine (8GB+) you can use the multi-node cluster using vagrant.
This is the preferred setup and this will closely align to the demos I will show you.
• VirtualBox: http://www.virtualbox.org
• Vagrant: http://www.vagrantup.com
• My git repository: https://github.com/wardviaene/hadoop-ops-course/archive/master.zip
Single-node cluster
If you don't have a powerful machine, you can also use the single-node sandbox from the Hortonworks
website. This is not the preferred way. Try the multi-node cluster first. This is an alternative if you don't
have enough memory in your machine. The cluster will already be installed for you. You can use this to
test the code examples, but you'll not be able to do the installation yourself.
Hadoop Single node sandbox: http://hortonworks.com/products/hortonworks-sandbox/
Direct link to VirtualBox
sandbox image: http://hortonassets.s3.amazonaws.com/2.4/HDP_2.4_virtualbox_v3.ova
Direct link to VMWare sandbox
image: http://hortonassets.s3.amazonaws.com/2.4/HDP_2.4_vmware_v3.ova
Resources for this lecture