Big Data
Live Lab 1: Starting the Virtual Machine
Copyright © 2020, Oracle and/or its affiliates. All rights reserved.
Big Data: Starting the Virtual Machine
• In this lab you will be introduced to the following:
−Identifying System resources
−Downloading and installing a virtual machine
−Configuring a virtual machine
−Configuring access to external files
−Updating settings
−Adding additional JDBC drivers
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 3
Resources required
• To run the labs for this course you will require the
following:
−Sufficient system resources to run the virtual image
−Oracle Virtual Box
−Oracle Big Data Lite VM
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 4
Confirming system resources
• Prepare your host system
−Minimum 16 GB of real memory
−Turn on Virtual Assist features in the BIOS
−95GB disk space needed to download and install VM
−Download and install 7Zip (free)
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 5
Getting the resources
• Download the Oracle Big Data Lite VM
−http://www.oracle.com/technetwork/database/bigdata-
appliance/oracle-bigdatalite-2104726.html
−Extract the zipped files into a directory on your local machine
• This requires Virtual Box
−http://www.oracle.com/technetwork/server-
storage/virtualbox/downloads/index.html
−Install virtual box on your machine, install extension pack
when prompted
• Download the lesson files
−Extract the BigData_Lesson_files.7z onto a USB flash drive or
into a local folder on your machine
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 6
Getting Started with the VM image
• Open virtual box:
−click on file
−preferences
−In the general tab set the default machine folder to a location
big enough to store the virtual image.
−You will need 65GB free space.
−Click ok.
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 7
Getting Started with the VM image
• In virtual box:
−click on file
−Click on Import Appliance
−Browse to where you extracted the BigDataLite411 file, and
then choose import
−Click Next
−Choose the default appliance settings and choose import
−Agree to the license and click Ok
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 8
Setting up the VM image
• In virtual box:
−In Virtual box select the BigDataLite411.ova image and click
on settings
−Under System Setting within Virtual box increase the RAM to
the maximum allocation allowed without affecting the host
system performance. Needs to be a minimum of 11000MB
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 9
Working with External files
• If you extracted the lesson files to a USB flash drive do
the following:
−Insert the USB flash drive into the local machine
−In Virtual box select the BigDataLite411.ova image and click
on settings
−Choose USB, enable USB Controller and then add a USB filter
so that the settings are as follows (the name of your USB
device may be different):
−Click Ok.
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 10
Working with External files
• If you extracted the lesson files to a local folder do the
following:
−In Virtual box select the BigDataLite411.ova image and click
on settings
−Choose Shared Folders and click Add Folder
−Specify the folder you would like to share and select
Auto‐mount, and then click OK
−Click Ok
−To access the shared folder from Big DataLite, navigate to
/media/sf_<folder_name>
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 11
Working with the VM image
• Click start to load the image in virtual box
• All username/passwords for the VM are:
−Username: oracle
−Password: welcome1
• unless stated otherwise
• This will bring you to a Linux desktop
• Update Guest Additions when prompted to.
−Double click the Start Here document on the desktop that
provides details about the installed software, ids, passwords,
and more
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 12
Working with the VM image
• Click on start/stop services
• Ensure that your services match the diagram
• Use spacebar to enable/disable services
• Click Ok
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 13
Working with the VM image
• Click on Refresh Samples to update your files
• When asked to proceed type “y”
• When asked to configure a proxy you should be able to
type “n”
• The new samples are downloaded to /home/oracle/src
• Click Enter to exit the terminal
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 14
Working with the VM image
• You need to restart the VM to allow the changes.
−System
−Shut down
−Restart
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 15
Setting up Hive in the VM image
• You need to install JDBC drivers to allow Hive to work
on your system
• If you copied your lesson files to a USB flash drive:
−Double click Computer on the desktop
−Double click the USB drive icon
−Right click and copy the Cloudera_HiveJDBC4_2.5.15.1040
directory and paste it in the oracle’s Home directory on the
desktop
−Right click the copied file and choose extract here
−Go into the new 2.5.15.1040 GA directory choose
Cloudera_HiveJDBC4_2.5.15.1040 and again extract here
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 16
Setting up Hive in the VM image
• You need to install JDBC drivers to allow Hive to work
on your system
• If you copied your lesson files to a shared folder:
−Double click Computer on the desktop then filesystem
−Scroll down and access the media directory then your share
−Right click and copy the Cloudera_HiveJDBC4_2.5.15.1040
directory and paste it in the oracle’s Home directory on the
desktop
−Right click the copied file and choose extract here
−Go into the new 2.5.15.1040 GA directory choose
Cloudera_HiveJDBC4_2.5.15.1040 and again extract here
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 17
Setting up Hive in the VM image
• Update SQL Developer to use the Cloudera Hive JDBC
Drivers.
−Start SQL Developer Go to Tools -> Preferences
−Navigate to Database -> Third Party JDBC Drivers
−Add all of the jar files individually that are contained in the
Cloudera_HiveJDBC4_2.5.15.1040 directory to the Third-party
JDBC Driver Path. It should look like the picture below:
−Exit SQL developer
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 18
System setup complete
• Your Virtual machine is now ready
• Work through the following live labs to gain practical
experience in working with Big Data
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 19
Live lab 1 – Starting the Virtual Machine
• This lab has shown you how to work with a virtual
machine
• Identifying System resources
• Downloading and installing a virtual machine
• Configuring a virtual machine
• Configuring access to external files
• Updating settings
• Adding additional JDBC drivers
Copyright © 2020, Oracle and/or its affiliates. All rights reserved. 20