Big Data Analytics Syllabus

This document outlines the course objectives, outcomes, skills, and activities for a course on Big Data Analytics. The course aims to provide an overview of big data storage, retrieval, and processing technologies. Students will learn to use frameworks like Hadoop, Hive, and Spark to efficiently store, process, and analyze big data. They will develop MapReduce applications and learn to solve data intensive problems using Pig and Spark. Upon completing the course, students will be able to build scalable distributed systems with Hadoop, write MapReduce applications, and design applications using Hive, Pig and Spark for big data use cases.

Uploaded by

Saiyed Faiayaz Waris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

243 views

Big Data Analytics Syllabus

Uploaded by

Saiyed Faiayaz Waris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

16IT445 BIG DATA ANALYTICS

Course Description and Objectives:

This course gives an overview of Big Data, i.e. storage, retrieval and processing of big data.
The focus will be on the “technologies”, i.e., the tools/algorithms that are available for
storage, processing of Big Data and a variety of “analytics”.
Course Outcome:
1. Understand Big Data and its analytics in the real world.
2. Use the Big Data frameworks like Hadoop and NOSQL to efficiently store and
process Big Data to generate Analytics
3 Design of Algorithms to solve Data Intensive problems using Map Reduce Paradigm.
4 Design and Implementation of Big Data Analytics using Pig and Spark to solve Data
Intensive problems and to generate analytics.
5 Analyse Big Data using Hive.

The student will be able to:

 Understand the theoretical issues involved in Big Data system design such as the
curse of dimensionality.
 Familiarize with major approaches in Big Data Analytics.

Skills:
Upon completion of this course, students will be able to do the following:
o Students will to build and maintain reliable, scalable, distributed systems with
Apache Hadoop.
o Students will be able to write Map-Reduce based Applications
o Students will be able to design and build applications using Hive and Pig
based Big data Applications
o Students will learn tips and tricks for Big Data use cases and solutions
Activities:
 Install Hadoop and develop applications on Hadoop
 Develop Map Reduce applications
 Develop applications using Hive/Pig/Spark
Unit-I

Introduction to big data: Data, Characteristics of data and Types of digital data:, Sources
of data, Working with unstructured data, Evolution and Definition of big data,
Characteristics and Need of big data, Challenges of big data

Big data analytics: Overview of business intelligence, Data science and Analytics, Meaning
and Characteristics of big data analytics, Need of big data analytics, Classification of
analytics, Challenges to big data analytics, Importance of big data analytics, Basic
terminologies in big data environment

Unit-II
Introduction to Hadoop : Introducing Hadoop, need of Hadoop, limitations of RDBMS,
RDBMS versus Hadoop, Distributed Computing Challenges, History of Hadoop , Hadoop
Overview, Use Case of Hadoop, Hadoop Distributors, HDFS (Hadoop Distributed File
System) , Processing Data with Hadoop, Managing Resources and Applications with Hadoop
YARN (Yet another Resource Negotiator), Interacting with Hadoop Ecosystem.

Unit-III

Introduction to MAPREDUCE Programming: Introduction , Mapper, Reducer, Combiner,

Partitioner , Searching, Sorting , Compression, Real time applications using MapReduce.

Unit-IV

Introduction to Hive: Introduction to Hive, Hive Architecture , Hive Data Types, Hive File
Format, Hive Query Language (HQL), User-Defined Function (UDF) in Hive.

Introduction to Pig: Introduction to Pig, The Anatomy of Pig , Pig on Hadoop , Pig
Philosophy , Use Case for Pig: ETL Processing , Pig Latin Overview , Data Types in Pig ,
Running Pig , Execution Modes of Pig, HDFS Commands, Relational Operators, Piggy
Bank , Word Count Example using Pig , Pig at Yahoo!, Pig versus Hive

Unit-V

Spark: Introduction to data analytics with Spark, Programming with RDDS, Working with
key/value pairs.

Text Books
1. Big Data Analytics, SeemaAcharya, SubhashiniChellappan, Wiley
2. Learning Spark: Lightning-Fast Big Data Analysis, Holden Karau, Andy Konwinski,
Patrick Wendell, MateiZaharia, O'Reilly Media, Inc.
Reference Books:
1. Boris lublinsky, Kevin t. Smith, AlexeyYakubovich, “Professional Hadoop
Solutions”, Wiley, ISBN: 9788126551071, 2015.
2. Chris Eaton,Dirkderooset al. , “Understanding Big data ”, McGraw Hill, 2012.
3. Tom White, “HADOOP: The definitive Guide”, O Reilly 2012.
4. VigneshPrajapati, “Big Data Analyticswith R and Haoop”, Packet Publishing 2013.

(eBook PDF) Introduction to Data Mining 2nd Edition by Pang-Ning Tanpdf download
100% (8)
(eBook PDF) Introduction to Data Mining 2nd Edition by Pang-Ning Tanpdf download
51 pages
Guerilla Guide To Social Business
No ratings yet
Guerilla Guide To Social Business
106 pages
Drafting Legal Opinions Cassidy QC
No ratings yet
Drafting Legal Opinions Cassidy QC
6 pages
Setting Up The Oracle Warehouse Builder 11g Release 2 Tutorial Environment
No ratings yet
Setting Up The Oracle Warehouse Builder 11g Release 2 Tutorial Environment
36 pages
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet
SQL Sqlite Commands Cheat Sheet PDF
No ratings yet
SQL Sqlite Commands Cheat Sheet PDF
5 pages
Big Data Fund
No ratings yet
Big Data Fund
5 pages
LAWBOT
No ratings yet
LAWBOT
13 pages
Big Data To Big Impact: Effect of Big Data in Modern Decision Making
No ratings yet
Big Data To Big Impact: Effect of Big Data in Modern Decision Making
11 pages
Indian Penal Code Recognition Using Multiclass Classification Algorithms in Machine Learning
No ratings yet
Indian Penal Code Recognition Using Multiclass Classification Algorithms in Machine Learning
4 pages
BDA Answers-1
No ratings yet
BDA Answers-1
15 pages
Banking Data Analysis On Hadoop
No ratings yet
Banking Data Analysis On Hadoop
21 pages
ARK1660 - Project Management For Lawyers - Part Report
No ratings yet
ARK1660 - Project Management For Lawyers - Part Report
10 pages
Sentence Level Sentiment Analysis
No ratings yet
Sentence Level Sentiment Analysis
8 pages
Big Data Technologies
No ratings yet
Big Data Technologies
4 pages
Python Pandas Data Analysis
No ratings yet
Python Pandas Data Analysis
36 pages
Court Cases Winning Stratergys
No ratings yet
Court Cases Winning Stratergys
9 pages
Petroleum: Big Data Analytics in Oil and Gas Industry: An Emerging Trend
No ratings yet
Petroleum: Big Data Analytics in Oil and Gas Industry: An Emerging Trend
10 pages
Project
No ratings yet
Project
14 pages
Big Data Syllabus For Theory and Lab
No ratings yet
Big Data Syllabus For Theory and Lab
4 pages
61 Shannon Chart
No ratings yet
61 Shannon Chart
1 page
Special Chart Types PDF
No ratings yet
Special Chart Types PDF
5 pages
WatsappChatAnalysis 2
No ratings yet
WatsappChatAnalysis 2
23 pages
A Big Data Analytics Study Challenges, Unresolved Research Issues, and Techniques
100% (1)
A Big Data Analytics Study Challenges, Unresolved Research Issues, and Techniques
8 pages
Software Technologies
No ratings yet
Software Technologies
32 pages
Lithium-Ion EV Battery Recycling Policy Framework - USA Alliance 2024
No ratings yet
Lithium-Ion EV Battery Recycling Policy Framework - USA Alliance 2024
10 pages
Limitations of Big Data
No ratings yet
Limitations of Big Data
9 pages
Profitability Analysis Template
No ratings yet
Profitability Analysis Template
1 page
Performance Evaluation Criteria For Board Committees of Board Chairperson and Directors New
No ratings yet
Performance Evaluation Criteria For Board Committees of Board Chairperson and Directors New
2 pages
Git - Life Cycle - Tutorialspoint
No ratings yet
Git - Life Cycle - Tutorialspoint
2 pages
MOODLE Student Introduction
No ratings yet
MOODLE Student Introduction
11 pages
Chapter5 CPIT110 v2 Loops
No ratings yet
Chapter5 CPIT110 v2 Loops
227 pages
Recognizing Cited Facts and Principles in Legal Judgements
No ratings yet
Recognizing Cited Facts and Principles in Legal Judgements
20 pages
Moodle Administrator: User Manual For Faculty Members
No ratings yet
Moodle Administrator: User Manual For Faculty Members
79 pages
Unit V Big Data Analytics
No ratings yet
Unit V Big Data Analytics
47 pages
Apache Pig
100% (2)
Apache Pig
80 pages
POV - Building A Better Business Case
100% (1)
POV - Building A Better Business Case
21 pages
Planning and Design of Airport Infrastructures: 10 Transportation Infrastructure Lecture
No ratings yet
Planning and Design of Airport Infrastructures: 10 Transportation Infrastructure Lecture
87 pages
Data Science
No ratings yet
Data Science
8 pages
Pig Slides
No ratings yet
Pig Slides
46 pages
Hadoop and Related Tools
No ratings yet
Hadoop and Related Tools
57 pages
Stm-Lecture Notes - 0 PDF
100% (1)
Stm-Lecture Notes - 0 PDF
120 pages
Software Testing Introduction
No ratings yet
Software Testing Introduction
15 pages
Introduction To Tableau: Data Visualization With Tableau
No ratings yet
Introduction To Tableau: Data Visualization With Tableau
17 pages
Hadoop Ecosystem Large PDF
No ratings yet
Hadoop Ecosystem Large PDF
229 pages
Assignment - Machine Learning
No ratings yet
Assignment - Machine Learning
3 pages
Sample - Project Abstract - Outline Report - Course No. - BITS ID Edited
100% (1)
Sample - Project Abstract - Outline Report - Course No. - BITS ID Edited
10 pages
1-A Business Model Framework For The Design and Evaluation of Business Models in The Internet of Services
No ratings yet
1-A Business Model Framework For The Design and Evaluation of Business Models in The Internet of Services
13 pages
Tools and Methods Used in Cyber Crime
No ratings yet
Tools and Methods Used in Cyber Crime
97 pages
Use Case Diagrams
No ratings yet
Use Case Diagrams
8 pages
Gyan Singh Machine Learning Project For A Level
No ratings yet
Gyan Singh Machine Learning Project For A Level
58 pages
Hadoop (Big Data) : Skills Gained
No ratings yet
Hadoop (Big Data) : Skills Gained
8 pages
BDA Lab ManuaL[1]
No ratings yet
BDA Lab ManuaL[1]
83 pages
Big Data Group Assingment
No ratings yet
Big Data Group Assingment
41 pages
Big Data Analytics
No ratings yet
Big Data Analytics
134 pages
Item-Based Collaborative Filtering Recommendation Algorithms
No ratings yet
Item-Based Collaborative Filtering Recommendation Algorithms
11 pages
Advanced Certification in Data Science and Artificial Intelligence
No ratings yet
Advanced Certification in Data Science and Artificial Intelligence
18 pages
1 Month Big Data Boot Camp
No ratings yet
1 Month Big Data Boot Camp
6 pages
Dbms Mini Project
No ratings yet
Dbms Mini Project
19 pages
Managing Data as a Product: Design and build data-product-centered socio-technical architectures
From Everand
Managing Data as a Product: Design and build data-product-centered socio-technical architectures
Andrea Gioia
No ratings yet
Velero
No ratings yet
Velero
8 pages
7.module LVM
No ratings yet
7.module LVM
10 pages
Information Systems blueprint (1)
No ratings yet
Information Systems blueprint (1)
26 pages
Data Modeling MIT2
No ratings yet
Data Modeling MIT2
62 pages
DBMS Handwritten Notes Q1j2as
100% (1)
DBMS Handwritten Notes Q1j2as
56 pages
Oracle Erp Financials r12 Training Manual Navigation
No ratings yet
Oracle Erp Financials r12 Training Manual Navigation
30 pages
A Brief Review On Search Engine Optimization: Dushyant Sharma Rishabh Shukla
No ratings yet
A Brief Review On Search Engine Optimization: Dushyant Sharma Rishabh Shukla
6 pages
Chapter 2 DB
No ratings yet
Chapter 2 DB
10 pages
3 Relationalmodel
No ratings yet
3 Relationalmodel
49 pages
Transfer Logins From SQL 2005 To SQL 2005 (Revlogin)
No ratings yet
Transfer Logins From SQL 2005 To SQL 2005 (Revlogin)
6 pages
(May-2017) New PassLeader 70-461 Exam Dumps
No ratings yet
(May-2017) New PassLeader 70-461 Exam Dumps
10 pages
DBMS QB
No ratings yet
DBMS QB
4 pages
Resume Tianlin Tan Data
No ratings yet
Resume Tianlin Tan Data
1 page
Etl Real Time Q
No ratings yet
Etl Real Time Q
13 pages
DCC Digital Computer Centre: Course Outline
No ratings yet
DCC Digital Computer Centre: Course Outline
4 pages
Top Hibernate Interview Questions For Experienced Developers
No ratings yet
Top Hibernate Interview Questions For Experienced Developers
8 pages
OWB Error
No ratings yet
OWB Error
8 pages
PHP 09 Crud
No ratings yet
PHP 09 Crud
17 pages
IRS Questions Qbank
100% (1)
IRS Questions Qbank
2 pages
Oracle Is A Client/Server Relational Database
No ratings yet
Oracle Is A Client/Server Relational Database
21 pages
Certification 1
No ratings yet
Certification 1
39 pages
03 Join Strategy
No ratings yet
03 Join Strategy
39 pages
Ou Dbms IV Sem Notes
No ratings yet
Ou Dbms IV Sem Notes
127 pages
Databricks, An Introduction: Chuck Connell, Insight Digital Innovation
No ratings yet
Databricks, An Introduction: Chuck Connell, Insight Digital Innovation
36 pages
CS 3306 Written Assignment Unit 1
No ratings yet
CS 3306 Written Assignment Unit 1
5 pages
Data Domain® Implementation With Application Software - MR-1WP-DDIAS SRG
No ratings yet
Data Domain® Implementation With Application Software - MR-1WP-DDIAS SRG
129 pages
EdFinkler-Introduction To CodeIgniter
No ratings yet
EdFinkler-Introduction To CodeIgniter
20 pages
Cohesity SmartFiles Administration 6.6
No ratings yet
Cohesity SmartFiles Administration 6.6
351 pages
Berrylicious Juice Shop Database
No ratings yet
Berrylicious Juice Shop Database
15 pages

Big Data Analytics Syllabus

Uploaded by

Big Data Analytics Syllabus

Uploaded by

16IT445 BIG DATA ANALYTICS

Course Description and Objectives:

The student will be able to:

Introduction to MAPREDUCE Programming: Introduction , Mapper, Reducer, Combiner,

You might also like