Data_Analytics_Syllabus_Created
Data_Analytics_Syllabus_Created
---------------------------------------------------
CO5: Describe the concept of R programming and implement analytics on Big data using R - K2, K3
Sources and nature of data, classification of data (structured, semi-structured, unstructured), characteristics
of data, introduction to Big Data platform, need of data analytics, evolution of analytic scalability, analytic
process and tools, analysis vs reporting, modern data analytic tools, applications of data analytics.
Need, key roles for successful analytic projects, various phases of data analytics lifecycle - discovery, data
(08 Lectures)
Regression modeling, multivariate analysis, Bayesian modeling, inference and Bayesian networks, support
vector and kernel methods, analysis of time series: linear systems analysis & nonlinear dynamics, rule
induction, neural networks: learning and generalization, competitive learning, principal component analysis
and neural networks, fuzzy logic: extracting fuzzy models from data, fuzzy decision trees, stochastic search
methods.
(08 Lectures)
Introduction to streams concepts, stream data model and architecture, stream computing, sampling data in a
stream, filtering streams, counting distinct elements in a stream, estimating moments, counting oneness in a
window, decaying window, Real-time Analytics Platform (RTAP) applications, Case studies - real time
(08 Lectures)
Mining frequent itemsets, market based modelling, Apriori algorithm, handling large data sets in main
memory, limited pass algorithm, counting frequent itemsets in a stream, clustering techniques: hierarchical,
K-means, clustering high dimensional data, CLIQUE and ProCLUS, frequent pattern based clustering
(08 Lectures)
MapReduce, Hadoop, Pig, Hive, HBase, MapR, Sharding, NoSQL Databases, S3, Hadoop Distributed File
Systems.
Visualization:
R graphical user interfaces, data import and export, attribute and data types, descriptive statistics, exploratory
(08 Lectures)
1. Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman, Mining of Massive Data Sets, Cambridge University
Press.
3. Bill Franks, Taming the Big Data Tidal Wave, Wiley & Sons.
5. David Dietrich et al., Data Science and Big Data Analytics, EMC Education.