0% found this document useful (0 votes)
8 views17 pages

B.tech Minor Syllabus-CSE (Data Science) - Final

The document outlines the Bachelor of Technology (B.Tech.) program in Computer Science and Engineering (Data Science) at B V Raju Institute of Technology, detailing the curriculum, course structure, and objectives for students admitted from 2020-2021. It includes information on various courses such as Data Science for Engineers, Data Visualization Techniques, and Big Data Engineering, along with their respective outcomes and practical components. The institute's vision, mission, and quality policy emphasize the commitment to providing quality education and developing competent engineers and technology innovators.

Uploaded by

23211a0444
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views17 pages

B.tech Minor Syllabus-CSE (Data Science) - Final

The document outlines the Bachelor of Technology (B.Tech.) program in Computer Science and Engineering (Data Science) at B V Raju Institute of Technology, detailing the curriculum, course structure, and objectives for students admitted from 2020-2021. It includes information on various courses such as Data Science for Engineers, Data Visualization Techniques, and Big Data Engineering, along with their respective outcomes and practical components. The institute's vision, mission, and quality policy emphasize the commitment to providing quality education and developing competent engineers and technology innovators.

Uploaded by

23211a0444
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

Estd: 1997 EAMCET Code: BVRI

BACHELOR OF TECHNOLOGY (B.Tech.)


MINOR DEGREE PROGRAMME
[Choice Based Credit System (CBCS)]
R20 Curriculum

(Applicable for the batches admitted from 2020-2021)

CSE (DATA SCIENCE)

Department of CSE (DATA SCIENCE)


B V RAJU INSTITUTE OF TECHNOLOGY
(UGC-Autonomous)
Approved by AICTE, Accredited by NAAC & NBA
& Affiliated to JNTUH, Hyderabad

Orchard Park Campus, Vishnupur


Narsapur, Medak Dist. – 502 313, Telangana, India
(Sri Vishnu Educational Society)

Email: [email protected] Website: www.bvrit.ac.in


INSTITUTE

VISION
To create and nurture competent engineers and managers who would be enterprise
leaders throughout the world with a sound background in ethics and societal
responsibilities.

MISSION

We are committed to providing a positive and professional learning environment


where all students are inspired to strive for excellence in becoming competent
engineers, technology innovators and leaders in a global society through a cohesive
network of parents, students, college staff and industry.

QUALITY POLICY

We aim at excellence in Technical Education through continual improvements and are


committed to provide responsible technocrats for effective nation building through
Imparting quality Education & Training

Developing students with a disciplined and Integrated personality

Facilitating faculty and supporting staff to update their knowledge


and skills to match the individual and technological developments
Proposed Course Structure and Tentative List of Courses

Year / Sem Course Name Credit

III Year, I Sem Data Science for Engineers 3

III Year, I Sem Data Science Laboratory 1.5

III Year, II Sem Data Visualization 3

III Year, II Sem Data Visualization Laboratory 1.5

IV Year, I Sem Big Data Engineering 3


(Conventional / MooC)

IV Year, I Sem Web and Social Media Analytics 3

IV Year, II Sem Data Science Tools 3


(Conventional / MooC)
Total 18

Proposed Outcomes: The students will be able to,

1. Analyze the industry trends for data driven technologies.

2. Build data driven solutions for various industry verticals.

3. Evaluate the risk associated with real world problems.

4. Create modern research solutions for interdisciplinary domains.


III Year B.Tech. CSE (DS) I Sem L T P C
3 - - 3
DATA SCIENCE FOR ENGINEERS

Pre-requisites: Probability and Statistics, Python for Problem Solving

Course objectives: The main objective of the course is to make students learn about

1. The basic concepts of Python in data manipulation and analysis.


2. Students will be able to write and execute Python code.
3. To write short scripts to perform data preprocessing, exploratory data analysis and data
visualization.

UNIT-1: Introduction to data science [08 Lectures]

What is data, types of data, What is data science, Fundamentals of data science, Data science
life cycle, Why data science is important, Applications of data science, Why Python is necessary
for data science

UNIT-2: Getting started with Python: [10 Lectures]

Jupyter/pycharm/spyder or any other python tool set up and installation. Basics of Python
including data types, operators, variables, expressions, control structures using sample dataset,
objects and functions. Python sequence data structures including String, Array, List, Tuple, Set
and Dictionary. Introduction to various python libraries for data science

UNIT-3: Basic data Processing: [10 Lectures]

Introduction to DataFrames in Pandas, learning to access elements with indexes, Re-indexing


in pandas Series and Dataframes, Data preparation. Numerical Computing with NumPy, going
from Python list to Numpy arrays, working with multi-dimensional array, array operations.
Several scientific numerical routines through SciPy.

UNIT-4: Exploratory data analysis and Data preprocessing: [10 Lectures]

Data loading, dealing with missing values and outliers, data wrangling, filtering data, Data
Normalization, Data Formatting, data cleaning, Web scraping with beautiful soup.

UNIT-5: Data Visualization: [10 Lectures]

Basic visualizations with Matplotlib, Advanced visualizations with Seaborn, Plotting images,
graphs and grids of charts.

Course Outcomes (CO):

1. Apply various Python data structures to effectively manage various types of data.
2. Learn the fundamentals of some of the widely used python packages and apply them
into data analytics.
3. Design applications applying various operations for data cleansing and transformation.
4. Describe the various areas where data science is applied
5. Design visualizations using various Python Libraries.
Text Book:

1. Python for data science for dummies 2nd Edition, John Paul Mueller, Luca Massaron, and
Wiley 2. Programming through Python, M. T. Savaliya, R. K. Maurya, G. M. Magar, STAREDU
Solutions.

Reference Books:

1. Pandas for everyone: Python Data Analysis, Daniel Y. Chen, Pearson

2. Introducing Data Science: Big Data, Machine Learning, and More, Using Python Tools, Davy
Cielen, Arno D.B. Meysman, et al., Minning

3. Applied Data Science with Python and Jupyter: Use powerful industry-standard tools to
unlock new, actionable insights from your data.
III Year B.Tech. CSE (DS) I Sem L T P C
- - 3 1.5
DATA SCEINCE LABARATORY

COURSE OBJECTIVES: The students will learn the following

1. How to program in Python and how to use its packages for effective data analysis.
2. Install and configure software necessary for a statistical programming environment.
3. Carry out basic data pre-processing and wrangling in to explore practical issues
4. Read in data into the python environment from different sources
5. Implement supervised/unsupervised machine learning techniques

List of Practicals/Tutorials:

WEEK-1. Write a program to create a list, insert elements into the list and sort it in ascending
order.
WEEK-2. Write a program to create a dictionary of 10 elements, change/delete the values off
ewkeys and display the dictionary before and after the updates.
WEEK-3. Write a program to create a tuple and a list. Convert the list to tuple and display
the elements of both. Write the program to remove the duplicate element of the list.
WEEK-4. Write a program to perform all basic data pre processing steps on the given data
set.
WEEK-5. Write a program to perform exploratory data analysis on the given dataset.
WEEK-6. Develop programs to learn the concept of Modules and packages.
WEEK-7. Develop a program to learn concept of array and numpy module.
WEEK-8. Write a NumPy program to convert a list of numeric value into a one dimensional
NumPy array. And perform all operations on that array.
WEEK-9. Write a NumPy program to find the union of two arrays. Union will return the unique,
sorted array of values that are in either of the two input arrays.
WEEK-10. Write a Pandas program to convert a NumPy array to a Pandas series. Also write a Pandas
program to calculate the frequency counts of each unique value of a given series.
WEEK-11. Write a Pandas program to read a dataset from diamonds DataFrame and modify the
default columns values and print the first 6 rows. Also find the number of rows and
columns and data type of each column of diamonds Data frame.
WEEK-12. Consider dataset with student name, gender, Enrollment no, 4 semester result with
marks of each subject, his mobile number, and city. Implement following in Python.
Plot various graphs and chart to visualize students ‘data.

COURSE OUTCOMES: After completion of the course, the students would be able to:

1. Choose appropriate data pre-processing techniques


2. Select appropriate algorithms for solving of real-world problems.
3. Use machine learning techniques in high-performance computing environment to solvereal-
world problems.
4. Evaluate the performance of machine learning models.
5. Identify the outliers if any in the data set.
TEXT BOOKS:

1. Shai Shalev-Shwartz, Shai Ben-David, “Understanding Machine Learning: From Theory to


Algorithms”, Cambridge University Press, 2014.

2. John Mueller and Luca Massaron, “Machine Learning for Dummies”, John Wiley & Sons,
2016.
III Year B.Tech. CSE (DS) II Sem L T P C
3 - - 3
DATA VISUALIZATION TECHNIQUES

Course Objective: To understand various data visualization techniques.

1. Understand the importance of data visualization and learns fundamentals of data


visualization
2. Employ best practices in data visualization to develop charts, maps, tables, and other
visual representations of data.
3. Describe data analysis and visualization by type of data, organizing by number and type
of variables.
4. Discuss various design issues that arise when assembling data visualizations.
5. Build data visualizations, dashboards and stories to support relevant communication for
diverse audiences.

UNIT – I : Introduction and Data Foundation: [08 Lectures]

Basics - Relationship between Visualization and Other Fields - The Visualization Process -
Pseudo code Conventions - The Scatter plot. Data Foundation - Types of Data - Structure
within and between Records - Data Preprocessing - Data Sets

UNIT – II: Foundations for Visualization: [08 Lectures]

Visualization stages - Semiology of Graphical Symbols - The Eight Visual Variables - Historical
Perspective - Taxonomies - Experimental Semiotics based on Perception. Gibson‘s Affordance
theory – A Model of Perceptual Processing.

UNIT – III: Visualization Techniques: [12 Lectures]

Spatial Data: One-Dimensional Data - Two-Dimensional Data – Three Dimensional Data -


Dynamic Data - Combining Techniques. Geospatial Data: Visualizing Spatial Data- Visualization
of Point Data -Visualization of Line Data - Visualization of Area Data - Other Issues in Geospatial
Data Visualization Multivariate Data: Point-Based Techniques - Line- Based Techniques -Region-
Based Techniques - Combinations of Techniques – Trees Displaying Hierarchical Structures –
Graphics and Networks- Displaying Arbitrary Graphs/Networks.

UNIT – IV: Interaction Concepts and Techniques: [12 Lectures]

Text and Document Visualization: Introduction - Levels of Text Representations - The Vector
Space Model - Single Document Visualizations –Document Collection Visualizations - Extended
Text Visualizations Interaction Concepts: Interaction Operators - Interaction Operands and
Spaces - A Unified Framework. Interaction Techniques: Screen Space - Object-Space -Data Space
-Attribute Space- Data Structure Space - Visualization Structure – Animating Transformations -
Interaction Control

UNIT – V: Research Directions in Virtualizations: [08 Lectures]

Steps in designing Visualizations – Problems in designing effective Visualizations- Issues of Data.


Issues of Cognition, Perception, and reasoning. Issues of System Design Evaluation, Hardware
and Applications.
Course Outcomes:

1. Visualize the objects in different dimensions.

2. Design and process the data for Virtualization.

3. Apply the visualization techniques in physical sciences, computer science, applied


mathematics and medical science.

4. Apply the virtualization techniques for research projects.

5. Analyze the research outcomes on real-world datasets.

TEXT BOOKS:

1. Matthew Ward, Georges Grinstein and Daniel Keim, “Interactive Data Visualization
Foundations, Techniques, Applications”, 2010.

2. Colin Ware, “Information Visualization Perception for Design”, 2nd edition, Margon
Kaufmann Publishers, 2004.

REFERENCE BOOKS:

1. Robert Spence “Information visualization – Design for interaction”, Pearson Education, 2nd

Edition, 2007.

2. Alexandru C. Telea, “Data Visualization: Principles and Practice,” A. K. Peters Ltd, 2008
III Year B.Tech. CSE (DS) II Sem L T P C
- - 3 1.5
DATA VISUALIZATION TECHNIQUES LAB

Course Objectives:

1. Understand the various types of data, apply and evaluate the principles of data
visualization.
2. Acquire skills to apply visualization techniques to a problem and its associated dataset.
3. Discuss various design issues that arise when assembling data visualizations.
4. Build data visualizations, dashboards and stories to support relevant communication
for diverse audiences.

List of Experiments:

WEEK-1. Acquiring and plotting data.


WEEK-2. Statistical Analysis – such as Multivariate Analysis, PCA, LDA, Correlation
regression and analysis of variance.
WEEK-3. Financial analysis using Clustering, Histogram and HeatMap.
WEEK-4. Time-series analysis – stock market.
WEEK-5. Visualization of various massive dataset - Finance - Healthcare - Census –
Geospatial.
WEEK-6. Visualization on Streaming dataset (Stock market dataset, weather forecasting).
WEEK-7. Market-Basket Data analysis-visualization.
WEEK-8. Text visualization using web analytics.
WEEK-9. Google API with maps
WEEK-10. Network Visualization using Gephi
WEEK-11. Visualization of reconstruction network using Qlickview
WEEK-12. Dash Board Creation using Tableau

Course Outcomes:

1. Identify the different data types, visualization types to bring out the insight.

2. Relate the visualization towards the problem based on the dataset to analyze and bring out
valuable insight on a large dataset.

3. Demonstrate the analysis of a large dataset using various visualization techniques and tools.

4. Identify the different attributes and showcasing them in plots. Identify and create various
visualizations for geospatial and table data.

5. Ability to create and interpret plots using Python and various data visualization tools as well.
TEXT BOOKS:

1. Matthew Ward, Georges Grinstein and Daniel Keim, “Interactive Data Visualization
Foundations, Techniques, Applications”, 2010.

2. Colin Ware, “Information Visualization Perception for Design”, 2nd edition, Margon
Kaufmann Publishers, 2004.

REFERENCE BOOKS:

1. Robert Spence “Information visualization – Design for interaction”, Pearson Education, 2 nd


Edition, 2007.

2. Alexandru C. Telea, “Data Visualization: Principles and Practice,” A. K. Peters Ltd, 2008.
IV Year B.Tech. CSE (DS) I Sem L T P C
3 - - 3
BIG DATA ENGINEERING

Course Objectives:

1. Learn the fundamental components of big data storage and processing techniques.
2. The purpose of this course is to provide the students with the knowledge of Big Data
Analytics principles and techniques.
3. This course is also designed to give an exposure of the frontiers of Big Data Analytics
4. Learn the fundamental components of big data storage and processing techniques.
5. Explore HADOOP Distributed File System and YARN resource manager

UNIT – I Introduction to Database System: [08 Lectures]

Introduction-Database System Applications, Purpose of Database Systems, View of Data - Data


Abstraction, Instances and Schemas, Data Models, Database Languages - DDL, DML, Database
System Structure, History of Data Base Systems.

UNIT – II Big Data and its Importance: [08 Lectures]

Four V’s of Big Data – Drivers for Big Data –Introduction to Big Data Analytics – Big Data Analytics
applications.

UNIT – III Big Data Technologies: [10 Lectures]

Hadoop’s Parallel World – Data discovery – Open source technology for Big Data Analytics – cloud
and Big Data –Predictive Analytics – Mobile Business Intelligence and Big Data

UNIT – IV Introduction Hadoop: [10 Lectures]

Big Data – Apache Hadoop & Hadoop Eco System – Moving Data in and out of Hadoop –
Understanding inputs and outputs of MapReduce - Data Serialization.

UNIT – V Hadoop Architecture: [12 Lectures]

Hadoop: RDBMS Vs Hadoop, Hadoop Overview, Hadoop distributors, HDFS, HDFS Daemons,
Anatomy of File Write and Read., Name Node, Secondary Name Node, and Data Node, HDFS
Architecture, Hadoop Configuration, Map Reduce Framework, Role of HBase in Big Data
processing, HIVE, PIG.

Courses Outcomes: After completion of this course students could able to

1. Explain the foundations, definitions, and challenges of Big Data and various Analytical
tools.
2. Understand the differentiations of HADOOP and Map reduce, NOSQL
3. Understand the fundamentals of various big data analytics techniques.
4. Describe distributed data storage and management in HDFS
5. Ability to understand the importance of Big Data in Social Media and Mining.
TEXT BOOKS:

1. Big Data Analytics, Seema Acharya, Subhasini Chellappan, Wiley 2015.


2. Big Data, Big Analytics: Emerging Business Intelligence and Analytic Trends for Today’s
Business, Michael Minelli, Michehe Chambers, 1st Edition, Ambiga Dhiraj, Wiely CIO Series, 2013.

REFERENCE BOOKS:

1. Big Data and Business Analytics, Jay Liebowitz, Auerbach Publications, CRC press (2013)
2. Using R to Unlock the Value of Big Data: Big Data Analytics with Oracle R Enterprise and
Oracle R Connector for Hadoop, Tom Plunkett, Mark Hornick, McGraw-Hill/Osborne Media
(2013), Oracle press.
3. Professional Hadoop Solutions, Boris lublinsky, Kevin t. Smith, Alexey Yakubovich, Wiley,
ISBN: 9788126551071, 2015.
4. Understanding Big data, Chris Eaton, Dirk deroos et al. McGraw Hill, 2012.
IV Year B.Tech. CSE (DS) I Sem L T P C
3 - - 3
WEB AND SOCIAL MEDIA ANALYTICS

COURSE OBJECTIVES: The students will learn the following:

1. Introduction to Responsive web design and media queries


2. Learn the Fluid layouts and responsive images in the web page.
3. HTML 5 for responsive web design
4. CSS selectors, and pseudo codes for responsive web design
5. Create a web page which is having responsive content in it.

Unit-1: HTML: [08 Lectures]

HTML5 Web Workers Client Side Scripting: Introduction to java script: Java script language declaring
variables, scope of variables, functions, event handlers (on click, on submit etc.), Document Object
Model, Form validation.

Unit-2: XML: [08 Lectures]

Introduction to XML, Defining XML tags, their attributes and values, Document Type Definition,
Displaying XML documents with CSS, XML Schemas, DOM and SAX Parser

Unit-3: Social Media Analytics (SMA): [12 Lectures]

Social media landscape, Need for SMA; SMA in Small organizations; SMA in large organizations;
Application of SMA in different areas

Web analytics tools and techniques:


Clickstream analysis, A/B testing, online surveys, Use of Google Analytics; Web crawling and
Indexing; Natural Language Processing Techniques for Micro Text Analysis

Unit-4: Facebook Analytics: [12 Lectures]

Introduction, parameters, demographics. Analyzing page audience. Reach and Engagement analysis.
Post-performance on FB, Use of Facebook Business Manager; Social campaigns. Measuring and
Analyzing social campaigns, defining goals and evaluating outcomes, Network Analysis.
(LinkedIn, Instagram, YouTube, Twitter etc.)

Unit-5: Processing and Visualizing Web Data: [10 Lectures]

Influence Maximization, Link Prediction, Collective Classification. Applications in Advertising


and Game Analytics (Use of tools like Unity30 / PyCharm). Introduction to Python Web
Programming, collecting and analyzing social media data; visualization and exploration.

COURSE OUTCOMES: After completion of this course students could able to


1. To provide overview and the need for web analytics.
2. To give the measuring strategy and effects of web analytics
3. To understand segmentation and conversions
4. Identify the various types of testing and reporting methods.
5. To provide exposure to usage of web analytic tools.
Textbook:
1. Mathew Ganis, AvinashKoihrkar,Social Media Analytics,IBM Press,2015
2. Hansen, Derek, Ben Sheiderman, Marc Smith, Analyzing Social Media Networks with
NodeXL: Insights from a Connected World. Morgan Kaufmann, 2010.
References:
1. Avinash Kaushik, Web Analytics 2.0: The Art of Online Accountability, Wiley Publishers,
2009.
2. Easley, D., Kleinberg, J., Networks, Crowds, and Markets: Reasoning About a Highly
Connected World. New York: Cambridge University Press, 2010.
http://www.cs.cornell.edu/home/kleinber/networks-book/
IV Year B.Tech. CSE (DS) II Sem L T P C
3 - - 3
DATA SCIENCE TOOLS

Course objectives:

Learning various data science tools to equip with the technologies that fulfil the requirement of
the current industries.

Unit-1: MS Excel [10 Lectures]

Conditional Formatting, Sparkline and Number Formats, macros, drop down lists, Mastering
charting techniques, Create an Interactive Dashboard

Unit-2: MySQL [10 Lectures]

SQL and MySQL, Basic DDL and DML statements, joins and views) and MongoDB database
(Importing-exporting and querying data, creating and manipulating documents, CRUD
operation, indexing and aggregation pipeline

Unit-3: R Studio [10 Lectures]

Importing data, data inputting, data visualization, manipulating and managing data, statistical
modeling, R and database.

Unit-4: Power BI [10 Lectures]

Introduction to Power BI, Power BI Desktop, Data Analysis Expressions, Data Visualization,
QlikView Workbench

Unit-5: Tableau [10 Lectures]

Intro to Tableau, Common charts, Transformations and calculations, Interactions, Data


storytelling

Course Outcomes (CO): After completion of this course students could able to

1. Make use of Excel for Data Analysis


2. Compare and work with SQL and NoSQL databases.
3. Analyze data with the various packages and libraries supported by R.
4. To develop dashboard for understanding and handling data through different data
science tools.
5. To learn the strategies adopted to analyze the data in various companies for better
decision making.

Text Books:

1. Analyzing Data with Microsoft Power BI and Power Pivot for Excel, Marco Russo, Alberto
Ferrari, PHI.

2. Learn Power BI: A beginner guide to developing interactive business intelligence solutions
using Microsoft Power BI, Greg Deckler.
Reference Books:

1. Communicating Data with Tableau: Designing, Developing, and Delivering Data


Visualizations, Ben Jones, O′Reilly publisher
2. Data Analytics with R, Bharti Motwani, Wiley Publication
3. Essentials of R for Data Analytics, Saroj Dahiya Ratnoo, Himmat Singh Ratnoo, Wiley
Publication

You might also like