0% found this document useful (0 votes)

193 views4 pages

Lahore School of Economics Data Analysis and Statistical Methods Winter 2020

This document contains an assignment from Lahore School of Economics for a course on Data Analysis and Statistical Methods. It includes questions about sources of big data, categories of business analytics, sources of big data analytics, and exploring two datasets related to Pakistan elections and yellow pages businesses. The assignment provides context, variable information, and suggestions for further analysis of the election and business datasets.

Uploaded by

Zohraiz Malik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

193 views4 pages

Lahore School of Economics Data Analysis and Statistical Methods Winter 2020

Uploaded by

Zohraiz Malik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Lahore School of Economics

Data Analysis and Statistical Methods

Winter 2020

Name: Zohraiz Mubarik Section:D_____________

Date: 02/09/2020 Score:___________________

Assignment 2
Q1: Explain different sources of Big Data.
 Transactional data : The main purpose of Transaction Processing System is to capture the
information and update the data for the operational decisions in an organization. There
are two ways to process transactions namely Batch processing which processes the data
as a single unit over a period of time and Real Time Processing System where data are
processed immediately.
 Social media data: People almost at every possible location in the world share their
information through social media which helps customers to make purchasing decisions
by having a glance at the feedback, customer complaints and miscellaneous services
provided with a product. Sentiments of the consumers are also expressed on social media
which help companies to make production decisions
 Internet Applications: There are numerous online ecommerce websites (such as Amazon,
Flipkart, Alibaba, eBay, Paytm, bookmyshow.com etc.) search engines (Google, Yahoo,
Bing, etc.) or online banking applications where millions of users are logging in daily and
using them. During their searches or transactions various click streams and logs get
generated which could be of value.
 Data from electronic instruments: There are numerous electronic media such as smart
phones, RFID tags, GPS Sensors, machines connected to networks, scanners, cameras
which generate high volumes of datasets. These are other sources of big data.

Q2: Explain the categories of Business Analytics.

Business analytics can be classified into 3 categories based on the purpose of use – descriptive,
predictive and prescriptive.

 Descriptive analytics explains a phenomenon from past data through reports, dashboards,
which helps in understanding what has happened.
 Predictive analytics helps us to understand what can happen. It supports predictions based
on past data, correlations between variables and patterns.
 Prescriptive analytics helps to understand different outcomes under different scenarios. It
consists of various tools such as optimization, simulations, what-if analysis scenarios
with change in input set of parameters.

Q3: Explain sources of Big Data analytics.

 Text Analytics consists Document representation, enterprise search system, search

engines, relevance of feedback, query processing, billions of searches of customer for a
particular product on google, searches on Amazon’s website provide indicator of
intention to purchase the product by customer.
 Audio and Video Analytics Audio analytics takes seconds to process audio through
technology mainly for safety purpose in any organization and can track a wide range of
sound in the environment. Video analytics is used to process and analyze videos from
variety of fields and industries. This helps in extracting events helpful for taking
operational decisions.
 Web Analytics Online retailer Amazon uses data mining techniques to mine the big data
such as click streams, web searches, order history, online etc. to derive intelligence. This
intelligence is used to make decisions about product promotions and it is working
successfully for companies such as Amazon.
 Network Analytics provides information about devices which are connected to network
and how they are interacting with each other. This information helps in designing
network policies, to make actionable decisions that help in improving business
performance and reducing costs.

Q4: Explore the “Predict Pakistan Elections 2018” dataset retrieved from
(https://www.kaggle.com/zusmani/predict-pakistan-elections-2018/kernels). Explain the
context, datatype, time regime, variable information, metadata etc. Discuss few questions
that are already answered (Hint: Kernel activity “Voter Behavior and Voting Reasons”
and “2002/2008/2013 Elections Visualizations”) and what further can be explored from it.

We predict the historic voters’ turn out in this election of 57-61%. Historically the average turn
out is 45% since 1977 (lowest 35% in 1997, highest 55% in 1977 and 53% in last elections).
Pakistan ranked 164th out of 169 nations in voters’ turn out; Australia being the first with 94.5%
turn out.

Voters’ participation in the country is very diverse, historically Musakhel and Kohlu yield less
than 25% whereas Layyah and Khanewal yield more than 60% and everything else is in between.
Punjab has the highest and Balochistan has the lowest voters’ turnout.
The contest will bring 3,675 candidates for 272 national assembly seats, that is 13 candidates on
average per seat. PTI has unleashed 244 candidates (highest in number by any political party).
Islamabad will see 76 candidates just for 3 seats fighting to rule the capital that guarantees the
psychological edge.

There a quite few interesting facts about these elections, for example we will see the highest
number of Lotas (candidates who often change their party affiliation) ever. PTI believes to win
the election no matter what may come while the survey pundits predicts the PML(N) lead of at
least 13% over PTI.

The history of elections and the charges of corruption, voters’ fraud, ghost votes, interferences
by deep state or violence go hand by hand. There is (almost) no country in the world without the
fear or accusations of such incidents in their elections.

We are releasing the complete National Assembly Elections’ Results dataset for 2002, 2008 and
2013 elections in CSV files for public and calling all data scientists, international observers and
journalists out there to help us achieve our inspirations.

Time Regime-Data collected is in a panel format which holds information from the timeline
2013 to 2018. The data set scrutinizes election results for the national assembly of Pakistan for
2002, 2008 and 2013.

Variable-The file contains Seat, Constituency, Candidates Name, Party Affiliation, Votes, Total
Valid Votes, Total Rejected Votes, Total Votes, Total Registered Voters and Turnout variables
for each seat.

Metadata-this data analyses different aspects of Pakistan’s election schedule. Canada, United
States, Pakistan and India are contributors of this data.

Q5: Explore the dataset “Yellow Pages of Pakistan”

Retrieved from (https://www.kaggle.com/mpasha96/yellow-pages-of-pakistan). Explain the
context, datatype, time regime, variable information, metadata etc. What analysis would
you suggest to obtain good insights from data.

Dataset to enable people to explore local businesses of Pakistan. This dataset might help the local
community in gathering information of local businesses. This also contributes in local economic
development of Pakistan by bridging traders and manufacturers.

Geography: Pakistan

Time period: 1990-2017

Dataset: The dataset contains information of approx 67000 businesses in Pakistan (~5000 in each
csv file)
Features: The dataset has total 7 columns

• Business Name

• Contact Name

• Telephone

• Website

• Services (Description of types of products/services provided by the business)

• Address

• City

Datatype-Cross Sectional data

Data Analytics for Beginners: Introduction to Data Analytics
From Everand
Data Analytics for Beginners: Introduction to Data Analytics
Anthony S. Williams
4/5 (19)
MANUAL Sierra Circular de Mano Black & Decker 7390, 7391, 7392
No ratings yet
MANUAL Sierra Circular de Mano Black & Decker 7390, 7391, 7392
8 pages
Management Information System
From Everand
Management Information System
IntroBooks Team
No ratings yet
Analytics and Big Data for Accountants
From Everand
Analytics and Big Data for Accountants
Jim Lindell
No ratings yet
Introduction to Statistical and Machine Learning Methods for Data Science
From Everand
Introduction to Statistical and Machine Learning Methods for Data Science
Carlos Andre Reis Pinheiro
No ratings yet
Geonav 4c PDF
No ratings yet
Geonav 4c PDF
136 pages
Essentials of Data Analysis
From Everand
Essentials of Data Analysis
Agasti Khatri
No ratings yet
Data Analytics and Data Processing Essentials
From Everand
Data Analytics and Data Processing Essentials
gareth thomas
No ratings yet
Data Analytics with Python: Data Analytics in Python Using Pandas
From Everand
Data Analytics with Python: Data Analytics in Python Using Pandas
Frank Millstein
3/5 (1)
Mastering Data Mining Techniques
From Everand
Mastering Data Mining Techniques
Dhaanyalakshmi Ahuja
No ratings yet
Business Analytics: Leveraging Data for Insights and Competitive Advantage
From Everand
Business Analytics: Leveraging Data for Insights and Competitive Advantage
Ronald BLaha
No ratings yet
"Big Data Science" Basic Concepts and Applications
From Everand
"Big Data Science" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
The Analyst's Atlas: Navigating the Financial Data Sphere
From Everand
The Analyst's Atlas: Navigating the Financial Data Sphere
Manish Tomar
No ratings yet
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
From Everand
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
From Everand
Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
Mathangi Sri Ramachandran
No ratings yet
Data-Driven Business Strategies: Understanding and Harnessing the Power of Big Data
From Everand
Data-Driven Business Strategies: Understanding and Harnessing the Power of Big Data
Steven Vollmer
No ratings yet
Making Big Data Work for Your Business: A guide to effective Big Data analytics
From Everand
Making Big Data Work for Your Business: A guide to effective Big Data analytics
Sudhi Sinha
No ratings yet
Comprehensive Guide to Implementing Data Science and Analytics: Tips, Recommendations, and Strategies for Success
From Everand
Comprehensive Guide to Implementing Data Science and Analytics: Tips, Recommendations, and Strategies for Success
Rick Spair
No ratings yet
Synthetic Data Generation: A Beginner’s Guide
From Everand
Synthetic Data Generation: A Beginner’s Guide
Robert Johnson
No ratings yet
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
From Everand
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
Zemelak Goraga
No ratings yet
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
From Everand
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
WINTON CLEM
No ratings yet
Big Data: Revolutionizing the Future
From Everand
Big Data: Revolutionizing the Future
Parvati Mishra
No ratings yet
Data Analytics for Businesses 2019: Master Data Science with Optimised Marketing Strategies using Data Mining Algorithms (Artificial Intelligence, Machine Learning, Predictive Modelling and more)
From Everand
Data Analytics for Businesses 2019: Master Data Science with Optimised Marketing Strategies using Data Mining Algorithms (Artificial Intelligence, Machine Learning, Predictive Modelling and more)
Riley Adams
5/5 (1)
Business Analytics and Big Data
From Everand
Business Analytics and Big Data
Sachin Naha
No ratings yet
Becoming a Data Analyst: Skills, Tools, and Real-World Strategies
From Everand
Becoming a Data Analyst: Skills, Tools, and Real-World Strategies
Othman Khalifa
No ratings yet
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
PYTHON FOR DATA ANALYTICS: Mastering Python for Comprehensive Data Analysis and Insights (2023 Guide for Beginners)
From Everand
PYTHON FOR DATA ANALYTICS: Mastering Python for Comprehensive Data Analysis and Insights (2023 Guide for Beginners)
Waldo Todd
No ratings yet
Introduction to Data Analytics
From Everand
Introduction to Data Analytics
Dan Martin
No ratings yet
"Data Analysis" Basic Concepts and Applications
From Everand
"Data Analysis" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
What Is Data Analytics? A Complete Guide For Beginners
From Everand
What Is Data Analytics? A Complete Guide For Beginners
Piyush Kumar Jain
No ratings yet
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
From Everand
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
Steven Taylor
No ratings yet
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
From Everand
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
Calvert Long
No ratings yet
The Data Whisperer - Making Sense of Big Data
From Everand
The Data Whisperer - Making Sense of Big Data
Keaton Rivers
No ratings yet
Principles of Data Mining
From Everand
Principles of Data Mining
Subodh Keshari
No ratings yet
Data Analytics
From Everand
Data Analytics
Jeffery Short
1/5 (1)
Data Governance for Tax Administrations: A Practical Guide
From Everand
Data Governance for Tax Administrations: A Practical Guide
Inter-American Center of Tax Administrations – CIAT
No ratings yet
Zero To Mastery In Cybersecurity- Become Zero To Hero In Cybersecurity, This Cybersecurity Book Covers A-Z Cybersecurity Concepts, 2022 Latest Edition
From Everand
Zero To Mastery In Cybersecurity- Become Zero To Hero In Cybersecurity, This Cybersecurity Book Covers A-Z Cybersecurity Concepts, 2022 Latest Edition
RAJIV JAIN
No ratings yet
Illuminating Pathways: Navigating the Enterprise Ecosystem Through Decision Intelligence and Generative AI Focused on Prioritization
From Everand
Illuminating Pathways: Navigating the Enterprise Ecosystem Through Decision Intelligence and Generative AI Focused on Prioritization
Skip Vanderburg
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
From Data To Decisions: Driving Performance in the Age of Analytics
From Everand
From Data To Decisions: Driving Performance in the Age of Analytics
Babatunde Yusuf
No ratings yet
Big Data Ethics in Research
From Everand
Big Data Ethics in Research
Nicolae Sfetcu
No ratings yet
Business Data Analytics with Microsoft Excel
From Everand
Business Data Analytics with Microsoft Excel
Pasquale De Marco
No ratings yet
Data Mining For Business Analytics & Data Analysis In Python
From Everand
Data Mining For Business Analytics & Data Analysis In Python
Book Option
No ratings yet
Data Science Career Guide Interview Preparation
From Everand
Data Science Career Guide Interview Preparation
Gradient Publication
No ratings yet
Business Analytics
From Everand
Business Analytics
Hiriyappa .B
4/5 (1)
Unveiling Insights: Mastering Data Mining and Knowledge Discovery in the Digital Age: O6.0 TRANSFORM DATA
From Everand
Unveiling Insights: Mastering Data Mining and Knowledge Discovery in the Digital Age: O6.0 TRANSFORM DATA
Elizabeth Mogopodi
No ratings yet
Managing Big Data Effectively
From Everand
Managing Big Data Effectively
Bhima Asan
No ratings yet
Business Analytics
From Everand
Business Analytics
Hiriyappa .B, Ph.D.
5/5 (1)
Data Analysis: An In-depth Insight
From Everand
Data Analysis: An In-depth Insight
Pasquale De Marco
No ratings yet
Data Science
From Everand
Data Science
Chloe Martin
No ratings yet
Data-Driven Decision Making
From Everand
Data-Driven Decision Making
Aadinath Pothuvaal
No ratings yet
AI Alchemy: Transforming Business Models into Gold in the Digital Age
From Everand
AI Alchemy: Transforming Business Models into Gold in the Digital Age
Malisa R. Moses
No ratings yet
Free Antivirus and its Market Implimentation: a Case Study of Qihoo 360 And Baidu
From Everand
Free Antivirus and its Market Implimentation: a Case Study of Qihoo 360 And Baidu
Yang Yiming
No ratings yet
The Role of Data Management in Building Sustainable AI Systems
From Everand
The Role of Data Management in Building Sustainable AI Systems
Alberto De Miranda
No ratings yet
How AI is Enhancing Business Performance
From Everand
How AI is Enhancing Business Performance
akosnemeth
No ratings yet
Data Science Project Ideas for Thesis, Term Paper, and Portfolio
From Everand
Data Science Project Ideas for Thesis, Term Paper, and Portfolio
Zemelak Goraga
No ratings yet
Business Intelligence and Data Mining Techniques
From Everand
Business Intelligence and Data Mining Techniques
Dwaipayan Sethi
No ratings yet
Introduction to Business Analytics
From Everand
Introduction to Business Analytics
Dwaipayan Sethi
No ratings yet
Artificial Intelligence and Machine Learning in Market Research: Smart Project Ideas
From Everand
Artificial Intelligence and Machine Learning in Market Research: Smart Project Ideas
Zemelak Goraga
No ratings yet
Big Data Assignment Revised
No ratings yet
Big Data Assignment Revised
4 pages
ABF Webinar - Episode 5
No ratings yet
ABF Webinar - Episode 5
46 pages
CRM Data Collection and Storage
No ratings yet
CRM Data Collection and Storage
22 pages
Ahmed Said Ali - Bill of Costs
100% (1)
Ahmed Said Ali - Bill of Costs
6 pages
Grey Minimalist Business Project Presentation
No ratings yet
Grey Minimalist Business Project Presentation
19 pages
Chapter 6 - Leading
No ratings yet
Chapter 6 - Leading
3 pages
JIS School Fees IDR 2021 22 Semester1
No ratings yet
JIS School Fees IDR 2021 22 Semester1
3 pages
Slaven Bilic On Why He Loves Leeds United Head Coach Marcelo Bielsa The Most
No ratings yet
Slaven Bilic On Why He Loves Leeds United Head Coach Marcelo Bielsa The Most
7 pages
CDP Assessment Tool
No ratings yet
CDP Assessment Tool
79 pages
Jets Piping Guide
No ratings yet
Jets Piping Guide
24 pages
Market Study
100% (1)
Market Study
32 pages
LDO Test Report 060520
No ratings yet
LDO Test Report 060520
1 page
Nuclear Power Plants Thesis
100% (3)
Nuclear Power Plants Thesis
8 pages
Home Improvement Resume Examples
100% (2)
Home Improvement Resume Examples
6 pages
D. E. McCabe_ J. G. Merkle_ K. Wallin - An Introduction to the Development and Use of the Master Curve Method (ASTM Manual) (Astm Manual Series, Mnl 52) (2005)
No ratings yet
D. E. McCabe_ J. G. Merkle_ K. Wallin - An Introduction to the Development and Use of the Master Curve Method (ASTM Manual) (Astm Manual Series, Mnl 52) (2005)
73 pages
12 Design Patterns
No ratings yet
12 Design Patterns
6 pages
Pantheon: Quiz 1 Answers
No ratings yet
Pantheon: Quiz 1 Answers
8 pages
Exw Fca CPT Cip Dpu Dap DDP: Fas Fob CFR Cif
No ratings yet
Exw Fca CPT Cip Dpu Dap DDP: Fas Fob CFR Cif
1 page
Sample Wealth Plan
No ratings yet
Sample Wealth Plan
15 pages
Gui 4 Cli
No ratings yet
Gui 4 Cli
3 pages
Health Career Orientation Program
No ratings yet
Health Career Orientation Program
18 pages
Google Earth Engine Applications
100% (1)
Google Earth Engine Applications
422 pages
600 MCQ - CPC-11
No ratings yet
600 MCQ - CPC-11
2 pages
Very Large Floating Structures
78% (9)
Very Large Floating Structures
28 pages
Cleartrip Flight Domestic E-Ticket
No ratings yet
Cleartrip Flight Domestic E-Ticket
2 pages
Iot Based Smart Parking System
No ratings yet
Iot Based Smart Parking System
8 pages
Daftar Pustaka
No ratings yet
Daftar Pustaka
2 pages
3a. Dairy Cattle Production
No ratings yet
3a. Dairy Cattle Production
57 pages
Windows Phone
No ratings yet
Windows Phone
29 pages
Assignment
No ratings yet
Assignment
6 pages
CT 128302
No ratings yet
CT 128302
11 pages

Lahore School of Economics Data Analysis and Statistical Methods Winter 2020

Uploaded by

Lahore School of Economics Data Analysis and Statistical Methods Winter 2020

Uploaded by

Lahore School of Economics

Data Analysis and Statistical Methods

Name: Zohraiz Mubarik Section:__D_______________

Date: 02/09/2020 Score:___________________

Q2: Explain the categories of Business Analytics.

Q3: Explain sources of Big Data analytics.

 Text Analytics consists Document representation, enterprise search system, search

Q5: Explore the dataset “Yellow Pages of Pakistan”

Time period: 1990-2017

• Services (Description of types of products/services provided by the business)

Datatype-Cross Sectional data

You might also like

Name: Zohraiz Mubarik Section:D_____________