BDA Unit - V
BDA Unit - V
School of Computing
Vel Tech Rangarajan Dr. Sagunthala R&D Institute of
Science and Technology
Unit 5 Big Data Cloud Concepts and Optimization
Real-time/Fast Data
Mobile devices
(tracking all objects all the time)
Product
Recommendations Learning why Customers
Influence
that are Relevant Behavior Switch to competitors
& Compelling and their offers; in
time to Counter
Friend Invitations
Improving the Customer to join a
Marketing Game or Activity
Effectiveness of a that expands
Promotion while it business
is still in Play
Preventing Fraud
as it is Occurring
& preventing more
proactively
Old Model: Few companies are generating data, all others are consuming data
New Model: all of us are generating data, and all of us are consuming
data
Interactive Business
Speed
Intelligence & Big Data:
In-memory RDBMS Scale
Real Time &
Single View
BI Reporting QliqView, Tableau, HANA
OLAP &
Graph Databases
Dataware house
Business Objects, SAS, Big Data:
Scale Speed
Informatica, Cognos other SQL Batch Processing &
Reporting Tools
Distributed Data Store
Hadoop/Spark; HBase/Cassandra
• Storage-as-a-service
• Database-as-a-service
• Information-as-a-service
• Process-as-a-service
• Application-as-a-service
• Platform-as-a-service
• Integration-as-a-service
• Security-as-a-service
• Management/
Governance-as-a-service
• Testing-as-a-service
• Infrastructure-as-a-service
Enabling Technology:
Virtualization
Hardware Hardware
Everything as a Service
• Utility computing = Infrastructure as a Service (IaaS)
• Why buy machines when you can rent cycles?
• Examples: Amazon’s EC2, Rackspace
• Platform as a Service (PaaS)
• Give me nice API and take care of the maintenance,
upgrades, …
• Example: Google App Engine
• Software as a Service (SaaS)
• Just run it for me!
• Example: Gmail, Salesforce
COBOL, Amazon.com
Edsel ARPANET Internet
AWS
• Elastic Compute Cloud – EC2 (IaaS)
• Simple Storage Service – S3 (IaaS)
• Elastic Block Storage – EBS (IaaS)
• SimpleDB (SDB) (PaaS)
• Simple Queue Service – SQS (PaaS)
• CloudFront (S3 based Content Delivery Network –
PaaS)
• Consistent AWS Web Services API
• Positive/Negative Sentiments
• Sentiment analysis features
1. NEGATION 2. POSITIVE SMILEY
3. NEGATIVE SMILEY
4. DONT— YOU, OH, SO, AS
FAR AS, 5. LAUGH
𝑓{ } = “Vijayanagara
Style”
Department of Computer Science and Engineering 39
Sparsity Learning
• Deep Compression
• Pruning low-weight values, retrain to recover accuracy.
• Less storage and also speedup reported in Fully Connected layers.
.
.
.
OFM: 55x55x96
• Deep Compression and SSL trained networks were used for analysis.
Both showed similar trends.
• Memory savings obtained with sparse representation is given below
• 2
Speedup
1.5
0.5
0
0.945 0.95 0.955 0.96 0.965 0.97 0.975 0.98 0.985 0.99 0.995
Sparsity
0.5
0.4
Speedup
0.3
0.2
0.1
0
0.945 0.95 0.955 0.96 0.965 0.97 0.975 0.98 0.985 0.99 0.995
Sparsity
Department of Computer Science and Engineering 52
Sparsity Learning
• Fully Connected Layers showed around 3x Speedup for layers having a high
sparsity.
• For a large dataset & GPU Memory, we might see drop in convolutional runtime
for Sparse representation.