Lesson 1:
Overview of Big Data
What is Big Data?
• Big data refers to extremely large and
complex datasets that cannot be easily
managed, processed, or analyzed using
traditional data processing tools.
• 100,000 tweets
• 204 million emails • 6 million views and 277,000 Facebook
sent Logins
• 61,000 hours of music • 2+ million Google searches
listened to on Spotify • 3 million uploads on Flickr
• 20 million photo views
Three V’s of Big Data
• 1. Variety: Big data comes in diverse
formats. It includes structured data (e.g.,
relational databases), semi-structured
data (e.g., JSON, XML), and
unstructured data (e.g., text, images,
videos). Managing and analyzing such
diverse data types is a key challenge in
big data processing.
Sample Relational
Databases
Three V’s of Big Data
2. Velocity: Big data is generated at a
high speed. The data is produced rapidly
and continuously, often in real-time.
Three V’s of Big Data
• 3. Volume: Big data involves a vast
amount of data. This could be
terabytes, petabytes, or even exabytes
of information, far beyond what
traditional databases can handle.
Big Data Opportunities
Making better informed decisions
e.g. strategies, recommendations
Discovering hidden insights
e.g. anomalies, patterns,
trends
Automating business processes
e.g. complex events, translation,
subscribing to SAAS.
Q&A and Discussion