big data 4
big data 4
Submitted to:
Lecturer
University of Dhaka
Submitted by:
Ibrahim
ID: 2209
University of Dhaka
1. Amazon elastic compute cloud: Amazon Elastic Compute Cloud (Amazon EC2) is a cloud
computing service that provides scalable virtual servers, known as instances. It allows users to
run applications on-demand without needing to invest in physical hardware. EC2 offers various
instance types, pricing models, and security options to meet different computing needs. It
integrates with other AWS services for flexibility, scalability, and reliability.
2. Amazon Simple Storage Service (S3): S3 is a reliable and scalable object storage solution
that allows organizations to efficiently store and access substantial volumes of data. It provides
various storage classes, ensures high durability, and integrates smoothly with other AWS services
for effective backup and data management.
3.Amazon Elastic MapReduce (EMR) is a cloud-based big data processing solution that
streamlines large-scale data analysis with Apache Hadoop and other open-source technologies.
It simplifies cluster administration, works with AWS storage services, and is suitable for data
transformations and machine learning.
4. Amazon Athena is a serverless query service for analysing structured and unstructured data
stored in S3 using normal SQL. It reduces the need for sophisticated infrastructure management
by using a pay-per-query paradigm for cost savings.
6. Amazon Glue: Amazon Glue is a fully managed ETL (Extract, Transform, Load) service. It
helps automate data preparation for analytics and machine learning. It supports various data
sources and formats.
7.Amazon Kinesis: Amazon Kinesis is a real-time data streaming service. It processes and
analyzes large volumes of data as it arrives. It is useful for applications like real-time analytics and
event tracking.
Microsoft Azure
Azure is Microsoft’s cloud computing platform, launched in 2010. It provides services similar to
AWS but is particularly popular with businesses that already use Microsoft products.
Key Azure Cloud Computing Services:
1.Azure Virtual Machines: Lets users create and manage virtual computers in the cloud. It
supports different operating systems, offers flexible storage, and includes tools for scaling and
monitoring performance.
2.Azure App Service: A cloud platform for building and running web and mobile apps. It supports
various programming languages, connects with databases and AI tools, and automatically scales
based on traffic.
3.Azure Storage: A secure and flexible cloud storage solution for saving different types of data. It
includes options for file storage, backups, and secure data management with built-in security
features.
4.Azure Cosmos DB: A fast, globally distributed database that supports different data formats like
documents and graphs. It provides quick access, automatic backups, and strong security for
applications needing real-time data.
5.Azure AI Services: Offers ready-to-use AI tools for tasks like language translation, image
recognition, and speech processing. Businesses can easily integrate AI into their apps without
needing deep expertise.
Google Cloud Platform (GCP)
Key Google Cloud Platform (GCP) Services
Compute Engine is a virtual machine (VM) service on Google Cloud. It allows users to run
applications on customizable VMs. It provides scalability, security, and flexibility for workloads.
Kubernetes Engine
Kubernetes Engine is a managed container orchestration service. It helps deploy, manage, and
scale containerized applications. It automates tasks like load balancing, scaling, and security.
Cloud Functions
Cloud Functions is a serverless compute service. It runs code in response to events without
managing servers. It supports real-time processing, automation, and backend logic.
Cloud Storage
Cloud Storage provides scalable and secure object storage. It supports unstructured data like
images, videos, and backups. It offers multiple storage classes for cost efficiency.
Cloud Bigtable
Cloud Bigtable is a NoSQL database for large-scale applications. It handles high-throughput and
low-latency workloads. It is ideal for analytics, IoT, and time-series data.
BigQuery
BigQuery is a fully managed data warehouse. It enables fast SQL queries on large datasets. It is
optimized for real-time analytics and machine learning.
Cloud Dataproc
Cloud Dataproc is a managed Hadoop and Spark service. It processes big data efficiently on
Google Cloud. It supports analytics, ETL, and machine learning workloads.
TensorFlow
TensorFlow is an open-source machine learning framework. It helps build and train AI models
efficiently. It supports deep learning, neural networks, and automation.