0% found this document useful (0 votes)
46 views4 pages

Week18 Quiz Solution

Uploaded by

Arnab Dey
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
46 views4 pages

Week18 Quiz Solution

Uploaded by

Arnab Dey
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 4
18 QUIZ SOLUTION 1. Amazon S3 is ? A. Adistributed processing framework B. A monolithic file system C. ‘*Adistributed cloud based file system/object store 2. A. Data locality principle is not followed in S3.True/False? “TRUE B. FALSE Explanation: Filesystems like S3 do not need the namenonde to run, thereby the data locality optimization is not available 3. A Virtual Machine is characterized by? Memory and Storage Storage and Compute Memory and Compute *Memory,storage and compute Which is true about Amazon S3?Multiple can be chosen. “If cluster is down, S3 data remains intact If cluster is down, S3 data is also lost “Supports fault tolerance Does not support default replication factor of 3 Explanation: The S3 storage is decoupled from processing in EMR making the data intact. FOBPs DOM> 5. For which of these instances Amazon can revoke the access from user before user shuts down the instance by himself. A. On-demand B. ‘*SpotReserved C. Both A and C 6. Arrange the instances from cheapest to costliest. A. On-Demand,Reserved,Spot B. _ Reserved,Spot, On-Demand C. — Spot,On-Demand,Reserved D. *Spot,Reserved,On-Demand TRENDYTEC 9108179578 WEEK 18 QUIZ SOLUTION Explanation: AWS spot instances represent AWS's excess capacity, spare capacity available for any surge in customer demand.Therefore it is available at cheaper costs. 7. Which of these nodes can be used for compute-heavy applications but cannot host the data? A. Master Node B. Core Node Cc. *Task Node D. Both B and C Explanation: Task nodes are meant for running parallel computation tasks such as MapReduce and Spark executors but don’t run Data node Daemon. 8. Which type of instance is ideal choice for a task-node? A. *Spot B. Reserved C. On-demand Explanation: Spot instances are available are relatively cheaper costs and EMR services provide tooling to leverage spot instances easily. 9. To run a scheduled reporting job daily, at a predefined time on a cluster ‘Which cluster type is preferable? A. *Step-Execution Cluster B. Long Running Cluster 10 Which of these nodes in AWS runs a datanode ? A. Master Node B. *Core Node C. Task Node 11. In AWS, Spark History Server is configured on port- A. 22 B. 4040 Cc. *18080 D. 8088 TRENDY 9108179578 WEEK 18 QUIZ SOLUTION 12. Which of these instances can be a good choice for spark cluster set up and for machine learning algorithms respectively? A. Ctype-R type B. Mtype-R type C. Ctype-M type D. *Rtype-C type 13. Traffic for an EC2 instance in AWS,can be controlled by A. Traditional Firewalls B. *Security Group C. Secured Key-Pair D. Allof the above 14 key is used to connect to the EC2 instance, while. key is stored by the instance? A. Public, Private B. *Private, Public 15. For which of these storages , data will be compulsorily lost if we terminate the EMR cluster? A S3 B. “Instance Store Cc EBS 16. Hdfs will be a part of which of these nodes in AWS? A Master Node B. *Core Node C. Task Node 17 The table created in Athena is ? A. Managed B. *External C. Temporary 18. Charges in Athena are based on: A. *Amount of data Scanned B Run-Time of Query Cc. BothA and B xi oo. x TRENDY 91081795 ZK 18 QUIZ SOLUTION 19. Which of these is true statement about Athena? A. Cluster is needed for running queries using Athena B. Recommended when we have very frequent queries to execute C. Running Alter Table command is chargeable D. *None of the above 20 In Athena,we can save on per-query costs and get better performance by using which of these techniques.Multiple can be chosen A. *Compressing B. ‘Partition Pruning C. Converting data to row based formats D. *Converting data to column based formats 21. AWS Glue can be used to A. __ Infer Schema for data stored in S3 B. — Store the metadata C. Asan ETL tool D. *All of the above B. 22. Athena and AWS Glue do not require configuring a server A. ‘TRUE FALSE 91081 x xi TRENDY 95

You might also like