Data Analytics Important Questions
Data Analytics Important Questions
1. What are the benefits of Big Data? Discuss challenger under Big Data.
2. Explain CRUD operations in Mongo DB.
3. How Big Data analytics can be useful in development of smart cities? (Discuss one
application).
4. What do you mean by inter and trans fire wall analytics.
5. Explain the process of data storage in Hadoop Distributed File System (HDFS) with the help
of a suitable example.
6. Explain the 4V’s of big data.
7. Explain the architecture and features of Hive.
8. Justify: SPARK is faster than Map reduce.
9. What is Apache pig and why we need it?
10. Discuss the applications of big data analytics in weather fore casting.
11. How to create collection in Mongo DB? Explain with its syntax.
12. Explain the term Pig Latin in detail
13. Explain working of Hive with proper steps and diagram.
14. Explain following for Mongo DB.
i) Indexing
ii) Aggregation
15. Write short note on any three of the following.
i) Capacity scheduler in Map Reduce
ii) 5P’s of Big Data
iii) Metastore in Hive
iv) Regression ANOVA
v) Term frequency
16. What is Big Data? Explain characteristics of Big Data.
17. Differentiate between Apache Pig and Map Reduce.
18. Explain replication and scaling features of Mongo DB.
19. Write down the goals of HDFS.
20. Define term frequency and inverse document frequency.
21. Explain 5 P’s of Big data in brief.
22. What is H Base? Explain storage mechanism of H Base with an example
23. Explain the concept of metastore in Hive.
24. How can we extract data for data storage?
25. What is Map Reduce programming model? Explain.
26. Explain Hadoop architecture and its components with proper diagram.
27. What is Zoo keeper? List the benefits of it.
28. Explain any three Hive QL DDL command with its syntax and example.
29. Write down the process of installing and running Hive.
30. Write short note on any three of the following.
i) Pig latin
ii) Information management
iii) Sharding process
iv) SPARK
v) 4 V’s of Big data
31. Suppose the weights of 800 male students are normally distributed with 28.8 kg and SD of
2.06 kg. Find the number of students whose weights are
i) Between 28.4 kg and 30.4 kg
ii) More than 31.3kg
32. A random variable has the following probability function:
x 0 1 2 3 4 5 6 7
P (x) 0 k 2k 2k 3k K2 2k2 7k2+k
Determine:
i) k
ii) mean
iii) variance
33. A sales tax officer has reported that the average sales of the 500 businesses that he has to deal
with during a year is Rs. 36,000 with a standard deviation of Rs. 10,000. Assuming that the
sales in these businesses are normally distributed, find
(i). The number of businesses as the sales of which are greater than Rs. 40,000.
(ii). The percentage of business the sales of which are likely to range between Rs. 30,000 and
Rs. 40,000.