Theory Questions for End Semester Exam of Business Analytics
Unit 1: Introduction
1. Define Data Science and explain its importance in today's world.
2. Differentiate between data analytics and data analysis.
3. What are the main classifications of analytics? Provide examples for each.
4. Explain the applications of analytics in business decision-making.
5. What are the characteristics of Big Data? Provide real-world examples.
6. Discuss the various applications of Big Data in different industries.
7. Highlight the major challenges faced in data analytics.
8. Explain the role of data analytics in predictive modeling.
9. How does Big Data help in improving customer experience?
10. What are the ethical considerations in data analytics?
Unit 2: Data Preparation, Summarization and Visualization using Spreadsheets
11. What is data preparation, and why is it important?
12. Describe the steps involved in data cleaning.
13. Explain the functionality of sorting and filtering in spreadsheets.
14. How is conditional formatting useful in data analysis?
15. What is the "Text to Column" feature, and where is it applied?
16. Outline the process of removing duplicates from data in spreadsheets.
17. Define data validation and its purpose.
18. What is an outlier, and how can you identify it in a dataset?
19. Explain the importance of covariance in data analysis.
20. Describe the role of a correlation matrix in understanding data relationships.
21. How can missing values be found and handled in data preparation?
22. Define data summarisation with examples.
23. Compare scatter plots, line charts, and histograms. When should each be used?
24. Explain the concept of pivot tables and their applications.
25. What are pivot charts, and how do they differ from interactive dashboards?
26. Discuss the importance of visualisation in presenting data insights.
Unit 3: Getting Started with R
27. What are the main advantages of using R for data analysis?
28. Describe the steps for installing R and its packages.
29. Explain the process of importing data from spreadsheet files into R.
30. What is the syntax for creating variables in R? Provide examples.
31. Differentiate between packages and libraries in R.
32. Briefly describe the following data structures in R:
- Vectors
- Matrices
- Arrays
- Lists
- Factors
- Data Frames
33. How can data be exported from R? Provide an example.
34. Why is R considered suitable for statistical computing and graphics?
35. Explain the difference between scalar and vector data structures in R.
36. Discuss the importance of data frames in data manipulation with an example.
37. What are factors in R, and how are they used for categorical data representation?
38. Define matrices in R. How do they differ from arrays?
39. Describe the process of combining multiple data frames in R.
40. How can missing values in datasets be handled in R?
41. Provide examples of commonly used R packages for data analysis.
42. Discuss the key components of R syntax and their significance.
43. What are the advantages of using R over other programming languages for statistical analysis?
44. How can lists in R be manipulated? Provide an example.
45. Describe the process of installing and updating R libraries.
46. What is the significance of exporting data in R? Explain with a practical example.
Unit 4: Descriptive Statistics Using R
47. Compare histograms and bar charts. Provide examples.
48. What is a box plot, and what insights can it provide?
49. Explain the use of line graphs and scatter plots in visualizing data.
50. Define measures of central tendency. Give examples of their application.
51. What are measures of dispersion? Explain their significance.
52. Discuss covariance and its role in data analysis.
53. What is correlation, and how is it different from covariance?
54. Explain the coefficient of determination with an example.
55. How can descriptive statistics be visualized effectively using R?
56. Compare the use of mean, median, and mode as measures of central tendency.
57. What is the interquartile range, and how is it represented visually using box plots?
58. Explain the process of calculating standard deviation in R.
59. How can covariance be interpreted in the context of variable relationships?
60. Discuss the steps to create a scatter plot in R for visualizing relationships between variables.
61. Describe the significance of correlation coefficient in understanding variable dependencies.
62. How is the coefficient of determination used in statistical modeling?
63. Explain the process of customizing bar charts in R with an example.
64. What are the steps involved in creating histograms in R for frequency distribution analysis?
65. Discuss the advantages of using R for visualizing descriptive statistics over traditional methods.
Unit 5: Predictive and Textual Analytics
66. What is a simple linear regression model? Illustrate with an example.
67. Define confidence intervals and prediction intervals in regression analysis.
68. Explain multiple linear regression and its applications.
69. What is the significance of regression coefficients in predictive modeling?
70. Discuss the concepts of heteroscedasticity and multi-collinearity.
71. Provide an overview of textual data analysis and its importance.
72. What are the common challenges faced in textual analytics?
73. How is R used for textual analysis?
74. Define text mining and its role in analytics.
75. Discuss the methods of categorization in textual analytics.
76. What is sentiment analysis, and how is it applied in business?
77. What are the assumptions underlying simple linear regression models?
78. Explain the importance of prediction intervals in forecasting.
79. How can multiple linear regressions be used for decision-making in business analytics?
80. Discuss the interpretation of heteroscedasticity in regression analysis.
81. What are the steps to address multicollinearity in multiple regression models?
82. Provide examples of industries where textual data analysis is extensively applied.
83. How is sentiment analysis performed on textual data? Discuss its significance.
84. Explain the process of text mining using R.
85. What are the common techniques for categorizing textual data?
86. How can the results of textual analysis be visualized effectively?
87. Discuss the challenges of analyzing unstructured textual data.
88. Provide examples of R packages used for textual analytics.
89. Compare text categorization and sentiment analysis techniques in R.
90. Discuss how predictive analytics can be integrated with textual analytics for actionable insights.