AI ML
AI ML
Data Analysis
Decision Trees
When the model performs well on training data but poorly on test data
c(1, 2, 3)
read.csv()
data.frame(column1, column2)
install.packages("package_name")
11
Inferential statistics
Mean
selection
conversion
duplicates
True
False
True
2. What challenges might the company face when handling text data?
✅ Ambiguity in language
✅ High dimensionality of text data
5. Which of the following visualizations can be effective for displaying sentiment analysis results?
✅ Bar charts
✅ Line graphs
Online_MBA_ML(OMBAIML 302)_
1. Which of the following machine learning algorithm can be used for imputing missing values of
both categorical and continuous variables?
✅ KNN
2. Which of the following distance measure do we use in case of categorical variables in KNN?
✅ Hamming distance
9. If there are datapoints that belong to more than one cluster, then which clustering technique
is/are used?
✅ Fuzzy
10. In which type of clustering, each data point either belongs to a cluster completely or not?
✅ Hard clustering
13. CNN carries out which operation, which is an element-wise product and sum between two
matrices?
✅ Convolution
14. Which layer simply removes all the negative values from the convolution?
✅ ReLU
19. ______ are best recognized for their ability to find patterns present in images
✅ CNN
20. _______ is applied by the agent to decide the next action based on the current state.
✅ Policy/Strategy
4. Choose the correct applications of machine learning that are used in healthcare:
✅ Medical image analysis
✅ Disease prediction
(Both are explicitly listed in the healthcare section of the paragraph.)
5. Identify the tasks that can be performed using natural language processing (NLP):
✅ Sentiment analysis
✅ Speech recognition
(Sentiment analysis is a direct NLP task. Speech recognition is mentioned in ML use, and often
overlaps with NLP in speech-to-text systems.)
6. Select all the applications of machine learning in recommendation systems:
✅ Music recommendation
✅ Movie recommendation
Online_MBA_PAP(OMBAIML 303)
2. Identify the correct way to access a value in a dictionary by its key in Python:
✅ dictionary[key]
4. Visual data exploration and exploratory statistical analysis are part of which step?
✅ Data Analysis
10. For multiple trees to be effective, how should the trees be?
✅ Independent
14. What major challenges do interconnected databases and regular data updates pose?
✅ Negative impact on data quality
15. Decision Trees can handle both numerical and categorical data.
✅ True
16. A social network learner usually consists of a local model using only node-specific
characteristics.
✅ True
17. Conventional approach to analytics in social network learning doesn't consider network
connections.
✅ True
18. Growing a decision tree involves recursively splitting data based on:
✅ Information Gain
19. A decision tree model with too many levels or branches suffers from:
✅ Overfitting
4. Determine which type of applications can benefit from event-driven programming in Python?
✅ Networking servers
✅ GUI interfaces (implied from the paragraph; choose Networking servers + GUI interfaces if asked)
(Note: Command-line utilities and simulations are not event-driven in nature)
Online_MBA_SQT(OMBAIML 304)
1- A&B
2- B&D
3- B&C
4- A&B
5- C&D
Online_MBA_BI(OMBAIML 305)_
3. Expansion of OLAP:
✅ Online Analytical Processing
4. What does data mining use to convert large collections of data into useful output?
✅ Algorithms and various techniques
7. Processing of data about customers and their relationship with the enterprise:
✅ CRM analytics
13. How to design effective tables and charts for data visualization:
✅ Data-ink ratio
19. A data source may not be the initial location where data is born:
✅ True
These are explicitly mentioned as data management solutions used to collect and analyze data.
The paragraph says: "The data used for analysis can be organised or unstructured."
It says BI uses statistics "to analyze raw information, and deliver insights for making better future
decisions."
15. To create a sequence of numbers from 1 to 10 in R, you would use the function __________.
✅ seq(1,10)
17. In R, the __________ function is used to calculate the mean of a numeric vector.
✅ mean()
18. R packages are collections of R functions, data, and documentation bundled together
✅ True
20. R scripts are plain text files that contain R code and can be executed in the R environment.
✅ True