0% found this document useful (0 votes)
4 views12 pages

AI ML

The document outlines various topics related to online MBA courses in AI, machine learning, data analysis, and programming, covering concepts like decision trees, sentiment analysis, and event-driven programming. It includes questions and answers on data handling, statistical methods, and programming techniques in R and SQL. The content is structured in a quiz format, testing knowledge on data science and analytics principles.

Uploaded by

Mahesh Karle
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views12 pages

AI ML

The document outlines various topics related to online MBA courses in AI, machine learning, data analysis, and programming, covering concepts like decision trees, sentiment analysis, and event-driven programming. It includes questions and answers on data handling, statistical methods, and programming techniques in R and SQL. The content is structured in a quiz format, testing knowledge on data science and analytics principles.

Uploaded by

Mahesh Karle
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

Online_MBA_AI(OMBAIML 301)

Extract insights from data

Data Analysis

Learning from labeled data

Decision Trees

When the model performs well on training data but poorly on test data

c(1, 2, 3)

read.csv()

data.frame(column1, column2)

install.packages("package_name")

11

To clean the data

Inferential statistics

Values that are not recorded

Mean

selection

conversion

duplicates

True

False

True

2. What challenges might the company face when handling text data?
✅ Ambiguity in language
✅ High dimensionality of text data

3. Which preprocessing methods help improve the accuracy of sentiment analysis?


✅ Lemmatization
✅ Stop word removal
4. What types of insights can be gained from sentiment analysis of customer feedback?
✅ Identification of customer preferences
✅ Understanding product issues

5. Which of the following visualizations can be effective for displaying sentiment analysis results?
✅ Bar charts
✅ Line graphs

6. What is the primary goal of topic modeling in text analysis?


✅ To identify common themes in feedback
✅ To classify text into information

Online_MBA_ML(OMBAIML 302)_

1. Which of the following machine learning algorithm can be used for imputing missing values of
both categorical and continuous variables?
✅ KNN

2. Which of the following distance measure do we use in case of categorical variables in KNN?
✅ Hamming distance

3. Machine Learning is used to find what?


✅ Hidden patterns in dataset

4. What is Standard Deviation?


✅ A quantity expressing by how much the members of a group differ from the mean value for the
group

5. Which algorithm is commonly used for classification tasks in machine learning?


✅ Decision trees
6. What is Regression?
✅ A technique to determine the statistical relationship between two or more variables

7. What is the full form of SVR?


✅ Support vector regression

8. Clustering is a technique of which Machine Learning?


✅ Unsupervised

9. If there are datapoints that belong to more than one cluster, then which clustering technique
is/are used?
✅ Fuzzy

10. In which type of clustering, each data point either belongs to a cluster completely or not?
✅ Hard clustering

11. What do you mean by support(A)?


✅ Number of transactions containing A / Total number of transactions

12. What is the full form of MDP?


✅ Markov Decision Process

13. CNN carries out which operation, which is an element-wise product and sum between two
matrices?
✅ Convolution

14. Which layer simply removes all the negative values from the convolution?
✅ ReLU

15. Mobile phone face ID is an example of Deep Learning


✅ True

16. PCA is dimensionality reduction technique


✅ True
17. Random Forest is an example of bagging
✅ True

18. The fundamental unit of Neural Network is ___________


✅ Neuron

19. ______ are best recognized for their ability to find patterns present in images
✅ CNN

20. _______ is applied by the agent to decide the next action based on the current state.
✅ Policy/Strategy

2. Choose applications that utilize machine learning:


✅ Weather forecasting
✅ Language translation
(These are explicitly mentioned in the paragraph as ML applications. Simple calculator apps and
alarm clocks are not.)

3. Identify the fields that are used in machine learning:


✅ Autonomous vehicles
✅ Music composition
(The paragraph discusses autonomous vehicles. Music composition can involve ML in generative
models, although not explicitly mentioned in the paragraph, it's still a valid application compared to
the others.)

4. Choose the correct applications of machine learning that are used in healthcare:
✅ Medical image analysis
✅ Disease prediction
(Both are explicitly listed in the healthcare section of the paragraph.)

5. Identify the tasks that can be performed using natural language processing (NLP):
✅ Sentiment analysis
✅ Speech recognition
(Sentiment analysis is a direct NLP task. Speech recognition is mentioned in ML use, and often
overlaps with NLP in speech-to-text systems.)
6. Select all the applications of machine learning in recommendation systems:
✅ Music recommendation
✅ Movie recommendation

Online_MBA_PAP(OMBAIML 303)

1. Identify the correct way to define a set in Python:


✅ set = {1, 2, 3}

2. Identify the correct way to access a value in a dictionary by its key in Python:
✅ dictionary[key]

3. Determine what statistics provided by df.describe() function in Pandas:


✅ Descriptive

4. Visual data exploration and exploratory statistical analysis are part of which step?
✅ Data Analysis

5. Which regression model is used when the dependent variable is categorical?


✅ Logistic Regression

6. Which of the following is not a method to prevent overfitting in decision trees?


✅ Regularization

7. What task is SVM algorithm used for?


✅ Classification

8. What does Ensemble methods do?


✅ combine multiple classifier to improve the performance of a model.

9. Major voting process is considered to be from the following:


✅ Bagging

10. For multiple trees to be effective, how should the trees be?
✅ Independent

11. What does k-NN stand for?


✅ k-Nearest Neighbor

12. Association Rules are used to determine:


✅ Discovering interesting relationships between variables

13. The support of an itemset in Association Rules relates to:


✅ The proportion of transactions containing the itemset out of all transactions

14. What major challenges do interconnected databases and regular data updates pose?
✅ Negative impact on data quality
15. Decision Trees can handle both numerical and categorical data.
✅ True

16. A social network learner usually consists of a local model using only node-specific
characteristics.
✅ True

17. Conventional approach to analytics in social network learning doesn't consider network
connections.
✅ True

18. Growing a decision tree involves recursively splitting data based on:
✅ Information Gain

19. A decision tree model with too many levels or branches suffers from:
✅ Overfitting

20. In SVM, the points closest to the hyperplane are called:


✅ Support Vectors

2. Determine the libraries or frameworks in Python that facilitate event-driven programming


✅ Tkinter
✅ asyncio
(❌ Requests and NumPy are not event-driven libraries)

3. Dissect the advantages of event-driven programming in Python?


✅ Responsive and interactive applications
✅ Modular and reusable code
(❌ Sequential execution and database management are not core advantages mentioned)

4. Determine which type of applications can benefit from event-driven programming in Python?
✅ Networking servers
✅ GUI interfaces (implied from the paragraph; choose Networking servers + GUI interfaces if asked)
(Note: Command-line utilities and simulations are not event-driven in nature)

5. Determine which features are associated with event-driven programming in Python?


✅ Asynchronous execution
✅ Callback functions
(❌ Object-oriented programming and exception handling are general concepts, not specific to
event-driven)
6. Evaluate how are events typically triggered in event-driven programming?
✅ Through user actions
✅ Automatically by the operating system

Online_MBA_SQT(OMBAIML 304)

1. Unit of measurement for variance of heights in cm


✅ cm²

2. Type of data for heights of the trees


✅ Continuous

3. Purpose of multiple regression


✅ To predict scores on a dependent variable from scores on multiple independent variables

4. Attribute with two classes is called


✅ Dichotomy

5. Data required for calculation of measures of association


✅ Attributes

6. Type of data for gender of students


✅ Attribute

7. Probability of selecting a king from a deck of 52 cards


✅ 0.076 (4 kings / 52 cards = 0.0769)

8. Why saying 10°C is twice as hot as 5°C is incorrect


✅ For interval data, differences make sense, but ratios do not

9. Maximax criterion decision-making environment


✅ Conditions of uncertainty, the decision maker being perfectly optimistic

10. Probability of getting a heart from 52 cards


✅ 0.25 (13 hearts / 52 cards = 0.25)

11. Mean and median for: 1, 7, 3, 3, 6, 4


✅ 4 and 3.5
(Sorted: 1, 3, 3, 4, 6, 7 → mean = 24/6 = 4; median = (3+4)/2 = 3.5)

12. Measure that is most frequent number in dataset


✅ Mode

13. Mean of 6 numbers is 17 → Sum = ?


✅ 102 (Mean × count = 17 × 6 = 102)
14. Arrival follows
✅ Poisson distribution

15. Programming that allows more than one objective


✅ Goal

16. Transportation problem requires equal


✅ Units supplied; units demanded

17. Standard normal distribution has


✅ Mean equal 0 and variance equal 1

18. Median divides into four equal parts


✅ False
(Median divides into two equal parts; quartiles divide into four)

19. Hungarian Method is faster and efficient


✅ True

20. Pure strategy in two-person, zero-sum game


✅ True

1- A&B
2- B&D
3- B&C
4- A&B
5- C&D

Online_MBA_BI(OMBAIML 305)_

1. Where is data stored, retrieved & updated in a data warehouse?


✅ OLAP

2. Which metadata maps core warehouse metadata to business concepts?


✅ Application-level metadata

3. Expansion of OLAP:
✅ Online Analytical Processing

4. What does data mining use to convert large collections of data into useful output?
✅ Algorithms and various techniques

5. Data mining tool based on GUI and uses R stats:


✅ Rattle
6. Primary means of creating a dimension:
✅ Dimension Wizard

7. Processing of data about customers and their relationship with the enterprise:
✅ CRM analytics

8. Performance management tool:


✅ Balanced Scorecard

9. How to remove noise and inconsistency in data:


✅ Data cleaning

10. Style of network communication where client initiates request:


✅ Pull coding

11. Device used to transfer an image in visualization:


✅ Scanner

12. Visualization technique showing customer perception of brands:


✅ Perceptual mapping

13. How to design effective tables and charts for data visualization:
✅ Data-ink ratio

14. Use of BI service:


✅ Flexible self-service analysis

15. What processes data through use of nodes:


✅ Neural networks

16. A calculation in ________ is a bunch of heuristics and estimations:


✅ Data mining

17. The class under study in Data Characterization is known as:


✅ Target

18. To harness a successful BI project, companies must plan:


✅ True

19. A data source may not be the initial location where data is born:
✅ True

20. Successful information mining supports business strategy and operations:


✅ True

2. What is the requirement to use BI softwares?


✅ for transparent analytics
✅ reporting process
3. Which are data management solutions?
✅ Business intelligence
✅ Business Analytics

These are explicitly mentioned as data management solutions used to collect and analyze data.

4. Which kind of data could be used for analysis?


✅ organized
✅ unstructured

The paragraph says: "The data used for analysis can be organised or unstructured."

5. What is the use of statistics in BI?


✅ to analyze raw information
✅ to deliver insights for future

It says BI uses statistics "to analyze raw information, and deliver insights for making better future
decisions."

6. In which format business intelligence present the data?


✅ charts
✅ dashboard

Online_MBA_R programming(OMBAIML 306)

1. What does R stand for in the context of the R programming language?


✅ NA (R does not officially stand for anything; it was partly named after the first letters of its
authors Robert and Ross)
2. Which of the following is a primary use of R?
✅ Statistical analysis

3. What is the command to install a package in R?


✅ install.packages("package_name")

4. Which of the following data structures is NOT available in R?


✅ tree

5. How do you create a vector in R?


✅ c(1,2,3)
6. What function is used to view the structure of a dataset in R?
✅ str()

7. Which of the following is a way to read a CSV file into R?


✅ read.csv("file.csv")

8. In R, what does the function summary() do?


✅ It provides a summary of statistical measures for an object.

9. What is the primary function of the ggplot2 package in R?


✅ data visualization

10. Which of the following is a logical operator in R?


✅ &&

11. What is a matrix in R?


✅ A collection of elements of the same type arranged in a rectangular format

12. How do you create a matrix in R?


✅ matrix(c(1, 2, 3, 4), nrow=2, ncol=2)

13. What function is used to access a specific element in a matrix in R?


✅ [row, column] notation

14. Which function is used to transpose a matrix in R?


✅ t(matrix)

15. To create a sequence of numbers from 1 to 10 in R, you would use the function __________.
✅ seq(1,10)

16. The function __________ is used to read a CSV file into R.


✅ read.csv()

17. In R, the __________ function is used to calculate the mean of a numeric vector.
✅ mean()

18. R packages are collections of R functions, data, and documentation bundled together
✅ True

19. The function install.packages("package_name") is used to load an already installed package


into the R environment.
✅ False
*(Correct function: library(package_name))

20. R scripts are plain text files that contain R code and can be executed in the R environment.
✅ True

2. What is the purpose of the JOIN clause in SQL?


✅ To combine rows from two or more tables based on a related column
✅ (None of the other listed options are correct; only one correct option applies here)
3. Which of the following SQL statements correctly groups data to calculate the total quantity sold
for each product?
✅ SELECT Product, SUM(Quantity) AS TotalQuantity FROM Orders GROUP BY Product;
✅ SELECT Product, SUM(Quantity) AS TotalQuantity FROM Orders ORDER BY TotalQuantity;
(Though missing GROUP BY, this one could still be a possible second choice if GROUP BY is implied.
However, the first is the most accurate.)

4. Which R function is used to read CSV data into a data frame?


✅ read.csv()
✅ read.csv2() (Another variant that uses semicolon separators common in European data formats)

5. What is the output of the following SQL query?


✅ Total revenue generated per city
✅ (Only one correct answer; others do not match the query's intent.)

6. Which dplyr functions did Mr. Sinha use in R to summarize data?


✅ group_by()
✅ summarise()

You might also like