Machine Learning Answers
Machine Learning Answers
Part-A
Part-B
Q5. Discuss how features impact the performance of machine learning models.
Features play a crucial role in determining the accuracy and efficiency of machine learning
models.
Part-C
Q8. How does the k-Nearest Neighbors (k-NN) algorithm work? Describe its advantages and
disadvantages.
k-Nearest Neighbors (k-NN) is a simple, non-parametric algorithm used for classification
and regression.
Working of k-NN:
1. Choose the number of neighbors (k).
2. Find the ‘k’ closest data points using distance measures (Euclidean, Manhattan).
3. Assign the most common class among the ‘k’ neighbors.
Advantages:
- Simple and effective.
- Works well with small datasets.
- No training phase required.
Disadvantages:
- Slow for large datasets.
- Sensitive to irrelevant features.
- Requires proper selection of ‘k’ for best performance.
Q9. Explain supervised and unsupervised learning in detail with examples and applications.
Supervised Learning:
- Uses labeled data to predict outcomes.
- Example: Spam detection – training data contains emails labeled as spam or not spam.
- Applications: Speech recognition, medical diagnosis, fraud detection.
Unsupervised Learning:
- Works with unlabeled data and finds hidden patterns.
- Example: Customer segmentation – grouping customers based on purchasing behavior.
- Applications: Anomaly detection, recommendation systems, market analysis.
OR
Q9. What do you mean by data pre-processing? Write down its advantages and
disadvantages.
Data Preprocessing is the process of cleaning, transforming, and organizing raw data to
improve model accuracy and performance.
Advantages:
- Improves model accuracy.
- Reduces training time.
- Enhances generalization.
Disadvantages:
- Time-consuming.
- May cause loss of useful information.
- Requires domain expertise for proper feature selection.