0% found this document useful (0 votes)

8 views10 pages

Ml Assignment Kv2

Logistic Regression is a widely used supervised machine learning algorithm for predicting binary outcomes based on input variables. It employs the sigmoid function to map predictions to probabilities and has various types including binary, multinomial, and ordinal logistic regression. The document also discusses its mathematical foundation, evaluation metrics, advantages, limitations, and real-world applications.

Uploaded by

victoriaanthony7518

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views10 pages

Ml Assignment Kv2

Uploaded by

victoriaanthony7518

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

1.

Introduction

Logistic Regression is one of the most popular algorithms in supervised

machine learning. It is used to predict the probability of a binary outcome,
such as yes/no, true/false, or success/failure, based on input variables.

Despite its simplicity, logistic regression is widely used in domains like

medical diagnosis, marketing, and social sciences. This assignment explores
logistic regression in depth—its theory, use cases, evaluation, and coding.

2. What is Logistic Regression?

Logistic Regression is a statistical method used for binary classification

problems. It estimates the probability that an instance belongs to a certain
class.

While linear regression predicts continuous outcomes, logistic regression

maps predictions to probabilities using the sigmoid function and classifies
the result as 0 or 1 based on a decision boundary (usually 0.5).

3. Mathematical Foundation

In logistic regression, we compute a weighted sum of input features:

Z = b_0 + b_1x_1 + b_2x_2 + \dots + b_nx_n

We then apply the sigmoid function:

\sigma(z) = \frac{1}{1 + e^{-z}}

This converts the output into a probability between 0 and 1. If the result is ≥
0.5, we classify it as 1 (positive class); otherwise, 0.

4. Sigmoid Function & Decision Boundary

The sigmoid function is:

\sigma(z) = \frac{1}{1 + e^{-z}}

At z = 0, output is 0.5

As z → ∞, output → 1

As z → -∞, output → 0

This output probability helps classify inputs. The decision boundary is the
point where the model decides between class 0 and class 1 (usually at 0.5).
5. Types of Logistic Regression

1. Binary Logistic Regression:

Used when the output has two categories.

Example: Is a customer likely to buy? Yes or No.

2. Multinomial Logistic Regression:

Used when the outcome has more than two unordered categories.

Example: Classifying animals as Dog, Cat, or Rabbit.

3. Ordinal Logistic Regression:

Used when the outcome has ordered categories.

Example: Rating a product as Poor, Average, Good, Excellent.

6. Assumptions of Logistic Regression

1. The dependent variable is binary or ordinal.

2. The model assumes a linear relationship between independent
variables and log-odds.

3. Observations are independent.

4. There is little to no multicollinearity among independent variables.

5. Large sample size improves accuracy.

6. Applications in Real Life

Medical: Predicting if a patient has a disease.

Finance: Classifying if a transaction is fraudulent.

Marketing: Customer segmentation (likely to purchase or not).

Social Media: Spam detection.

Education: Predicting student dropout.

7. Logistic Regression vs Linear Regression

Feature Linear Regression Logistic Regression

Output Type Continuous Probability (0–1)

Used For Regression problems Classification problems

Function Used Linear Function Sigmoid Function

Output Range -∞ to +∞ 0 to 1

Cost Function Mean Squared Error (MSE) Log Loss / Cross Entropy

8. Evaluation Metrics

1. Accuracy:

\frac{TP + TN}{TP + TN + FP + FN}

2. Precision:
\frac{TP}{TP + FP}

3. Recall (Sensitivity):

\frac{TP}{TP + FN}

4. F1-Score:

Harmonic mean of precision and recall.

5. Confusion Matrix:

A matrix showing True Positives, False Positives, True Negatives, and False
Negatives.

9. Regularization in Logistic Regression

Regularization helps prevent overfitting:

L1 Regularization (Lasso):
Can shrink some weights to zero (feature selection).

L2 Regularization (Ridge):

Shrinks all weights but keeps all features.

The cost function with L2 becomes:

Loss = -\sum [y \log(p) + (1 – y) \log(1 – p)] + \lambda \sum w^2

Where λ is the regularization strength.

12. Real-world Projects

Customer Churn Prediction:

Predict whether a user will stop using a telecom service.

Loan Approval:

Classify if a loan application should be approved.

Email Spam Classifier:

Identify if an email is spam or not.

Heart Disease Prediction:

Predict if a person has heart disease based on medical data.

13. Advantages & Limitations

Advantages:

Simple and easy to implement.

Efficient for binary classification.

Interpretable output.

Works well with linearly separable data.

Limitations:

Poor performance on non-linear data.

Assumes linear relationship with log-odds.

Sensitive to irrelevant features and outliers.

14. Conclusion
Logistic regression is a foundational classification technique in machine
learning. It’s widely used due to its simplicity, efficiency, and ability to
handle binary outcomes effectively. With proper evaluation and
preprocessing, logistic regression can produce powerful results in real-world
applications across industries.

15. References

1. “Introduction to Machine Learning” by Ethem Alpaydin

2. Scikit-learn official documentation – https://scikit-learn.org

3. Coursera – Andrew Ng’s ML Course

4. Analytics Vidhya

5. GeeksforGeeks – Logistic Regression Tutorial

Would you like me to generate this as a Word or PDF file for easy printing or
submission?

A Comparative Analysis of Logistic Regression, Random Forest and KNN Models for the Text Classification
No ratings yet
A Comparative Analysis of Logistic Regression, Random Forest and KNN Models for the Text Classification
16 pages
Handling Imbalanced Datasets in Machine Learning - by Baptiste Rocca - Towards Data Science
No ratings yet
Handling Imbalanced Datasets in Machine Learning - by Baptiste Rocca - Towards Data Science
24 pages
Logistic Regression
No ratings yet
Logistic Regression
22 pages
Logistic_Regression_Class_Notes
No ratings yet
Logistic_Regression_Class_Notes
3 pages
PCCAIML601
No ratings yet
PCCAIML601
7 pages
Sonia Jessica - 2022 - How Does Logistic Regression Work
No ratings yet
Sonia Jessica - 2022 - How Does Logistic Regression Work
4 pages
Logistic Regression[2]
No ratings yet
Logistic Regression[2]
36 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
Logistic regression
No ratings yet
Logistic regression
12 pages
09_23ECE216_LogisticRegression
No ratings yet
09_23ECE216_LogisticRegression
40 pages
Interview Questions
No ratings yet
Interview Questions
26 pages
Session9-LogisticRegression_a6c5bc556df30fa3eb779e22e464a08a - Copy
No ratings yet
Session9-LogisticRegression_a6c5bc556df30fa3eb779e22e464a08a - Copy
33 pages
Intro to Linear and Logistic Reg
No ratings yet
Intro to Linear and Logistic Reg
5 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
ai-tech-agency-infographics
No ratings yet
ai-tech-agency-infographics
65 pages
03 Logistic Regression
No ratings yet
03 Logistic Regression
23 pages
Logistic Regression Presentation (1)
No ratings yet
Logistic Regression Presentation (1)
10 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Lecture Note #9_PEC-CS701E
No ratings yet
Lecture Note #9_PEC-CS701E
41 pages
Logistic Regression Report
No ratings yet
Logistic Regression Report
39 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
logisticregression
No ratings yet
logisticregression
22 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
week 8
No ratings yet
week 8
38 pages
B.Tech_V_KCS055_Unit2_2
No ratings yet
B.Tech_V_KCS055_Unit2_2
7 pages
Machine learning notes
No ratings yet
Machine learning notes
53 pages
FALLSEM2024-25 MMAT501L TH VL2024250107615 2024-09-24 Reference-Material-I
No ratings yet
FALLSEM2024-25 MMAT501L TH VL2024250107615 2024-09-24 Reference-Material-I
12 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
logisticregression-190726150723 (1)
No ratings yet
logisticregression-190726150723 (1)
19 pages
Task1
No ratings yet
Task1
7 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
15) Machine Learning Algorithms - Google Docs
No ratings yet
15) Machine Learning Algorithms - Google Docs
5 pages
SMDS-Unit-5
No ratings yet
SMDS-Unit-5
21 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
ml (08-08-2024)
No ratings yet
ml (08-08-2024)
5 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Logistic Regression
No ratings yet
Logistic Regression
1 page
Logistic Regression Algorithm
No ratings yet
Logistic Regression Algorithm
8 pages
P 2.1 Logistic Regression
No ratings yet
P 2.1 Logistic Regression
18 pages
Lecture 4-Logistic Regression
No ratings yet
Lecture 4-Logistic Regression
20 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
SUPERVISED MACHINE LEARNING
No ratings yet
SUPERVISED MACHINE LEARNING
56 pages
4.Logistic Regression
No ratings yet
4.Logistic Regression
28 pages
ML Lec-9
No ratings yet
ML Lec-9
13 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
4. Logistic Regression
No ratings yet
4. Logistic Regression
21 pages
2-Logistic Regression
No ratings yet
2-Logistic Regression
15 pages
AI lab8
No ratings yet
AI lab8
8 pages
ML_MU_Unit_2 - Supervised Learning-Classification Techniques
No ratings yet
ML_MU_Unit_2 - Supervised Learning-Classification Techniques
153 pages
Machine Learning unit 2 que and ans
No ratings yet
Machine Learning unit 2 que and ans
16 pages
ML Mod 3
No ratings yet
ML Mod 3
4 pages
Lecture 4-Logistic-Regression
No ratings yet
Lecture 4-Logistic-Regression
50 pages
Untangling Logistic Regression: A Comprehensive Guide
From Everand
Untangling Logistic Regression: A Comprehensive Guide
Pasquale De Marco
No ratings yet
Kailash ML Report
No ratings yet
Kailash ML Report
51 pages
X_AI_MS_SPB
No ratings yet
X_AI_MS_SPB
4 pages
Karmakar Et Al. - 2023
No ratings yet
Karmakar Et Al. - 2023
12 pages
Object_detection_based_deinterleaving_of_radar_sig
No ratings yet
Object_detection_based_deinterleaving_of_radar_sig
13 pages
Board QP Solution and Notes
No ratings yet
Board QP Solution and Notes
36 pages
Cognizant_Hackathon_Team_26
No ratings yet
Cognizant_Hackathon_Team_26
25 pages
Machine Learning Approach For Flood Risks Prediction
No ratings yet
Machine Learning Approach For Flood Risks Prediction
8 pages
Deep Transfer Learning Based Classification Model For Covid-19 Using Chest CT-scans
No ratings yet
Deep Transfer Learning Based Classification Model For Covid-19 Using Chest CT-scans
7 pages
final_ppr (1) BTP
No ratings yet
final_ppr (1) BTP
14 pages
HW1 Cse 5160-01
No ratings yet
HW1 Cse 5160-01
6 pages
Automated Marketing Research Using Online Customer Reviews
No ratings yet
Automated Marketing Research Using Online Customer Reviews
62 pages
Texture-Based Airport Runway Detection: Ö. Aytekin, U. Zöngür, and U. Halici
No ratings yet
Texture-Based Airport Runway Detection: Ö. Aytekin, U. Zöngür, and U. Halici
8 pages
A Novel Statistical Analysis and Autoencoder Driven (CB)
No ratings yet
A Novel Statistical Analysis and Autoencoder Driven (CB)
29 pages
AReviewonMulti-Label Learning Algorithms
No ratings yet
AReviewonMulti-Label Learning Algorithms
43 pages
Hand Gesture Recognition2
No ratings yet
Hand Gesture Recognition2
5 pages
The Advantages of The Matthews Correlation Coefficient (MCC) Over F1 Score and Accuracy in Binary Classification Evaluation
No ratings yet
The Advantages of The Matthews Correlation Coefficient (MCC) Over F1 Score and Accuracy in Binary Classification Evaluation
13 pages
Capstone Project Guidelines Data Science
No ratings yet
Capstone Project Guidelines Data Science
2 pages
Unit 4_Question Bank and answers
No ratings yet
Unit 4_Question Bank and answers
23 pages
Project Viva
No ratings yet
Project Viva
4 pages
7 Types of Classification Algorithms - Analytics India Magazine
No ratings yet
7 Types of Classification Algorithms - Analytics India Magazine
17 pages
Information Retrieval System
No ratings yet
Information Retrieval System
4 pages
Credit Card
No ratings yet
Credit Card
9 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
2.5 Pre- and Post-Coordinate Indexing
No ratings yet
2.5 Pre- and Post-Coordinate Indexing
48 pages
Attack Techniques and Threat Identification For Vulnerabilities
No ratings yet
Attack Techniques and Threat Identification For Vulnerabilities
9 pages
Education Research International-Musso Et Al.
No ratings yet
Education Research International-Musso Et Al.
13 pages
3 - Machine Learning and Deep Learning Techniques For The Analysis of Heart Disease A Systematic Literature Review, Open Challenges and Future Directions - 2023
No ratings yet
3 - Machine Learning and Deep Learning Techniques For The Analysis of Heart Disease A Systematic Literature Review, Open Challenges and Future Directions - 2023
52 pages
Vi Int368 Ml-I
No ratings yet
Vi Int368 Ml-I
2 pages

Ml Assignment Kv2

Uploaded by

Ml Assignment Kv2

Uploaded by

1.

Logistic Regression is one of the most popular algorithms in supervised

Despite its simplicity, logistic regression is widely used in domains like

2. What is Logistic Regression?

Logistic Regression is a statistical method used for binary classification

While linear regression predicts continuous outcomes, logistic regression

In logistic regression, we compute a weighted sum of input features:

Z = b_0 + b_1x_1 + b_2x_2 + \dots + b_nx_n

\sigma(z) = \frac{1}{1 + e^{-z}}

4. Sigmoid Function & Decision Boundary

The sigmoid function is:

\sigma(z) = \frac{1}{1 + e^{-z}}

1. Binary Logistic Regression:

Used when the output has two categories.

Example: Is a customer likely to buy? Yes or No.

2. Multinomial Logistic Regression:

Example: Classifying animals as Dog, Cat, or Rabbit.

3. Ordinal Logistic Regression:

Used when the outcome has ordered categories.

Example: Rating a product as Poor, Average, Good, Excellent.

6. Assumptions of Logistic Regression

1. The dependent variable is binary or ordinal.

3. Observations are independent.

4. There is little to no multicollinearity among independent variables.

5. Large sample size improves accuracy.

6. Applications in Real Life

Medical: Predicting if a patient has a disease.

Finance: Classifying if a transaction is fraudulent.

Marketing: Customer segmentation (likely to purchase or not).

Social Media: Spam detection.

Education: Predicting student dropout.

Feature Linear Regression Logistic Regression

Output Type Continuous Probability (0–1)

Used For Regression problems Classification problems

Function Used Linear Function Sigmoid Function

\frac{TP + TN}{TP + TN + FP + FN}

Harmonic mean of precision and recall.

9. Regularization in Logistic Regression

Regularization helps prevent overfitting:

Shrinks all weights but keeps all features.

The cost function with L2 becomes:

Loss = -\sum [y \log(p) + (1 – y) \log(1 – p)] + \lambda \sum w^2

Where λ is the regularization strength.

12. Real-world Projects

Customer Churn Prediction:

Predict whether a user will stop using a telecom service.

Classify if a loan application should be approved.

Email Spam Classifier:

Identify if an email is spam or not.

Heart Disease Prediction:

Predict if a person has heart disease based on medical data.

Simple and easy to implement.

Efficient for binary classification.

Works well with linearly separable data.

Poor performance on non-linear data.

Assumes linear relationship with log-odds.

Sensitive to irrelevant features and outliers.

1. “Introduction to Machine Learning” by Ethem Alpaydin

2. Scikit-learn official documentation – https://scikit-learn.org

3. Coursera – Andrew Ng’s ML Course

5. GeeksforGeeks – Logistic Regression Tutorial

You might also like