0% found this document useful (0 votes)

59 views7 pages

PCCAIML601

This report provides a detailed analysis of Logistic Regression and Maximum Likelihood Estimation (MLE), focusing on their mathematical foundations, optimization techniques, and applications in binary classification tasks. It discusses the advantages and limitations of logistic regression, as well as regularization methods to prevent overfitting. The report serves as a comprehensive resource for understanding the implementation and significance of logistic regression in various fields.

Uploaded by

Eshika Giri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views7 pages

PCCAIML601

Uploaded by

Eshika Giri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

FUTURE INSTITUE OF TECHNOLOGY

240, Garia Boral Main Road, Kolkata - 700154 West Bengal

(Affiliated To MAKAUT)

Detailed Report on
Logistic Regression and Maximum Likelihood
Estimation

Submitted as CA2 in

Machine Learning Applications

(PCCAIML601)

for

the partial fulfilment of

B. Tech in

Computer Science and Engineering (AI & ML)

Submitted by:

Eshika Giri

(34230822009)

Submitted on: 12th of March, 2025

Table of Contents

1. Introduction
2. Logistic Regression
o Definition and Importance
o Mathematical Formulation
o Sigmoid Function
o Decision Boundary
3. Maximum Likelihood Estimation (MLE)
o Concept of Likelihood
o Derivation of MLE for Logistic Regression
o Log-Likelihood Function
o Optimization Using Gradient Descent
4. Cost Function for Logistic Regression
5. Regularization in Logistic Regression
o L1 (Lasso) Regularization
o L2 (Ridge) Regularization
6. Advantages of Logistic Regression
7. Limitations of Logistic Regression
8. Applications of Logistic Regression
9. Conclusion
10. References
Abstract

Logistic Regression is a fundamental machine learning algorithm used for binary

classification problems. It models the probability of an event occurring by applying the
logistic (sigmoid) function to a linear combination of input features. The parameters of
logistic regression are optimized using Maximum Likelihood Estimation (MLE), which
ensures the best fit to the data by maximizing the probability of observed outcomes. This
report provides a comprehensive discussion on the mathematical foundations of logistic
regression, the derivation of MLE, optimization techniques, cost functions, regularization
methods, and real-world applications. Additionally, the report highlights the advantages and
limitations of logistic regression in practical scenarios.

Introduction

Logistic Regression is a widely used statistical method for binary classification tasks. It is
particularly effective when the target variable has two possible outcomes, such as 'yes' or 'no',
'spam' or 'not spam', and 'fraudulent' or 'non-fraudulent'. Unlike linear regression, which
predicts continuous values, logistic regression estimates the probability of a particular class
label.

The key idea behind logistic regression is to model the relationship between a set of
independent variables and the probability of a dependent variable belonging to a particular
class. The model uses the sigmoid function to ensure that the output values are constrained
between 0 and 1, making them interpretable as probabilities.

To estimate the parameters of the logistic regression model, Maximum Likelihood

Estimation (MLE) is used. MLE finds the parameter values that maximize the likelihood of
the observed data. Since logistic regression does not have a closed-form solution like linear
regression, optimization techniques such as Gradient Descent are used to find the best
parameter estimates.

Logistic Regression

Definition and Importance

Logistic regression is a supervised learning algorithm used for classification problems where
the dependent variable is categorical. The model predicts the probability of an event
occurring, making it useful in numerous domains, including medical diagnosis, fraud
detection, and marketing.
Mathematical Formulation

For a given set of input features X, the logistic regression model is expressed as:

where:

 β0,β1,...,βn are the model parameters (weights and bias),

 X1,...,Xn are input features,
 e is Euler’s number (2.718), ensuring the output is between 0 and 1.

Sigmoid Function

The sigmoid function is used to map the

linear combination of input features to a
probability:

where z is the linear combination of

input features. The sigmoid function ensures that output values range between 0 and 1.

Decision Boundary

 The model classifies an instance

as 1 if P(Y=1∣X)≥0.5, otherwise
as 0.
 The decision boundary is linear
in simple logistic regression, but
in nonlinear cases, feature
transformations can be applied.
Maximum Likelihood Estimation (MLE)

Concept of Likelihood

MLE is a statistical technique that estimates parameters by maximizing the likelihood

function, which represents the probability of observed data given the parameters.

Derivation of MLE for Logistic Regression

For logistic regression, the likelihood function is given by:

Taking the logarithm (log-likelihood function):

Optimization Using Gradient Descent

 Compute gradients with respect to parameters β.

 Use the update rule:

where α is the learning rate.

 Iterate until convergence.

Cost Function for Logistic Regression

The cost function is derived from the negative log-likelihood function:

Minimizing this function ensures optimal parameter values.

Regularization in Logistic Regression

To prevent overfitting, regularization techniques are used:

 L1 (Lasso) Regularization: Adds absolute weight penalties to shrink some

coefficients to zero.
 L2 (Ridge) Regularization: Adds squared weight penalties to prevent large
coefficient values.

Advantages of Logistic Regression

1. Simple and Interpretable: Provides clear probability estimates.

2. Efficient for Binary Classification: Performs well when classes are linearly
separable.
3. Less Prone to Overfitting: Simpler than deep learning models.
4. Probability Outputs: Unlike SVM, it provides probability scores.
Limitations of Logistic Regression

1. Limited to Linear Decision Boundaries: Cannot handle complex decision

boundaries without transformations.
2. Sensitive to Outliers: Outliers can impact model performance.
3. Struggles with High-Dimensional Data: Feature selection is necessary.

Applications of Logistic Regression

1. Medical Diagnosis: Predicting diseases (e.g., diabetes detection).

2. Spam Detection: Classifying emails as spam or not spam.
3. Credit Scoring: Assessing loan eligibility.
4. Marketing: Predicting customer purchase behavior.
5. Fraud Detection: Identifying fraudulent transactions.

Conclusion

Logistic regression is a powerful classification algorithm, particularly for binary

classification problems. Maximum Likelihood Estimation plays a crucial role in optimizing
model parameters. While logistic regression is simple and effective, it has limitations that
must be addressed using techniques such as feature engineering and regularization.

References

1. ScienceDirect. (n.d.). Logistic Regression Analysis. Retrieved from:

https://www.sciencedirect.com/topics/medicine-and-dentistry/logistic-regression-analysis

2. GeeksforGeeks. (n.d.). Understanding Logistic Regression. Retrieved from:

https://www.geeksforgeeks.org/understanding-logistic-regression/

ML Assignment Kv2
No ratings yet
ML Assignment Kv2
10 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
41 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
21 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
40 pages
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
No ratings yet
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
31 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Logistic Regression - Jaikrishna 2
No ratings yet
Logistic Regression - Jaikrishna 2
5 pages
Logistic Regression for Beginners
No ratings yet
Logistic Regression for Beginners
5 pages
Logistic Regression for Beginners
No ratings yet
Logistic Regression for Beginners
28 pages
Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
23 pages
Understanding Logistic Regression Techniques
No ratings yet
Understanding Logistic Regression Techniques
13 pages
AI Lab8
No ratings yet
AI Lab8
8 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
4 pages
ML (08-08-2024)
No ratings yet
ML (08-08-2024)
5 pages
Logistic Regression Class Notes Guide
No ratings yet
Logistic Regression Class Notes Guide
3 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Exp 2
No ratings yet
Exp 2
7 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistics Regression
No ratings yet
Logistics Regression
8 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
39 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
4 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
65 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
56 pages
Logistic Regression - Metrics and Iteration
No ratings yet
Logistic Regression - Metrics and Iteration
26 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Understanding Regression Types and Applications
No ratings yet
Understanding Regression Types and Applications
14 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Week 8
No ratings yet
Week 8
38 pages
P-2 M.L. M-I U-I Logistic Regression
No ratings yet
P-2 M.L. M-I U-I Logistic Regression
50 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
53 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
29 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
20 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
10 pages
Logistic Regression Explained for Interviews
No ratings yet
Logistic Regression Explained for Interviews
12 pages
Unit-3 - Introduction To ML, Part-1
No ratings yet
Unit-3 - Introduction To ML, Part-1
3 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
3 pages
2-Logistic Regression
No ratings yet
2-Logistic Regression
15 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
84 pages
Logisticregression 190726150723
No ratings yet
Logisticregression 190726150723
19 pages
EXP-2-To Implement Logistic Regression
No ratings yet
EXP-2-To Implement Logistic Regression
5 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
6 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Supervised Logistic Regression
No ratings yet
Supervised Logistic Regression
13 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
21 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
22 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
8 pages
Logistic Regression Explained: Concepts & Examples
No ratings yet
Logistic Regression Explained: Concepts & Examples
4 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
54 pages
L5 LogisticRegression
No ratings yet
L5 LogisticRegression
22 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
Lecture 08
No ratings yet
Lecture 08
42 pages
Tuto1 Merged
No ratings yet
Tuto1 Merged
11 pages
Quiz 8 Attempt Review and Answers
No ratings yet
Quiz 8 Attempt Review and Answers
15 pages
Model Exam Question Set 1
No ratings yet
Model Exam Question Set 1
2 pages
Academic Stress Perception Analysis
No ratings yet
Academic Stress Perception Analysis
5 pages
Estimation and Detection Exercises
No ratings yet
Estimation and Detection Exercises
2 pages
Stock and T-Bill Return Analysis
No ratings yet
Stock and T-Bill Return Analysis
16 pages
MLE and MOM Estimation Problems
No ratings yet
MLE and MOM Estimation Problems
4 pages
Understanding Bivariate Analysis Techniques
No ratings yet
Understanding Bivariate Analysis Techniques
64 pages
Scaini 2016
No ratings yet
Scaini 2016
8 pages
Introductory Econometrics - Exam: 1 Theoretical Questions
No ratings yet
Introductory Econometrics - Exam: 1 Theoretical Questions
5 pages
Autocorrelation in Regression Analysis
No ratings yet
Autocorrelation in Regression Analysis
11 pages
Analysis of Work-Related Quality of Seafarers' Life
No ratings yet
Analysis of Work-Related Quality of Seafarers' Life
13 pages
Chapter 6
No ratings yet
Chapter 6
28 pages
Unit-4 Sampling Distribution
No ratings yet
Unit-4 Sampling Distribution
4 pages
2.probability Statistics
No ratings yet
2.probability Statistics
44 pages
Understanding Standard Deviation Analysis
No ratings yet
Understanding Standard Deviation Analysis
9 pages
Using Stata Chapter 2
No ratings yet
Using Stata Chapter 2
4 pages
Grameenphone SIM Sales Trend Analysis
No ratings yet
Grameenphone SIM Sales Trend Analysis
6 pages
Matrix Approach to Linear Regression
No ratings yet
Matrix Approach to Linear Regression
12 pages
Research Methodology Quiz Results
No ratings yet
Research Methodology Quiz Results
9 pages
Pearson Product Moment Correlation
No ratings yet
Pearson Product Moment Correlation
53 pages
Data Preprocessing
No ratings yet
Data Preprocessing
32 pages
Aaps - Schafer Missing Data and Longitudinal Analysis
No ratings yet
Aaps - Schafer Missing Data and Longitudinal Analysis
59 pages
Unit 9 Statistical Packages: 9.0 Objectives
No ratings yet
Unit 9 Statistical Packages: 9.0 Objectives
16 pages
Revision For Final Exam 1
No ratings yet
Revision For Final Exam 1
2 pages
Analisis Regresi Produksi Beras
No ratings yet
Analisis Regresi Produksi Beras
10 pages
Lesson 7 - 2: Means and Variances of Random Variables
No ratings yet
Lesson 7 - 2: Means and Variances of Random Variables
19 pages
Puc 3103 Fundamentals of Probability and Statistics: Cat I
No ratings yet
Puc 3103 Fundamentals of Probability and Statistics: Cat I
5 pages
Statistics For Beginners 2024
No ratings yet
Statistics For Beginners 2024
37 pages

PCCAIML601

Uploaded by

PCCAIML601

Uploaded by

FUTURE INSTITUE OF TECHNOLOGY

240, Garia Boral Main Road, Kolkata - 700154 West Bengal

Machine Learning Applications

the partial fulfilment of

Computer Science and Engineering (AI & ML)

Submitted on: 12th of March, 2025

Logistic Regression is a fundamental machine learning algorithm used for binary

To estimate the parameters of the logistic regression model, Maximum Likelihood

Definition and Importance

 β0,β1,...,βn are the model parameters (weights and bias),

The sigmoid function is used to map the

where z is the linear combination of

 The model classifies an instance

MLE is a statistical technique that estimates parameters by maximizing the likelihood

Derivation of MLE for Logistic Regression

For logistic regression, the likelihood function is given by:

Taking the logarithm (log-likelihood function):

Optimization Using Gradient Descent

 Compute gradients with respect to parameters β.

where α is the learning rate.

 Iterate until convergence.

The cost function is derived from the negative log-likelihood function:

Minimizing this function ensures optimal parameter values.

Regularization in Logistic Regression

To prevent overfitting, regularization techniques are used:

 L1 (Lasso) Regularization: Adds absolute weight penalties to shrink some

Advantages of Logistic Regression

1. Simple and Interpretable: Provides clear probability estimates.

1. Limited to Linear Decision Boundaries: Cannot handle complex decision

Applications of Logistic Regression

1. Medical Diagnosis: Predicting diseases (e.g., diabetes detection).

Logistic regression is a powerful classification algorithm, particularly for binary

1. ScienceDirect. (n.d.). Logistic Regression Analysis. Retrieved from:

2. GeeksforGeeks. (n.d.). Understanding Logistic Regression. Retrieved from:

You might also like