0% found this document useful (0 votes)

4 views

ch5

The document discusses various modeling techniques in data science, including mathematical modeling, regression methods, and decision-making frameworks. It covers deterministic, stochastic, and empirical models, highlighting their strengths, limitations, and applications. Additionally, it explains linear regression, logistic regression, regularization techniques, and multi-criteria decision-making methods, providing insights into their functions and uses in predictive analytics.

Uploaded by

Andualem Getachew

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

ch5

Uploaded by

Andualem Getachew

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Modeling Techniques,

Regression
Mathematical Modeling
● To explore how mathematical models represent real-world phenomena and
contribute to solving problems in data science.
● Mathematical modeling is a fundamental approach to representing real-world
phenomena through mathematical expressions, enabling the prediction,
analysis, and optimization of complex systems.
● In data science and machine learning, it serves as the backbone for
designing algorithms and generating insights from data.
● It transforms complex phenomena into a simplified, understandable, and
computable form.
● Models allow for prediction, optimization, and understanding of systems
across various dicipleance. They help identify relationships and patterns in
data, enabling informed decision-making and problem-solving.
Cont.

1. Problem Formulation and Data Collection

○ Problem Formulation: Clearly define the problem or phenomenon to model.
i. Example: Predicting sales based on seasonal demand.
○ Data Collection: Gather relevant, accurate, and sufficient data to inform the model.
i. Example: Historical sales data, market trends, and economic indicators.
2. Model Selection and Assumptions
○ Model Selection: Choose the most appropriate model type for the problem.
i. Example: Linear regression for relationships, differential equations for dynamic systems.
○ Assumptions: Identify and document assumptions to simplify the model while maintaining its
relevance.
i. Example: Assuming a constant rate of growth in a population model.
Cont.

3. Model Implementation, Validation, and Evaluation

○ Implementation: Translate the mathematical structure into a computational framework or
algorithm.
i. Tools used are: Python, MATLAB, or R.
○ Validation: Compare model predictions with real-world data to assess its accuracy.
i. Techniques: Use test datasets or cross-validation methods.
○ Evaluation: Analyze the model's performance and refine it if necessary.
i. Metrics: Mean Squared Error (MSE), R-squared, or likelihood functions.
Types of Models
Deterministic Models
● Deterministic models are systems where the outcomes are completely determined
by the input data and parameters, without any randomness or probabilistic
elements.
● These models operate under strict rules or equations, ensuring that the same
inputs always produce the same outputs. This characteristic makes deterministic
models highly predictable and reliable.
● They are used for Deterministic transformations like scaling, encoding, or
normalizing data ensure consistent input for algorithms in machine learning
preprocessing and
● Deterministic models often form the foundation for simple AI systems, such as
chatbots that provide predefined responses in rule based AI.
Cont.
● Deterministic models rely on fixed relationships between variables, often
derived from established rules or mathematical equations.
● They assume that all relevant factors can be fully captured in the model,
leaving no room for uncertainty.
Example: Weather Prediction
● Estimating temperature trends based on historical seasonal data.
● Inputs: Time of year, geographical location, historical averages.
● Rule: Temperature follows a sinusoidal pattern based on the Earth's orbit.
Cont.
The strength of this modeling approach is:
● Predictability: Outcomes are consistent and repeatable for the same inputs.
● Simplicity: Easy to design and interpret due to their rule-based nature.
● Transparency: Clearly defined rules make it easy to audit and justify
decisions.
Limitations of Deterministic Models
● Lack of Flexibility: Cannot handle randomness or uncertainty effectively.
● Dependence on Complete Data: Assumes all influencing factors are known
and measurable.
Stochastic Models

● Stochastic models are systems that incorporate elements of randomness or

probability to represent uncertainty and variability in data or outcomes.
● Unlike deterministic models, stochastic models recognize that real-world
processes often involve inherent randomness, which is reflected in their
predictions.
● Outputs are not fixed and may vary even with identical inputs due to
probabilistic components.
○ Example: Predicting customer churn based on historical behavior and external factors
introduces uncertainty.
● Stochastic models use probability distributions to describe outcomes.
○ Example: Using a Gaussian distribution to model the variability in product demand.
Cont.
● One of the most known Stochastic Models is Markov Chains. This type of models
are used to determine the uncertain conditions that might affect the prediction. So
this conditions are real world factors which can make the output be random.
Strengths of Stochastic Models
● Handling Uncertainty: Ideal for systems with inherent randomness, like weather or
customer behavior.
● Realism: Reflects the variability and unpredictability of real-world processes.
● Flexibility: Can adapt to dynamic systems where inputs and conditions change
frequently.
Cont.

Limitations of Stochastic Models

● Complexity: Requires advanced mathematical understanding and

computation.
● Data Dependency: Heavily reliant on high-quality data to estimate
probabilities accurately.
● Uncertainty in Predictions: Provides ranges or probabilities instead of fixed
outputs, which might be less actionable in some contexts.
Empirical Models
● Empirical models are data-driven approaches that rely on observed data to derive
patterns, relationships, and predictions.
● The model construction is based on historical or real-time data.Patterns and
relationships are inferred directly from data rather than predefined equations or
rules.
● Empirical models can generalize to new, unseen data when trained effectively.
● Often lack interpretability, as the focus is on accurate predictions rather than
understanding the process.
○ Example: Neural networks can predict outcomes but may not explain why or how a decision was
made.
● Some of the algorithms that follows the empirical model is svm and Random
Forests.
Cont.
Strengths of Empirical Models

● Versatility: Can be applied to various domains, such as finance, healthcare, and e-commerce.
● Data Adaptability: Models improve as more data becomes available, leading to better accuracy.
● Automation: Empirical models can automate complex tasks like anomaly detection or
recommendation systems.

Challenges of Empirical Models

● Data Dependence: Requires extensive and high-quality data for training. Poor data quality can lead
to inaccurate predictions.
● Overfitting: Empirical models may perform well on training data but fail to generalize to unseen data
if not properly regularized.
● Interpretability: Complex models like neural networks are often "black boxes," making it difficult to
explain decisions.
Linear regression

● Linear regression is a statistical method that models the relationship between

a dependent variable and one or more independent variables using a linear
equation.
● It is one of the simplest and most widely used techniques in predictive
modeling. The goal is to fit a linear equation to the observed data and use this
equation to predict values of the dependent variable based on new
observations of the independent variables.
● Predicting continuous outcomes like house prices, stock prices, etc.
Cont.
Equation: y=β0+β1x+ϵ
y: Dependent variable

x: Independent variable

β0: Intercept

β1: Slope

ϵ: Error term

Explanation: The equation represents a straight line that best fits the data points. The intercept (β0) is the value of
y when x is 0, and the slope (β1) represents the change in y for a one-unit change in x
Cont.
Slope (β1): Change in the dependent variable for a one-unit change in the
independent variable.
Intercept (β0): Value of the dependent variable when the independent variable is
zero.
Understanding the coefficients helps interpret the model and the relationship
between variables. The slope indicates the strength and direction of the
relationship.
Regularization
● Regularization is a technique used to address overfitting by penalizing overly
complex models. It helps in enhancing the generalization ability of models,
ensuring they perform well on unseen data.
● A method to constrain or shrink model coefficients to prevent overfitting.
Introduces a penalty for large coefficients, encouraging simpler, more
interpretable models.
● Prevents Overfitting: Reduces the model's ability to capture noise in the
training data.
● Improves Generalization: Ensures the model performs well on test or real-
world data.
● Stabilizes Models: Helps avoid extreme fluctuations in predictions for small
changes in input.
Types of Regularization Techniques

Lasso Regression (Least Absolute Shrinkage and Selection Operator)

● Penalty Added: 𝐿1-norm penalty, represented as λ∑∣𝑤𝑖∣ , where 𝑤𝑖 are model

coefficients.
● Forces some coefficients to become exactly zero, effectively performing
feature selection.
● Simplifies models by eliminating irrelevant features.
● This regularization is used when there are many irrelevant or redundant
features in the dataset.
Cont.

Ridge Regression

● Penalty Added: 𝐿2-norm penalty, represented as λ∑𝑤𝑖2 , where 𝑤𝑖 are model

coefficients.
● Shrinks all coefficients toward zero but does not eliminate them entirely.
● Useful for managing multicollinearity and stabilizing predictions.
Cont.

Elastic Net

● Penalty Added: Combines 𝐿1-norm and 𝐿2-norm penalties.

λ1∑∣𝑤𝑖∣ +λ2∑𝑤𝑖2

● Balances the benefits of Lasso (feature selection) and Ridge (shrinkage).

● Useful when features are highly correlated or when there are many irrelevant
features.
Cont.
Logistic regression

● Logistic regression is a statistical method used for binary classification

problems, where the outcome is a binary variable (e.g., yes/no, true/false).
● Unlike linear regression, logistic regression predicts the probability that a
given input point belongs to a certain class.
● Predicting outcomes like spam vs. not spam, disease presence, etc.
Cont.

● Logistic function (sigmoid function)

● The logistic function maps any real-valued number into the range (0, 1),
making it suitable for probability estimation. The model predicts the probability
that the dependent variable Y equals 1 (i.e., belongs to the positive class).
Cont.

● Binary dependent variable: The dependent variable has only two possible
outcomes, typically coded as 0 and 1.
● Independent observations:Observations in the dataset must be
independent, meaning the outcome of one observation does not influence
another.
● No multicollinearity: Independent variables should not be highly correlated
with each other, as this can make the model's coefficients unstable.
● Large sample size: Logistic regression requires a relatively large sample
size to provide reliable and stable estimates of the model parameters.
Building Decision Models with Multi-Criteria Decision Making (MCDM)

● Multi-Criteria Decision Making (MCDM) refers to a set of techniques or

frameworks used to evaluate, prioritize, and choose between multiple
alternatives, especially when decisions involve trade-offs among conflicting
criteria.
● It helps decision-makers arrive at optimal or near-optimal solutions in complex
scenarios by structuring the decision process and incorporating multiple
viewpoints.
● In real-world scenarios, decisions are rarely based on a single factor. For
example, selecting a supplier may involve considering cost, quality, delivery
speed, and sustainability, which often conflict. MCDM provides systematic
methods to evaluate such trade-offs.
Cont.
Decision Space
● The decision space refers to the collection of all possible solutions or alternatives
available for evaluation.
● Helps define the boundaries and scope of the decision-making process.
Criteria Weighting
● Criteria weighting involves assigning relative importance to each criterion based
on its significance to the overall decision.
● Ensures that more critical factors have a greater influence on the final decision.
○ Expert Judgment: Decision-makers rank or rate criteria based on experience or preferences.
○ Pairwise Comparisons: Methods like the Analytic Hierarchy Process (AHP) systematically compare
criteria to assign weights.
Cont.
Scoring and Ranking
● Each alternative is scored based on its performance for each criterion, and
the scores are aggregated to rank the alternatives.
Steps:
1. Evaluate each alternative against the criteria.
2. Multiply the criterion score by its weight.
3. Sum up the weighted scores for each alternative.
● Provides a clear ranking of alternatives, allowing decision-makers to identify
the most suitable choice.
Popular MCDM Methods

1. Analytic Hierarchy Process (AHP):

◦ Breaks a decision problem into a hierarchy of sub-problems .

◦ Steps:

▪ Define the criteria and alternatives.

▪ Compare criteria pairwise to establish weights.

▪ Score alternatives and calculate an overall ranking.

cont.
2. Weighted Sum Model (WSM):
◦ Calculates a weighted score for each alternative.

● Where wj is the weight of criterion j, and xij is the performance of alternative i

on criterion j.
Taylor Polynomials

● A Taylor polynomial is an approximation of a function 𝑓(𝑥) using a finite

number of terms from its Taylor series expansion.
● It is a way to approximate a function near a specific point, typically around
𝑥=a, by using the values of the function and its derivatives at that point.
● Given a function 𝑓(𝑥) that is sufficiently differentiable at a point 𝑎, the 𝑛-th
degree Taylor polynomial 𝑃𝑛(𝑥) of 𝑓(𝑥) around 𝑥=𝑎 is:
Cont.

● f(a): Function value at 𝑎.

● 𝑓′(𝑎): First derivative of the function evaluated at 𝑎
● 𝑓′′(𝑎): Second derivative of the function evaluated at 𝑎
● 𝑓(𝑛)(𝑎):𝑛-th derivative of the function evaluated at 𝑎
● (𝑥−𝑎): The difference between the point of approximation 𝑥 and the base point
𝑎.
● 𝑛!: Factorial of 𝑛.
Cont.
Properties and Significance:

Convergence:
The Taylor polynomial provides a good approximation of the function near the point 𝑎. The more terms
you include in the expansion, the more accurate the approximation, especially for smooth functions.
Degree of Approximation:

The degree 𝑛 of the Taylor polynomial determines how many derivatives of the function are taken into
account, with higher-degree polynomials offering better approximations for a larger range of 𝑥.

Error Estimate (Lagrange Remainder):

The error between the Taylor polynomial and the actual function can be bounded by the Lagrange
remainder term, which provides an estimate of how much the Taylor polynomial deviates from the actual
function for a given 𝑛.
Dividing and Conquering with Bisection Methods

● The bisection method is a numerical technique used to find the roots of a

continuous function. It works by repeatedly narrowing down an interval in half
and selecting the subinterval that contains the root. This method is based on
the intermediate value theorem, which ensures that a root exists between two
points if the function values at those points have opposite signs.
● Also called the interval halving method, the binary search method, or the
dichotomy method. is based on the Bolzano’s theorem for continuous
functions.
Cont.

● The Bisection Method looks to find the

value c for which the plot of the function f
crosses the x-axis. The c value is in this
case is an approximation of the root of the
function f(x). How close the value of c gets
to the real root depends on the value of the
tolerance we set for the algorithm.
Cont.

Algorithm Steps:

1. Identify Interval:
a. Choose an interval [𝑎,𝑏] such that 𝑓(𝑎)⋅𝑓(𝑏)<0. This condition ensures that the function has a
root in the interval, as the function changes sign between 𝑎 and 𝑏.
2. Compute Midpoint:
a. Calculate the midpoint 𝑚 of the interval [𝑎,𝑏]
Cont.

3. Evaluate:
a. Evaluate the function at the midpoint 𝑓(𝑚).
b. If 𝑓(𝑚)=0, then 𝑚 is the root.
c. If 𝑓(𝑎)⋅𝑓(𝑚)<0, the root lies in the left subinterval [𝑎,𝑚].
d. If 𝑓(𝑚)⋅𝑓(𝑏)<0, the root lies in the right subinterval [𝑎,𝑚].
4. Repeat:
a. Narrow the interval by selecting the subinterval where the root lies. Repeat the process until
the desired level of precision is achieved, i.e., until the difference between 𝑎 and 𝑏 is
sufficiently small.
Predicting the future with Markov chains

● A Markov Chain is a mathematical model that describes a system where the

future state depends only on the current state, and not on the sequence of
events that preceded it. This property is known as the Markov Property, which
can be summarized as:
● Markov Assumption: current unobservable state depends on a finite number
of past states
● The Markov property states that the transition probability depends only on the
current state and not on the sequence of events that led to it.
Cont.

● i.e.: Xt depends on some previous Xis

● First-order Markov process: current state depends only on the previous state,
● i.e.: P(Xt|X0:t-1) = P(Xt|Xt-1)
● kth order: depends on previous k time steps
● Sensor Markov assumption:
○ Observable variables depend only on the current state (by definition, essentially), these are
the “sensors”.
○ The current state causes the sensor values.

P(Et|X0:t, E0:t-1) = P(Et|Xt)

Cont.

● Assume stationary process: transition model P(Xt|Xt-1) and sensor model

P(Et | Xt) are the same for all t
● In a stationary process, the changes in the world state are governed by laws
that do not themselves change over time
● The laws of probability don’t change over time
Cont.
Steps for Using Markov Chains to Predict the Future:

● Define the States:

○ Identify all possible states in the system. For example, in a weather prediction model, the states could be "Sunny,"
"Cloudy," and "Rainy."
● Construct the Transition Matrix:
○ Create a transition matrix that defines the probabilities of transitioning from one state to another. Each row represents
the current state, and each column represents the possible next states.
● Initial State Distribution:
○ Define the initial distribution of the states, i.e., the probability of starting in each state. This can be represented as a
vector 𝜋0.
● Predict Future States:
○ To predict the future state, multiply the initial state distribution by the transition matrix. This gives the probability
distribution of the next state.

π t+1 =πt ⋅P
Dimensionality Reduction: PCA and SVD

● Dimensionality reduction is the process of reducing the number of variables

(features) in a dataset while retaining as much relevant information as
possible.
● It is crucial for:
○ Simplifying data visualization.
○ Enhancing computational efficiency.
○ Reducing noise and redundancy in data.
● Two popular techniques for dimensionality reduction are Principal Component
Analysis (PCA) and Singular Value Decomposition (SVD). Both are grounded
in linear algebra and are widely used in machine learning, data science, and
statistics.
Principal Component Analysis (PCA):

● PCA transforms a dataset into a new coordinate system by finding directions

(principal components) that capture the maximum variance in the data.
● Linear combinations of the original features. The first principal component
captures the largest variance, the second captures the next largest variance
orthogonal to the first, and so on.
● PCA seeks to maximize the variance captured in lower dimensions to
preserve as much information as possible.
Singular Value Decomposition (SVD):
● SVD splits 𝐴 into its constituent parts, revealing its structure.
● Provide the magnitude of variance captured by each component.
● By retaining only the top 𝑘 singular values and corresponding vectors, we approximate 𝐴 in a
lower-dimensional space.
● SVD decomposes a matrix 𝐴 into three matrices:

● U: Orthogonal matrix containing left singular vectors (column space of 𝐴).

● Σ: Diagonal matrix of singular values (capturing importance of components).
● VT : Orthogonal matrix containing right singular vectors (row space of 𝐴).

System Architecture Design and Platform Development Strategies
No ratings yet
System Architecture Design and Platform Development Strategies
203 pages
Machine Learning Simplified
100% (1)
Machine Learning Simplified
109 pages
Senhasegura 3.12 Cookbook en Us
No ratings yet
Senhasegura 3.12 Cookbook en Us
57 pages
Sethi
No ratings yet
Sethi
89 pages
Regression Modeling Strategies
No ratings yet
Regression Modeling Strategies
506 pages
Mathematical Modelling
No ratings yet
Mathematical Modelling
29 pages
Determination CV in Bomb Calorimeter
67% (3)
Determination CV in Bomb Calorimeter
3 pages
ML 3 (1)
No ratings yet
ML 3 (1)
50 pages
Regression
No ratings yet
Regression
24 pages
PREDECTIVE ANALYTICS
No ratings yet
PREDECTIVE ANALYTICS
11 pages
Statistics Consulting Cheat Sheet: Kris Sankaran October 1, 2017
100% (1)
Statistics Consulting Cheat Sheet: Kris Sankaran October 1, 2017
44 pages
Tutorial 3
No ratings yet
Tutorial 3
30 pages
Information Retrieval Important questions
No ratings yet
Information Retrieval Important questions
20 pages
dsbda_ut4
No ratings yet
dsbda_ut4
12 pages
Bookdown Demo PDF
No ratings yet
Bookdown Demo PDF
19 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
Statistical Machine Learning: Yiqiao YIN Department of Statistics Columbia University
No ratings yet
Statistical Machine Learning: Yiqiao YIN Department of Statistics Columbia University
204 pages
Notes5_Regression
No ratings yet
Notes5_Regression
14 pages
Chapter 2
No ratings yet
Chapter 2
136 pages
SML
No ratings yet
SML
8 pages
3. Undergraduate Fundamentals of Machine Learning Author William J. Deuschle
No ratings yet
3. Undergraduate Fundamentals of Machine Learning Author William J. Deuschle
143 pages
Bishop Solutions PDF
No ratings yet
Bishop Solutions PDF
87 pages
Machine Learning Simplified A Gentle Introduction To Supervised Learning 1011 Andrew Wolf pdf download
No ratings yet
Machine Learning Simplified A Gentle Introduction To Supervised Learning 1011 Andrew Wolf pdf download
53 pages
2.WhyMachineLearning.pdf
No ratings yet
2.WhyMachineLearning.pdf
27 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
88 pages
MM_CIA1
No ratings yet
MM_CIA1
17 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
5. Ai_foundations of Machine Learning II
No ratings yet
5. Ai_foundations of Machine Learning II
54 pages
Machine Learning Basic Principles
No ratings yet
Machine Learning Basic Principles
124 pages
Final ML
No ratings yet
Final ML
2 pages
Statistical Prediction and Machine Learning
100% (2)
Statistical Prediction and Machine Learning
314 pages
Week 10_Lecture 10
No ratings yet
Week 10_Lecture 10
59 pages
ML final
No ratings yet
ML final
92 pages
CO-2-Session-3
No ratings yet
CO-2-Session-3
39 pages
BAI 3303 Notes
No ratings yet
BAI 3303 Notes
12 pages
Mldaf Short
No ratings yet
Mldaf Short
23 pages
Python Theory Notes
No ratings yet
Python Theory Notes
28 pages
Notes MSM
No ratings yet
Notes MSM
66 pages
DSR Notes 3 To 5
No ratings yet
DSR Notes 3 To 5
70 pages
ML-1-PPT-UNIT-1
No ratings yet
ML-1-PPT-UNIT-1
93 pages
General ML Notes
No ratings yet
General ML Notes
30 pages
3 - SupervisedIntro
No ratings yet
3 - SupervisedIntro
80 pages
Machine Learning Handbook - Radivojac and White
No ratings yet
Machine Learning Handbook - Radivojac and White
108 pages
Ba Unit 4 - Part1
No ratings yet
Ba Unit 4 - Part1
7 pages
Anintroductiontomachinelearning: Michaelclark Centerforsocialresearch Universityofnotredame
No ratings yet
Anintroductiontomachinelearning: Michaelclark Centerforsocialresearch Universityofnotredame
43 pages
DA Unit-2
No ratings yet
DA Unit-2
7 pages
A Broader Understanding of ML and Types of Regression
No ratings yet
A Broader Understanding of ML and Types of Regression
8 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
BIG DATA UNIT I
No ratings yet
BIG DATA UNIT I
27 pages
ARTIFICIAL INTELLIGENCE LEC 4
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 4
13 pages
Big Data Lesson 2 Lucrezia Noli
No ratings yet
Big Data Lesson 2 Lucrezia Noli
21 pages
Machine Learning Guide
No ratings yet
Machine Learning Guide
185 pages
Modern Statistical Methods
No ratings yet
Modern Statistical Methods
67 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Regression Models Overview
No ratings yet
Regression Models Overview
170 pages
MLbook Extract
No ratings yet
MLbook Extract
14 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Predictive Analys
No ratings yet
Predictive Analys
34 pages
Accuracy Assessment and Confusion Matrix
No ratings yet
Accuracy Assessment and Confusion Matrix
23 pages
Machine_Learning_&_AI
No ratings yet
Machine_Learning_&_AI
38 pages
CS601_Machine Learning_Unit 1_Notes_1672759748
No ratings yet
CS601_Machine Learning_Unit 1_Notes_1672759748
13 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
MANUALES
0% (1)
MANUALES
84 pages
Digital Transformation in Banking
No ratings yet
Digital Transformation in Banking
2 pages
English 7 Quarter 2 Test
75% (8)
English 7 Quarter 2 Test
5 pages
BVMS 11.0 - Licensing Overview
No ratings yet
BVMS 11.0 - Licensing Overview
41 pages
px2-300d Network Storage: User Guide
No ratings yet
px2-300d Network Storage: User Guide
166 pages
Download Complete Exam Ref AI-900 Microsoft Azure AI Fundamentals Julian Sharp PDF for All Chapters
100% (2)
Download Complete Exam Ref AI-900 Microsoft Azure AI Fundamentals Julian Sharp PDF for All Chapters
40 pages
E60HMHDL13
100% (1)
E60HMHDL13
68 pages
Case Study - Samsung
No ratings yet
Case Study - Samsung
9 pages
ZXCTN 6000 Product Series Introduction - 20121217 - EN PDF
100% (3)
ZXCTN 6000 Product Series Introduction - 20121217 - EN PDF
48 pages
002
No ratings yet
002
2 pages
Facebook and YouTube Addiction The Usage Pattern of Malaysian Students
No ratings yet
Facebook and YouTube Addiction The Usage Pattern of Malaysian Students
6 pages
New Fuse Realay
No ratings yet
New Fuse Realay
2 pages
Fundamental of Data Analytics
No ratings yet
Fundamental of Data Analytics
2 pages
CustomBMOpTransactionHistoryUX5
No ratings yet
CustomBMOpTransactionHistoryUX5
8 pages
Cloningadisk 280920 0625 14937
No ratings yet
Cloningadisk 280920 0625 14937
8 pages
String & Sound Waves, Home Work Sheet-8
No ratings yet
String & Sound Waves, Home Work Sheet-8
2 pages
Practical Optimization: Algorithms and Engineering Applications 2nd Edition Andreas Antoniou download
100% (1)
Practical Optimization: Algorithms and Engineering Applications 2nd Edition Andreas Antoniou download
40 pages
Medical Test Kits (300215) Exports To Mexico - 2017
No ratings yet
Medical Test Kits (300215) Exports To Mexico - 2017
3 pages
lma11-01-pef-20220113
No ratings yet
lma11-01-pef-20220113
10 pages
Molas Lubes-Products List
No ratings yet
Molas Lubes-Products List
2 pages
DDJ-SZ: DJ Controller
No ratings yet
DDJ-SZ: DJ Controller
3 pages
Massive MIMO For 5G FINAL
No ratings yet
Massive MIMO For 5G FINAL
4 pages
Registration_Report
No ratings yet
Registration_Report
1 page
B30 - B2 - Agile Final
No ratings yet
B30 - B2 - Agile Final
26 pages
Ui&ux Unit 3 &4 Part C 1-4QNS
No ratings yet
Ui&ux Unit 3 &4 Part C 1-4QNS
24 pages
NAAC PPT 1
No ratings yet
NAAC PPT 1
20 pages

ch5

Uploaded by

ch5

Uploaded by

Modeling Techniques,

1. Problem Formulation and Data Collection

3. Model Implementation, Validation, and Evaluation

● Stochastic models are systems that incorporate elements of randomness or

Limitations of Stochastic Models

● Complexity: Requires advanced mathematical understanding and

Challenges of Empirical Models

● Linear regression is a statistical method that models the relationship between

Lasso Regression (Least Absolute Shrinkage and Selection Operator)

● Penalty Added: 𝐿1-norm penalty, represented as λ∑∣𝑤𝑖∣ , where 𝑤𝑖 are model

● Penalty Added: 𝐿2-norm penalty, represented as λ∑𝑤𝑖2 , where 𝑤𝑖 are model

● Penalty Added: Combines 𝐿1-norm and 𝐿2-norm penalties.

● Balances the benefits of Lasso (feature selection) and Ridge (shrinkage).

● Logistic regression is a statistical method used for binary classification

● Logistic function (sigmoid function)

● Multi-Criteria Decision Making (MCDM) refers to a set of techniques or

1. Analytic Hierarchy Process (AHP):

◦ Breaks a decision problem into a hierarchy of sub-problems .

▪ Define the criteria and alternatives.

▪ Compare criteria pairwise to establish weights.

▪ Score alternatives and calculate an overall ranking.

● Where wj is the weight of criterion j, and xij is the performance of alternative i

● A Taylor polynomial is an approximation of a function 𝑓(𝑥) using a finite

● f(a): Function value at 𝑎.

Error Estimate (Lagrange Remainder):

● The bisection method is a numerical technique used to find the roots of a

● The Bisection Method looks to find the

● A Markov Chain is a mathematical model that describes a system where the

● i.e.: Xt depends on some previous Xis

P(Et|X0:t, E0:t-1) = P(Et|Xt)

● Assume stationary process: transition model P(Xt|Xt-1) and sensor model

● Define the States:

● Dimensionality reduction is the process of reducing the number of variables

● PCA transforms a dataset into a new coordinate system by finding directions

● U: Orthogonal matrix containing left singular vectors (column space of 𝐴).

You might also like