GRP.7 Probability and Statistics
GRP.7 Probability and Statistics
ADVANCED ENGINEERING
MATHEMATICS
Event (E): A subset of the sample space, representing one or more outcomes.
Understanding these concepts is crucial for modeling engineering problems where outcomes are
uncertain.
Probabilities can be found using estimates from empirical studies or common sense
estimates based on equally likely events.
For equally likely outcomes, the probability of an event “E” is calculated as:
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑓𝑎𝑣𝑜𝑟𝑎𝑏𝑙𝑒 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
P(E)=
𝑇𝑜𝑡𝑎𝑙 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑜𝑠𝑠𝑖𝑏𝑙𝑒 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
1
Example: In rolling a fair six-sided die, the probability of rolling a 3 is P(3)=
6
The sum of the probabilities for all simple events in S are always equal to 1.
Example: Toss a fair coin twice. What is the probability of observing at least one
head?
Example: A bowl contains three M&Ms, one red, one blue & one green. A child selects two
M&Ms at random. What is the probability that at least one is red?
Example:
A locker has 5 different digits (1, 2, 3, 4, 5). If a 3-digit code is needed, where no digit
repeats, how many different codes can be formed?
Example:
A lottery draws 6 numbers from a set of 49. How many possible different lottery
combinations exist?
Example: If the probability of a machine failing is 0.1, then the probability of it not failing is 0.9.
Example: In quality control, if the probability of finding a defect of type A is 0.05 and type B is 0.02, and
they cannot occur simultaneously, then the probability of finding either defect is 0.07.
These rules are instrumental in analyzing complex systems with multiple components or failure modes.
Binomial Distribution: Evaluates the probability of an event occurring several times over a given number of
trials. It is commonly used in scenarios like coin flips or free throws in basketball.
• Description: Represents the number of successes in 𝑛 independent trials, where each trial has two possible
outcomes (success or failure)
Suppose a batch of steel rods is produced, and each rod has a 95% probability of meeting the required tensile
strength. If an engineer randomly tests 10 rods, the binomial distribution can calculate the probability that
exactly 8 rods pass the test
Final Answer
Variance:
Var(X)=p(1−p)
College of Engineering and
Information Technology
DISCRETE PROBABILITY DISTRIBUTION
Example of Bernoulli Distribution:
A machine on a factory assembly line produces widgets. Each widget has a 10% chance of being defective
(𝑝=0.1) and a 90% chance of being non-defective (1−𝑝=0.9).
Characteristics:
Mean (Expected Value):
E(X)=λ
Variance:
Var(X)=λ
College of Engineering and
Information Technology
DISCRETE PROBABILITY DISTRIBUTION
Example of Poisson Distribution: :
A civil engineer monitors the occurrence of cracks on a bridge's surface. Historical data shows that cracks
occur at an average rate of 3 per year. Determine the probability of observing exactly 5 cracks in a year:
Final Answer
Characteristics:
Mean (Expected Value):
E(X)=1/p
Variance:
1−𝑝
Var(X)=
𝑝^2
College of Engineering and
Information Technology
DISCRETE PROBABILITY DISTRIBUTION
Example of Geometric Distribution: :
An engineer inspects components for defects. If the probability of finding a defective part is 0.02, the geometric
distribution can calculate the probability that the first defect is found on the 10th inspection:
Final Answer
Applications in Engineering:
engineers can use the uniform distribution to estimate the likelihood of a part falling within a specified tolerance
range. Understanding how likely parts are to meet quality standards (e.g., being within the tolerance limits) is
crucial for ensuring the reliability and performance of products.
College of Engineering and
Information Technology
CONTINUOUS PROBABILITY
DISTRIBUTION
Example of Uniform Distribution: :
Imagine a manufacturer produces steel rods that are supposed to be 10 cm long, but due to the manufacturing
process, there is a tolerance of ±0.2 cm, meaning the actual length of any rod can vary between 9.8 cm and
10.2 cm.
If the length of each rod is equally likely to fall anywhere in this range, then the length of the rod can be
modeled as a uniform distribution.
Range: The rod’s length, L, is uniformly distributed between 9.8 cm and 10.2 cm. So, we have:
• a=9.8 cm (minimum length),
• b=10.2 cm (maximum length).
The probability density function (PDF) of this uniform distribution would be:
Solution:
The probability of the rod’s length being between 9.9 cm and 10.1 cm can be found by calculating the area
under the PDF curve between these two values.
So, the probability that a randomly selected rod will have a length between 9.9 cm and 10.1 cm is 50%.
Applications in Engineering:
Modeling material strength, measurement errors, and variations in manufacturing processes
College of Engineering and
Information Technology
CONTINUOUS PROBABILITY
DISTRIBUTION
Example of Normal Distribution: :
The compressive strength of concrete follows a normal distribution with a mean of 4000 psi and a standard
deviation of 250 psi. Calculate the probability that a randomly selected concrete sample has a strength less
than 3800 psi:
Formula:
Use the z-score formula to convert to standard normal distribution and then use a z-table or calculator.
Calculation:
Requires calculating z-scores for less than 3800psi using a z-table or calculator to find probabilities.
Result:
You will get the cumulative probability, which is approximately 0.2118, or 21.18%, meaning that the probability
ofCollege
the compressive strength
of Engineering being less than 3800 psi is around 21.18%.
and
Information Technology
CONTINUOUS PROBABILITY
DISTRIBUTION
Normal Distribution :
Characteristics:
Mean:
1
μ=
λ
Variance:
1
σ ^2 =
λ^2
Applications in Engineering:
College of Engineering and
Modeling the time
Information to failure of components in reliability engineering.
Technology
CONTINUOUS PROBABILITY
DISTRIBUTION
Example of Exponential Distribution: :
A power supply unit has a mean time to failure of 5,000 hours. Determine the probability of the unit failing within
the first 1,000 hours using the exponential distribution::
Formula:
Result:
Calculation:
β =1
β =3
β =2
β =1
Characteristics:
Mean:
Dependent on β and η
Variance:
Dependent on β and η
Applications in Engineering:
College of Engineering and
Modeling material
Information fatigue, failure rates, and product lifetimes.
Technology
CONTINUOUS PROBABILITY
DISTRIBUTION
Example of Weibull Distribution:
A wind turbine blade follows a Weibull distribution with shape parameter β=2 and scale parameter η=15 years.
What is the probability that the blade lasts more than 20 years?
Formula:
Calculation:
The full probability density function (PDF) for the Weibull distribution has a more complex form:
The CDF (Cumulative distribution function), on the other hand, is the cumulative probability up to a
certain point x and is derived from the PDF. The CDF gives us the probability that a variable X takes a
value less than or equal to x:
Probability that the lifespan will be
equal or less than 20 years
We're using the CDF because it directly gives us the cumulative probability, and subtracting it from 1 gives us
the probability of the complement (lifespan greater than 20 years). The PDF, while important for other types of
calculations, isn't directly necessary
College of Engineering and for this specific probability.
Information Technology
CONTINUOUS PROBABILITY
DISTRIBUTION
Explanation:
• Use the PDF when you need the density at a particular point, to calculate probabilities over a
range (via integration), or when working with more complex statistical quantities.
• Use the CDF when you want to calculate the probability that the random variable is less than or
equal to a particular value, or when you’re interested in cumulative probabilities (like the one in
your original question).
• Component Testing: Engineers can use the geometric distribution when testing
mechanical components that are subject to wear and tear. For example, how
many machines will fail before the first one is repaired.
• Reliability Testing and Failure Analysis: Engineers use the Weibull distribution
to model component lifetimes and analyze failure patterns in complex systems,
such as turbines, engines, or electrical systems.
Definition:
DESCRIPTIVE INFERENTIAL
STATISTICS STATISTICS
DESCRIPTIVE
STATISTICS
INFERENTIAL
STATISTICS
INFERENTIAL
STATISTICS
DESCRIPTIVE
STATISTICS
DESCRIPTIVE INFERENTIAL
STATISTICS STATISTICS
Definition:
NOMINAL SCALE
ORDINAL SCALE
INTERVAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
ORDINAL SCALE
NOMINAL SCALE
INTERVAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
NOMINAL SCALE
ORDINAL SCALE
Nominal data divides variables
INTERVAL SCALE into mutually exclusive and
labeled categories
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
NOMINAL SCALE
Nominal data divides variables into mutually
ORDINAL SCALE exclusive and labeled categories
RATIO SCALE Blue Brown Black Iphone Samsung Huawei Bus Train Car
NOMINAL SCALE
ORDINAL SCALE
INTERVAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
ORDINAL SCALE
NOMINAL SCALE
Ordinal data classifies variables
INTERVAL SCALE into categories which have a
natural order or rank.
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
ORDINAL SCALE
Ordinal data classifies variables into
NOMINAL SCALE categories which have a natural order or rank.
NOMINAL SCALE
INTERVAL SCALE
ORDINAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
INTERVAL SCALE
NOMINAL SCALE
Interval data is measured along
ORDINAL SCALE numerical scale that has equal
intervals between adjacent values.
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
INTERVAL SCALE
Interval data is measured along numerical
scale that has equal intervals between
NOMINAL SCALE
adjacent values.
ORDINAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
NOMINAL SCALE
RATIO SCALE
ORDINAL SCALE
INTERVAL SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
RATIO SCALE
NOMINAL SCALE
Ratio data is measured along a
numerical scale that has equal
ORDINAL SCALE distances between adjacent
values and a true zero.
INTERVAL SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
RATIO SCALE
Ratio data is measured along a numerical
scale that has equal distances between
NOMINAL SCALE
adjacent values and a true zero.
ORDINAL SCALE
INTERVAL SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
NOMINAL SCALE
ORDINAL SCALE
INTERVAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
NOMINAL SCALE
ORDINAL SCALE
INTERVAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
NOMINAL SCALE
ORDINAL SCALE
INTERVAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
4 Scales of Measurement
NOMINAL SCALE
ORDINAL SCALE
INTERVAL SCALE
RATIO SCALE
College of Engineering and
Information Technology
Some major measures that are commonly used to
described descriptive statistics:
FREQUENCY
DISTRIBUTION
MEASURES OF
CENTRAL TENDENCY
MEASURES OF
VARIABILITY
FREQUENCY
MEASURES OF DISTRIBUTION
CENTRAL TENDENCY
MEASURES OF
VARIABILITY
FREQUENCY
DISTRIBUTION
8, 9, 8, 7, 10, 9, 6, 4, 9, 8, 7, 8, 10, 9, 8, 10 2
9 5
6, 9, 7, 8, 8 8 7
7 3
Solution: 6 2
10, 10, 9, 9, 9, 9, 9, 8, 8, 8, 8, 8, 8, 8, 7, 7, 5 0
7, 6,of6,
College 4
Engineering and 4 1
Information Technology
Frequency Distribution Graph:
HISTOGRAM
POLYGON
BAR GRAPH
HISTOGRAM
POLYGON
BAR GRAPH
HISTOGRAM
A histogram is a graphical
POLYGON representation of data
distribution, typically used to
BAR GRAPH visualize frequency distributions.
POLYGON
BAR GRAPH
POLYGON
HISTOGRAM
BAR GRAPH
POLYGON
HISTOGRAM
BAR GRAPH
BAR GRAPH
HISTOGRAM
POLYGON
BAR GRAPH
A bar graph in frequency
HISTOGRAM distribution is a visual
representation of categorical or
numerical data using bars of
POLYGON
different heights.
HISTOGRAM
POLYGON
SYMMETRICAL SKEWED
DISTRIBUTION DISTRIBUTION
SYMMETRICAL
DISTRIBUTION
SKEWED
DISTRIBUTION
SKEWED
DISTRIBUTION
SYMMETRICAL
DISTRIBUTION
FREQUENCY
DISTRIBUTION
MEASURES OF
CENTRAL TENDENCY
MEASURES OF
VARIABILITY
MEASURES OF CENTRAL
FREQUENCY TENDENCY
DISTRIBUTION
MEASURES OF
VARIABILITY
MEASURES OF CENTRAL
TENDENCY
Definition:
• The mean is commonly known as athematic average, is computed by
adding all the scores in the distribution and dividing by the number of
scores. Two Types of Mean: Sample mean & Population Mean
Sample Mean (ഥ 𝒙): The average of a subset (sample) of observations drawn
from a larger population.
σ𝑛𝑖=1 𝑥𝑖
𝑥ҧ =
𝑛
Where:
𝑥𝑖 are the individual data points in the sample, and 𝑛 is the sample size.
College of Engineering and
Information Technology
Mean
MEAN
MEDIAN MODE
Definition:
• The median is the score that divides a distribution exactly in
half.
Definition:
• In a frequency distribution the mode is the score or category
that has the greatest frequency.
FREQUENCY
DISTRIBUTION
MEASURES OF
CENTRAL TENDENCY
MEASURES OF
VARIABILITY
MEASURES OF
FREQUENCY VARIABILITY
DISTRIBUTION
MEASURES OF
CENTRAL TENDENCY
RANGE
VARIANCE
STANDARD DEVIATION
INTERQUARTILE RANGE
COEFFICIENT OF VARIATION
VARIANCE
RANGE
STANDARD DEVIATION
INTERQUARTILE RANGE
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
RANGE
VARIANCE RANGE is the difference between
the highest and lowest values in a
STANDARD DEVIATION dataset.
INTERQUARTILE RANGE
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
RANGE is the difference between the highest
RANGE and lowest values in a dataset.
EXAMPLE:
In civil engineering, the compressive strength of concrete is a critical factor in determining
its structural integrity. Suppose a batch of concrete samples is tested for compressive
strength (measured in megapascals, MPa), and the following results are obtained:
VARIANCE
RANGE
STANDARD DEVIATION
INTERQUARTILE RANGE
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
RANGE
VARIANCE
STANDARD DEVIATION
INTERQUARTILE RANGE
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
VARIANCE
VARIANCE is the average of the squared
differences from the mean, indicating
how data points deviate from the mean.
VARIANCE
VARIANCE is the average of the squared
differences from the mean, indicating
how data points deviate from the mean.
EXAMPLE:
In mechanical and civil engineering, ensuring consistent manufacturing quality is crucial.
Suppose a factory produces steel rods with a nominal diameter of 50 mm, but slight
variations occur during production. To analyze these variations, an engineer collects the
diameters (in mm) of all steel rods produced in a day, making it a population dataset (not
a sample).
RANGE
VARIANCE
STANDARD DEVIATION
INTERQUARTILE RANGE
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
RANGE
STANDARD DEVIATION
VARIANCE
INTERQUARTILE RANGE
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
STANDARD DEVIATION
STANDARD DEVIATION the square root of variance, representing
the average deviation from the mean in the same units as the data.
RANGE
STANDARD DEVIATION
VARIANCE
INTERQUARTILE RANGE
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
RANGE
INTERQUARTILE RANGE
VARIANCE
STANDARD DEVIATION
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
INTERQUARTILE RANGE
The Interquartile Range (IQR) is a measure of statistical dispersion,
representing the range within which the middle 50% of a dataset falls. It is
calculated as:
EXAMPLE:
A civil engineer tests compressive strength (MPa) of concrete samples and obtains the
following results:
22, 24, 25, 26, 27, 29, 30, 32, 33, 35
RANGE
INTERQUARTILE RANGE
VARIANCE
STANDARD DEVIATION
COEFFICIENT OF VARIATION
College of Engineering and
Information Technology
Five Common Measures of Variability:
RANGE
COEFFICIENT OF VARIATION
VARIANCE
STANDARD DEVIATION
INTERQUARTILE RANGE
College of Engineering and
Information Technology
Five Common Measures of Variability:
COEFFICIENT VARIATIION
The Coefficient of Variation (CV) is a measure of relative variability,
expressed as a percentage. It helps compare the spread of data regardless of
the unit of measurement. It is calculated as:
Design Optimization
• Identifying Trends and Patterns:
- Using descriptive statistics to evaluate past designs and identify
patterns that lead to improved performance or cost-efficiency.
• Example: Analyzing historical data on column dimensions to optimize
designs for different load scenarios.
• Cost and Resource Analysis:
- Summarizing data on material usage and construction costs to
minimize waste and reduce expenses.