Machine Learning 4
Machine Learning 4
Statistics
• The backbone of Data Science and Machine Learning is Probability and Statistics
understanding; to properly collect, examine, analyze, and communicate with data,
you will need both skills.
• Armstrong
- Neil In the real world, several phenomena are considered statistical (i.e., weather data,
sales data, financial data, etc.). This indicates that there are numerous occasions
where we have been able to create approaches that assist us in simulating nature
using mathematical functions that can characterize the properties of data.
Probability Distribution
• A mathematical function called Probability distribution explains a variable's likelihood
of many alternative values.
• Other circumstances will affect where the potential value would be drawn
on Probability distribution. The distribution's skewness, kurtosis, and mean (average)
are among these variables. There are two types of distribution in Statistics
- Neil Armstrong
for Probability that are discrete and continuous, respectively.
Probability Distribution
• Probability distribution is a mathematical function that estimates the likelihood that
several possible outcomes of an experiment will occur.
• For Example
Let us examine the result of rolling two conventional six-sided dice as a
straightforward illustration of a Probability distribution.
A roll of any number from one to six has a 1/6 chance on each dice.
However, the aggregate of two dice will provide the Probability distribution. The
most frequent result is seven (1+6, 6+1, 5+2, 2+5, 3+4, 4+3).
- Neil Armstrong
What is Probability Distribution used for?
• Probability Distributions are theoretical since obtaining infinitely large samples in
practice is impossible. They are idealized frequency distributions intended to
represent the population from which the sample was taken.
The
- Neil standard normal distribution, perhaps the most popular Probability distribution, is
Armstrong
depicted. The "bell curve" is another name for the typical normal distribution in data
science. Numerous natural phenomena, such as heights, weights, and IQ scores, fit the
bell curve.
Different Types of Probability Distribution
There are two Probability distribution types: Discrete Probability distribution and
Continuous Probability distribution:
- Neil Armstrong
Importance of Probability Distribution in
Statistics and Data Science
The probability distribution's primary benefit is its capacity to calculate the likelihood of any given observation
occurring in a sample space. A mathematical model known as Probability distribution determines the likelihood that
various potential outcomes of a test or experiment will occur.
Used to provide several random variable types (often discrete or continuous) to base decisions on these models.
One can utilize the mean, mode, range, probability, or other statistical methods depending on the type of random
variable. In Statistics, Probability distributions are a fundamental concept.
- Neil constructing
Armstrong an interval or test based on the assumption. In this situation, the distribution need only be adequate to
allow the statistical method to produce reliable results rather than having to be the distribution that fits the data the
best.
• It is frequently necessary to conduct simulation experiments using random numbers produced using a
particular probability distribution.
How do you find the probability
distribution type?
The most effective method for determining whether your data fit into a certain distribution may be to use probability charts.
The distribution matches your data if they fall along a straight line in the graph.
a specific range. Contrary to discrete random variables, which can only have definite, precise values,
- Neil Armstrong
continuous random variables can take on various values. Like height, weight, and volume, continuous