Data Analysis_Probability Distribution_26.03.2014
Data Analysis_Probability Distribution_26.03.2014
Problem Solving
Topics Covered
❖ Probability Distributions
Random Experiment:
Random experiment is one whose results depend on chance, that is the result cannot be predicted.
Eg: Tossing of coins, throwing of dice are some examples of random experiments.
Random Variable:
A variable whose value is a number determined by the outcome of a random experiment is called a random variable.
A random variable is denoted by capital letters like X, Y, Z….., whereas the values of the random variable are
denoted by the corresponding small letters like x, y, z …….
Notation:
Notations:
● X represents the random variable X.
● P(X) represents the probability of X.
● P(X = x) refers to the probability that the random variable X is equal to a particular value, denoted by x. As an
example, P(X = 1) refers to the probability that the random variable X is equal to 1.
Example:
Suppose that two coins are tossed so that the sample space is S = {HH, HT, TH, TT}
Suppose X represent the number of heads which can come up, with each sample point we can associate a number for
X as shown in the table below:
Sample Point HH HT TH TT
X 2 1 1 0
Examples of Random Variables
Thus the random variable X takes the values 0, 1,2 for this random experiment. X= 0, 1, 2. For each random variable
xi, the probability of respective random variable is denoted by p(xi) or simply pi .
The sum of the probabilities of all the random variable is equal to one. ie p(x1) + p(x2) + p(x3) = ¼+2/4+¼ = 1
Example: when 3 coins are tossed, the number of heads obtained is the random variable X assumes the values
0,1,2,3 which form a countable set. Such a variable is a discrete random variable.
Continuous random variable:
A random variable X which can take any value between certain interval is called a continuous random variable.
Note that the probability of any single value at x, value of X is zero. i.e P(X = x) = 0 Thus continuous random variable
takes value only between two given limits.
For example:
The height of students in a particular class lies between 4 feet to 6 feet.
We write this as X = {x|4 ≤ x ≤ 6}
The maximum life of electric bulbs is 2000 hours. For this the continuous random variable will be X = {x | 0 ≤ x ≤ 2000}
Try this question:
Find the probability distribution of the number of sixes in throwing two dice once.
Explanation:
When two dice are thrown the total number of sample points are 36.
Let X denote the number of sixes obtained in throwing two dice once.
Then X is the random variable, which can take the values 0,1,2.
Let A denote the success of getting a six in throwing a die and A’ denote not getting a six.
No sixes:
P(x = 0) = P(A’ and A’ )
= P(A ) . P(A’) = 5/6 x ⅚
=25/36
P(x = 1) = P(A and A’ ) or P(A’ and A)
= P(A) . P(A’ ) + P(A’ ) .P(A)
=⅙ x ⅚ +⅚x ⅙
=10/36
=5/18
P(x = 2) = P( A and A)
= P(A) .P(A)
=⅙ x ⅙
=1/36
X=x 0 1 2
If a random variable is a discrete variable, its probability distribution is called a Discrete Probability Distribution of
Random variables.
Let X be a discrete random variable which assumes the values x1, x2, ...xn with each of these values, we associate a
number called the probability Pi= P(X=xi), i = 1,2,3…n
This function pi or p(xi) is called the probability mass function of the discrete random variable X.
Try this question:
A random variable has the following probability distribution
Values of 0 1 2 3 4 5 6 7 8
X
a = 1/81
Values of 0 1 2 3 4 5 6 7 8
X
F(X) 1/81 4/81 9/81 16/81 25/81 36/81 49/81 64/81 81/81
=1
(ii)F(5) = 36/81
Binomial Distribution
The binomial distribution gives the discrete probability distribution.
The binomial distribution describes the behavior of a count variable X if the following conditions apply:
1: The number of observations n is fixed.
2: Each observation is independent.
3: Each observation represents one of two outcomes ("success" or "failure").
4: The probability of "success" p is the same for each outcome.
If these conditions are met, then X has a binomial distribution with parameters n and p, abbreviated B(n,p).
Binomial distribution
A random variable X is said to follow binomial distribution, if its probability mass function is given by
P (X = x) = P(x) = nCx px (1-p)n-x ; x = 0, 1,2, …,n
0 ; otherwise
Here, the two independent constants n and p are known as the ‘ parameters’ of the distribution. The distribution is
completely determined if n and p are known. x refers the number of successes.
Example:
Toss a coin for 12 times. What is the probability of getting exactly 5 heads.
Explanation
Number of trials n = 12
Number of success x = 5 since we define getting a head as success
Probability of success on any single trial p = ½, then 1-p = ½
Example
P(X=5) = 12C5 (½)5 (1/2)12-5
P(X=5) = [11 x 9 x 8] (1/2)12
=729 (1/2)12
For a discrete random variable X having the possible values x1, x 2, x3, ..., xn, the expectation of X is defined as
E(x) = x1 × P(X = x1) + x2 × P(X = x2) + x3 × P(X = x3) + …. + xn × P(X = xn)
Example:
Find the expected value of x, where x represents the outcome when a die is thrown.
Explanation:
Here each of the outcome (ie., number) 1, 2, 3, 4, 5 and 6 occurs with probability 1/6. Thus the probability distribution of
X will be
X 1 2 3 4 5 6
An urn contains 7 white and 3 red balls. Two balls are drawn together at random from the urn. Find the
expected number of white balls drawn
Explanation:
From the urn containing 7 white and 3 red balls, two balls can be drawn in 10C2 ways. Let X denote the number of
white balls drawn, X can take the values 0, 1 and 2.
If a random variable is a continuous variable, its probability distribution is called a Continuous Probability Distribution of
Random variables.