Statistics (Part 3)
Statistics (Part 3)
Probability Concepts
2
4. Probability Concepts
(cont.)
3.2 Events
Sample space: The collection of all possible
outcomes for an experiment.
Event: A collection of outcomes for the
experiment, that is, any subset of the sample
space.
An event occurs if and only if the outcome of the
experiment is a member of the event.
Notation and Graphical Displays for Events:
3
4. Probability Concepts
(cont.)
Relationships Among Events
(not E ): The event “E does not occur”
(A & B ): The event “both A and B occur”
(A or B ): The event “either A or B or both occur”
4
4. Probability Concepts
(cont.)
Mutually Exclusive Events: Two or more events are
mutually exclusive events if no two of them have
outcomes in common.
5
4. Probability Concepts
(cont.)
4.3 Some rules of Probability
Probability Notation:
If E is an event, then P(E) represents the probability
that event E occurs. It is read “the probability of E.”
The Special Addition Rule
If event A and event B are mutually exclusive, then
P(A or B) = P(A) + P(B).
More generally, if events A, B, C, ... are mutually
exclusive, then P(A or B or C or ···) = P(A) + P(B) +
P(C) +··· .
6
4. Probability Concepts
(cont.)
The Complementation Rule
For any event E, P(E) = 1 − P(not E).
7
4. Probability Concepts
(cont.)
4.4 Contingency Table
Data from one variable of a population are called
univariate data
Data from two variables of a population are
called bivariate data
A frequency distribution for bivariate data is
called a contingency table or two-way table
8
9
4. Probability Concepts
(cont.)
Joint and Marginal Probabilities
10
4. Probability Concepts
(cont.)
P(A1), P(A2)…and P(R1), P(R2)…: marginal
probabilities
Marginal probabilities correspond to events
represented in the margin of the contingency
table
P(A1&R1), P(A1&R2)…: joint probabilities
11
4. Probability Concepts
(cont.)
12
4. Probability Concepts
(cont.)
4.5 Conditional Probability
The probability that event B occurs given that
event A occurs is called a conditional probability.
It is denoted P (B |A), which is read “the
probability of B given A.” We call A the given
event.
Example
The Conditional Probability Rule
If A and B are any two events with P(A)> 0, then
13
4. Probability Concepts
(cont.)
4.6 The Multiplication Rule
If A and B are any two events, then
P(A & B) = P(A) · P(B|A)
Independent Events
Event B is said to be independent of event A if P(B | A) =
P(B).
The Special Multiplication Rule
If A and B are independent events, then
P(A & B) = P(A) · P(B),
and conversely, if P(A & B) = P(A) · P(B), then A and B are
independent events.
If events A, B, C, ... are independent, then
14
P(A & B & C & ···) = P(A) · P(B) · P(C) ··· .
Mutually Exclusive Versus Independent Events
4. Probability Concepts
(cont.)
4.7 Bayes’ s Rule
The Rule of Total Probability
Suppose that events A1, A2,..., Ak are mutually
exclusive and exhaustive, that is, exactly one of the
events must occur. Then for any event B,
16
4. Probability Concepts
(cont.)
4.8 Counting Rule
The Basic Counting Rule (BCR)
Suppose that r actions are to be performed in a
definite order. Further suppose that there are m1
possibilities for the first action and that
corresponding to each of these possibilities are m 2
possibilities for the second action, and so on. Then
there are m1· m2 ···mr possibilities altogether for the r
actions.
Factorials
k! = k(k − 1) ··· 2 · 1.
We also define 0! = 1.
17
4. Probability Concepts
(cont.)
A permutation of r objects from a collection of m
objects is any ordered arrangement of r of the m
objects.
The number of possible permutations of r objects
that can be formed from a collection of m objects
is denoted mPr (read “m permute r”).
The Permutations Rule
The number of possible permutations of r objects
from a collection of m objects is given by the formula
18
4. Probability Concepts
(cont.)
The Special Permutations Rule
The number of possible permutations of m objects
among themselves is m!.
Combinations
A combination of r objects from a collection of m
objects is any unordered arrangement of r of the m
objects—in other words, any subset of r objects from
the collection of m objects.
The Combinations Rule
The number of possible combinations of r objects
from a collection of m objects is given by the formula
19
4. Probability Concepts
(cont.)
Number of Possible Samples
The number of possible samples of size n from a
population of size N is NCn.
Applications to Probability
Examples
20
Review
24
5. Discrete Random
Variables (cont.)
Sum of the Probabilities of a Discrete Random
Variable
For any discrete random variable X, we have P(X =
x) = 1.
Interpretation of a Probability Distribution
In a large number of independent observations of a
random variable X,the proportion of times each
possible value occurs will approximate the
probability distribution of X; or, equivalently, the
proportion histogram will approximate the
probability histogram for X.
25
5. Discrete Random
Variables (cont.)
5.2 Mean and Standard Deviation of a Discrete
Random Variable
Mean of a Discrete Random Variable
Denoted as μX or simply μ. It is defined by
μ = xP(X = x).
The terms expected value and expectation are
commonly used in place of the term mean.
Interpretation of the Mean of a Random Variable
In a large number of independent observations of a
random variable X, the average value of those
observations will approximately equal the mean, μ, of
X. The larger the number of observations, the closer
the average tends to be to μ.
26
5. Discrete Random
Variables (cont.)
The standard deviation of a discrete random
variable X is denoted σX or simply σ. It is defined
as
28
5. Discrete Random
Variables (cont.)
Bernoulli Trials
Repeated trials of an experiment are called Bernoulli
trials if the following three conditions are satisfied:
1. The experiment (each trial) has two possible
outcomes, denoted generically s, for success, and f, for
failure.
2. The trials are independent.
3. The probability of a success, called the success
probability and denoted p, remains the same from trial
to trial.
The binomial distribution is the probability
distribution for the number of successes in a
sequence of Bernoulli trials
29
5. Discrete Random
Variables (cont.)
Number of Outcomes Containing a Specified Number
of Successes
In n Bernoulli trials, the number of outcomes that contain
exactly x successes equals the binomial coefficient
Binomial Probability Formula
Let X denote the total number of successes in n Bernoulli
trials with success probability p. Then the probability
distribution of the random variable X is given by
30
5. Discrete Random
Variables (cont.)
Procedure To Find a Binomial Probability Formula
Assumptions:
1. n trials are to be performed.
2. Two outcomes, success or failure, are possible for each
trial.
3. The trials are independent.
4. The success probability, p, remains the same from trial to
trial.
Step 1 Identify a success.
Step 2 Determine p, the success probability.
Step 3 Determine n, the number of trials.
Step 4 The binomial probability formula for the number of
successes, X,is
31
5. Discrete Random
Variables (cont.)
Binomial Probability Tables
Eliminate most of the computations required in
working with the binomial distribution.
Such tables are of limited usefulness, because they
contain only a relatively small number of different
values of n and p.
Shape of a Binomial Distribution
32
5. Discrete Random
Variables (cont.)
Mean and Standard Deviation of a Binomial Random
Variable
33
5. Discrete Random
Variables (cont.)
Sampling and the Binomial Distribution
Suppose that a simple random sample of size n is
taken from a finite population in which the
proportion of members that have a specified
attribute is p. Then the number of members sampled
that have the specified attribute:
Has exactly a binomial distribution with parameters
n and p if the sampling is done with replacement and
Has approximately a binomial distribution with
parameters n and p if the sampling is done without
replacement and the sample size does not exceed
5% of the population size.
34
5. Discrete Random
Variables (cont.)
5.4 The Poisson Distribution
often used to model the frequency with which a specified
event occurs during a particular period of time
In addition, used to describe the probability distribution
of the number of misprints in a book, or the number of
bacterial colonies appearing on a petri dish smeared with
a bacterial suspension.
Poisson Probability Formula
Probabilities for a random variable X that has a Poisson
distribution are given by the formula