0% found this document useful (0 votes)

67 views5 pages

Sufficient Statistics - Problems - Solved - Xiang - Yin

The document discusses parameter estimation and sufficient statistics. It defines what constitutes a sufficient statistic and provides the Neyman-Fisher factorization theorem for determining if a statistic is sufficient. Examples of sufficient statistics are given such as the entire sample, rank ordered sample, and total from Bernoulli trials.

Uploaded by

Saksham Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views5 pages

Sufficient Statistics - Problems - Solved - Xiang - Yin

Uploaded by

Saksham Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

AU7022 Stochastic Methods in Systems & Control Xiang Yin

8 Parameter Estimations and Suﬀicient Statistics

A General Model for Statistics

▶ Many problems have the following common structure. A continuous signal {x(t) : t ∈ R}
is measured at t1 , . . . , tn producing vector x = (x1 , . . . , xn ), where xi = x(ti ). The vector
x is a realization of a random vector or a random process X = (X1 , . . . , Xn ) with a joint
distribution which is of known form but depends on some unknown parameters
θ = (θ1 , . . . , θp ). The estimation theory aims to estimate these unknown parameters θ
based on the observed realization x.

▶ Formally, the above problem has the following ingredients:

– X = (X1 , . . . , Xn ) is a vector of random measurements or observations taken over
the course of the experiment
– X is sample or measurement space of realizations x of X, e.g., X = R × · · · × R
– θ = (θ1 , . . . , θp ) is an unknown parameter vector of interest
– Θ is parameter space for the experiment
– Pθ : B(Rn ) → [0, 1] is a probability measure such that, for any Borel set or event B,
we have
 Z



 f (x; θ)dx if X is continuous
B
Pθ (B) = probability of event B ⊆ X = X

 p(x; θ) if X is discrete


x∈B

Such {Pθ }θ∈Θ is called the statistical model of the experiment.

The probability model also induces the joint C.D.F. associated with X

F (x; θ) = Pθ (X1 ≤ x1 , . . . , Xn ≤ xn ),

which is assumed to be known for each θ ∈ Θ. We denote by Eθ (X) the expectation of

random variable X given θ ∈ Θ.

Parametric Statistics (Estimation Theory)

▶ The basic estimation problem is as follows. The observations X = (X1 , . . . , Xn ) is actually

generated by a true parameter θ0 ∈ Θ. In case Xi are i.i.d., we have Xi ∼ Pθ0 (·).
Then we want to find an estimator θ̂ : X → Θ such that the estimate θ̂(X1 , . . . , Xn )
approximates θ0 “optimally”.

▶ The question is that how to describe whether θ̂ is a good estimator. Depending on

whether or not we have prior knowledge about the distribution θ, we will discuss two
different approaches: Bayesian estimation and non-random estimation.

1
AU7022 Stochastic Methods in Systems & Control Xiang Yin

Definition of Suﬀicient Statistics

▶ Let us consider an i.i.d. observations X = (X1 , . . . , Xn ) with distribution Pθ from the

family {Pθ : θ ∈ Θ}. Imagine that there are two people A and B, and that
– A observes the entire sample (X1 , . . . , Xn );
– B observes only a smaller vector T = T (X1 , . . . , Xn ) which is a function of the
sample. In this case, function T : Rn → Rm , m ≤ n. is called a statistic.

Clearly, A has more information about the distribution of the data and, in particular,
about the unknown parameter θ. However, in some cases, for some choices of function T
(called suﬀicient statistics) B will have as much information about θ as A has.

▶ To see this more clearly, for observations X = (X1 , . . . , Xn ) and statistic T (X), the
conditional probability
fX|T (X) (x | t, θ) = Pθ (X1 = x1 , . . . , Xn = xn | T (X) = t)

is, typically, a function of both t and θ. For some choices of statistic T , however,
fX|T (X) (x | t, θ) can be θ-independent.

▶ To see the above argument, let us consider consider the case X = (X1 , . . . , Xn ), a sequence
of n Bernoulli trials with success probability parameter θ and the statistic T (X) = X1 +
· · · + Xn the total number of successes. Then
Y
n
Pθ (X1 = x1 , . . . , Xn = xn ) = θxi (1 − θ)1−xi = θt (1 − θ)n−t ,
i=1
P
where t = T (x1 , . . . , xn ) = x1 + · · · xn . Therefore, if ni=1 xi ̸= t, then we know that the
statistic is incompatible with the observation. Otherwise, we have
−1
fX (x | θ) Pθ (X1 = x1 , . . . , Xn = xn ) θt (1 − θ)n−t n
fX|T (X) (x | t, θ) = = = n t =
fT (X) (t | θ) Pθ (T (X) = t) t
θ (1 − θ)n−t t

which does not depend on the parameter θ. This means that all information about θ in
X has been summarized by T (X). This motivates the following definition.

Definition: Suﬀicient Statistics

A statistic T = T (X) is said to be suﬀicient for parameter θ if

Pθ (X1 ≤ x1 , . . . , Xn ≤ xn | T (X) = t) = G(x, t)

where G(·, ·) is a function that does not depend on θ. Equivalent, we have

– p(x | t, θ) = Pθ (X = x | T (X) = t) = G(x, t) if X is discrete;

– f (x | t, θ) = G(x, t) if X is continuous.

▶ Thus, by the law of total probability

Pθ (X1 ≤ x1 , . . . , Xn ≤ xn ) = P (X1 ≤ x1 , . . . , Xn ≤ xn | T (X) = T (x))Pθ (T (X) = T (x))

and once we know the value of the suﬀicient statistic, we cannot obtain any additional
information about the value of θ from knowing the observed values.

2
AU7022 Stochastic Methods in Systems & Control Xiang Yin

Neyman-Fisher Factorization Theorem

▶ The above definition of sufficient statistics is often difficult to use since it involves deriva-
tion of the conditional distribution of X given T . However, when the random variable X is
discrete or continuous a simpler way to verify sufficiency is through the Neyman-Fisher
factorization criterion.
Theorem: Fisher Factorization Criterion

A statistic T = T (X) is suﬀicient for θ if and only if functions g and h can be found
such that
fX (x | θ) = g(T (x), θ)h(x)

We only proof the case of discrete random variables, i.e., fX (x; θ) is the PMF.

▶ (⇒) Because T is a function of x, we have

fX (x | θ) = fX,T (X) (x, T (x) | θ) = fX|T (X) (x | T (x), θ)fT (X) (T (x) | θ)

Since T is suﬀicient, then fX|T (X) (x | T (x), θ) is not a function of θ and we can set it to
be h(X). The second term is a function of T (x) and θ. We will write it g(T (x), θ).

▶ (⇐) Suppose that we have the factorization. By the definition of conditional expectation,

fX,T (X) (x, t | θ)

fX|T (X) (x | t, θ) =
fT (X) (t | θ)

For the numerator, we have

(
0 if T (x) ̸= t
fX,T (X) (x, t | θ) =
fX (x | θ) = g(t, θ)h(x) otherwise

Furthermore, for the denominator, we have

X X
fT (X) (t | θ) = fX (x̃ | θ) = g(t, θ)h(x̃)
x̃:T (x̃)=t x̃:T (x̃)=t

Therefore, we have

g(t, θ)h(x) h(x)

fX|T (X) (x | t, θ) = P =P ,
x̃:T (x̃)=t g(t, θ)h(x̃) x̃:T (x̃)=t h(x̃)

which is independent of θ and, therefore, T is suﬀicient.

▶ For example, in the maximum likelihood estimation, we have to find the best estimate
θ ∈ Θ such that the likelihood function

L(θ | x) = fX (x | θ)

is maximized for the observed sample x = (x1 , . . . , xn ). For suﬀicient statistics, since
fX (x | θ) = g(T (x), θ)h(x), maximizing the likelihood is equivalent to maximizing
g(T (x), θ) and the maximum likelihood estimator θ̂(T (x)) is a function of the suﬀicient
statistic.

3
AU7022 Stochastic Methods in Systems & Control Xiang Yin

General Examples of Suﬀicient Statistics

▶ Example 1: Entire Sample
X = (X1 , . . . , Xn ) is clearly suﬀicient but not very interesting.

▶ Example 2: Rank Ordered Sample

X(1) , . . . , X(n) is suﬀicient when Xi are i.i.d. This is because, under the i.i.d. setting,

Y
n Y
n
f (x1 , . . . , xn | θ) = f (xi | θ) = f (x(i) | θ)
i=1 i=1

▶ Example 3: Binary Likelihood Ratios

Suppose that θ only takes two possible values Θ = {θ0 , θ1 }, or simply θ ∈ {0, 1}. This
gives the binary decision problem: “decide between θ = 0 versus θ = 1. Then the
“likelihood ratio” (assume it is finite)

f1 (X) f (X | 1)
Λ(X) = =
f0 (X) f (X | 0)

is suﬀicient for θ, because we can write

 
 
fθ (X) = θf1 (X) + (1 − θ)f0 (X) = θΛ(X) + (1 − θ) f0 (X)
| {z } | {z }
g(T,θ) h(X)

▶ Example 4: Discrete Likelihood Ratios

Suppose that θ takes p possible values, i.e., Θ = (θ1 , . . . , θp ). Then the vector of p − 1
likelihood ratios (assume it is finite)

fθ1 (X) fθp−1 (X)
Λ(X) = ,..., = (Λ1 (X), . . . , Λp−1 (X))
fθp (X) fθp (X)

is suﬀicient for θ. Try to prove this as a homework.

▶ Example 5: Likelihood Ratio Trajectory

When Θ is a set of scalar parameters θ the likelihood ratio trajectory over Θ is

fθ (X)
Λ(X) =
fθ0 (X) θ∈Θ

is suﬀicient for θ. Here θ0 is an arbitrary reference point in Θ for which the trajectory is
finite for all X. When θ is not a scalar, this becomes a likelihood ratio surface, which is
also a suﬀicient statistic.

▶ We say Tmin is a minimal sufficient statistic if for any sufficient statistic T there
exists a function q such that Tmin = q(T ). Finding minimal sufficient statistic is in
general difficult; the following provides a sufficient condition for T (X) to be be minimal

∀x, x′ ∈ X : Λ(T (x)) = Λ(T (x′ )) ⇒ T (x) = T (x′ )

Note that Λ(t) is well-defined because Λ(x) = Λ(T (x)) for any suﬀicient statistic T as we
discussed above.

4
AU7022 Stochastic Methods in Systems & Control Xiang Yin

More Examples of Suﬀicient Statistics

▶ Example 1: Bernoulli Distribution
Suppose that X = (X1 , X2 , . . . , Xn ) is i.i.d. and each Xi satisfies the Bernoulli distribution
with unknownPprobability, i.e., Pθ (Xi = 1) = θ and Pθ (Xi = 0) = 1 − θ. Then we claim
that T (X) = ni=1 Xi is a suﬀicient statistic. To see this, we write the joint PMF as

Y
n Y
n Yn
θ Xi
pX (X; θ) = pXi (Xi ; θ) = θ (1 − θ)
Xi 1−Xi
= (1 − θ)( )
i=1 i=1 i=1
1 − θ
θ T (X)
= (1 − θ)n ( ) · |{z}
1
| 1−θ
{z } h(X)
g(T (X,θ))

Clearly, this suﬀicient statistic is minimal as it is already one-dimensional.

▶ Example 2: Uniform Distribution

Suppose that X = (X1 , X2 , . . . , Xn ) is i.i.d. and each Xi satisfies the uniform distribution
over [0, θ] with unknown length θ. Then we claim that T (X) = maxni=1 Xi is a suﬀicient
statistic. To see this, we write
Y
n Y
n
1 Y
n
1 1
fX (X; θ) = fXi (Xi ; θ) = 1[0,θ] (Xi ) = 1[Xi ,∞) (θ) = 1
n [T (X),∞)
(θ) · |{z}
1
i=1 i=1
θ i=1
θ |θ {z } h(X)
g(T (X,θ))

Note that, the tricky part is I[0,θ] (Xi ) = I[Xi ,∞) (θ).

▶ Example 3: Gaussian Distribution with Unknown Mean

Suppose that X = (X1 , X2 , . . . , Xn ) is i.i.d. and each Xi satisfies the Gaussian distribution
P
with unknown mean θ but the variance σ 2 is known. Then we claim that T (X) = ni=1 Xi
is a suﬀicient statistic. To see this, we have

fX (X; θ)
n !
Yn Yn
1 (Xi − θ)2 1 Xn
(Xi − θ)2
= fXi (Xi ; θ) = √ exp − = √ exp −
2πσ 2σ 2 2πσ 2σ 2
i=1
n i=1
Pn i=1
1 θT (X) −nθ2 − i=1 Xi2
= √ exp exp exp
2πσ σ2 2σ 2 2σ 2
| {z }| {z }
g(T (X,θ)) h(X)

▶ Example 4: Gaussian Distribution with Unknown Mean and Variance

When the unknown
Pn mean Pnis µ = θ1 and the unknown variance is σ 2 = θ2 , then we claim
2
that T (X) = ( i=1 Xi , i=1 Xi ) is a suﬀicient statistic. To see this, we have

fX (X; θ)
n !
Yn Yn
1 (Xi − θ)2 1 Xn
(Xi − θ)2
= fXi (Xi ; θ) = √ exp − 2
= √ exp −
i=1 i=1
2πσ 2σ 2πσ i=1
2σ 2
n
1 θ1 1 −nθ12
= √ exp T1 (X) − T2 (X) exp · |{z}
1
2πθ2 θ2 2θ2 2θ2
| {z } h(X)
g(T (X,θ))

Solutions To Steven Kay's Statistical Estimation Book
67% (3)
Solutions To Steven Kay's Statistical Estimation Book
16 pages
Cramer Raoh and Out 08
No ratings yet
Cramer Raoh and Out 08
13 pages
Topics in Propagation of Chaos - Sznitman
No ratings yet
Topics in Propagation of Chaos - Sznitman
87 pages
Sta 212 Mathematical Statistics 1
No ratings yet
Sta 212 Mathematical Statistics 1
4 pages
STAT 480b Answer Key To Problem Set No. 4
No ratings yet
STAT 480b Answer Key To Problem Set No. 4
3 pages
402 08 Elandt Johnson Survival Models and Data Analysis 1980
No ratings yet
402 08 Elandt Johnson Survival Models and Data Analysis 1980
478 pages
An Introduction To Generalized Linear Models (Third Edition, 2008) by Annette Dobson & Adrian Barnett Outline of Solutions For Selected Exercises
No ratings yet
An Introduction To Generalized Linear Models (Third Edition, 2008) by Annette Dobson & Adrian Barnett Outline of Solutions For Selected Exercises
23 pages
Topics in Probability Theory and Stochastic Processes Steven R. Dunbar
100% (1)
Topics in Probability Theory and Stochastic Processes Steven R. Dunbar
26 pages
Robert v. Hogg, Allen T. Craig - Introduction To M
No ratings yet
Robert v. Hogg, Allen T. Craig - Introduction To M
448 pages
(The Artech House Electronic Warfare Library) Nicholas A. O'Donoughue - Emitter Detection and Geolocation For Electronic Warfare-Artech House (2019)
No ratings yet
(The Artech House Electronic Warfare Library) Nicholas A. O'Donoughue - Emitter Detection and Geolocation For Electronic Warfare-Artech House (2019)
352 pages
Sufficient Statistics and Exponential Family
No ratings yet
Sufficient Statistics and Exponential Family
11 pages
281A Final Sol
No ratings yet
281A Final Sol
9 pages
Machine Learning For Survival Analysis
No ratings yet
Machine Learning For Survival Analysis
107 pages
Biostatistics II: Survival Analysis: Department of Biostatistics, Erasmus University Medical Center
No ratings yet
Biostatistics II: Survival Analysis: Department of Biostatistics, Erasmus University Medical Center
429 pages
Ibrahim Survival
No ratings yet
Ibrahim Survival
491 pages
Ejemplo de Inferencia Umvue
No ratings yet
Ejemplo de Inferencia Umvue
10 pages
Solution Exercises List 1 - Probability and Measure Theory
No ratings yet
Solution Exercises List 1 - Probability and Measure Theory
8 pages
Tutorials in Probability
No ratings yet
Tutorials in Probability
493 pages
3 - Principles of Data Reduction
No ratings yet
3 - Principles of Data Reduction
14 pages
Fall 2011
No ratings yet
Fall 2011
2 pages
FMCW Radar App Note PDF
No ratings yet
FMCW Radar App Note PDF
44 pages
Autoregressive-Moving Average (ARMA) Models
100% (1)
Autoregressive-Moving Average (ARMA) Models
34 pages
Estimation EMV
No ratings yet
Estimation EMV
37 pages
Exponential Distribution
No ratings yet
Exponential Distribution
19 pages
Essentials of Statistics
No ratings yet
Essentials of Statistics
272 pages
Marcin Pitera. Stochastic Processes.
No ratings yet
Marcin Pitera. Stochastic Processes.
45 pages
Theory of Estimation by P.G.dixit, Nirali Publication
No ratings yet
Theory of Estimation by P.G.dixit, Nirali Publication
186 pages
Fundamental Probability - 2006 - Paolella
100% (2)
Fundamental Probability - 2006 - Paolella
474 pages
1480215236
100% (2)
1480215236
657 pages
Principles of Biostatistics: Class Notes To Accompany The Textbook by Pagano and Gauvreau
No ratings yet
Principles of Biostatistics: Class Notes To Accompany The Textbook by Pagano and Gauvreau
125 pages
Rohatgi Expl
No ratings yet
Rohatgi Expl
192 pages
MCMC Sheldon Ross
No ratings yet
MCMC Sheldon Ross
68 pages
Stats 210 Course Book
No ratings yet
Stats 210 Course Book
200 pages
202004160626023624rajiv Saksena Advance Statistical Inference
No ratings yet
202004160626023624rajiv Saksena Advance Statistical Inference
31 pages
Binomial Distribution
No ratings yet
Binomial Distribution
16 pages
Statistical Inference in Science
No ratings yet
Statistical Inference in Science
262 pages
Stochastic - Lecture Notes
100% (1)
Stochastic - Lecture Notes
108 pages
Discrete-Time Stochastic Process Group Work Project
100% (1)
Discrete-Time Stochastic Process Group Work Project
11 pages
Random Walks: (Lecture Notes)
100% (1)
Random Walks: (Lecture Notes)
84 pages
Evžen Kočenda - Alexandr Černý - Elements of Time Series Econometrics - An Applied Approach-Karolinum Press, Charles University (2017)
No ratings yet
Evžen Kočenda - Alexandr Černý - Elements of Time Series Econometrics - An Applied Approach-Karolinum Press, Charles University (2017)
220 pages
Solution To Rao Crammer Bound
No ratings yet
Solution To Rao Crammer Bound
11 pages
Lilliefors Test For The Exponential Distribution
No ratings yet
Lilliefors Test For The Exponential Distribution
2 pages
Weak Convergence and Empirical Processes With Applications To Statistics (A.w. Van Der Vaart - Jon A. Wellner) (Z-Library)
No ratings yet
Weak Convergence and Empirical Processes With Applications To Statistics (A.w. Van Der Vaart - Jon A. Wellner) (Z-Library)
693 pages
Estabilidad Lyapunov Ejercicios
No ratings yet
Estabilidad Lyapunov Ejercicios
6 pages
Solutions To Some Exercises From Bayesian Data Analysis, Third Edition, by Gelman, Carlin, Stern, and Rubin
No ratings yet
Solutions To Some Exercises From Bayesian Data Analysis, Third Edition, by Gelman, Carlin, Stern, and Rubin
36 pages
TimeSeries Analysis State Space Methods
100% (1)
TimeSeries Analysis State Space Methods
57 pages
Study Book
No ratings yet
Study Book
316 pages
Stat 231 Course Notes
100% (1)
Stat 231 Course Notes
326 pages
Random Walk Article
100% (1)
Random Walk Article
289 pages
Measure Int
100% (1)
Measure Int
247 pages
(GAM) Application PDF
No ratings yet
(GAM) Application PDF
30 pages
Statistical Inference Casella and Berger
No ratings yet
Statistical Inference Casella and Berger
10 pages
Exercises in Functional Analysis: Peter SJ Ogren November 2, 2010
No ratings yet
Exercises in Functional Analysis: Peter SJ Ogren November 2, 2010
6 pages
Prob&StatsBook PDF
No ratings yet
Prob&StatsBook PDF
202 pages
Measure and Integration Theory - 4-06-11-2021!16!20-07 - Measure and Integration Theory (20MAT22C2)
100% (1)
Measure and Integration Theory - 4-06-11-2021!16!20-07 - Measure and Integration Theory (20MAT22C2)
90 pages
Module2 - Principles of Data Reduction
No ratings yet
Module2 - Principles of Data Reduction
6 pages
1 Sufficient Statistics: I I N I 1 I N I 1 I
No ratings yet
1 Sufficient Statistics: I I N I 1 I N I 1 I
5 pages
Topic 3 Theory of Estimation
No ratings yet
Topic 3 Theory of Estimation
10 pages
Generalized Fisher-Darmois-Koopman-Pitman Theorem and Rao-Blackwell Type Estimators For Power-Law Distributions
No ratings yet
Generalized Fisher-Darmois-Koopman-Pitman Theorem and Rao-Blackwell Type Estimators For Power-Law Distributions
19 pages
Mathematical Statistics (MA212M) : Lecture Slides
No ratings yet
Mathematical Statistics (MA212M) : Lecture Slides
8 pages
Tuljaram Chaturchand College of Arts, Science and Commerce, Baramati
No ratings yet
Tuljaram Chaturchand College of Arts, Science and Commerce, Baramati
20 pages
Bahadur's Theorem
No ratings yet
Bahadur's Theorem
6 pages
Deep Density Estimation
No ratings yet
Deep Density Estimation
20 pages
Jawaban PSM by Blackbox
No ratings yet
Jawaban PSM by Blackbox
2 pages
Applied Statistics Syllabus 2021 2022 Revised
No ratings yet
Applied Statistics Syllabus 2021 2022 Revised
94 pages
Sta 341 Exam
No ratings yet
Sta 341 Exam
4 pages
Lecture 04
No ratings yet
Lecture 04
29 pages
Stat QEs S24
No ratings yet
Stat QEs S24
11 pages
Classical Detection Theory
No ratings yet
Classical Detection Theory
23 pages
Estimation PDF
No ratings yet
Estimation PDF
348 pages
MSC Syllabus PDF
No ratings yet
MSC Syllabus PDF
40 pages
Tut 06
No ratings yet
Tut 06
2 pages
Solutions To Exercises - Week 42 Exercise 7.47: Complete Sufficient Statistics and Best Unbiased Estimators
No ratings yet
Solutions To Exercises - Week 42 Exercise 7.47: Complete Sufficient Statistics and Best Unbiased Estimators
4 pages
Fisher Information - Wikipedia
No ratings yet
Fisher Information - Wikipedia
13 pages
Introstat
No ratings yet
Introstat
16 pages
MSexam Stat 2019S Solutions
No ratings yet
MSexam Stat 2019S Solutions
11 pages
Case Study - Theory of Estimation
No ratings yet
Case Study - Theory of Estimation
4 pages
3
No ratings yet
3
204 pages
Notes On Estimation
No ratings yet
Notes On Estimation
76 pages
03 PointEstimation
No ratings yet
03 PointEstimation
37 pages
Sufficient Statistics - Problems - Solved - Xiang - Yin
No ratings yet
Sufficient Statistics - Problems - Solved - Xiang - Yin
5 pages
1lLX5EBmPfEXHr5WTX2gZL Kf9kRuj6f4
No ratings yet
1lLX5EBmPfEXHr5WTX2gZL Kf9kRuj6f4
75 pages
Statistics 580 The EM Algorithm
No ratings yet
Statistics 580 The EM Algorithm
28 pages
Prints PDF
No ratings yet
Prints PDF
106 pages
Model Exit Exam Statistical Inference (2024)
No ratings yet
Model Exit Exam Statistical Inference (2024)
7 pages
Statistical Inference
No ratings yet
Statistical Inference
24 pages
WWW Elitmuszone Com Elitmus Logical Reasoning Problem 2
No ratings yet
WWW Elitmuszone Com Elitmus Logical Reasoning Problem 2
10 pages
MSC Math
No ratings yet
MSC Math
18 pages
Exercises Solutions Based On Estimation
No ratings yet
Exercises Solutions Based On Estimation
9 pages
Kay - Solutions
100% (2)
Kay - Solutions
47 pages

Sufficient Statistics - Problems - Solved - Xiang - Yin

Uploaded by

Sufficient Statistics - Problems - Solved - Xiang - Yin

Uploaded by

AU7022 Stochastic Methods in Systems & Control Xiang Yin

8 Parameter Estimations and Suﬀicient Statistics

▶ Formally, the above problem has the following ingredients:

Such {Pθ }θ∈Θ is called the statistical model of the experiment.

which is assumed to be known for each θ ∈ Θ. We denote by Eθ (X) the expectation of

Parametric Statistics (Estimation Theory)

▶ The basic estimation problem is as follows. The observations X = (X1 , . . . , Xn ) is actually

▶ The question is that how to describe whether θ̂ is a good estimator. Depending on

Definition of Suﬀicient Statistics

▶ Let us consider an i.i.d. observations X = (X1 , . . . , Xn ) with distribution Pθ from the

Definition: Suﬀicient Statistics

A statistic T = T (X) is said to be suﬀicient for parameter θ if

Pθ (X1 ≤ x1 , . . . , Xn ≤ xn | T (X) = t) = G(x, t)

where G(·, ·) is a function that does not depend on θ. Equivalent, we have

– p(x | t, θ) = Pθ (X = x | T (X) = t) = G(x, t) if X is discrete;

▶ Thus, by the law of total probability

Neyman-Fisher Factorization Theorem

▶ (⇒) Because T is a function of x, we have

fX,T (X) (x, t | θ)

For the numerator, we have

Furthermore, for the denominator, we have

g(t, θ)h(x) h(x)

which is independent of θ and, therefore, T is suﬀicient.

General Examples of Suﬀicient Statistics

▶ Example 2: Rank Ordered Sample

▶ Example 3: Binary Likelihood Ratios

is suﬀicient for θ, because we can write

▶ Example 4: Discrete Likelihood Ratios

is suﬀicient for θ. Try to prove this as a homework.

▶ Example 5: Likelihood Ratio Trajectory

∀x, x′ ∈ X : Λ(T (x)) = Λ(T (x′ )) ⇒ T (x) = T (x′ )

More Examples of Suﬀicient Statistics

Clearly, this suﬀicient statistic is minimal as it is already one-dimensional.

▶ Example 2: Uniform Distribution

▶ Example 3: Gaussian Distribution with Unknown Mean

▶ Example 4: Gaussian Distribution with Unknown Mean and Variance

You might also like