0% found this document useful (0 votes)

8 views50 pages

SLIDES 20180123 RodLittle

Bibhiihihhiihhiihih

Uploaded by

Muhammad Zulfadhli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views50 pages

SLIDES 20180123 RodLittle

Bibhiihihhiihhiihih

Uploaded by

Muhammad Zulfadhli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

Measures of uncertainty, and

the P-Value controversy

Roderick Little
Outline
• Widespread concerns about scientific replicability
• Perception that misunderstandings and misuses of
hypothesis testing, P-values, contribute to this
problem
• American Statistical Association (ASA) “Statement
on Statistical Significance and P-Values”
• Review these issues, and discuss alternative
approaches for conveying statistical uncertainty–
p-values, confidence intervals, Bayesian inference

MLEAD Seminar 2
Inference for a population based on a sample
• Statistical inference: the process of making inferences
about parameters of a population based on sample data.

Statistical Inference
Sample Population
Mean x Mean µ
SD s SD σ

• Inference crucially requires that sample is “representative”

(e.g. randomly selected) from population (or an
assumption that it is)
• Statistical inferences are subject to uncertainty –
quantifying uncertainty is an important objective
MLEAD Seminar 3
Tools for assessing uncertainty
• Hypothesis Testing: basic tool is P-value
– P-value = Pr(“data”|null hypothesis). A low value (e.g. P
< 0.05) is interpreted as evidence against the null
hypothesis
• Interval Estimation: basic tool is the
Confidence interval – random interval that
includes the true value of a parameter in a given
proportion of repeated samples (e.g. 95%)
• Bayesian methods: basic tool is the Posterior
Distribution
– More on this later
MLEAD Seminar 4
Hypothesis testing
• Assesses consistency of the data with a particular null
value of the parameter
• For example, for inference about a mean
– Confidence interval: set of values of the mean consistent with the
data
– Hypothesis test: are the data consistent with a particular value of
the mean?
• Often the null value corresponds to “no difference” or “no
association”
Elements of a hypothesis test
• A scientific hypothesis, e.g. “new treatment is better than old
treatment”
• An associated null hypothesis H0. The null hypothesis is
often counter to the scientific hypothesis, e.g. “the average
difference in outcomes between treatments is zero”.
• An alternative hypothesis Ha : legitimate values of the
parameter if H0 is not true.
• A test statistic T computed from the data, which (a) has a
known distribution if the null hypothesis is true and (b)
provides information about the truth of the null hypothesis.
• The P-Value for the test is:
P = Pr(test statistic the same or more extreme than T | H 0 )
• Small P-values are evidence against the null hypothesis

MLEAD Seminar 6
More on P-Value

P-Value = Pr("data " | H 0 )

"data " = "values of T at least as extreme as that observed".
Measures consistency of data with H 0
P-Value is not Pr(H 0 | data)
That is, is not the probability that H 0 is true given the data
(Latter is computed in Bayesian hypothesis testing)

MLEAD Seminar 7
The misinterpretation of p-values:
Experiment in McShane and Gall (2017 JASA)
“The study aimed to test how different interventions might
affect terminal cancer patients’ survival. Subjects were
randomly assigned to one of two groups. Group A was
instructed to write daily about positive things they were blessed
with while Group B was instructed to write daily about
misfortunes that others had to endure.
Subjects were then tracked until all had died. Subjects in Group
A lived, on average, 8.2 months post-diagnosis whereas
subjects in Group B lived, on average, 7.5 months post-
diagnosis (p = 0.01). Which statement is the most accurate
summary of the results?

MLEAD Seminar 8
McShane and Gill (2017 JASA)
Speaking only of the subjects who took part in this
particular study:
A. the average number of post-diagnosis months lived by
the subjects who were in Group A was greater than that
lived by the subjects who were in Group B.
B. the average number of post-diagnosis months lived by
the subjects who were in Group A was less than that lived
by the subjects who were in Group B.
C. The average number of post-diagnosis months lived by
the subjects who were in Group A was no different than
that lived by the subjects who were in Group B.
D. It cannot be determined whether the average number of
post-diagnosis months lived by the subjects who were in
Group A was greater/no different/less than that lived by
the subjects who were in Group B.

MLEAD Seminar 9
McShane and Gill (2017 JASA)
After seeing this question, each subject was
asked the same question again but p = 0.01 was
switched to p = 0.27 (or vice versa for the
subjects in the condition that presented the p =
0.27 version of the question first)”

MLEAD Seminar 10
Proportion choosing A (correct answer): NEJM readers

MLEAD Seminar 11
Proportion Choosing A: JASA readers

MLEAD Seminar 12
P-Values
P-values can indicate how incompatible the data are with a
specified statistical model.
P–values are not:
(a) The probability that the null hypothesis is true
(b) Good measures of the size of an effect:
Smaller deviations from the null can be detected with larger
sample sizes, so the P-Value is strongly dependent on
sample size

MLEAD Seminar 13
Significance level
• A classical significance test sets a cut off value α , and
formally “rejects” the null hypothesis if P-value < α ,
“accepts” the null hypothesis if P-value > α
• The cut-off α is called the “significance level”, “size” or
“type 1 error” of the test, and has the property that
Pr(reject Null|Null true) = α
• The choice of significance level α is somewhat
arbitrary; a typical value by convention is 0.05 (but
more on this below).
• P = 0.049 is not substantively different from P=0.051,
but one “rejects” and the other “accepts” at the 5%
level.
• So I think it is better to avoid a cut-off and just report
the P-value

MLEAD Seminar 14
Redefining significance
Comparisons with Bayesian hypothesis testing by my ex-
colleague Val Johnson suggest that the common
“P<.05” significance level is weak evidence against the
null, contributing to the lack of replicability of results
Hence my limerick:
“In statistics one thing do we cherish,
P .05 we publish, else perish
Val says that’s so out-of-date, our studies don’t replicate
P .005, then null is rubbish!”
Redefining significance
• A recent 74-author (!) paper (Benjamin…V. Johnson. Redefine
Statistical Significance. 2017 Nature Human Behavior) argues
for changing the threshold from 0.5 to .005, based on
comparing P-values with Bayes Factors for a simple null
Let D = data, H = hypothesis.
Bayes’ rule converts Pr(D|H) into Pr(H|D), and is a simple
consequence of basic rules of probability:
Pr( H , D ) = Pr( D) × Pr( H | D) = Pr( H ) × Pr( D | H )
Pr( H | D ) = Pr( H ) × Pr( D | H ) / Pr( D )
Pr( H | D) Pr( H ) Pr( D | H )
Hence, = ×
Pr( H ' | D) Pr( H ') Pr( D | H ')
That is, posterior odds = prior odds × Bayes factor
Strength of evidence against null

MLEAD Seminar 17
More on significance level
• Regardless of the threshold, it is a bad idea to
publish only statistically significant results, since
this leads to publication bias
– for interpretation, we need to know about negative
studies too!
– journals should report results from methodologically
sound studies that address important questions,
whether or not results are significant
From ASA P-Value Statement
• “P-values can indicate how incompatible the data are with a
specified statistical model.
• P-values do not measure the probability that the studied
hypothesis is true, or the probability that the data were
produced by random chance alone.
• Scientific conclusions and business or policy decisions
should not be based only on whether a p-value passes a
specific threshold.
• Proper inference requires full reporting and transparency
• A p-value, or statistical significance, does not measure the
size of an effect or the importance of a result.
• By itself, a p-value does not provide a good measure of
evidence regarding a model or hypothesis.”

MLEAD Seminar 19
Full reporting and transparency
• Bad practice: Carry out many statistical tests and only report
significant ones. Transparency here is to report all the tests
carried out, whether or not significant.
• 20 independent tests: one will be significant even at 5% level
even if all effects are null
• Question is whether interest is in controlling type 1 error of
each individual test, or over all the tests in the experiment.
• If latter, one simple (if crude) approach is the Bonferroni
correction: divide the significance level by number of tests
made; e.g. if 10 tests and sig level .05, test at .05/10 = .005
level
• Related: in genetics with many genes tested, significance
level is chosen to be very low.

MLEAD Seminar 20
P-Value is not the effect size
• P-value is poor measure of the size of an effect –
– size of P-value has no clinical meaning
– mixes estimate of effect and its uncertainty
– strongly determined by sample size – since nothing is
exactly zero, anything is significant with a large
enough data … and we are entering the era of big data!
– One-sided or two sided alternative – not always clear
– The more important question is the size of the effect,
not whether it differs from zero
Problems with P-Values
“Hypothesis testing, as performed in the applied
sciences, is criticized. Then assumptions that the
author believes should be axiomatic in all statistical
analyses are listed. These assumptions render
many hypothesis tests superfluous. The author
argues that the image of statisticians will not
improve until the nexus between hypothesis testing
and statistics is broken.”
MARKS R. NESTER, An Applied Statistician's Creed
Applied Statistics (1996) 45,No.4,pp. 401-410
Confidence intervals
• A confidence interval -- estimate with associated
measure of uncertainty
• Confidence interval property – in hypothetical
repeated samples, the 95% interval includes the
true value of the parameter at least 95% of the
time. Here 95% is the “nominal coverage” of the CI
– Example: 95% CI for population mean in a normal
sample of size n with mean x , sd s is
x ± t.975 s / n
where t.975 is the 97.5th percentile of the t distribution
with n – 1 degrees of freedom. In particular
t.975 = 1.96 if n >50, t.975 = 2.447 if n = 7.
Roughly “estimate +/- two se’s” for moderate size n
MLEAD Seminar 23
Confidence Intervals
Hypothetical
( ) repeated
( ) samples
( ) .
.
( ) .
( )
( ) Confidence interval
property: (at least)
( )
95% of these random
intervals include the
true value
Unknown true value
of parameter θ
MLEAD Seminar 24
Confidence Intervals: better for
inference than P-values
• Estimate has clinical meaning – closer to the
science. Good measurement is the heart of
statistics
• Width of interval captures uncertainty
• Confidence interval summarizes the evidence in
a natural way
Study A: small trial
Success Failure
Treatment 1 10 (50%) 10 (50%)
Treatment 2 15 (75%) 5 (25%)

• Null Hypothesis H 0 : Outcome independent of treatment, or

treatments equally effective
• Chi-squared test of equality of proportions: P = 0.102
• P = Pr(Tables with treatment differences as or more
extreme than that observed | H 0 )
• Conclusion: “accept” H 0 at 5% level
Study B: large trial

Success Failure
Treatment 1 500 (50%) 500 (50%)
Treatment 2 550 (55%) 450 (45%)

• Null Hypothesis H 0 : Outcome independent of treatment, or

treatments equally effective
• Test of equality of proportions: P = 0.025
• Conclusion: Reject H 0 at 5% level
Examples
• Study A: 95% CI for Diff = (-4.6%, 54.6%)
Wide, consistent with no difference, but large differences
also possible
P-Value = .102. Not significant (NS), but doesn’t mean there
is no effect – NS does not mean null hypothesis is true!
• Study B: 95% CI for Diff = (0.6%, 9.4%)
Narrow, not consistent with no difference, but large
difference is unlikely
P-Value = .025. Statistically significant, but evidence is that
effect is not clinically significant!
Can warfarin be continued during dental extraction?
Results of a randomized controlled trial
• I. L. Evans, M. S. Sayers, A. J. Gibbons, G. Price, H. Snooks, A. W. Sugar. Brit. J. Oral &
Maxillofacial Surgery (2002) 40, 248–252

• SUMMARY. A randomized controlled trial was set up to

investigate whether patients who were taking warfarin …
require cessation of their anticoagulation drugs before dental
extractions.
• Of 109 patients who completed the trial, 52 were allocated to
the control group (warfarin stopped 2 days before extraction)
and 57 patients were allocated to the intervention group
(warfarin continued).
• The incidence of bleeding complications in the intervention
group was higher (15/57, 26%) than in the control group
(7/52, 14%)
• but this difference was not significant… we found no evidence
of an increase in clinically important bleeding. As there are
risks associated with stopping warfarin, the practice of
routinely discontinuing it before dental extractions should be
reconsidered.
Clinical vs statistical significance
• “Incidence of bleeding complications in the
intervention group was higher (15/57, 26%) than
in the control group (7/52, 14%) but this
difference was not significant ...we found no
evidence of an increase in clinically important
bleeding.”
– Is 26% vs 14% clinically significant? 95% confidence
interval for difference in proportions = (0, 0.28)
– Study seems underpowered (sample size too small)
– a common problem in clinical trials
Some objections to CIs
• Confidence intervals are peculiar objects: the
interval is random, but the parameter is fixed
• For some basic problems there is no CI
procedure that gives exactly the nominal
coverage
– Behrens-Fisher problem: comparing means of two
normal samples with unknown means and
variances, not assumed to be equal.
• Basing inference on sampling distribution
violates the likelihood principle – experiments
leading to the same likelihood function should
have the same inference

MLEAD Seminar 31
A related problem with CIs
• What should be included in the set of hypothetical
repetitions -- the reference set -- is not always
clear
– and different choices give different confidence intervals

MLEAD Seminar 32
Example: Independence in 2x2 Contingency
Table
Outcome
S F
A 170 2
Treatment H0 : π A = π B ; Ha : π A > π B
B 162 9

Alternative tests

Pearson chi-squared (C) P=0.016

Yates continuity corrected (Y) P=0.032
Fisher exact test (F) P=0.030
Bayes Pr(π A < π B | data ) Pr=0.013

MLEAD Seminar 33
Independence in 2x2 tables
• Choice of test doesn’t matter in large samples,
but it does in small/moderate samples
• Fisher test is conservative when one margin is
fixed in repeated sampling (as is common in
many practical designs), but exact if both
margins are fixed
• Should the reference set condition on second
margin or not? It’s debatable (Yates 1984, Little
1989)
• Frequentist theory is ambiguous, and
frequentists disagree about which is the right
test
MLEAD Seminar 34
A CI is not a probability interval
Most people interpret a confidence interval as a
probability interval: a fixed interval that includes the
unknown parameter with 95% probability. That is, the
interval is fixed, the parameter is random. Unfortunately,
confidence intervals have some properties that are in
conflict with this idea:

For example, an interval A that includes an interval B on

a particular data set may have lower confidence
coverage!

Bayes turns confidence interval into probability

intervals, and P(D|H) (as in P-values) into P(H|D) (what
we really want)…
MLEAD Seminar 35
Example: Inference for a mean with bound on
precision
A normal sample with n = 7, y = 1, s = 1 yields
BRP
PI.05 F
( s = 1) = CI.05 ( )
( s = 1) = y ± 2.447 1 / n = 1 ± 0.92 (1)
Experimenter E tells us that true sd σ = 1.5
BRP
PI.05 F
(σ = 1.5) = CI.05 ( )
(σ = 1.5) = 1 ± 1.96 1.5 / 7 = 1 ± 1.11 (2)
E: oops there’s more variance! In fact σ > 1.5!
BRP
PI.05 (σ > 1.5) = 1 ± 1.45 (3)
What does a frequentist do? Pick your poison:
(1) is an exact 95% CI but is clearly the wrong inference!
(2) is an anti-conservative 95% CI (though it contains (1)!)
(3) is correctly wider than (2), but it’s Bayes, not a 95% CI,
and depends on the choice of prior
MLEAD Seminar 36
Pr(D|H) or Pr(H|D)?
• Pr(D|H) is easier, but Pr(H|D) is what we really care
about
• Classical or frequentist statistics (the stuff you learnt
in a basic statistics course) stops at Pr(D|H):
– P-value = Pr(D|H), not Pr(H|D)
– Confidence intervals: proportion of intervals in repeated
sampling that include a fixed parameter, not Pr(fixed
interval includes parameter)
• Bayesian statistics tries for Pr(H|D)

• Bayes rule also converts confidence interval statements

into posterior distributions for parameters θ :
p(θ | D ) ∝ p (θ ) × p( D | θ )
posterior ∝ prior × likelihood
A simple application of Bayes:
Screening Tests
• A friend is diagnosed by a screening test (D = result
of test, + or -) to have an extremely rare form of
cancer (H = has cancer). Only one out of a million
people in his age group have the cancer.
• Naturally he is very upset as the test is pretty
accurate:
Sensitivity: Pr(+| has cancer)=0.99, implying
Pr(-| has cancer)=0.01 (False negative)
Specificity: P(-|no cancer)=0.999, implying
Pr(+| no cancer)=0.001 (False positive)

MLEAD Seminar 40
False Positive
• The probability that matters is the positive
predictive value, which by Bayes Rule is

Pr(+ | has cancer)Pr(has cancer)

Pr(has cancer|+)=
Pr(+)
(0.99)(1/1000000)
=
(0.99)(1/1000000) + (0.001)(999999 / 1000000)
= 0.001 (!)

MLEAD Seminar 41
False Positive

Very likely, the friend does not have cancer.

MLEAD Seminar 42
Bayesian statistics treats all unknowns
(including fixed quantities) as random
• Frequentist statistics does not allow probability
statements about fixed quantities – such as the true value
of a parameter. Probability is the limit of the frequency of
events in repeated sampling
• Bayes uses probability statements to express
uncertainty about all unknowns, whether “fixed” or
“random”
• In this sense any unknown is treated as a random variable,
until its value is known.
• This idea greatly extends the reach of probabilistic
statements.

MLEAD Seminar 43
History of Bayes
• Much maligned in the last century, Bayesian
statistics has since experienced a dramatic
revival
• See for example “The theory that would not
die” by Sharon McGrayne

MLEAD Seminar 44
Bayes and the University of Michigan
• Arthur Bailey: BS from U of M
Actuarial mathematics in 1928,
affirmed Bayesian roots of
“credibility theory” for setting
workers’ compensation insurance
rates
• Allen Mayerson, actuarial professor
at U-M, wrote about Bailey’s
seminal role
• Howard Raiffa: enrolled in actuarial
mathematics at U of M, got his Ph.D.
in 1952. With Robert Schlaifer
wrote a highly influential book on
Bayesian decision theory.

MLEAD Seminar 45
Bayes at U Michigan
• Leonard Jimmie Savage
(mathematics PhD at U of
M and professor at
Chicago and later U of M)
became a leader of the
Bayesian revival
• In 1969 Bill Ericson (U of
M Statistics Department)
wrote the seminal paper
on Bayes for sample
surveys LJ Savage

MLEAD Seminar 46
Calibrated Bayes
“… frequency calculations are
useful for making Bayesian
statements scientific,
scientific in the sense of
capable of being shown
wrong by empirical test; here
the technique is the
calibration of Bayesian
probabilities to the
frequencies of actual events.”
Don Rubin (1984 Annals of Statistics)

MLEAD Seminar 47
Factoring in scientific plausibility
• Bayesian hypothesis testing formally allows prior
scientific plausibility to modify the assessment of
evidence, through the choice of prior distribution:
H = "Homeopathy works", H = "Homeopathy doesn't work".
Pr( H | data) Pr( H ) Pr(data|H )
= ×
Pr( H | data) Pr( H ) Pr(data|H )
Posterior odds = Prior odds × Bayes factor

• For example, I’d give theories like homeopathy

based on dubious science “skeptical priors”.
What’s bad about Bayes?
• “OK for gambling, but too subjective for science”
– But frequentist methods can also make strong assumptions
– Bayes makes assumptions in a model explicit, subject to
criticism
– Bayesian methods differ greatly in degree of subjective, e.g.
in choice of model or prior
• Requires a high degree of model specification
– Bad models yield bad answers
– Need to pay attention to developing a good statistical model
• Too much work, computationally tractable
– But computation is now feasible, using monte-carlo
simulation methods

MLEAD Seminar 49
Summary
• Hypothesis testing and associated P-Values are widely
viewed as flawed for assessing evidence
• Confidence intervals are better ways of assessing evidence,
but they have some problems.
• Bayesian methods provide direct answers to the questions
we really want to answer – what’s the probability that a
hypothesis is correct, or that an interval contains the
parameter of interest
• So Bayesian methods are an alternative or complement to
frequentist methods
• … although we would like our Bayesian methods to have
good frequentist properties (to be well calibrated).

MLEAD Seminar 50

StormCAD V8i User's Guide PDF
No ratings yet
StormCAD V8i User's Guide PDF
654 pages
Testing of Hypothesis
No ratings yet
Testing of Hypothesis
58 pages
Biostat Week4 Lecture 2024B 13427701
No ratings yet
Biostat Week4 Lecture 2024B 13427701
57 pages
09 Uji Hipotesis
No ratings yet
09 Uji Hipotesis
45 pages
Lec 9 (Hypothesis Testing)
No ratings yet
Lec 9 (Hypothesis Testing)
53 pages
P Value Calculation
No ratings yet
P Value Calculation
9 pages
Chapter 8 and 9 Intro-to-Hypothesis-Testing-Using-Sign-Test
No ratings yet
Chapter 8 and 9 Intro-to-Hypothesis-Testing-Using-Sign-Test
44 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
60 pages
P Value
No ratings yet
P Value
31 pages
Slurry Pumps MCU Introduction (English)
100% (1)
Slurry Pumps MCU Introduction (English)
33 pages
Statistical Significance - Wikipedia
No ratings yet
Statistical Significance - Wikipedia
43 pages
Complete: Chemistry
100% (3)
Complete: Chemistry
26 pages
11-12 Hypothesis Tests
No ratings yet
11-12 Hypothesis Tests
29 pages
What - Are - Confidence Interval and P Value
100% (1)
What - Are - Confidence Interval and P Value
8 pages
Testing
No ratings yet
Testing
29 pages
Navidi ch6
No ratings yet
Navidi ch6
82 pages
Statistical Significance Versus Clinical Relevance
No ratings yet
Statistical Significance Versus Clinical Relevance
38 pages
Lecture 39 - Hypothesis Testing
No ratings yet
Lecture 39 - Hypothesis Testing
26 pages
On P-Values and Bayes Factors - Annual Review of Statistics and Its Application
No ratings yet
On P-Values and Bayes Factors - Annual Review of Statistics and Its Application
27 pages
The Reproducibility of Research and The Misinterpretation of P-Values - Colquhoun, D. - 2017
No ratings yet
The Reproducibility of Research and The Misinterpretation of P-Values - Colquhoun, D. - 2017
22 pages
DIY Multipurpose Circle of Fifths
No ratings yet
DIY Multipurpose Circle of Fifths
35 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
20 pages
1 s2.0 S2666389923002702 Main
No ratings yet
1 s2.0 S2666389923002702 Main
22 pages
A Critical Evaluation of The Current "P-Value Controversy"
No ratings yet
A Critical Evaluation of The Current "P-Value Controversy"
19 pages
Hypothesis Test and Significance Level
No ratings yet
Hypothesis Test and Significance Level
27 pages
Entropy 24 00161
No ratings yet
Entropy 24 00161
15 pages
Sonek Inferential Analysis
No ratings yet
Sonek Inferential Analysis
25 pages
Statistical Significance
No ratings yet
Statistical Significance
16 pages
The Heuristic Value of P in Inductive Statistical Inference
No ratings yet
The Heuristic Value of P in Inductive Statistical Inference
16 pages
P Value
No ratings yet
P Value
15 pages
14632practicalsignificance 161017020922
No ratings yet
14632practicalsignificance 161017020922
25 pages
CICS Recovery and Restart Guide
No ratings yet
CICS Recovery and Restart Guide
295 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
12 pages
Testing Hypotheses About Proportions
No ratings yet
Testing Hypotheses About Proportions
26 pages
P Value
No ratings yet
P Value
2 pages
What Are Confidence Intervals and P-Values?
0% (1)
What Are Confidence Intervals and P-Values?
8 pages
Family Dynamics
No ratings yet
Family Dynamics
207 pages
P-Values P-Values: Statistical Inference
No ratings yet
P-Values P-Values: Statistical Inference
8 pages
P-Value and Statistical Significance What It Is Amp Why It Matters
No ratings yet
P-Value and Statistical Significance What It Is Amp Why It Matters
14 pages
The Debate About P-Values: Ying LU, Ilana Belitskaya-Levy
No ratings yet
The Debate About P-Values: Ying LU, Ilana Belitskaya-Levy
5 pages
AMS 5355jv005
100% (3)
AMS 5355jv005
11 pages
Hypothesis Testing, P Values, Confidence Intervals, and Significance
No ratings yet
Hypothesis Testing, P Values, Confidence Intervals, and Significance
6 pages
Hypothesis Testing (Original) PDF
No ratings yet
Hypothesis Testing (Original) PDF
5 pages
A - Statistical Versus Practical Significance
No ratings yet
A - Statistical Versus Practical Significance
12 pages
Test of Significance
No ratings yet
Test of Significance
3 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
5 pages
The P-Value Requires Context Not A Threshold
No ratings yet
The P-Value Requires Context Not A Threshold
4 pages
J Heat Transfer 2001 Vol 123 N1
No ratings yet
J Heat Transfer 2001 Vol 123 N1
198 pages
19Z701-AI-Unit-3-1 Knowledge Representation and Reasoning
No ratings yet
19Z701-AI-Unit-3-1 Knowledge Representation and Reasoning
147 pages
Article
No ratings yet
Article
3 pages
Epri
100% (1)
Epri
21 pages
J Evidence Based Medicine - 2018 - Lytsy - P in The Right Place Revisiting The Evidential Value of P Values
No ratings yet
J Evidence Based Medicine - 2018 - Lytsy - P in The Right Place Revisiting The Evidential Value of P Values
4 pages
pr2 c4 ls6
No ratings yet
pr2 c4 ls6
4 pages
P Value
No ratings yet
P Value
4 pages
Estadistica, Articulo, Statistical Errors
No ratings yet
Estadistica, Articulo, Statistical Errors
3 pages
Twelve P Value Misconceptions
No ratings yet
Twelve P Value Misconceptions
6 pages
Tests of Significance Notes PDF
No ratings yet
Tests of Significance Notes PDF
12 pages
Statistical Errors: P Values, The Gold Standard' of Statistical Validity, Are
No ratings yet
Statistical Errors: P Values, The Gold Standard' of Statistical Validity, Are
3 pages
P Value - P Valor
No ratings yet
P Value - P Valor
2 pages
P Value Definition
100% (1)
P Value Definition
1 page
Math 140 Introductory Statistics: Types of Error
No ratings yet
Math 140 Introductory Statistics: Types of Error
4 pages
Silica Selective TF-5
No ratings yet
Silica Selective TF-5
3 pages
What A P-Value Tells You About Statistical Data: Deborah J. Rumsey Statistics For Dummies, 2nd Edition
No ratings yet
What A P-Value Tells You About Statistical Data: Deborah J. Rumsey Statistics For Dummies, 2nd Edition
1 page
03 Fact Sheet HME712 Bos - 3 General Principles of Hypothesis Testing
No ratings yet
03 Fact Sheet HME712 Bos - 3 General Principles of Hypothesis Testing
2 pages
P Values: Click Here For The New Statsdirect Help System
No ratings yet
P Values: Click Here For The New Statsdirect Help System
2 pages
Cold Electricity - Amperage Without Voltage Similar To EV Gray TH Moray J Bedini
100% (4)
Cold Electricity - Amperage Without Voltage Similar To EV Gray TH Moray J Bedini
13 pages
Why The P-Value Culture Is Bad and Con Fidence Intervals A Better Alternative
No ratings yet
Why The P-Value Culture Is Bad and Con Fidence Intervals A Better Alternative
4 pages
Statistical Parameters P-Value
No ratings yet
Statistical Parameters P-Value
2 pages
Statistical Significance: 2 Role in Statistical Hypothesis Test-Ing
No ratings yet
Statistical Significance: 2 Role in Statistical Hypothesis Test-Ing
4 pages
Malayalam Conjunct
No ratings yet
Malayalam Conjunct
11 pages
011 13EG2001 Lecture Notes - Second Shifting Theorem
No ratings yet
011 13EG2001 Lecture Notes - Second Shifting Theorem
10 pages
Magnetic Effects of Electric Current
No ratings yet
Magnetic Effects of Electric Current
8 pages
MEC304 Lecture 4-11878
No ratings yet
MEC304 Lecture 4-11878
25 pages
Aircon Project 1
No ratings yet
Aircon Project 1
25 pages
03 - 1 - Welding Terminology and Symbols - 0
No ratings yet
03 - 1 - Welding Terminology and Symbols - 0
9 pages
YLI Product Brochure 2023 030823
No ratings yet
YLI Product Brochure 2023 030823
20 pages
Preventive Maintenance of Railway Tracks Ballast P
No ratings yet
Preventive Maintenance of Railway Tracks Ballast P
10 pages
Alexander 1966
No ratings yet
Alexander 1966
9 pages
Resistivity and Magnetics of The Roman Town Carnuntum, Austria: An Example of Combined Interpretation of Prospection Data, by W. Neubauer
No ratings yet
Resistivity and Magnetics of The Roman Town Carnuntum, Austria: An Example of Combined Interpretation of Prospection Data, by W. Neubauer
13 pages
OLYMPUS EMPOWER H35 Product Details - 78729 - 1
No ratings yet
OLYMPUS EMPOWER H35 Product Details - 78729 - 1
2 pages
STM8S-Duty PWM 1
No ratings yet
STM8S-Duty PWM 1
5 pages
NPN 1 Credit Course Learning Guide V1
No ratings yet
NPN 1 Credit Course Learning Guide V1
7 pages
Algebra Questions For Practice With Solutions
No ratings yet
Algebra Questions For Practice With Solutions
4 pages
PC Express List Price
No ratings yet
PC Express List Price
2 pages
Noel T.P. Hutahaean - 2205171038 - Tes1
No ratings yet
Noel T.P. Hutahaean - 2205171038 - Tes1
3 pages
5a - Assignment Worksheet Instructions and Examples Edtp 121 2024
No ratings yet
5a - Assignment Worksheet Instructions and Examples Edtp 121 2024
3 pages
Class-11 - U1 Syllabus - 2023-24
No ratings yet
Class-11 - U1 Syllabus - 2023-24
2 pages
Statistics For Dummies
From Everand
Statistics For Dummies
Deborah J. Rumsey
4/5 (28)
Bayesian Methodology: an Overview With The Help Of R Software
From Everand
Bayesian Methodology: an Overview With The Help Of R Software
Editor IJSMI
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet

SLIDES 20180123 RodLittle

Uploaded by

SLIDES 20180123 RodLittle

Uploaded by

Measures of uncertainty, and

the P-Value controversy

• Inference crucially requires that sample is “representative”

P-Value = Pr("data " | H 0 )

• Null Hypothesis H 0 : Outcome independent of treatment, or

• Null Hypothesis H 0 : Outcome independent of treatment, or

• SUMMARY. A randomized controlled trial was set up to

Pearson chi-squared (C) P=0.016

For example, an interval A that includes an interval B on

Bayes turns confidence interval into probability

• Bayes rule also converts confidence interval statements

Pr(+ | has cancer)Pr(has cancer)

Very likely, the friend does not have cancer.

• For example, I’d give theories like homeopathy

You might also like