0% found this document useful (0 votes)

14 views25 pages

List 00780

Uploaded by

holypeace2947

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views25 pages

List 00780

Uploaded by

holypeace2947

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

How Can We Make Experimental Research

Results More Reliable and Replicable?

John A. List, U. Chicago, ANU, and NBER

@Econ_4_Everyone
Building Confidence in (and Knowledge from)
Experimental Results
1. Scientific research aims to create a stock of knowledge.
Optimally adding to this stock requires confidence in the
received estimates.
2. A key question concerning confidence revolves around the
query: after a research finding has been claimed, what is the
post-study probability that it is true?
3. Two unique features of the experimental approach situate it
well to deepen the stock of scientific knowledge: selective
data generation and the ability to enhance the notion, and role,
of replications.
A Simple Bayesian Framework
to Build Knowledge

PSP: Probability that a declaration of a research

finding, made upon reaching statistical
significance, is true.
α: Level of statistical significance
1 - β: Level of power
π: Can think of this as the prior
Some Inference
Exhibit 15.5: PSPs With and Without a Statistically Significant Finding

Power

0.20 0.30 0.50 0.70 0.80

PSP (reject null)

0.01 0.04 0.06 0.09 0.12 0.14

0.05 0.17 0.24 0.34 0.42 0.46

0.10 0.31 0.40 0.53 0.61 0.64

0.20 0.50 0.60 0.71 0.78 0.80

0.30 0.63 0.72 0.81 0.86 0.87

0.40 0.73 0.80 0.87 0.90 0.91

0.50 0.80 0.86 0.91 0.93 0.94

What Can Go Wrong?
Controlling the False Positive Rate

 Statistical Error (alpha)

 Human Error (how we generate/evaluate/interpret data)
 Human Fraud (less rare than we hope)

 Import of replication becomes clear

One Example of Human Error: MHT

Reported P-value
.5

.3
Fraction

0
0.05 1.0
Corrected P-values

Holm-corrected P-value
.5

.3
Fraction

0
0.05 1.0
Building Confidence: What Can We Do?

1) Reduce Bias

2) Promote Transparency

3) Promote Scrutiny
1. One Kind of Bias
Common belief is that significant results have much
greater import than reporting null results.

 Scientific journals might prefer statistically

significant “newsworthy” results
 Funders might reward scholars who produce
noteworthy insights
 Ultimately, scientists might conclude that journal
publications and streams of funding matter a great
deal in tenure decisions
Yet, from a scientific perspective of building
knowledge, such skewed preferences are flawed
(see List, 2024).
Null Results Are Informative Too

Exhibit 15.5: PSPs With and Without a Statistically Significant Finding

Power

0.20 0.30 0.50 0.70 0.80

PSP (reject null)

0.01 0.04 0.06 0.09 0.12 0.14

0.05 0.17 0.24 0.34 0.42 0.46

0.10 0.31 0.40 0.53 0.61 0.64

0.20 0.50 0.60 0.71 0.78 0.80

0.30 0.63 0.72 0.81 0.86 0.87

0.40 0.73 0.80 0.87 0.90 0.91

0.50 0.80 0.86 0.91 0.93 0.94

PSP (null result)

0.01 0.01 0.01 0.01 0.00 0.00

0.05 0.04 0.04 0.03 0.02 0.01

0.10 0.09 0.08 0.06 0.03 0.02

0.20 0.17 0.16 0.12 0.07 0.05

0.30 0.27 0.24 0.18 0.12 0.08

0.40 0.36 0.33 0.26 0.17 0.12

0.50 0.46 0.42 0.34 0.24 0.17

Implications
 If our goal is to build scientific knowledge,
then recognizing and rewarding null
results, especially those that move priors,
is important

 Side benefit: will reduce the level of bias

in our science.
2. Promote Transparency
 Pre-Registration (must be well timed)
 Pre-Analysis Plans (must be well timed)
 Registered Reports (not for all journals)

Scientific transparency alone does not verify the validity

of the received results. Rather, it permits an exploration
of the received claims.

In this manner, transparency and scrutiny are

complements in enhancing knowledge building.
Implications
 When building scientific knowledge it is important to
understand that there is a crucial distinction between
the probability that a reported significant finding in the
literature represents a real relationship and the
probability that an individual experiment has
uncovered a real relationship.

 Side benefits of enhanced transparency: reduces bias

and provides a better depiction of what the literature is
finding.
3. Promote Scrutiny (Replications)
Pure replication: examine same question using the underlying original data set.
Robustness analysis: use the exact same data as the original analysis but modify
the data or the empirical methods to see if the results are robust
Same population replication: running a new experiment closely following the
original protocol to test whether similar results can be generated using random
draws from the same underlying population.
Similar population replication: conducting an experiment with UCLA
undergraduates to replicate a previous lab experiment conducted with University
of Maryland undergraduates.
Disparate population replication: examining the same question and model using
a population dissimilar from the original experiment.
Finally, the sixth and broadest replication category entails testing the hypotheses
of the original study using a new research design; this as a conceptual
replication,
How Fast Can We Build Confidence?


The Power of Replication

Power (1-β) = 0.80 Power (1-β) = 0.50

PSP PSP

0.01 0.14 0.72 0.98 1.00 0.09 0.50 0.91 0.99

0.02 0.25 0.84 0.99 1.00 0.17 0.67 0.95 1.00

0.05 0.46 0.93 1.00 1.00 0.34 0.84 0.98 1.00

0.10 0.64 0.97 1.00 1.00 0.53 0.92 0.99 1.00

0.20 0.80 0.98 1.00 1.00 0.71 0.96 1.00 1.00

0.30 0.87 0.99 1.00 1.00 0.81 0.98 1.00 1.00

0.40 0.91 0.99 1.00 1.00 0.87 0.99 1.00 1.00

0.50 0.94 1.00 1.00 1.00 0.91 0.99 1.00 1.00

What About Other Types of Replication?


Power

0.20 0.30 0.50 0.70 0.80

ɖ = 0.00

0.01 0.04 0.06 0.09 0.12 0.14

0.05 0.17 0.24 0.34 0.42 0.46

0.10 0.31 0.40 0.53 0.61 0.64

0.20 0.50 0.60 0.71 0.78 0.80

0.30 0.63 0.72 0.81 0.86 0.87

0.40 0.73 0.80 0.87 0.90 0.91

0.50 0.80 0.86 0.91 0.93 0.94

ɖ = 0.10

0.01 0.02 0.03 0.04 0.05 0.05

0.05 0.09 0.12 0.17 0.21 0.23

0.10 0.18 0.22 0.30 0.36 0.39

0.20 0.33 0.39 0.49 0.56 0.59

0.30 0.45 0.52 0.62 0.68 0.71

0.40 0.56 0.63 0.72 0.77 0.79

0.50 0.66 0.72 0.79 0.83 0.85

ɖ = 0.25

0.01 0.01 0.02 0.02 0.03 0.03

0.05 0.07 0.08 0.10 0.12 0.13

0.10 0.13 0.16 0.19 0.23 0.25

0.20 0.26 0.29 0.35 0.40 0.43

0.30 0.37 0.41 0.48 0.54 0.56

0.40 0.48 0.52 0.59 0.64 0.66

0.50 0.58 0.62 0.68 0.73 0.75

ɖ = 0.50

0.01 0.01 0.01 0.01 0.02 0.02

0.05 0.06 0.06 0.07 0.08 0.08

0.10 0.11 0.12 0.14 0.15 0.16

0.20 0.22 0.24 0.26 0.29 0.30

0.30 0.33 0.35 0.38 0.41 0.42

0.40 0.43 0.45 0.49 0.52 0.53

0.50 0.53 0.55 0.59 0.62 0.63

Implications
 When building knowledge it is important to have rapid scrutiny
so the course of science can quickly correct.

 We always focus on false positives but one might argue that

researchers’ tolerance for false negatives has potentially
irreversible effects on the development of scientific knowledge:
since false negative results are less likely to be followed up than
false positives, self-correction is less likely to occur in these
cases.

 Side benefits of scrutiny: reduces bias and provides a better

depiction of what the literature is finding.
What Could Go Wrong?
 The Great Endangered Species!
 Maniadis et al. (2017) survey experimental papers published between
1975–2014 in the top 150 journals in economics and estimate that the
fraction of replication studies among all experimental papers in their
sample is 4.2%.

 Changing Incentives
 Replications typically bring little recognition (few journals interested) and
even induce scorn.
 JESA, JPE:Micro are promising steps.
 Need to change authors’ incentives to collaborate with replicators. Should
positive replications of one’s work be considered a “super cite”?
Promoting Reproducibility
 Butera and List (2017): original investigators
of a study commit to only publishing their
results as a working paper and offer
coauthorship of a second paper to others who
are willing to replicate.

 Dreberet al. (2015) suggest using prediction

markets with experts as quick and low-cost
ways to obtain information about
reproducibility.
Cites
Abrams, Eliot, Jonathan Libgober, and John A. List. 2020. “Research Registries: Facts,
Myths, and Possible Improvements.” NBER.
Alevy, Jonathan, John List, and Wiktor Adamowicz. 2010. “How Can Behavioral
Economics Inform Non-Market Valuation? An Example from the Preference Reversal
Literature.” National Bureau of Economic Research, Inc, NBER Working Papers 87
(January). https://doi.org/10.3386/w16036.
Benjamin, Daniel J., James O. Berger, Magnus Johannesson, Brian A. Nosek, E.-J.
Wagenmakers, Richard Berk, Kenneth A. Bollen, et al. 2017. “Redefine Statistical
Significance.” Nature Human Behaviour 2 (1): 6–10. https://doi.org/10.1038/s41562-
017-0189-z.
Butera, Luigi, Philip J. Grossman, Daniel Houser, John A. List, and Marie-Claire
Villeval. 2020. “A New Mechanism to Alleviate the Crises of Confidence in Science-
With An Application to the Public Goods Game,” Working Paper Series, , February.
https://doi.org/10.3386/w26801.
Butera, Luigi, and John A. List. 2017. “An Economic Approach to Alleviate the Crises
of Confidence in Science: With an Application to the Public Goods Game,” Working
Paper Series, , April. https://doi.org/10.3386/w23335.
Cites
Camerer, Colin F., Anna Dreber, Eskil Forsell, Teck-Hua Ho, Jürgen Huber, Magnus Johannesson, Michael
Kirchler, et al. 2016. “Evaluating Replicability of Laboratory Experiments in Economics.” Science 351 (6280):
1433–36. https://doi.org/10.1126/science.aaf0918.
Dreber, Anna, Thomas Pfeiffer, Johan Almenberg, Siri Isaksson, Brad Wilson, Yiling Chen, Brian A. Nosek,
and Magnus Johannesson. 2015. “Using Prediction Markets to Estimate the Reproducibility of Scientific
Research.” Proceedings of the National Academy of Sciences 112 (50): 15343–47.
https://doi.org/10.1073/pnas.1516179112.
Levitt, Steven D., and John A. List. 2009. “Field Experiments in Economics: The Past, the Present, and the
Future.” European Economic Review 53 (1): 1–18. https://doi.org/10.1016/j.euroecorev.2008.12.001.
List, John A. 2004. “Neoclassical Theory versus Prospect Theory: Evidence from the Marketplace.”
Econometrica 72 (2): 615–25.
Maniadis, Zacharias, Fabio Tufano, and John A. List. 2014. “One Swallow Doesn’t Make a Summer: New
Evidence on Anchoring Effects.” American Economic Review 104 (1): 277–90.
https://doi.org/10.1257/aer.104.1.277.
———. 2015. “How to Make Experimental Economics Research More Reproducible: Lessons from Other
Disciplines and a New Proposal.” Research in Experimental Economics 18 (January).
https://doi.org/10.1108/S0193-230620150000018008.
———. 2017. “To Replicate or Not to Replicate? Exploring Reproducibility in Economics through the Lens of a
Model and a Pilot Study.” The Economic Journal 127 (605): F209–35. https://doi.org/10.1111/ecoj.12527.
Tufano, Fabio, and John A. List. 2021. “On the Importance of ‘Null Effects’ in Economics.” Unpublished
Manuscript.

STA630 ResearchMethod Short Notes
70% (10)
STA630 ResearchMethod Short Notes
18 pages
Statistical Thinking and Smart Experimental Design 2016
100% (3)
Statistical Thinking and Smart Experimental Design 2016
82 pages
List ExperimentalEconSlides
No ratings yet
List ExperimentalEconSlides
77 pages
Avoiding Questionable Research Practices in Applied Psychology Premium Ebook Download
100% (19)
Avoiding Questionable Research Practices in Applied Psychology Premium Ebook Download
15 pages
Annurev Psych 020821 094927
No ratings yet
Annurev Psych 020821 094927
28 pages
The NEW STATISTICS Method
No ratings yet
The NEW STATISTICS Method
23 pages
Always Pay Attention To Which Model of Motor Learning You Are Using
No ratings yet
Always Pay Attention To Which Model of Motor Learning You Are Using
36 pages
BUS5301 Research Methods in Education: Week 2 Academic Ethics & Overview of Educational Research
No ratings yet
BUS5301 Research Methods in Education: Week 2 Academic Ethics & Overview of Educational Research
48 pages
Lecture Slides
No ratings yet
Lecture Slides
175 pages
Week 1 Introduction To Research
No ratings yet
Week 1 Introduction To Research
12 pages
Replicability of Psychological Science
No ratings yet
Replicability of Psychological Science
12 pages
Avoiding Questionable Research Practices in Applied Psychology ISBN 3031049675, 9783031049675 All-in-One Download
No ratings yet
Avoiding Questionable Research Practices in Applied Psychology ISBN 3031049675, 9783031049675 All-in-One Download
14 pages
Statistics Done Wrong PDF
No ratings yet
Statistics Done Wrong PDF
27 pages
Basics in Experimental Research: Ajit Sahai
No ratings yet
Basics in Experimental Research: Ajit Sahai
44 pages
Marsman Et Al A Bayesian Bird S Eye View of Replications of Important Results in Social Psychology
No ratings yet
Marsman Et Al A Bayesian Bird S Eye View of Replications of Important Results in Social Psychology
18 pages
Camerer2018 Zbior Replikacji 2010 2015 M in Google Effect
No ratings yet
Camerer2018 Zbior Replikacji 2010 2015 M in Google Effect
10 pages
Wagenmakers 2012
No ratings yet
Wagenmakers 2012
8 pages
Angela Verano MC MATH 15 Module 1
No ratings yet
Angela Verano MC MATH 15 Module 1
12 pages
How Scientists Can Stop Fooling Themselves
No ratings yet
How Scientists Can Stop Fooling Themselves
1 page
統計迷思概念之2
No ratings yet
統計迷思概念之2
8 pages
Nature of Research
No ratings yet
Nature of Research
6 pages
Reporting On Journal Articles
No ratings yet
Reporting On Journal Articles
26 pages
Simons 2014 Value of Direct Replication
No ratings yet
Simons 2014 Value of Direct Replication
6 pages
Advance Statistics Module
100% (1)
Advance Statistics Module
64 pages
Sfoidglsfldsg
No ratings yet
Sfoidglsfldsg
6 pages
Deciding What To Replicate Preprint
No ratings yet
Deciding What To Replicate Preprint
14 pages
Improving Psychological Science Through Transparency and Openness An Overview - Hales Et Al. - 2019
No ratings yet
Improving Psychological Science Through Transparency and Openness An Overview - Hales Et Al. - 2019
19 pages
Nevermind The Data, Where Are The Protocols - The Scholarly Kitchen
No ratings yet
Nevermind The Data, Where Are The Protocols - The Scholarly Kitchen
11 pages
Psyc 201
No ratings yet
Psyc 201
5 pages
Capstone Research Lesson
No ratings yet
Capstone Research Lesson
12 pages
What Psychology Teacher Should Know About Open Science
No ratings yet
What Psychology Teacher Should Know About Open Science
11 pages
Reproducibility Crisis Might The Methods Used Frequently in Behavior-Analysis Research Help - Branch Et Al. - 2019
No ratings yet
Reproducibility Crisis Might The Methods Used Frequently in Behavior-Analysis Research Help - Branch Et Al. - 2019
13 pages
RRR - Annualreview 45 50
No ratings yet
RRR - Annualreview 45 50
6 pages
Support Vector Machines And: Predictive Data Modeling
No ratings yet
Support Vector Machines And: Predictive Data Modeling
49 pages
Lengersdorff Lamm 2025 With Low Power Comes Low Credibility Toward A Principled Critique of Results From Underpowered
No ratings yet
Lengersdorff Lamm 2025 With Low Power Comes Low Credibility Toward A Principled Critique of Results From Underpowered
9 pages
RRR - AnnualReview 1 5
No ratings yet
RRR - AnnualReview 1 5
5 pages
Henrich, Smaldino, OSC, Vazire, Nuzzo
No ratings yet
Henrich, Smaldino, OSC, Vazire, Nuzzo
4 pages
Advance Statistics Module
No ratings yet
Advance Statistics Module
61 pages
Reproducibility in Science: Review
No ratings yet
Reproducibility in Science: Review
12 pages
Schoenbrodt Open Science Introduction 2021
No ratings yet
Schoenbrodt Open Science Introduction 2021
79 pages
23 STS909
No ratings yet
23 STS909
2 pages
Psychology
No ratings yet
Psychology
6 pages
Scientific Principles
0% (1)
Scientific Principles
18 pages
Introduction To Research Design David Nana Adjei
No ratings yet
Introduction To Research Design David Nana Adjei
22 pages
Assignments Master Ofarts (Education) - 1St Year January 2019 & July 2019
100% (1)
Assignments Master Ofarts (Education) - 1St Year January 2019 & July 2019
5 pages
Expp211 Prelims
No ratings yet
Expp211 Prelims
5 pages
When and Why To Replicate 218-3686-1-PB
No ratings yet
When and Why To Replicate 218-3686-1-PB
15 pages
Cole Et Al 2025 Practical Problems Estimating and Reporting Power When Hypotheses Are Embedded in Complex Statistical
No ratings yet
Cole Et Al 2025 Practical Problems Estimating and Reporting Power When Hypotheses Are Embedded in Complex Statistical
17 pages
Reporting On Journal Articles
No ratings yet
Reporting On Journal Articles
35 pages
Lecture 1
No ratings yet
Lecture 1
30 pages
Harris Et Al (2018) Examining The Reproducibility of 6 Published Studies in Public Health Services and Systems Research.
No ratings yet
Harris Et Al (2018) Examining The Reproducibility of 6 Published Studies in Public Health Services and Systems Research.
9 pages
"Inductive Bias" - LessWrong
No ratings yet
"Inductive Bias" - LessWrong
1 page
Lesson 2 Nature of Research
No ratings yet
Lesson 2 Nature of Research
23 pages
SwiatkowskiDompnier2017 1
No ratings yet
SwiatkowskiDompnier2017 1
15 pages
Outline
No ratings yet
Outline
6 pages
Filosofi Penelitian
No ratings yet
Filosofi Penelitian
28 pages
TN Listicle ReproducibilityCrisis
No ratings yet
TN Listicle ReproducibilityCrisis
4 pages
Statistics Done Wrong
No ratings yet
Statistics Done Wrong
27 pages
Mabalacat City College
100% (1)
Mabalacat City College
85 pages
Enzyme Activity Lab Report
100% (1)
Enzyme Activity Lab Report
10 pages
Statistics Formula Sheet New
No ratings yet
Statistics Formula Sheet New
22 pages
Data Mining Tutorial: D. A. Dickey
No ratings yet
Data Mining Tutorial: D. A. Dickey
109 pages
Mgt555 - Individual Assignment 2
100% (1)
Mgt555 - Individual Assignment 2
6 pages
5 Ways To Fix Statistics
No ratings yet
5 Ways To Fix Statistics
3 pages
Unit IV: Quantitative Biology and Bioinformatics
No ratings yet
Unit IV: Quantitative Biology and Bioinformatics
27 pages
StatsTests04 PDF
No ratings yet
StatsTests04 PDF
32 pages
Exam Revision - Answered
100% (1)
Exam Revision - Answered
21 pages
Ps 7
No ratings yet
Ps 7
9 pages
Linear Regression-2: Prof. Asim Tewari IIT Bombay
No ratings yet
Linear Regression-2: Prof. Asim Tewari IIT Bombay
19 pages
Technology Exposure Its Relationship To
No ratings yet
Technology Exposure Its Relationship To
17 pages
Solutions Stat200 Final Fall2015 Ol4 B
No ratings yet
Solutions Stat200 Final Fall2015 Ol4 B
9 pages
Thesis Refference 1 (Dev and Accept of Electro-Tech Board)
No ratings yet
Thesis Refference 1 (Dev and Accept of Electro-Tech Board)
12 pages
The Role of Using Facebook in Improving English
No ratings yet
The Role of Using Facebook in Improving English
7 pages
One-Sample Tests of Hypothesis: Mcgraw-Hill/Irwin
No ratings yet
One-Sample Tests of Hypothesis: Mcgraw-Hill/Irwin
49 pages
CHAPTER 2 CHAPTER 3 Edited
No ratings yet
CHAPTER 2 CHAPTER 3 Edited
29 pages
Roxas, J. G.
No ratings yet
Roxas, J. G.
10 pages
Assessing The Effects of Price Escalation On Buildin
No ratings yet
Assessing The Effects of Price Escalation On Buildin
8 pages
Teachers' and Students' Attitudes Toward Error Correction in L2 Writing
No ratings yet
Teachers' and Students' Attitudes Toward Error Correction in L2 Writing
31 pages
Dokumen - Tips Homework 3 Solution Department of Statistics Ovitekstat526 Spring11filespdfshw3 Solpdfstat
No ratings yet
Dokumen - Tips Homework 3 Solution Department of Statistics Ovitekstat526 Spring11filespdfshw3 Solpdfstat
12 pages
SUSS BSBA: BUS105 Jan 2021 TOA Answers
No ratings yet
SUSS BSBA: BUS105 Jan 2021 TOA Answers
9 pages
Applied Psychology Research Methods Study Guide
No ratings yet
Applied Psychology Research Methods Study Guide
21 pages
Statistics and Probability Module 5 Moodle - Copy
No ratings yet
Statistics and Probability Module 5 Moodle - Copy
12 pages
Shared Leadership and Project Success: The Roles of Knowledge Sharing, Cohesion and Trust in The Team
No ratings yet
Shared Leadership and Project Success: The Roles of Knowledge Sharing, Cohesion and Trust in The Team
12 pages
Gucela, Christine I. BPED II-B.-PrEd-161e-Lesson-1.1-Learning-Task
No ratings yet
Gucela, Christine I. BPED II-B.-PrEd-161e-Lesson-1.1-Learning-Task
6 pages
Understanding Generation Y Green Purchasing Decision in Malaysia
No ratings yet
Understanding Generation Y Green Purchasing Decision in Malaysia
15 pages
The Effect of E-Commerce On Employment in Retail Sector: January 2018
No ratings yet
The Effect of E-Commerce On Employment in Retail Sector: January 2018
10 pages
A Practical Guide For Understanding Confidence Intervals and P Values
No ratings yet
A Practical Guide For Understanding Confidence Intervals and P Values
6 pages
Monday, Isaneil M. (Examination Answer)
No ratings yet
Monday, Isaneil M. (Examination Answer)
11 pages
Fall2012 Ch7 CI Review
No ratings yet
Fall2012 Ch7 CI Review
6 pages
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
From Everand
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
S. Deviant
4.5/5 (6)