Maria De Paola, Francesca Gioia, Vincenzo Scoppa
Cited by*: None Downloads*: None

We ran a field experiment to investigate whether competing in rank-order tournaments with different prize spreads affects individual performance. Our experiment involved students from an Italian University who took an exam that was partly evaluated on the basis of relative performance. Students were matched in pairs on the basis of their high school grades and each pair was randomly assigned to one of three different tournaments. Random assignment neutralizes selection effects and allows us to investigate if larger prize spreads increase individual effort. We do not find any positive effect of larger prizes on performance. Furthermore, we show that the effect of prize spreads on students' performance depends on their degree of risk-aversion: competing in tournaments with large spreads negatively affects the performance of risk-averse students, while it does not produce any effect on students who are more prone to taking risks.
Bharat Chandar, Ali Hortacsu, John A List, Ian Muir, Jeffrey M Wooldridge
Cited by*: None Downloads*: None

Field experiments conducted with the village, city, state, region, or even country as the unit of randomization are becoming commonplace in the social sciences. While convenient, subsequent data analysis may be complicated by the constraint on the number of clusters in treatment and control. Through a battery of Monte Carlo simulations, we examine best practices for estimating unit-level treatment effects in cluster-randomized field experiments, particularly in settings that generate short panel data. In most settings we consider, unit-level estimation with unit fixed effects and cluster-level estimation weighted by the number of units per cluster tend to be robust to potentially problematic features in the data while giving greater statistical power. Using insights from our analysis, we evaluate the effect of a unique field experiment: a nationwide tipping field experiment across markets on the Uber app. Beyond the import of showing how tipping affects aggregate outcomes, we provide several insights on aspects of generating and analyzing cluster-randomized experimental data when there are constraints on the number of experimental units in treatment and control.
Michal Krawczyk, Ernesto Reuben
Cited by*: None Downloads*: None

This article reports results of a field experiment in which two hundred e-mails were sent to authors of recent articles in economics that had promised to send the interested reader supplementary material, such as alternative econometric specifications, "upon request." The e-mails were sent either by a researcher affiliated at Columbia University, New York or the University of Warsaw, Poland; furthermore, the authors' position (assistant professor) was specified in half the e-mails only. Overall, 64% of the approached authors responded to our message, of which two thirds (44% of the entire sample) delivered the requested materials. The frequency and speed of responding and delivering were very weakly affected by the position and affiliation of the sender. Gender of affiliation of the author, number of citations or journal impact factory or the type of object in question seemed to make no difference. However, authors of published articles were much more likely to share than authors of working papers.
Daniel J Benjamin, James O Berger, Magnus Johannesson, Brian A Nosek, E. J Wagenmakers, Richard Berk, Kenneth A Bollen, Bjorn Brembs, Lawrence Brown, Colin F Camerer, David Cesarini, Christopher D. Chambers, Merlise Clyde, Thomas D Cook, Paul De Boeck, Zoltan Dienes, Anna Dreber, Kenny Easwaran, Charles Efferson, Ernst Fehr, Fiona Fidler, Andy P. Field, Malcom Forster, Edward I. George, Tarun Ramadorai, Richard Gonzalez, Steven Goodman, Edwin Green, Donald P Green, Anthony Greenwald, Jarrod D. Hadfield, Larry V. Hedges, Leonhard Held, Teck Hau Ho, Herbert Hoijtink, James Holland Jones, Daniel J Hruschka, Kosuke Imai, Guido Imbens, John P.A. Ioannidis, Minjeong Jeon, Michael Kirchler, David Laibson , John A List, Roderick Little, Arthur Lupia, Edouard Machery, Scott E. Maxwell, Michael McCarthy, Don Moore, Stephen L. Morgan, Marcus Munafo, Shinichi Nakagawa, Brendan Nyhan, Timothy H Parker, Luis Pericchi, Marco Perugini, Jeff Rouder, Judith Rousseau, Victoria Savalei, Felix D. Schonbrodt, Thomas Sellke, Betsy Sinclair, Dustin Tingley, Trisha Van Zandt, Simine Vazire, Duncan J. Watts, Christopher Winship, Robert L. Wolpert, Yu Xie, Cristobal Young, Jonathan Zinman, Valen E. Johnson
Cited by*: 1 Downloads*: 965

We propose to change the default P-value threshold for statistical significance for claims of new discoveries from 0.05 to 0.005.
Michael Fix, Raymond J Struyk
Cited by*: 32 Downloads*: 955

Auditing is a technique used to test for discrimination. The concept is straightforward: Two individuals are matched on all relevant characteristics except the one presumed to lead to discrimination. Each person then applies for the same job, housing, mortgage loan, or credit card. The differential treatment they receive provides a measure of discrimination. The authors argue that the value of auditing has grown in the current legal and political environment because it can detect subtle forms of discrimination.
John A List, Azeem M Shaikh, Yang Xu
Cited by*: 33 Downloads*: 278

Empiricism in the sciences allows us to test theories, formulate optimal policies, and learn how the world works. In this manner, it is critical that our empirical work provides accurate conclusions about underlying data patterns. False positives represent an especially important problem, as vast public and private resources can be misguided if we base decisions on false discovery. This study explores one especially pernicious influence on false positives-multiple hypothesis testing (MHT). While MHT potentially affects all types of empirical work, we consider three common scenarios where MHT influences inference within experimental economics: jointly identifying treatment effects for a set of outcomes, estimating heterogenous treatment effects through subgroup analysis, and conducting hypothesis testing for multiple treatment conditions. Building upon the work of Romano and Wolf (2010), we present a correction procedure that incorporates the three scenarios, and illustrate the improvement in power by comparing our results with those obtained by the classic studies due to Bonferroni (1935) and Holm (1979). Importantly, under weak assumptions, our testing procedure asymptotically controls the familywise error rate - the probability of one false rejection - and is asymptotically balanced. We showcase our approach by revisiting the data reported in Karlan and List (2007), to deepen our understanding of why people give to charitable causes.
John A List, Anya Samek, Dana L Suskind
Cited by*: 0 Downloads*: 258

Behavioral economics and field experiments within the social sciences have advanced well beyond academic curiosum. Governments around the globe as well as the most powerful firms in modern economies employ staffs of behavioralists and experimentalists to advance and test best practices. In this study, we combine behavioral economics with field experiments to reimagine a new model of early childhood education. Our approach has three distinct features. First, by focusing public policy dollars on prevention rather than remediation, we call for much earlier educational programs than currently conceived. Second, our approach has parents at the center of the education production function rather than at its periphery. Third, we advocate attacking the macro education problem using a public health methodology, rather than focusing on piecemeal advances.
Ufuk Akcigit, Fernando Alvarez, Stephane Bonhomme, George M Constantinides, Douglas W Diamond, Eugene F Fama, David W Galenson, Michael Greenstone, Lars Peter Hansen, Uhlig Harald, James J Heckman, Ali Hortacsu, Emir Kamenica, Greg Kaplan, Anil K Kashyap, Steven D Levitt, John A List, Robert E Lucas Jr., Magne Mogstad, Roger Myerson, Derek Neal, Canice Prendergast, Raghuram G Rajan, Philip J Reny, Azeem M Shaikh, Robert Shimer, Hugo F Sonnenschein, Nancy L Stokey, Richard H Thaler, Robert H Topel, Robert Vishny, Luigi Zingales
Cited by*: 0 Downloads*: 207

No abstract available
John A List, Robert D Metcalfe, Michael H Taylor, Ivo Vlaev
Cited by*: 44 Downloads*: 193

Tax collection problems date back to the earliest recorded history of mankind. This paper begins with a simple theoretical construct of paying (rather than declaring) taxes, which we argue has been an overlooked aspect of tax compliance. This construct is then tested in two large natural field experiments. Using administrative data from more than 200,000 individuals in the UK, we show that including social norms and public goods messages in standard tax payment reminder letters considerably enhances tax compliance. The field experiments increased taxes collected by the Government in the sample period and were cost-free to implement, demonstrating the potential importance of such interventions in increasing tax compliance.
Jie Bai
Cited by*: 0 Downloads*: 157

There is often a lack of reliable high quality provision in many markets in developing countries. I designed an experiment to understand this phenomenon in a setting that features typical market conditions in a developing country: the retail watermelon market in a major Chinese city. I begin by demonstrating empirically that there is substantial asymmetric information between sellers and buyers on sweetness, the key indicator of quality for watermelons, yet sellers do not sort and price watermelons by quality. I then randomly introduce one of two branding technologies into 40 out of 60 markets-one sticker label that is widely used and often counterfeited and one novel laser-cut label. I track sellers' quality, pricing and sales over an entire season and collect household panel purchasing data to examine the demand side's response. I find that laser branding induced sellers to provide higher quality and led to higher sales profits, establishing that reputational incentives are present and can be made to pay. However, after the intervention was withdrawn, all markets reverted back to baseline. To rationalize the experimental findings, I build an empirical model of consumer learning and seller reputation. The structural estimates suggest that consumers are hesitant to upgrade their perception about quality under the existing branding technology, which makes reputation building a low return investment. While the new technology enhances consumer learning, the resulting increase in profits is not sufficient to cover the fixed cost of the technology for small individual sellers. Counterfactual analysis shows that information frictions and fragmented markets lead to significant under-provision of quality. Third-party interventions that subsidize initial reputation building for sellers could improve welfare.
Shlomo Benartzi, Richard H Thaler
Cited by*: 298 Downloads*: 146

As firms switch from defined-benefit plans to defined-contribution plans, employees bear more responsibility for making decisions about how much to save. The employees who fail to join the plan or who participate at a very low level appear to be saving at less than the predicted life cycle savings rates. Behavioral explanations for this behavior stress bounded rationality and self-control and suggest that at least some of the low-saving households are making a mistake and would welcome aid in making decisions about their saving. In this paper, we propose such a prescriptive savings program, called Save More Tomorrow (hereafter, the SMarT program). The essence of the program is straightforward: people commit in advance to allocating a portion of their future salary increases toward retirement savings. We report evidence on the first three implementations of the SMarT program. Our key findings, from the first implementation, which has been in place for four annual raises, are as follows: (1) a high proportion (78 percent) of those offered the plan joined, (2) the vast majority of those enrolled in the SMarT plan (80 percent) remained in it through the fourth pay raise, and (3) the average saving rates for SMarT program participants increased from 3.5 percent to 13.6 percent over the course of 40 months. The results suggest that behavioral economics can be used to design effective prescriptive programs for important economic decisions.
Marsha Blumenthal, Charles Christian, Joel Slemrod
Cited by*: 148 Downloads*: 124

In 1995 a group of 1724 randomly selected Minnesota taxpayers was informed by letter that the returns they were about to file would be 'closely examined'. Compared to a control group that did not receive this letter, low and middle-income taxpayers in the treatment group on average increased tax payments compared to the previous year, which we interpret as indicating the presence of noncompliance. The effect was much stronger for those with more opportunity to evade; in fact, the difference in differences is not statistically significant for those who do not have self-employment or farm income, and do not pay estimated tax. Surprisingly, however, the reported tax liability of the high income treatment group fell sharply relative to the control group.
Jasper Knockaert, Stefanie Peer, Erik T Verhoef
Cited by*: 0 Downloads*: 119

If left unidentified and uncorrected, self-selection biases may greatly compromise the external validity of the outcomes of field experiments. We show that self-selection biases in terms of observed und unobserved characteristics can be well identified and corrected by means of a complementary stated preference (SP) experiment conducted among the participants and non-participants of a field experiment. In the SP experiment, respondents are confronted with hypothetical choice situations that closely resemble the choice situations present in the field experiment. The SP experiment does not only allow us to compare participants and non-participants with respect to their behavior and implied preferences in the hypothetical choice situations, but also renders it possible to infer how non-participants would have behaved if they had decided to participate, using an innovative modeling approach to elicit the corresponding preference structures. We apply this approach in the context of a large-scale field experiment in which train commuters received monetary rewards for traveling outside peak hours. We find strong self-selection biases, especially with respect to the marginal utility of income, which is significantly higher among participants of the field experiment.
John A List, Paramita Sinha, Michael H Taylor
Cited by*: 33 Downloads*: 116

Critics of stated preference methods argue that hypothetical bias precludes survey techniques from providing reliable economic values for non-market goods and services, rendering estimation of the total economic benefits of public programs fruitless. This paper explores a relatively new methodology to obtain the total value of non-market goods and services-choice experiments-which conveniently provide information on the purchase decision as well as the characteristic value vector. The empirical work revolves around examining behavior in two very different field settings. In the first field study, we explore hypothetical bias in the purchase decision by eliciting contributions for a threshold public good in an actual capital campaign. To extend the analysis a level deeper, in a second field experiment we examine both the purchase decision and the marginal value vector via inspection of consumption decisions in an actual marketplace. In support of the new valuation design, both field experiments provide some evidence that hypothetical choice experiments combined with ""cheap talk"" can yield credible estimates of the purchase decision. Furthermore, we find no evidence of hypothetical bias when estimating marginal attribute values. Yet, we do find that the ""cheap talk"" component might induce internal inconsistency of subjects' preferences in the choice experiment.
Orana Bandiera, Iwan Barankay, Imran Rasul
Cited by*: 236 Downloads*: 114

We present evidence on whether workers have social preferences by comparing workers' productivity under relative incentives, where individual effort imposes a negative externality on others, to their productivity under piece rates, where it does not. We find that the productivity of the average worker is at least 50 percent higher under piece rates than under relative incentives. We show that this is due to workers partially internalizing the negative externality their effort imposes on others under relative incentives, especially when working alongside their friends. Under piece rates, the relationship among workers does not affect productivity. Further analysis reveals that workers internalize the externality only when they can monitor others and be monitored. This rules out pure altruism as the underlying motive of workers' behavior.
David Laibson , John A List
Cited by*: 2 Downloads*: 114

There are many great ways to incorporate behavioral economics in a first-year undergraduate economics class-i.e., the course that is typically called "Principles of Economics." Our preferred approach integrates behavioral economics throughout the course (e.g., see Acemoglu, Laibson, and List 2015). With the integrated approach, behavioral content plays a role in many of the chapters of the principles of economics curriculum, including chapters on optimization, equilibrium, game theory, intertemporal choice, probability and risk, social preferences, household finance, the labor market, financial intermediation, monetary policy, economic fluctuations, and financial crises. We prefer the integrated approach because it enables the behavioral insights to show up where they are conceptually most relevant. By illustration, it is best to combine a discussion of downward nominal wage rigidity (i.e., the idea that workers strongly resist nominal wage declines) with the overall discussion of the labor market. Whether or not an instructor integrates behavioral economics throughout the principles of economics course, it makes sense to pull central materials together and dedicate a lecture (or more) to a focused discussion of behavioral economics. This note describes our approach to such a lecture, emphasizing six key principles of behavioral economics. Our choice of content for a behavioral lecture is motivated by three factors. First, we include ideas that are conceptually important. Second, we include material that is practically important and personally relevant to our students-we have found that such content resonates long after the course ends. Third, we include content that relates to what has been (or will be) taught in the rest of the course, and therefore serves as a complement. We want students to see that behavioral economics is an integrated part of economics, not a freak show that is isolated from "the standard ingredients" in the rest of the economics course. This paper summarizes our approach to such a focused behavioral lecture. In Section I, we define behavioral economics and place it in historical context. In Section II, we introduce six modular principles that can be used to teach behavioral economics. We provide PowerPoint notes on our home pages, which instructors should feel free to edit and use.
John A List
Cited by*: 0 Downloads*: 111

Not applicable.
Jeffrey A Flory, Andreas Leibbrandt, John A List
Cited by*: 0 Downloads*: 109

Workplace misbehaviors are often governed by explicit monitoring and strict punishment. Such enforcement activities can serve to lessen worker productivity and harm worker morale. We take a different approach to curbing worker misbehaviour - bonuses. Examining more than 6500 donor phone calls across more than 80 workers, we use a natural field experiment to investigate how different wage contracts influence workers' propensity to break workplace rules in harmful ways. Our findings show that even though standard relative performance pay contracts, relative to a fixed wage scheme, increase productivity, they have a dark side: they cause considerable cheating and sabotage of co-workers. Yet, even in such environments, by including an unexpected bonus, the employer can substantially curb worker misbehavior. In this manner, our findings reveal how employers can effectively leverage bonuses to eliminate undesired behaviors induced by performance pay contracts.
Glenn W Harrison, John A List
Cited by*: 644 Downloads*: 105

Experimental economists are leaving the reservation. They are recruiting subjects in the field rather than in the classroom, using field goods rather than induced valuations, and using field context rather than abstract terminology in instructions. We argue that there is something methodologically fundamental behind this trend. Field experiments differ from laboratory experiments in many ways. Although it is tempting to view field experiments as simply less controlled variants of laboratory experiments, we argue that to do so would be to seriously mischaracterize them. What passes for "control" in laboratory experiments might in fact be precisely the opposite if it is artificial to the subject or context of the task. We propose six factors that can be used to determine the field context of an experiment: the nature of the subject pool, the nature of the information that the subjects bring to the task, the nature of the commodity, the nature of the task or trading rules applied, the nature of the stakes, and the environment that subjects operate in.
Stephen Benard, Shelley Correll, In Paik
Cited by*: 14 Downloads*: 104

Survey research finds that mothers suffer a substantial wage penalty, although the causal mechanism producing it remains elusive. The authors employed a laboratory experiment to evaluate the hypothesis that status-based discrimination plays an important role and an audit study of actual employers to assess its real-world implications. In both studies, participants evaluated application materials for a pair of same-gender equally qualified job candidates who differed on parental status. The laboratory experiment found that mothers were penalized on a host of measures, including perceived competence and recommended starting salary. Men were not penalized for, and sometimes benefited from, being a parent. The audit study showed that actual employers discriminate against mothers, but not against fathers.