Maria De Paola, Francesca Gioia, Vincenzo Scoppa
Cited by*: None Downloads*: None

We ran a field experiment to investigate whether competing in rank-order tournaments with different prize spreads affects individual performance. Our experiment involved students from an Italian University who took an exam that was partly evaluated on the basis of relative performance. Students were matched in pairs on the basis of their high school grades and each pair was randomly assigned to one of three different tournaments. Random assignment neutralizes selection effects and allows us to investigate if larger prize spreads increase individual effort. We do not find any positive effect of larger prizes on performance. Furthermore, we show that the effect of prize spreads on students' performance depends on their degree of risk-aversion: competing in tournaments with large spreads negatively affects the performance of risk-averse students, while it does not produce any effect on students who are more prone to taking risks.
Greer K Gosnell, John A List, Robert D Metcalfe
Cited by*: None Downloads*: None

Increasing evidence indicates the importance of management in determining firms' productivity. Yet, causal evidence regarding the effectiveness of management practices is scarce, especially for high-skilled workers in the developed world. In an eight-month field experiment measuring the productivity of captains in the commercial aviation sector, we test four distinct management practices: (i) performance monitoring; (ii) performance feedback; (iii) target setting; and (iv) prosocial incentives. We find that these management practices -particularly performance monitoring and target setting- significantly increase captains' productivity with respect to the targeted fuel-saving dimensions. We identify positive spillovers of the tested management practices on job satisfaction and carbon dioxide emissions, and captains overwhelmingly express desire for deeper managerial engagement. Both the implementation and the results of the study reveal an uncharted opportunity for management researchers to delve into the black box of firms and rigorously examine the determinants of productivity amongst skilled labor.
Amanda Bayer, Syon Bhanot, Fernando Lozano
Cited by*: None Downloads*: None

No abstract available
Syon Bhanot
Cited by*: None Downloads*: None

Social norms messaging campaigns are increasingly used to influence human behavior, with social science research generally finding that they have modest but meaningful effects. One aspect of these campaigns in practice has been the inclusion of injunctive norms messaging, designed to convey a social judgement about one's behaviors (often in the form of encouraging or discouraging language, or a visual smiley or frowny face). While some prominent research has provided support for the use of such messaging as a tool for positive behavior change, causal evidence on the effect of injunctive norms messaging as a motivator (as opposed to just one part of a multifaceted messaging campaign) is limited. This paper presents a field experiment on water conservation behavior conducted by an organization in California, involving over 40,000 households, which provides some of the most precise evidence to date regarding the effect of injunctive norms on decision making. I find that not only do injunctive norms encourage conservation behavior, there is also no evidence that they discourage individuals from further attending norms messaging-regardless of whether the social judgement conveyed is negative or positive. Taken together, this suggests that injunctive norms are a useful tool in "nudge"-style campaigns tackling behavior change.
Michal Krawczyk, Ernesto Reuben
Cited by*: None Downloads*: None

This article reports results of a field experiment in which two hundred e-mails were sent to authors of recent articles in economics that had promised to send the interested reader supplementary material, such as alternative econometric specifications, "upon request." The e-mails were sent either by a researcher affiliated at Columbia University, New York or the University of Warsaw, Poland; furthermore, the authors' position (assistant professor) was specified in half the e-mails only. Overall, 64% of the approached authors responded to our message, of which two thirds (44% of the entire sample) delivered the requested materials. The frequency and speed of responding and delivering were very weakly affected by the position and affiliation of the sender. Gender of affiliation of the author, number of citations or journal impact factory or the type of object in question seemed to make no difference. However, authors of published articles were much more likely to share than authors of working papers.
Bharat Chandar, Ali Hortacsu, John A List, Ian Muir, Jeffrey M Wooldridge
Cited by*: None Downloads*: None

Field experiments conducted with the village, city, state, region, or even country as the unit of randomization are becoming commonplace in the social sciences. While convenient, subsequent data analysis may be complicated by the constraint on the number of clusters in treatment and control. Through a battery of Monte Carlo simulations, we examine best practices for estimating unit-level treatment effects in cluster-randomized field experiments, particularly in settings that generate short panel data. In most settings we consider, unit-level estimation with unit fixed effects and cluster-level estimation weighted by the number of units per cluster tend to be robust to potentially problematic features in the data while giving greater statistical power. Using insights from our analysis, we evaluate the effect of a unique field experiment: a nationwide tipping field experiment across markets on the Uber app. Beyond the import of showing how tipping affects aggregate outcomes, we provide several insights on aspects of generating and analyzing cluster-randomized experimental data when there are constraints on the number of experimental units in treatment and control.
Omar Isaac Asensio, Magali A Delmas
Cited by*: 3 Downloads*: None

Little is known about the effect of message framing on conservation behavior over time. In a randomized controlled trial with residential households, we use advanced metering and information technologies to test how different messages about household energy use impact the dynamics of conservation behavior down to the appliance level. Our results, based on 374 million panel observations of kilowatt-hour (kWh) electricity consumption for 118 households over 9 months, show that differences in behavioral responses due to message framing become more significant over time. We find that a health-based frame, in which households consider the human health effects of their marginal electricity use, induced persistent energy savings behavior of 8-10% over 100 days; whereas a more traditional cost savings frame, drove sharp attenuation of treatment effects after 2 weeks with no significant savings versus control after 7 weeks. We discuss the implications for the design of effective information campaigns to engage households in conservation behavior.
John A List
Cited by*: None Downloads*: None

A summary of artefactual field experiments on fieldexperiments.com.
Plamen Nikolov
Cited by*: None Downloads*: None

Attitudes toward risk underlie virtually every important economic decision an individual makes. In this experimental study, I examine how introducing a time delay into the execution of an investment plan influences individuals' risk preferences. The field experiment proceeded in three stages: a decision stage, an execution stage and a payout stage. At the outset, in the Decision Stage (Stage 1), each subject was asked to make an investment plan by splitting a monetary investment amount between a risky asset and a safe asset. Subjects were informed that the investment plans they made in the Decision Stage are binding and will be executed during the Execution Stage (Stage 2). The Payout Stage (Stage 3) was the payout date. The timing of the Decision Stage and Payout Stage was the same for each subject, but the timing of the Execution Stage varied experimentally. I find that individuals who were assigned to execute their investment plans later (i.e., for whom there was a greater delay prior to the Execution Stage) invested a greater amount in the risky asset during the Decision Stage.
Maya Haran Rosen , Orly Sade
Cited by*: None Downloads*: None

Define contribution mechanism combined with a dynamic job market can affect the sum of retirement savings and the choices of plans and products. Hence, it is important for regulators to engage servers to manage the accounts they accumulate over the years. In 2013-2014 the Israeli regulator reached out to the population, recommending the use of a website to help individuals find inactive retirement savings accounts and close them (withdraw the savings or transfer them to active accounts). The government's efforts did not result in the closure of most of the inactive accounts. Proprietary data indicate that those who closed the inactive accounts live in central locations with a higher socioeconomic index. Survey data indicate that those who lacked financial literacy and confidence in their financial knowledge were less likely to take financial actions. Using a controlled field experiment, we also provide evidence that an intervention with a human touch can promote greater involvement.
Indranil Goswami, Oleg Urminsky
Cited by*: None Downloads*: None

We present a complete empirical case study of fundraising decisions that demonstrates the importance of in-context field experiments. We first design novel matching-based fundraising appeals. We derive theory-based predictions from the standard impure altruism model and solicit expert opinion about the potential performance of our interventions. Both theory-based predictions and descriptive advice suggest improved fundraising performance from a framing intervention that credited donors for the matched funds (compared to a typical match framing). However, results from a natural field experiment with prior donors of a non-profit showed significantly poorer performance of this framing compared to a regularly framed matching intervention. This surprising finding was confirmed in a second natural field experiment, to establish the ground truth. Theoretically, our results highlight the limitations of both impure altruism models and of expert opinion in prediction complex "warm glow" motivation. More practically, our results question the availability of useful guidance, and suggest the indispensability of field testing for interventions in fundraising.
Syon Bhanot, Jiyoung Han, Chaning Jang
Cited by*: None Downloads*: None

Restrictions like work requirements and constraints on voucher transfers are often used in social welfare systems, but little empirical evidence exists on their impact on wellbeing. We conducted a 10-day randomized experiment with 432 individuals living below the poverty line in the Kawangware settlement of Nairobi, kenya, testing two elements of social welfare design: workfare versus welfare and restricted versus unrestricted vouchers. Participants were randomly assigned to a "Work" condition, involving daily work for unrestricted vouchers, or one of two "Wait" conditions, involving daily waiting for vouchers that were either unrestricted or partially restricted to staple foods. We find that working improved psychological wellbeing relative to waiting, suggesting that means of implementing welfare programs may have important effects on individuals beyond the impact of monetary benefit alone. Furthermore, although restrictions were inframarginal, partially restricted vouchers crowded-in spending on staple foods, suggesting the existence of a "flypaper effect" in spending from restricted vouchers.
Maria De Paola, Francesca Gioia, Vincenzo Scoppa
Cited by*: None Downloads*: None

We investigate whether and how social ties affect performance in teams by implementing a field experiment in which a sample of undergraduate students are randomly assigned either to teams composed by friends or to teams composed by individuals not linked by friendship relationships. Students undertake an intermediate exam divided into two parts: one graded on the basis of individual performance and the other graded on the basis of team performance. We find that students assigned to socially connected teams perform significantly better than control students in both the team part and the individual part of the exam, suggesting that social ties are relevant both for solving free-riding problems and for inducing knowledge spillovers among teammates. The positive effect of friendship persists over time: treated students obtain better grades also after the conclusion of the experiment.
Michal Krawczyk, Joanna Rachubik
Cited by*: None Downloads*: None

The representativeness heuristic (RH) has been proposed to be at the root of several types of biases in judgment. In this project, we ask whether the RH is relevant in two kinds of choices in the context of gambling. Specifically, in a field experiment with naturalistic stimuli and a potentially extremely high monetary pay-out, we give each of our subjects a choice between a lottery ticket with a random-looking number sequence and a ticket with a patterned sequence; we subsequently offer them a small cash bonus if they switch to the other ticket. In the second task, we investigate the gambler's fallacy, asking subjects what they believe the outcome of a fourth coin toss after a sequence of three identical outcomes will be. We find that most subjects prefer "random" sequences, and that approximately half believe in dependence between subsequent coin tosses. There is no correlation, though, between the initial choice of the lottery ticket and the prediction of the coin toss. Nonetheless, subjects who have a strong preference for certain number combinations (i.e., subjects who are willing to forgo the cash bonus and remain with their initial choice) also tend to predict a specific outcome (in particular a reversal, corresponding to the gambler's fallacy) in the coin task.
Yan Chen, Peter Cramton, John A List, Axel Ockenfels
Cited by*: None Downloads*: None

We review past research and discuss future directions on how the vibrant research areas of market design and behavioral economics have influenced and will continue to impact the science and practice of management in both the private and public sectors. Using examples from various auction markets, reputation and feedback systems in online markets, matching markets in education, and labor markets, we demonstrate that combining market design theory, behavioral insights, and experimental methods can lead to fruitful implementation of superior market designs in practice.
Michal Krawczyk
Cited by*: None Downloads*: None

Several studies have identified the "better than average" effect - the tendency of most people to think they are better than most other people on most dimensions. The effect would have profound consequences (see e.g. Barber and Odean (2001)). These findings are predominantly based on non-incentivized, non-verifiable self-reports. The current study looks at the impact of incentives to judge one's abilities accurately in a framed field experiment. Nearly 400 students were asked to predict whether they would do better or worse than average in an exam. The most important findings are that subjects tend to show more confidence when incentivized and when asked before the exam rather than afterwards. The first effect shows particularly in females.
John A List
Cited by*: None Downloads*: None

These are the slides from John A. List's keynote at the 2022 AFE conference.
Omar Al-Ubaydli, John A List, Claire Mackevicius, Min Sok Lee, Dana L Suskind
Cited by*: None Downloads*: None

Policymakers are increasingly turning to insights gained from the experimental method as a means to inform large scale public policies. Critics view this increased usage as premature, pointing to the fact that many experimentally-tested programs fail to deliver their promise at scale. Under this view, the experimental approach drives too much public policy. Yet, if policymakers could be more confident that the original research findings would be delivered at scale, even the staunchest critics would carve out a larger role for experiments to inform policy. Leveraging the economic framework of Al-Ubaydli et al. (2019), we put forward 12 simple proposals, spanning researchers, policymakers, funders, and stakeholders, which together tackle the most vexing scalability threats. The framework highlights that only after we deepen our understanding of the scale up problem will we be on solid ground to argue that scientific experiments should hold a more prominent place in the policymaker's quiver.
Daniel J Benjamin, James O Berger, Magnus Johannesson, Brian A Nosek, E. J Wagenmakers, Richard Berk, Kenneth A Bollen, Bjorn Brembs, Lawrence Brown, Colin F Camerer, David Cesarini, Christopher D. Chambers, Merlise Clyde, Thomas D Cook, Paul De Boeck, Zoltan Dienes, Anna Dreber, Kenny Easwaran, Charles Efferson, Ernst Fehr, Fiona Fidler, Andy P. Field, Malcom Forster, Edward I. George, Tarun Ramadorai, Richard Gonzalez, Steven Goodman, Edwin Green, Donald P Green, Anthony Greenwald, Jarrod D. Hadfield, Larry V. Hedges, Leonhard Held, Teck Hau Ho, Herbert Hoijtink, James Holland Jones, Daniel J Hruschka, Kosuke Imai, Guido Imbens, John P.A. Ioannidis, Minjeong Jeon, Michael Kirchler, David Laibson , John A List, Roderick Little, Arthur Lupia, Edouard Machery, Scott E. Maxwell, Michael McCarthy, Don Moore, Stephen L. Morgan, Marcus Munafo, Shinichi Nakagawa, Brendan Nyhan, Timothy H Parker, Luis Pericchi, Marco Perugini, Jeff Rouder, Judith Rousseau, Victoria Savalei, Felix D. Schonbrodt, Thomas Sellke, Betsy Sinclair, Dustin Tingley, Trisha Van Zandt, Simine Vazire, Duncan J. Watts, Christopher Winship, Robert L. Wolpert, Yu Xie, Cristobal Young, Jonathan Zinman, Valen E. Johnson
Cited by*: 1 Downloads*: 965

We propose to change the default P-value threshold for statistical significance for claims of new discoveries from 0.05 to 0.005.
Michael Fix, Raymond J Struyk
Cited by*: 32 Downloads*: 955

Auditing is a technique used to test for discrimination. The concept is straightforward: Two individuals are matched on all relevant characteristics except the one presumed to lead to discrimination. Each person then applies for the same job, housing, mortgage loan, or credit card. The differential treatment they receive provides a measure of discrimination. The authors argue that the value of auditing has grown in the current legal and political environment because it can detect subtle forms of discrimination.