Accepted Manuscript Carbon is forever: A climate change experiment on cooperation Giacomo Calzolari, Marco Casari, Riccardo Ghidoni
PII:
S0095-0696(16)30455-7
DOI:
10.1016/j.jeem.2018.09.002
Reference:
YJEEM 2173
To appear in:
Journal of Environmental Economics and Management
Received Date: 22 November 2016 Revised Date:
11 September 2018
Accepted Date: 14 September 2018
Please cite this article as: Calzolari, G., Casari, M., Ghidoni, R., Carbon is forever: A climate change experiment on cooperation, Journal of Environmental Economics and Management (2018), doi: https:// doi.org/10.1016/j.jeem.2018.09.002. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
RI PT
Carbon is forever:
SC
A climate change experiment on cooperation∗
Giacomo Calzolari University of Bologna
Marco Casari
Riccardo Ghidoni
University of Bologna
Tilburg University
& IZA
September 11, 2018
AC C
EP
TE
D
& CEPR
M AN U
Running title: Carbon is Forever
∗
We are thankful to Valentina Bosetti, Massimo Tavoni, Alessandro Tavoni, Peter Martinsson, and seminar participants at Fondazione Eni Enrico Mattei, University of Gothenburg, European University Institute, Fourth IAERE Annual Conference, Tilburg University, London School of Economics, EWEBE 2017, VU University Amsterdam, Collegio Carlo Alberto, International Workshop on the Economics of Climate Change and Sustainability at the University of Bologna, Goethe University Frankfurt, and 2018 ESA World Meeting. Furthermore, we would like to thank the editor and two anonymous referees for very helpful comments. Casari acknowledges financial support from the Italian Ministry of Education, Universities and Research through the FIRB-Futuro in Ricerca grant (Project No. RBFR084L83). Corresponding addresses: Calzolari,
[email protected], Tel: +39 051 2098489, Department of Economics, University of Bologna, Piazza Scaravilli 2, 40124 Bologna; Casari,
[email protected], Tel: +39 051 2098662, Department of Economics, University of Bologna, Piazza Scaravilli 2, 40124 Bologna; Ghidoni,
[email protected], Tel: +31 134662898, Tilburg School of Economics and Management, Department of Economics, PO Box 90153, 5000LE Tilburg.
1
ACCEPTED MANUSCRIPT
Carbon is forever:
Abstract
SC
RI PT
A climate change experiment on cooperation
M AN U
Greenhouse gases generate impacts that can last longer than human civilization itself. Such persistence may affect the behavioral ability to cooperate. In a laboratory experiment, we study mitigation efforts with dynamic externalities in a framework that reflects key features of climate change. In treatments with persistence, pollution cumulates and generates damages over time, while in another treatment it has only immediate effects and then disappears. We show that with
TE D
pollution persistence, cooperation is initially high but then systematically deteriorates with high stocks of pollution.
Keywords: Stock externalities; Public goods; Inequality; Dynamic games
AC C
EP
JEL codes: C70; C90; D03; Q54
1
ACCEPTED MANUSCRIPT
1
Introduction
Unless major efforts are undertaken to reduce greenhouse gas emissions (GHG), climate change will reach dangerous levels within this century. The impacts will be global, with uncertainties in their magnitude and geographical distributions, and with high degrees of
RI PT
irreversibility and persistence (IPCC, 2014). Coordinating international actions in regard to mitigation is notoriously difficult. A major reason is the temptation of opportunistic behavior by each single country that can benefit from the mitigation efforts of others without paying the costs of reducing the carbon intensity of their economy (Nordhaus,
SC
2013). Here we study the role of the long-run persistence of GHG emissions in the atmosphere, which makes climate change a dynamic social dilemma. In particular, carbon
M AN U
dioxide is the most important pollutant responsible for anthropogenic climate change, and a considerable portion of its current stock will last for well over a millennium, which – on the time scale of human civilization – basically means forever (Inman, 2008). The research question is whether and how such long-term persistence of pollution
TE D
affects the ability of societies to cooperate. This aspect has been singled out in the literature as one of the factors that hamper cooperative efforts in mitigating climate change (Wagner and Weitzman, 2015). There may be multiple reasons for it that relate to the strategies adopted and the learning process. Decision-makers may be myopic,
EP
in the sense that they consider only the short-run consequences of their actions but not those in the long-run.1 Coordinating on a specific equilibrium may be harder in a
AC C
dynamic as opposed to a static social dilemma. Furthermore, dynamic dilemmas involve irreversibility due to the lasting impact of pollution, and hence the consequences of past choices are harder or impossible to undo. These elements are tied to behavioral aspects, which we study by means of a laboratory experiment. We model the climate change game as a strategic interaction among long-lived decisionmakers in the absence of a legally binding treaty, where each decision-maker independently chooses its level of GHG emission at each point in time. Pollution benefits the 1
A noteworthy example, although not modeled here, is that politicians may be short-sighted because of re-election concerns and uncertain voters’ preferences (K¨oke and Lange, 2017).
2
ACCEPTED MANUSCRIPT decision-maker that emits it because it is linked to economic production, but it has negative externalities in the form of damage for all decision-makers. More precisely, we consider a public good game, and modify it in order to capture salient characteristics of the climate change dilemma.
RI PT
We characterize this platform from a theoretical point of view and study its behavioral properties with an experiment by varying the degree of persistence of GHG emissions. In a Persistent treatment, the global emissions generated in one round remain fully in the atmosphere in the next rounds and indefinitely cumulate into a stock of pollution. Con-
SC
versely, an Immediate treatment reproduces a static although repeated social dilemma, where the entire damage of the current emissions is suffered within the current round.
M AN U
Finally, a Halving treatment is an intermediate case, where emissions cumulate from one round to the next but the stock depreciates – halving in each round – because pollution dissipates. In all treatments the indefinite horizon gives rise to multiple equilibria, creating opportunities for partial or full cooperation. Our main theoretical benchmarks are the socially optimal level of emissions and the constant-actions Markov perfect equilibrium.
TE D
We calibrated the parameters of the experiment in a way such that these benchmarks are identical across treatments. Hence, differences in behavior will easily reveal which scenario is most conducive to cooperation in mitigation efforts.
EP
What emerges from the experiment are distinct patterns of behavior under static vs. dynamic externalities. More precisely, the persistence of pollution has effects on the trend
AC C
and strategies that decision-makers adopt to sustain mitigation policies. Under dynamic externalities, initial emission levels are low and they are followed by an upward trend steeper than under static externalities. This pattern seems consistent with individual strategies conditioning emissions on the level of the stock of pollution. When the situation becomes critical in terms of cumulated stock, participants in the Persistent treatment are increasingly less able to support high levels of cooperation. One must acknowledge the limitations to the external validity of most economic experiments on climate change. In the laboratory, decision-makers are individuals and not nations. The number of players involved is usually rather small. Laboratory communi-
3
ACCEPTED MANUSCRIPT cation hardly resembles the process of international climate negotiations. That said, the available field evidence exhibits major drawbacks, some of which can be circumvented through experiments. In the field, many key parameters are difficult to measure. Consider for instance the returns from cooperation, the discount factors, and the expectations
RI PT
about the damage caused by climate change. Laboratory experiments make it possible to set these parameters, render them observable to the experimenter and the participants, and establish causal relations among variables (Falk and Heckman, 2009).
We place the contribution of this paper in the context of the experimental literature
SC
on climate change and on cooperation with dynamic externalities (Section 2). We then present the theoretical platform of analysis (Section 3) and describe the experimental de-
of the results (Section 6).
2
Literature review
M AN U
sign (Section 4). Finally, we report the results (Section 5) and conclude with a discussion
TE D
This paper contributes to the experimental literature about dynamic social dilemmas and about cooperation in a climate change set-up. With few exceptions, existing public good experiments have a static set-up. Climate change externalities are instead dynamic
EP
because they depend on the stock of pollution accumulated in the atmosphere and not just on the yearly flow. Overall, cooperation in dynamic set-ups appears more difficult
AC C
than in static ones. The seminal paper by Herr et al. (1997) investigates extraction in a finite horizon common-pool resource experiment contrasting a static externality with a dynamic one, and reporting lower payoffs with dynamic externalities. Battaglini et al. (2016) show that when contributions to a non-depletable public good are irreversible they are lower than under reversibility. Our experiment furnishes novel evidence by focusing on a climate change game with an indefinite horizon. Comparing a static externality to a dynamic externality with different degrees of persistence, we identify higher initial cooperation followed by its significant deterioration when persistence is in place. We adopt a climate change framework that builds on the model of Dutta and Radner (2004,
4
ACCEPTED MANUSCRIPT 2009). The second of these theoretical papers identifies “greenhouse traps”, i.e. equilibria in which the current stock of pollution affects emissions decisions. Interestingly, in Section 5 we document behavior in our Halving treatment that can be interpreted as consistent with traps of this type.
RI PT
With respect to the standard public good experiment, we have modified the action space, the gain-loss frame, the duration, and players’ asymmetries. First, in our game, the theoretical benchmarks are interior points of the action space, in contrast with the typical public good experiment, where social optimum and Nash equilibrium are at the corners
SC
of the action space (Laury and Holt, 2008). Second, the choice concerns a public bad, as in the case of climate change: all endowments are in the common project by default,
M AN U
and everyone decides how much to withdraw from it to their private account (Andreoni, 1995; Khadjavi and Lange, 2015).2 Third, we implement a long-run interaction in the laboratory given that societies are long-lived entities and climate change has consequences in the distant future. The time horizon is indefinite, implemented through a continuation probability after every round.3 Fourth, we introduce an asymmetry in individual earnings
TE D
in order to mirror the wide income gaps among countries, which is a major issue in addressing climate change because it may undermine cooperation (Tavoni et al., 2011). Using a public good terminology, in our set-up the return from the private account is
EP
lower for a poor than for a rich player type.
The most closely related experiments are Pevnitskaya and Ryvkin (2013), Sherstyuk
AC C
et al. (2016), and Ghidoni et al. (2017). Their set-up is similar to ours but none of them studies the impact of pollution persistence. Pevnitskaya and Ryvkin (2013) study the role of framing and time horizon on cooperation. Under a finite horizon, participants were faster in learning to cooperate, but increased emissions in the last round. They have persistence rate of 0.75 and groups of 2 members. Sherstyuk et al. (2016) compare overlapping generations to long-lived agents, allowing for access to past history and 2
Andreoni (1995) and Khadjavi and Lange (2015) find that, if participants can only withdraw from the common project, cooperation is lower than when they can contribute to it. 3 Experiments with indefinite repetitions are common with the prisoner’s dilemma (e.g. Camera and Casari, 2009), but are few and far between with public goods (Battaglini et al., 2016). Climate negotiations typically involve numerous players and a wide action space.
5
ACCEPTED MANUSCRIPT intergenerational advices. With overlapping generations, cooperation became harder to sustain due to both limited incentives and greater strategic uncertainty. They have a persistence rate of 0.3 and groups of 3 members. Finally, Ghidoni et al. (2017) consider an indefinite horizon, like the other studies above, but, unlike them, only consider static
RI PT
externalities. The study decouples emission choices from realized damage through the introduction of randomness and delay so as better to identify individuals’ cooperative strategies. Their groups comprise 4 members. We will further discuss these papers in Section 5.
SC
The impact of income inequality studied in Ghidoni et al. (2017) and Tavoni et al. (2011) is instead absent from Pevnitskaya and Ryvkin (2013) and Sherstyuk et al. (2016).
M AN U
Tavoni et al. (2011) is one of the first papers to study inequality in climate change experiments. Strong inequality in earnings within a group hampered cooperation. Their study belongs to a branch of the literature pioneered by Milinski et al. (2008) that models climate change as a collective catastrophe that can happen if cooperation remains below a threshold. Participants could either keep their endowment in a private fund or invest
TE D
it in mitigation. If, by the end of the experiment, the group’s cumulated investment in mitigation is below a known threshold, the catastrophe of losing everything takes place with some probability.4
EP
Within the Milinski et al. (2008) approach to modeling the dynamic climate game, two other experiments investigate aspects that the present paper neglects. Bosetti et al.
AC C
(2017) study the interplay between mitigation and investments in a clean technology. In our paper, instead, the only available action concerns mitigation and no other tool is available. Finally, Hauser et al. (2014) investigate the role of voting mechanisms when managing a common pool with threshold under an indefinite horizon and with a catastrophic risk that would fall on the next generation. Here instead we only consider individual voluntary contributions. 4
Some experiments embed scientific uncertainty and ambiguity in climate tipping points. Empirically, the threat of a catastrophe enhances cooperation if the uncertainty on the tipping point is low. However, this deterrence effect disappears for high levels of uncertainty (Barrett and Dannenberg, 2012, 2014) or ambiguity about the tipping point (Dannenberg et al., 2015).
6
ACCEPTED MANUSCRIPT
3
The model
3.1
The climate game
We consider a group of N ≥ 2 long-lived decision-makers interacting over an indefinite
RI PT
number of rounds in a game (sequence, henceforth). At any round t ∈ {0, 1, . . .}, there will be an additional round with probability δ ∈ [0, 1), or else the sequence will end at t with probability (1 − δ). Decision-makers never know whether or not they are in the last round. The continuation probability δ, as well as all other parameters of the game, are
SC
public information.
In every round t, each decision-maker i = 1, 2, .., N chooses its level of emission ei (t)
M AN U
from an interval ranging from 1 to a common finite upper-bound. Choices are simultaneous. Emissions are the sole input in the production of an output that is exclusively enjoyed by the emitting subject according to a production function specified below. After every round, decision-makers observe the current individual emission ei (t) of everyone in the group.
TE D
Over the rounds, global emissions accumulate into a stock of pollution according to the following dynamic equation:
E(t) = σE(t − 1) +
N X
ei (t) ,
(1)
EP
i=1
AC C
where σ ∈ [0, 1] is the persistence rate of past emissions. Hence, if σ = 0 all emissions dissipate at the end of every round, while if σ = 1 emissions last forever. At the beginning of the sequence, the initial stock of pollution is E(0) = E0 ≥ 0. Each decision-maker privately benefits from the output produced through its own current emission, but, in absolute terms, all N decision-makers are equally damaged by the stock of global emissions accumulated up to the current round. The instantaneous payoff of decision-maker i is given by benefits from own output minus climate damages:
ui (t) ≡ γln (ai ei (t)) −
7
c E(t) . N
(2)
ACCEPTED MANUSCRIPT Emission intensity of production is linear and time-invariant: one unit of emission grants ai > 0 units of income. The natural logarithm implies a declining marginal utility of income.5 The parameter γ > 0 serves only for rescaling. We introduce heterogeneity by assuming two types of decision-makers in equal num-
RI PT
bers: poor (type p) and rich (type r), which differ only in the level of one parameter, ar > ap . Note that, for the same emission level, type p decision-makers contribute just as much to climate change as type r ones. Hence, the aggregate volume of emissions is in potency the same for types p and r, because the upper-bound in emission is the same.6
SC
Finally, emissions damages increase linearly in the stock of pollution according to the parameter c > 0. Unlike in a standard public good game, payoffs can be negative
M AN U
in the case of a lack of international cooperation, because Equation (2) is an additive function.7 Although in absolute terms the damage is equal for rich and poor decisionmakers (parameter c), in relative terms it will harm poor decision-makers more than rich ones since the latter can always obtain higher benefits from their emission. This feature reproduces an aspect of the field: poor countries are predicted to suffer from
TE D
climate change relatively more than rich countries.8 Describing the difference between rich and poor decision-makers only in terms of their income and relative damages is a simplification. However, in the experiment, this will allow us to precisely pin down the
EP
impacts of this source of inequality, abstracting from other sources (e.g., differences in available technologies or environmental footprints). 5
AC C
In their climate cost-benefit analyses, Nordhaus (2013) and Stern et al. (2006) assume a utility function that takes the natural logarithm of GDP per capita. 6 Tavoni et al. (2011) model countries heterogeneity through pre-determined contributions to a climate protection account that leave countries with a different inheritance in terms of endowment level when they begin to make choices. In the present study, instead, heterogeneity stems from different population weights ai , which implies a structural country difference originating from the lower economic yields from the same level of emissions. 7 Some scholars have employed a multiplicative function for damages (Nordhaus, 2013), while others have used an additive one (Dutta and Radner, 2004). There is consensus that the stock of pollution linearly impacts on temperature. Most scholars argue for a convex damage function in temperature (Burke et al., 2015) but others have used a linear approximation (Dutta and Radner, 2004). Linearity makes it possible to keep the experiment simple, as further discussed below. The number of decisionmakers at the denominator of the damage rescales payoffs so that the social optimum emission e∗ does not depend on N . 8 The channels are somewhat different, however, since in the field damages will be higher for poor countries because they are located in warmer climates (IPCC, 2014).
8
ACCEPTED MANUSCRIPT
3.2
Theoretical benchmarks
Here we outline the social optimum and three different equilibrium strategies of the game described in the first part of this section. For each of these benchmarks we will emphasize key properties that we will then contrast with the experimental evidence. As
RI PT
we will see, the Markov perfect equilibrium (C-MPE) of the climate game is far from delivering the socially optimal level of emissions. Participants, however, could achieve the social optimum by coordinating on a constant trigger equilibrium (C-TE). They could also support full or partial cooperation by following non-constant Markov strategies (NC-
SC
MPE) where emissions depend on the current level of the stock of pollution. Proofs are in Appendix A.
M AN U
We will interpret the continuation probability δ as the discount factor of an (intertemporally) risk-neutral decision-maker (Camera and Casari, 2009). Hence, each decisionmaker maximizes the present expected value of its current and future payoffs (Equation 2),
vi =
∞ X
δ t ui (t) .
(3)
TE D
t=0
Social optimum. If decision-makers jointly maximize the unweighted sum of individual present-valued payoffs:
EP
v=
N (vr + vp ), 2
they set a socially optimal emission that is constant over the rounds, and which, for any
AC C
type of decision-maker, is equal to
e∗ = γ
1 − σδ . c
The socially optimal emission e∗ is time-invariant and independent of the stock of pollution E because of the linearity of the damage in E. It is obtained by equating the marginal benefit from the individual emission γ/ei to the marginal present-valued group’s damage, c σ c N× 1+δ = , N (1 − δσ) 1 − δσ 9
ACCEPTED MANUSCRIPT which is itself independent of the stock E.9 The socially optimal emission e∗ is the same for all decision-makers because with our payoffs the marginal benefit does not depend on ai , and is decreasing in emission persistence σ. Constant-actions Markov perfect equilibrium (C-MPE). In general, at any round
RI PT
t, Markov strategies map the current state, i.e. the stock of pollution E(t), into the set of emissions. Simple and interesting equilibrium candidates are the constant-actions strategies that depend neither on the stock E nor on the history of past emissions.
SC
Proposition 1. For any σ there exists a constant-actions Markov perfect equilibrium (C-MPE) that contemplates a constant level of emission for both r and p decision-makers
M AN U
that is N times the socially optimal emission, i.e. eF = N × e∗ .
In a C-MPE each decision-maker equates the marginal benefit γ/ei to the marginal individual damage
c , N (1−δσ)
which does not depend on E because of the linear damage.
When emissions entirely dissipate at the end of each round (σ = 0), the game becomes
TE D
a repeated one, and the C-MPE corresponds to the Nash equilibrium of the stage game. Although when 0 < σ ≤ 1 the game is a dynamic one, both marginal benefits and marginal damages from emissions do not depend on the stock. This property – also discussed in footnote 7 – simplifies the environment for the decision-makers and makes
EP
it possible to sustain the simple C-MPE.
Note that for any emission level e constant over the rounds, the stock E converges
AC C
to the steady-state N e/(1 − σ) under partial persistence (0 < σ < 1). When instead pollution persists forever (σ = 1), the stock associated with constant emissions follows a trajectory E(t, e) that diverges.
Constant trigger equilibria (C-TE). In our settings there also exist equilibria based on the history of play that may support a wide range of (cooperative) outcomes. In particular, we are interested to identify under which conditions the social optimum can 9
The constant social optimum is also the consequence of an implicit assumption of no technological change. We rule out technological change to keep the experiment simple, although it could help to achieve climate policy objectives (Bosetti et al., 2012; Gerlagh and Van der Heijden, 2015).
10
ACCEPTED MANUSCRIPT be supported as an equilibrium outcome when decision-makers punish deviations on larger emissions with a reversion to constant strategies, in particular the C-MPE, eF . We label the associated equilibrium as “constant trigger” equilibrium. Proposition 2. If δ is higher than a threshold δ¯ > 0, then a (subgame perfect) Constant
e∗ for any decision-maker i.
RI PT
Trigger Equilibrium (C-TE) exists with individual emissions equal to the social optimum
With the punishment of permanent reversion to eF , the threshold is N 1 ln(N ) −1 , N −1 N −1
SC
δ¯ =
M AN U
that, for the structure of the payoffs in our model, only depends on N . More precisely, ¯ all emissions levels in [e∗ , eF ] can be supported as equilibrium outcomes. For when δ > δ, future reference, we note that with N = 4 (as in our experiment), the threshold δ¯ is close to 0.28. In addition, emissions in [e∗ , eF ] can be supported in (subgame perfect) equilibria with punishments that are milder than in the C-TE: after a deviation, decision-makers
TE D
revert to emission eF for a finite number of rounds T . As usual, this possibility comes with the requirement of a higher threshold δ¯ for any lower T .
EP
Equilibria with non-constant Markov strategies (NC-MPE). When the persistence rate σ is not nil, decision-makers may follow stock-dependent strategies that specify
AC C
different emissions ei (E) depending on the current level of the stock, i.e. non-constant Markov strategies. This is particularly interesting in our environment. As Dutta and Radner (2009) have shown in a similar model, non-constant Markov equilibria may in fact determine a rich pattern of emissions and stock of pollution. The dependence of strategies on the stock of pollution can take many different forms. As said, with payoffs as in (2), the marginal effect of ei (t) on payoffs is independent of E(t). This intuitively implies that there exist no equilibria based on “simple” nonconstant Markov strategies such as a proportionality rule ei (E) = βi × E.10 However, 10
The proof of non-existence of equilibria with “simple” Markov strategies is available upon request.
11
ACCEPTED MANUSCRIPT with σ > 0 decision-makers can use the stock level as a visible coordination device. They may “target” a trajectory of the stock (which could be constant when σ < 1) and use it as a reference and monitoring device to support cooperation. Define E(t, e) as the “long-run” stock of pollution when decision-makers constantly
and with σ = 1, E(t, e) = E(t − 1, e) + N × e.
RI PT
emit e in any period, from t onward. In particular, with σ < 1, it is E(t, e) = N e/(1 − σ),
Proposition 3. Sufficiently many and patient decision-makers (respectively N ≥
eF eF −e∗
¯ may coordinate on the following non-constant Markov perfect equilibrium and δ ≥ δ)
SC
(NC-MPE):
constant at e∗ thereafter;
M AN U
• if E(t) ≤ E(t, e∗ ), emissions lead to a stock E(t0 , e∗ ) at some t0 > t and remain
• if instead E(t) > E(t, e∗ ), decision-makers emit eF and the stock converges to E(t, eF ).
TE D
Proposition 3 considers the possibility to implement the socially optimal emission e∗ . Analogous propositions could be obtained when implementing higher and less efficient levels of emission in the interval [e∗ , eF ], which would then be associated to different thresholds of N and δ. The idea of supporting this NC-MPE is the following. Consider
EP
first the case with σ < 1 and constant long-run stock. Decision-makers set the target stock of pollution equal to the long-run socially optimal level, i.e. E(t, e∗ ) = N e∗ /(1 − σ). If
AC C
the stock is below E(t, e∗ ), decision-makers guarantee that it reaches it – either smoothly or with a single “jump” in emissions – and remains forever at this “target” level with ¯ exactly individual emissions e∗ . Emissions higher than e∗ are dominated when δ ≥ δ, the same threshold identified above for Proposition 2. If instead the stock “has gone too far”, i.e. E(t) ≥ N e∗ /(1 − σ), the decision-makers act non-cooperatively forever onward with an individual emission equal to eF . In this case the stock converges to the associated non-cooperative steady-state level E(t, eF ) = N eF /(1 − σ). A single decisionmaker may want to try and push E back into the “good region” (i.e. below E(t, e∗ )). However, if there are sufficiently many other decision-makers emitting eF , then even the 12
ACCEPTED MANUSCRIPT strongest attempt (i.e. the decision-maker setting e = 0) in the ideal situation (i.e. a stock of pollution just above E ∗ ) would be a sacrifice made in vain. With the other N − 1 decision-makers emitting eF , the stock of pollution would remain above the target E ∗ in any case, if (N − 1)eF ≥ N e∗ . This condition is equivalent to the one in the Proposition
RI PT
and is satisfied in our experimental set-up with N = 4, δ = 0.92, eF = 12, and e∗ = 3. Dutta and Radner (2009) referred to this pattern of emissions as a “greenhouse trap”. Interestingly, the stock of pollution could converge to E(t, eF ) cyclically if, for some reasons, emissions undershoot and/or overshoot that target.11
SC
When emissions persist forever (σ = 1), the socially optimal emission e∗ is still implementable, even though the target must now be a trajectory E(t, e∗ ) of the stock. As
M AN U
long as the associated trajectory of the stock “does not go too far”, i.e. E(t) ≤ E(t, e∗ ), decision-makers have no individual incentive to deviate from the socially optimal emission e∗ . If instead, at any t, the stock overshoot E(t, e∗ ), then cooperation breaks down and emissions revert to the C-MPE forever as in a “greenhouse trap”. These NC-MPE exhibit some interesting properties. Whatever the level of persistence
TE D
σ > 0, the emissions of a decision-maker may remain constant with no immediate reactions to others’ changes in emissions. This happens as long as global emissions keep the stock below the target level E(t, e˜). Here the stock of pollution plays the role of allowing
EP
decision-makers to coordinate their actions with a target stock level and with no need to check all individual emissions systematically. This desirable property of NC-MPE must
AC C
be contrasted with at least two difficulties in coordinating on a level of stock. First, the target level could be a trajectory that changes at any round. Second, a stock is not completely under a decision-maker’s control: even if a decision-maker is willing to reduce its own emissions, cooperation may still break down because the stock has already “gone too far”. We will explore these possibilities in Section 5. 11
Emissions and stock respectively larger than eF and E(t, eF ) may temporarily occur off equilibrium in the punishment phase of non-Markov equilibria. We discuss the possibility of these more sophisticated strategies and equilibria at the end of Section 5.
13
ACCEPTED MANUSCRIPT
4
Experimental design
4.1
Treatments
We designed an experiment based on our climate game that comprises three treatments
RI PT
(Table I): Persistent and Halving – where the externality is dynamic –, and Immediate – where the externality is static. The three treatments cover the full range of possible persistence levels. In the Persistent treatment emissions never dissipate (σ = 1). This scenario roughly approximates the climate change problem where the most significant
SC
GHG persist in the atmosphere for a very long time. Out of one ton of carbon dioxide emitted today about 50% will remain after 30 years and from 20% to 40% will remain after
M AN U
1,000 years (IPCC, 2007). The corresponding value of σ that will generate this degree of persistence is 0.98 for the 30-year horizon and 0.998-0.999 for the 1,000-year horizon. Other GHG, instead, are short-lived. Methane, for instance, can be removed from the atmosphere by a much faster chemical reaction and persists for decades. In the Halving treatment half of the stock of pollution dissipates after every round (σ = 0.5). Finally,
TE D
in the Immediate treatment there is no stock accumulation and all the damage happens in the current round (σ = 0). Some pollution externalities are best approximated by the Immediate treatment, such as noise pollution, and others by the Persistent treatment,
EP
such as radioactivity from nuclear waste elements like Plutonium 244 (with a half-life of 80 million years). See Figure B.1 of the Appendix for a graphical illustration of the
AC C
damage profiles across treatments. Climate change is a long-term problem, and the experimental set-up incorporates this aspect through indefinite repetitions. After every round there is a continuation probability of interaction of δ = 0.92. Many simulations of climate change consider scenarios for the year 2100, which is 85 years distant from today. In the experiment, each sequence has an expected length of 1/(1 − δ) = 12.5 rounds, which can be interpreted as a decision every 7 years up to 2100.12 Consider also that for risk-neutral agents this 12
The initial round starts with no past history of emissions, and we set the initial stock of pollution E0 equal to 0. Varying the initial stock of pollution can be an interesting treatment dimension (e.g. Tavoni et al., 2011) that we leave for future research.
14
ACCEPTED MANUSCRIPT set-up is theoretically equivalent to an infinite time horizon with a discount factor of 0.92 (Dal B´o and Fr´echette, 2017). While individuals have a finite life, societies can be treated as long-lived. This calibration of δ creates conditions favorable for the emergence of cooperation because the shadow of the future is sufficiently large to sustain the socially
RI PT
optimal emission. From a theoretical point of view, what is required is that δ be larger than 0.28 (Proposition 2).
In our design the damage from the marginal emission is identical across treatments in terms of its present value. For each unit of emission, the expected damage is 33.375 tokens
SC
in all treatments but varies in how it is spread over time. In the Immediate treatment the damage occurs entirely in the round in which the emission is done (c = 33.375). In
M AN U
the Halving treatment, half of the damage occurs in the current round (c = 18.0225) and the other half in the future. In the Persistent treatment, there is a damage of c = 2.67 in the current round and of 2.67 in each one of all future rounds. The expected value of the damage in the Persistent treatment is 33.375, given an expected duration of 12.5 rounds. Given this calibration of σ and c, the socially optimal level of emission and the C-
TE D
MPE are the same in all treatments (Proposition 1), allowing for easy comparison of the experimental results. Both theoretical benchmarks are internal elements of the action space: participants could emit any integer amount between 1 and 18, where the social
EP
optimum is e∗ = 3 and C-MPE is eF = 12. Thus, participants were allowed to implement excessive restraint or overshoot in emissions.13
AC C
A group comprises N = 4 decision-makers. This is a simplification in comparison to the over 190 countries who meet for climate negotiations. Consider, however, that in 2010 eight regions accounted for almost 2/3 of annual GHG emissions worldwide (EDGAR, JRC and PBL, 2016). Four regions accounted for more than half (China, USA, European Union 28, and Brazil). Limiting group size to four makes it possible to increase the number of independent observations while retaining the possibility of overcoming coordi13
The action space is similar to most public good experiments in terms of number of elements. The condition ensuring that eF and e∗ are the same for any σ is determined as follows. Indicate with σh the persistence rate in the Persistent treatment, and let σl be 0 when l = Immediate or 0.5 when h l = Halving. Then the damage coefficient in treatment l, cl , is set to ch 1−δσ 1−δσl , where ch is the damage coefficient in Persistent.
15
ACCEPTED MANUSCRIPT nation issues and some degree of heterogeneity. Settings with four players are typical in many public goods games.14 There are two rich and two poor members in each group, randomly assigned, with an equal potential for emissions. The RICE model, for instance, has five poor regions
RI PT
and seven rich regions. Our design incorporates income inequalities of a comparable magnitude of those in the field. On average, the per-capita income in rich regions is 4.8 times higher than in poor regions ($34,085 vs. $7,125.9).15 In our design, for any level of emissions, the per-capita output of a rich participant is five times larger than that
SC
of a poor one. This is achieved by setting appropriate weights ar and ap (Table I). By design there is no intrinsic conflict between concerns for efficiency and equality, since
M AN U
both motivations will induce participants to increase cooperation levels toward the social optimum level (in the Persistent treatment, this is true only in the long run).16
4.2
Procedures
TE D
In our experiment, the instructions were expressed in a neutral language and did not use any term that could recall climate change (Appendix D).17 Overall 225 volunteers participated in the experiment. There were 25 participants in each session: 20 engaged
EP
in the main task described in Section 4.1, while 5 performed a side task with no other purpose than to keep them busy during the session. Participants assigned to the side task
AC C
were those that displayed the lowest level of understanding of the instructions. These 5 participants had to guess the level of damage in the current round and in ten rounds 14
Meta-studies on public goods experiments detect a small impact of group size on cooperation (e.g. Fiala and Suetens, 2017). 15 We consider the 2010 average per-capita GNIs according to World Bank data for the regions of the RICE model. We label “rich” a region with a per capita GNI greater than $12,745 in 2010 (World Bank’s threshold for high income countries). Otherwise, the region is labeled “poor”. Poor regions in RICE turn out to be Africa, China, Eurasia, India, and Other Asia (N = 5). Rich regions are EU, Japan, Latin America, Middle East, Russia, USA, and Other High Income (N = 7). 16 In fact, poor decision-makers suffer more for the damages of higher emissions than rich ones only in relative terms, i.e. with respect to the individual benefits. 17 For instance, we referred to emission choices as “production”, and explained that one participant’s production had two effects: to increase personal “revenue” and to “damage” earnings for everyone.
16
ACCEPTED MANUSCRIPT Table I: Overview of the experiment. Treatments Immediate
Halving
Persistent
0.92
0.92
0.92
0.5
1
18.0225
2.67
40.05
40.05
δ
Discount factor (continuation probability)
σ
Pollution persistence
c
Damage in the current round
33.375
ar
Population weight for rich decision-maker
40.05
ap
Population weight for poor decision-maker
γ
Utility rescaling
E0
Initial stock of pollution
N
Number of players in a group
SC 8.01
8.01
8.01
100
100
100
0
0
0
4
4
4
M AN U
Benchmarks
0
RI PT
Parameters
Social optimum (individual emission)
3
3
3
eF
Constant Markov perfect equilibrium
12
12
12
60+15
60+15
60+15
55
60
60
11
12
12
10.8
8.3
8.8
Observations
TE D
e∗
Number of participants (main task + side task)
EP
Number of groups
Number of sequences
AC C
Average length of a sequence
Note: All sessions but one were performed in May 2015: Immediate (20, 21, 27), Halving (28, 29, June 18), Persistent (14, 25, 28). Session 20/05/2015 (Immediate) was interrupted during the third sequence for time constraints following the protocol described in footnote 20. The unit for the average length of a sequence is a round in a sequence. According to Wilcoxon-Mann-Whitney (WMW) tests differences in the length of the sequences are not statistically significant when comparing Immediate vs. Persistent (p-value= 0.802) and Immediate vs. Halving (p-value= 0.366). When comparing Halving vs. Persistent, the WMW test detects a marginally significant difference (p-value= 0.089).
from the current one for a given group of active participants.18 The alternative would 18 They received e0.25 for each round of all sequences plus a show-up fee of e5. On average, low score participants overestimated damages in the current round of 50 tokens in the Immediate treatment, 17 tokens in the Halving treatment, and 12 tokens in the Persistent treatment. The lower precision of guesses in the Immediate treatment may be due to the higher sensibility of damages to the marginal
17
ACCEPTED MANUSCRIPT have been to send them away, but this might have upset them. Moreover, we opted to exclude them instead of simply controlling for their quiz performance in the statistical analyses, because we thought that it was a cleaner way to obtain high quality data, since it prevented low score participants’ actions from influencing others’ behavior.
RI PT
The 20 participants in the main task performed up to four repetitions (or sequences) of our climate change game. This allowed participants to gain familiarity with the complexities of the climate game.19 A sequence comprised an indefinite number of rounds, depending on random draws by the computer that were ex-ante unknown to participants
SC
and experimenter alike.20 Within each sequence, participants interacted with the same group (partner protocol). After every sequence, new groups were formed so that in fol-
met (perfect stranger protocol).
M AN U
lowing sequences no participant ever interacted again with a person that she had already
Climate change negotiations are performed by professionals, but we recruited college students. We aimed at ensuring that participants were well-qualified in the sense of having a good understanding of the rules. To this end: (i) instructions explained the task in a
TE D
simple way and with extensive use of figures; (ii) the software was a user-friendly interface and a built-in simulator tool with which participants could compute present and future consequences of hypothetical emissions; (iii) all participants completed a comprehension
EP
quiz on the instructions (Appendix E); (iv) those with a poor understanding of the instructions were excluded from the main task; (v) after the quiz, everyone underwent a
AC C
fifteen-round practice sequence playing against robots in order to familiarize themselves with the main task; (vi) after every round, we asked participants to write down on a record emission (Table I, parameter c). Due the random duration of the sequences, we can only evaluate the precision of guesses on the damage in ten rounds in a handful of cases. These guesses are quite noisy and inaccurate. The lack of proper incentives to predict damages may explain imprecisions. 19 Participants in experiments with an indefinite horizon may need some repetitions of the supergame before converging to a stable outcome (Dal B´o and Fr´echette, 2017). While the number of sequences per session may appear low, previous experiments have documented that sharp changes in behavior often take place already in early sequences. 20 Participants were recruited for a maximum of three hours and a half. If a session was still running after 2 hours and 40 minutes, the experimenter announced that the current sequence was the last one and that the session would finish within the next 30 minutes. The experimenter told participants that the exact minute at which the sequence was stopped was determined by a 30 faces dice roll, which was immediately rolled and observed by the experimenter but not by the participants.
18
ACCEPTED MANUSCRIPT sheet their own and other group members’ emissions in order to make sure that they had actively followed the play; (vii) participants experienced four separate sequences that were new restarts with identical rules; hence, they could learn by doing; (viii) the instructions explicitly stated that if everyone emitted 3, group earnings would be maximized (social
RI PT
optimum). More details on procedures (i)–(viii) are in Appendix B. Table I reports some statistics on the experimental sessions. In every session five groups simultaneously played during a sequence. Overall, we managed to implement 11 sequences for the Immediate treatment, yielding 55 different groups (5 × 4 + 5 × 4 + 5 × 3).
SC
For the Halving and Persistent treatments, instead, we implemented 12 sequences yielding 60 groups for each one of these treatments (5 × 4 + 5 × 4 + 5 × 4). Participants in the
M AN U
main task were paid according to the total amount of tokens that they had earned in all the sequences: they received e0.01 for every 6 tokens plus a show-up fee of e4 in the Persistent treatment, e5 in the Halving treatment, and e6 in the Immediate treatment. Since cumulate earnings could be negative at the end of the experiment because of the damage, we ensured participants a minimum payment of e10. Average earnings for those
TE D
who participated in the main task were e17.6; overall, 40 participants (20%) earned e10. Recruitment was done via ORSEE (Greiner, 2015) and participants were involved in at most one session. Instructions were read aloud and participants had a hard copy on
EP
their desks. Sessions took place at the BLESS laboratory of the University of Bologna
5
AC C
using zTree (Fischbacher, 2007).
Results
After providing an overview of aggregate emissions across treatments, we present seven main results, where we compare emission levels of rich and poor participants (Result 1), show the effect of persistence on initial emissions (Result 2) and on emission trends (Result 3), as well as reporting disaggregated analyses on the strategies adopted by participants (Results 4–7). The aggregate levels of emissions are 9.4 for the Immediate, 8.8 for the Halving, and
19
ACCEPTED MANUSCRIPT 8.5 for the Persistent treatment when considering all rounds. They are computed as the means of the individual emissions within a group in a sequence. These emission levels are statistically indistinguishable across treatments, as one can see from a Tobit regression on groups’ aggregate emissions (Table II, col. 3). Non-parametric tests on groups’ aggregate
RI PT
emissions in sequence 1 confirm the lack of significant treatment differences (Table C.1 in Appendix). One can express aggregate emissions in terms of cooperation rates, 1 − [(ei − e∗ )/(eF − e∗ )] for ei in-between e∗ and eF , to facilitate comparisons with other studies. Cooperation rates are 29% in Immediate, 35% in Halving, and 33% in Persistent.21
SC
The analysis of aggregate emissions may be too coarse to properly identity the treatment effects of dynamic externalities. For this reason, we further explore our data, first by
M AN U
contrasting emissions of rich and poor participants and then by studying initial emissions and trends.
Result 1 (Rich vs. poor types). Average emissions of rich participants are usually lower than those of poor participants, but the difference is quantitatively small.
TE D
Support for Result 1 is provided in Figure 1 and Table III. Recall that theory predicts equal emissions for rich and poor participants (Proposition 1). In the experiment, rich participants emitted on average 6% to 15% less than poor participants depending on
EP
treatments (Figure 1). Table III reports Tobit regressions of individual emission choices controlling for a variety of relevant factors. The coefficient for a dummy that takes value
AC C
1 for rich participants is generally not statistically significant.22 We can place this finding in the context of two related studies. In a similar setup, Ghidoni et al. (2017) find small differences in the same direction but generally not statistically significant. In a threshold public good game, Tavoni et al. (2011) report larger differences in the same direction and also a statistically significant link between 21
These cooperation levels are in line with those found in the dynamic public bad experiments of Sherstyuk et al. (2016) and Pevnitskaya and Ryvkin (2013) in comparable treatments (49% and 10-20%, respectively). Long-run participants in Sherstyuk et al. (2016) achieved average individual emissions of 4.5 in a possible range of 1-11, with a social optimal at 3 and a C-MPE at 6. In Pevnitskaya and Ryvkin (2013) average individual emissions with indefinite horizon were between 8 and 9 in a possible range between 0 (social optimal) and 10 (C-MPE). 22 See Tables C.2 and C.3 in Appendix for robustness analyses using non-parametric tests on sequence 1 and additional regressions.
20
ACCEPTED MANUSCRIPT
Table II: Emissions under static vs. dynamic externalities. (2) Average group
(3) Average group
emission
emission in
emission in
in round 1 of
round 1 of
all rounds of
sequence 1
all sequences
all sequences
Treatment dummies
Persistent
–0.816
–0.792
–0.169
(0.659)
(1.048)
(0.851)
–1.358*
–1.921**
–0.625
(0.740) Rich type
–0.929
Sequence number
0.592***
Length of current sequence
EP
(0.177)
(0.175)
–0.018
–0.035
(0.027)
(0.055)
7.033***
7.594***
(0.312)
(0.973)
(1.368)
0.414
0.247
0.607
180
175
175
AC C
Observations
0.292*
(0.051)
7.538***
Wald test p-value: Having vs. Persistent
(1.018)
0.128**
TE D
Length of past sequence
Constant
(0.955)
M AN U
(0.709)
SC
Halving
RI PT
(1) Individual
Note: Tobit regressions with observations censored at 1 and 18. The unit of observation is a participant’s emission in the first round (col. 1), the average group emission in the first round of a sequence (col. 2), and the average group emission in all rounds of a sequence (col. 3). Standard errors are clustered at the session level. The variable “Halving” (“Persistent”) is a dummy taking value 1 in the Halving (Persistent) treatment, 0 otherwise. The variable “Length of past sequence” counts the number of rounds in the previous sequence; in sequence 1 it is set to 12.5. * p < 0.1, ** p < 0.05, *** p < 0.01.
the within-group earnings inequality and the ability to cooperate of a group: among those groups where rich participants contributed relatively more to the public good than poor participants did, groups that successfully cooperated are overrepresented. In our experiment this association is not statistically significant. On using as dependent variable 21
ACCEPTED MANUSCRIPT
10 11 12 13 14 15 16 17 18
C-MPE 9.7
9.6
9
8.1
6
7
8
7.9
4
RI PT
9
9.1
5
Average emission
Figure 1: Average emissions of rich and poor by treatment.
1
2
3
Social optimum
Halving Rich
Persistent
SC
Immediate
Poor
M AN U
Note: The unit of observation is one type of participants of a group in the sequence. All sequences are included (N = 55 in Immediate, N = 60 in Halving, N = 60 in Persistent). We consider the average emission of the two rich (poor) participants of a group over all rounds of the sequence. The vertical segments represent the 95% confidence interval.
the average emission of the rich-types divided by the overall group emissions, a regression shows a statistically insignificant coefficient for a variable denoting those groups that
TE D
successfully cooperated (p-value = 0.997, N = 175).23
In terms of cumulated earnings, rich participants ended up considerably better off than poor participants. On average, the relative earnings of a poor participant with
EP
respect to a rich one range from 38% to 66% depending on the treatment. In the CMPE equilibrium, the predicted earnings gap ranges from 41% to 78% depending on the
AC C
treatment.
We now zoom in on our data in order to highlight important treatment effects on the emission dynamics that remain undetected when considering aggregate emissions. Result 2 (Cooperation in round 1). Persistence lowers emissions in the initial decision of a sequence. Support for Result 2 is provided by Figure 2 and Tables II and IV. Focusing on the 23
OLS regression with clustered standard errors at session-level and controls for treatment dummies, sequence number, and length of current and past sequence. We define a group as successfully cooperating if its aggregate emission was lower or equal to seven. A robustness check where cooperative groups are those with an aggregate emission below the median one in that treatment confirms this result (pvalue= 0.721).
22
Table III: Tobit regressions of individual emission.
ACCEPTED MANUSCRIPT Immediate
Individual emission in a round
(1)
Rich type
Halving (2)
Persistent (3)
(4)
–0.223
–0.989*** –0.676
–0.892
(0.860)
(0.488)
(0.338)
(1.507)
Round number within a sequence
0.014 (0.037)
0.245*** –0.062* (0.087)
Stock of pollution at the beginning of a round
(1.582)
0.673*** –0.352
(0.035)
(0.067)
0.106***
(0.012)
0.209 (0.336)
(0.102)
Mistakes in the quiz
0.220**
(0.090)
Limited liability
4.053***
TE D
(0.189)
Constant
9.360***
(1.620)
Observations
0.986**
(0.417)
–0.148*** –0.005 (0.040)
0.609***
(0.170)
0.881**
(0.428)
6.278***
(0.043) 0.486*** (0.062)
0.501*
(0.272) 0.024*** (0.005) 0.312***
(0.266)
(0.102)
–0.004
–0.002
(0.082)
(0.029)
0.450
0.202
(0.346)
(0.200)
–1.397*** (0.405) 2.280***
4.264**
7.051***
(0.604)
(0.703)
(2.107)
(1.564)
2380
2000
2000
2120
2120
0.008
0.041
0.084
0.048
0.069
EP
Pseudo R2
(0.375)
M AN U
–0.014
1.192***
SC
Sequence number
Length of past sequence
(5)
1.164
RI PT
Dependent variable:
AC C
Note: Tobit regressions with observations censored at 1 and 18. The unit of observation is a participant in a round. All sequences are included. Standard errors are clustered at the session level. “Rich type” is a dummy taking value 1 if a participant is of type r, and 0 otherwise. “Length of past sequence” counts the number of rounds in the previous sequence; in sequence 1 it is set to 12.5. “Mistakes in the quiz” counts the number of mistakes made by a participant in the quiz on the instructions. “Limited liability” is a dummy taking value 1 if the emission decision was made under limited liability (see footnote 24), and 0 otherwise. * p < 0.1, ** p < 0.05, *** p < 0.01.
first round of all sequences, the average emission is 8.3 in the Immediate treatment, 7.5 in the Halving treatment, and 6.4 in the Persistent treatment. First round emissions exhibit an ascending order from high to no persistence in all sequences (Figure 2), which is statistically significant at 5% level in sequence 1 (Table IV, JT test). The bilateral difference between Immediate and Persistent treatments in first round emissions of sequence 1 is also statistically significant at 5% level (Table IV, WMW test). 23
ACCEPTED MANUSCRIPT
17
Immediate
16
Halving
C-MPE
4
5
6
7
8
9
10
RI PT
11
12
13
14
15
Persistent
1
2
Sequence number
3
M AN U
1
SC
3
Social optimum 2
Average individual emissions in round 1
18
Figure 2: Average emissions in round 1 across sequences.
4
Note: The treatments’ ranking remains the same in all sequences. The unit of observation is an individual emission in the first round of a sequence.
Table IV: Tests on treatment differences in round 1 emissions. Average emission
p-value
Observations
6.9, 6.2, 5.7
0.020
60, 60, 60
6.9, 5.7
0.043
60, 60
Immediate vs. Halving
6.9, 6.2
0.423
60, 60
Halving vs. Persistent
6.2, 5.7
0.193
60, 60
TE D
Jonckheere-Terpstra test (JT)
Immediate > Halving > Persistent
Wilcoxon-Mann-Whitney tests (WMW)
AC C
EP
Immediate vs. Persistent
Note: First round of sequence 1 only. The unit of observation is a participant. The null hypothesis in JT and WMW tests is that the samples come from the same population. In JT, the alternative hypothesis is that the medians are ordered by persistence as shown in the table.
Tobit regressions on emissions in the first round confirm the finding that Persistent is lower than Immediate when controlling for a host of relevant variables (Table II). The finding is statistically significant both when considering only sequence 1 (col. 1) and when considering all sequences (col. 2).24 24
Result 2 – as well as the other results in this section – is robust when controlling for situations where a participant’s cumulative earnings were negative, thus in the region of limited liability. The dataset always includes choices under limited liability, which is the situation where the participant’s cumulate
24
ACCEPTED MANUSCRIPT As we will see in the next result, while the Persistent treatment starts with emissions lower than the other treatments, it also exhibits a steeper trend. Result 3 (Trend). Emissions in the Persistent and Halving treatments exhibit a significantly steeper trend over the rounds of a sequence than in the Immediate treatment.
RI PT
Support for Result 3 is provided by Figure 3 and Table III. The trends of average emissions over the rounds within a sequence are shown in Figure 3. One can see steep linear trends for both the Halving and the Persistent treatments, which are statistically
SC
significant at 1% level according to a Tobit regression on individual choices (Table III, col. 2 and 4). Instead, no statistically significant linear trend is present in the Immediate treatment (col. 1).
M AN U
At the sub-sequence level, Figure 3 suggests that an upward trend in emissions within a sequence is also present in the Immediate treatment approximately until round 10. That trend then flattens in later rounds. Tobit regressions where we pool data from all treatments confirm the graphical evidence (Table C.5 of the Appendix, col. 3): when
TE D
considering only the first 10 rounds of all sequences, the estimated coefficient of the trend is positive and statistically significant also in Immediate (p-value < 0.001). However, even within the first 10 rounds the trends of Persistent and Halving are significantly steeper than the trend in Immediate (p-value < 0.001 and p-value = 0.037, respectively). These
EP
treatment differences are robust to a number of relevant controls, including the evolution over sequences, the length of past sequences, and level of participants’ understanding.25
AC C
Given the expected duration of a sequence set by design at 12.5 rounds, the combination of Results 2 and 3 explains why we do not find an aggregate treatment effect. Dynamic externalities induce participants to reduce initial emissions, as compared with earnings dropped below e10 or started monotonically decreasing after reaching a maximum below e10. In terms of number of choices, 235 in Immediate, 342 in Halving, and 0 in Persistent. None of the results changes when choices under limited liability are omitted. 25 Due to the indefinite horizon, our dataset is unbalanced. Table C.5 in Appendix also reports a robustness check where we truncate observations at round 20 in order to deal with a more balanced dataset (col. 2). Focusing only on the first 20 rounds does not change the essence of any of the results although a positive but weaker trend is here found in the Immediate treatment too. The difference in trends between Immediate and Persistent remains strong (p-value < 0.001). The difference between Immediate and Halving trends is instead somewhat weaker (p-value = 0.107). Finally, checking for nonlinear trends, Persistent and Halving treatments still exhibit significantly steeper trends than Immediate.
25
ACCEPTED MANUSCRIPT
18
Figure 3: Aggregate emissions over the rounds.
C-MPE Immediate
5
6
7
8
9
RI PT
10
11
12
13
14
Halving
4
Average individual emission
15
16
17
Persistent
1
2
3
Social optimum
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
SC
1
Round number within a sequence
28
29
30 (truncated)
M AN U
Note: The unit of observation is a group in a round. Every round contains observations from all sessions and all sequences implemented in a session (3 or 4 depending on the session). Due to the random termination of the sequences, the number of observations decreases over rounds: in round 1 N = 55 in Immediate, and N = 60 in Halving and Persistent, see also footnote 25.
static externalities, but at the same time they are associated with emissions that increase more over the rounds and thus take-over those in the Immediate treatment. In the Immediate treatment participants also start from a somewhat low emissions level possibly
TE D
to give cooperation a chance. However, they begin scaling up emissions when observing deviations to higher emissions. It is notable that the upward trend that we identified in Immediate disappears around round 10, when the average emission is close to the C-MPE.
EP
At that point participants play the same best-response emission to the Nash equilibrium and emissions stop growing.
AC C
To uncover the behavioral channels behind Results 2 and 3, the next three results give an assessment of the strategies adopted by the participants in the different treatments as compared to our theoretical benchmarks. We will first evaluate the share of participants that adopt constant strategies (Proposition 1) against the percentage of those who display non-constant patterns. We will then investigate whether changes in emissions are more in line with a behavior ensuing from the adoption of trigger strategies (Proposition 2) or if they are instead the outcome of stock-dependent strategies (Proposition 3). Result 4 (C-MPE). The constant Markov perfect equilibrium describes at most 5% of the individual strategies in any treatment. 26
ACCEPTED MANUSCRIPT The C-MPE equilibrium is a poor predictor of behavior in our experiment. This conclusion is remarkably similar across treatments. When taking to the data the CMPE equilibrium of an individual emission of 12 in every round (Proposition 1), one can hardly find any support: less than 1% of participants fit this definition.26 A more
RI PT
generous definition of C-MPE that includes all individuals emitting 12 plus or minus 2 in every round fits 4% of participants in Immediate, 2% in Halving, and 3% in Persistent. When taking as reference the average emission of a group in a sequence, this classification yields similar results: 4% in Immediate and Halving, 5% in Persistent.
SC
How many participants instead managed to sustain the social optimum? Result 5 addresses this question by applying the same technique as employed in Result 4.
M AN U
Result 5 (Social optimum). A minority of participants – less than one third – always choose near-socially optimal emission levels.
The socially optimal outcome has some empirical attraction, especially in the Persistent treatment. At the group level, nobody in Immediate or Halving was able to achieve
TE D
a group average emission of 3 in every round of a sequence. In Persistent only 2% of groups managed to do so. When applying a more generous definition of social optimum, which is a constant emission of 5 plus or minus 2, one can classify as closer to the social
EP
optimum about 13% of groups in Immediate, 12% in Halving, and 28% in Persistent. This is similar to the classification of individuals with this more generous definition: 13%
AC C
of participants in Immediate, 8% in Halving, and 31% in Persistent.27 In Results 4 and 5 we attempted to classify each individual in a sequence considering just one constant emission level. Here we generalize this exercise by classifying an individual as constant if she follows any of the eighteen possible levels of constant emissions. In the experiment, only a minority of individuals actually followed a constant strategy when allowing a bandwidth of +/ − 2: 25% in Immediate, 17% in Halving, and 47% in 26
The unit of observation is a participant in a sequence: N = 180 in Immediate, N = 100 in Halving, N = 240 in Persistent. Only sequences of three or more rounds are considered. 27 The socially optimal emission of 3 remains at the lower bound of the interval considered. When focusing on longer sequences treatment differences are similar. For example, when considering only sequences that lasted at least 6 rounds (instead of 3), classified participants are 8% in Immediate, 3% in Halving, and 14% in Persistent.
27
ACCEPTED MANUSCRIPT Persistent. An additional analysis at the group level provides further insights into non-constant behavior. Table V partitions the emission space into sixteen cells based on the initial and last emission, where groups exhibiting no trend in emissions are on the main diagonal
RI PT
(shaded area). Among the groups that interacted for at least two rounds, those with roughly constant emissions are 36% in Immediate (16/45), 51% in Halving (23/45), and 37% in Persistent (22/60). Consistently with the previous analyses, many groups do not fit a constant strategy classification. Moreover, Table V shows that groups starting with
SC
high emissions rarely became cooperative. The ratio of the number of groups with strictly increasing vs. strictly decreasing trend is about 4:1 in Immediate, 10:1 in Halving, and
M AN U
9:1 in Persistent. In line with Result 2, groups in Halving and Persistent are statistically significantly more likely to start with an average emission close to the social optimum (interval 1-5) than in Immediate (Probit regression: Immediate vs. Halving p-value= 0.173, Immediate vs. Persistent p-value= 0.013, N = 175).28 Consistently with Result 3, approximately 50% of these cooperative groups exhibit higher emissions by the end of
TE D
the game in both dynamic treatments.
Table V: Classification of groups based on emissions in the first and last round. First round emission, by treatment
1–5
6–10
5 (2)
6–10
11–14
15–18
3
14 (7)
AC C
emission
Last round
1–5
EP
Immediate (N = 55)
2
Halving (N = 60)
1–5
6–10
7 (2)
1
5
23 (8)
11–14
Persistent (N = 60) 15–18
1–5
6–10
11–14
11
3
1
7
7
1
11–14
15
5
1
7
8 (5)
2
7
3
15–18
4
4
2 (1)
5
3
4
12
2
15–18
1
Note: The unit of observation is a group in a sequence. All sequences are included. In parentheses are the number of groups that interacted for only one round.
With our theory we can identify two explanations for participants changing and increasing their emissions. They may react to others’ deviations, as with trigger strategies (Proposition 2 and Result 6) or they may follow stock-dependent strategies (Proposition 28 The Probit regression includes data from all sequences and controls for treatment dummies, sequence number, length of current and past sequence. Standard errors are clustered at session-level.
28
ACCEPTED MANUSCRIPT 3 and Result 7). Result 6 (Trigger strategies). Participants using trigger-strategies activated by deviations were estimated at between 21% and 36% depending on the treatment. Participants following a trigger strategy should punish others’ defections from the co-
RI PT
operative play by increasing their own emissions (Proposition 2). We follow the algorithm employed in Ghidoni et al. (2017) to identifying emission patterns in line with strategies of a trigger-type, that is, an individual who transitions from a cooperative mode to a
SC
punishment mode in the round following a defection event. The unit of observation of this analysis is a participant in a sequence that lasted three or more rounds. The algorithm focuses on all the instances where one observes the maximal positive jump in
M AN U
emissions made by a participant between two subsequent rounds. We interpret these maximal jumps as possible candidates for the switch to a punishment mode. An individual is hence classified as a trigger-type if two conditions are met. First, immediately before the first instance of maximal jump the individual experienced a defection by other
TE D
group members. We consider as defection any positive increase in others’ average emissions between two rounds before and the round preceding that with the maximal jump, i.e. e−i (t − 1) − e−i (t − 2) > 0 where e−i is the average emission of the other three group members. Second, in the presence of multiple instances of maximal jump within the same
EP
sequence, the algorithm requires that the mean emission of the others in the two rounds before the jumps was strictly increasing on average.
AC C
According to the algorithm, participants in a sequence that can be identified as following a trigger strategy are 34% (75/220) in Immediate, 21% (51/240) in Halving, and 36% (86/240) in Persistent. These shares are statistically indistinguishable when comparing Immediate vs. Halving and Immediate vs. Persistent. The share of trigger-types in Halving is significantly lower than in Persistent (Wald test in Table C.6 of the Appendix). While this algorithm clearly relies on some behavioral assumptions, it can capture emission patterns that are consistent with both grim trigger and T -rounds punishment strategies.29 The outcome presented should not, however, be regarded as an exact esti29
As discussed with Proposition 2, the social optimum can be also sustained with a limited period of
29
ACCEPTED MANUSCRIPT mate but rather an approximate one. The algorithm classifies as trigger-types individuals who reverted to (higher) emissions different from the C-MPE (as it should instead be the case in C-TE strategies). The classification of the algorithm is also not a necessary condition for trigger-types. It may fail to classify as such an individual who follows a trigger
RI PT
strategy if the trigger is never activated. Overall, this procedure may overestimate or underestimate the number of participants using these strategies. Hence, we performed a further analysis focusing on non-constant types, where the algorithm has better chances to identify trigger strategies.
SC
In particular, one can intersect the two classification criteria, trigger-strategy vs. constant strategy with a bandwidth of plus or minus 2. In this new analysis, about
M AN U
half of participants that follow a non-constant strategy can be classified as trigger-types. More precisely, the shares are about 49% (63/128) in Immediate, 55% (46/83) in Halving, and 52% (67/128) in Persistent. Instead, much smaller shares of individuals that follow a constant strategy can be classified as trigger-types: 13% (12/92) in Immediate, 5% (5/157) in Halving, and 17% (19/112) in Persistent.
TE D
Alternatively to trigger strategies, participants who exhibit non-constant emissions may have followed stock-dependent strategies (Proposition 3). Result 7 (Stock-dependent strategies). In the Halving and Persistent treatments,
EP
the lagged level of the stock in a sequence is on average positively correlated with the current individual emissions.
AC C
Support for Result 7 is provided by Figure 4 and Table III. In the Halving and Persistent treatments, the emissions of the average participant positively correlate with the level of the stock of pollution in the group as shown by Tobit regressions in Table III (col. 3 and 5). This empirical exercise suffers from the limitation that – due to the experimental design – the correlation between rounds and the stock of pollution is rather strong. Nevertheless, by exploiting the data variability across groups, we can disentangle the correlation between emissions and the stock from that between emissions and a simple time trend. When including both the rounds in a sequence and the stock of pollution punishment because in our experiment the discount factor parameter δ is high.
30
ACCEPTED MANUSCRIPT as regressors, the coefficient of the time trend (“Round number within a sequence”) becomes negative, while the coefficient of the stock is positive and statistically significant, which suggests a preeminence of the stock over the time trend variable in explaining the increasing dynamic of emissions.30 This result suggests that on average participants
RI PT
adopted stock-dependent strategies. However, considering the large heterogeneity in individual strategies highlighted by Results 4–6, our analysis cannot rule out that the share of participants using stock-dependent strategies is still a minority.
Figure 4: Participants’ emissions depend on the stock of pollution.
19 20
C-MPE
4 1
2
5
6
7
8
9
10 11
14 12 13 15
Average emission in the round
17
18
16
3
Social optimum
200
400
600
Stock of pollution
800
1000
C-MPE
32 27 2829 31 33 23 24 25 30 34 22 15 17 2635 16 20 1918 21 9 1410 813 7 12 11
6 5
3
4
2
1
Social optimum
0
20
40
C-MPE steady state
60
Stock of pollution
80
100
120
TE D
0
SC
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
(b) Halving
M AN U
Average emission in the round
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
(a) Persistent
EP
Note: One observation is the average emission of each group in a round of a sequence. All sequences are included. Each dot is the average of all groups in a round over all sequences. The number next to each dot indicates the round number within a sequence. The stock of pollution is measured at the beginning of the round.
Figure 4 illustrates average emissions in a given round as a function of the stock of
AC C
pollution. The theoretical benchmarks of social optimum and C-MPE are represented by flat lines. Empirically, in the Persistent treatment, the higher the stock, the higher were average emissions (Figure 4a). As a possible consequence of the ever-increasing stock in Persistent, about one third of groups end up with average emissions above 15 (19 out of 60 groups, in contrast with 9 out of 45 in Immediate and 8 out of 45 in Halving; see Table V). In the Halving treatment participants’ strategies are generally increasing in the stock of pollution but we frequently observe “cycles” at given levels of stock (Figure 4b). A 30
Differences in the magnitude of the stock coefficients are attributable to the rescaling of the damage coefficient c across treatments (Table I).
31
ACCEPTED MANUSCRIPT cycle takes place when both the stock of pollution and the global emissions of the group simultaneously decrease from one round to the next, but later they start increasing again. In the Halving treatment, 15 out of 20 groups that played more than three rounds experience a cycle. The decreasing phase of a cycle takes place at stock levels between 34
RI PT
and 142, with an average of 101, which is close to the predicted steady-state stock of 96 under C-MPE.31 This pattern is consistent with the strategies illustrated in Proposition 3 and with the “greenhouse trap” suggested in Dutta and Radner (2009). When following stock-dependent strategies, the convergence to a stock may take place from below (with
SC
increasing emissions) or from above (with decreasing emissions).32
As shown in Figure 4, in the dynamic treatments, the pollution stock often exceeds
M AN U
C-MPE levels, which is not predicted by equilibrium behavior of the theoretical analysis. This pattern is confirmed by more disaggregated analyses. The stock was above the CMPE level for at least three rounds, not necessarily consecutive, in 60% of the groups of the Halving treatment and in 13% of the groups of the Persistent treatment.33 A possible behavioral interpretation for the higher frequency of stock overshooting in Halving vs.
TE D
Persistent may be that participants in Halving have the impression of being able to control the stock of pollution and reduce it if necessary, as shown by the “cycles”. In Persistent,
Discussion and conclusions
AC C
6
EP
instead, participants may be more cautious because of stronger persistence.34
To foster international efforts to combat climate change, we need a thorough grasp of those factors that hinder or favor cooperation. Here we employ the experimental method to 31 A group-by-group graphical analysis is in Figures C.1 and C.2 of the Appendix. In Halving, 40% of the groups display more than one decreasing phase over the sequence. In Persistent, instead, it can be observed that the majority of the groups display a positive relationship between stock and emissions. 32 This pattern is the one illustrated in Figure 4b and Figure C.2, precisely “around” the C-MPE steady-state stock. 33 These percentages refer to groups that interacted for at least three rounds: N = 25 in Halving and N = 60 in Persistent. 34 Stock and emissions above C-MPE levels may be the outcome of temporary punishment phases of sophisticated non-Markov behavior off equilibrium. This is a theoretical possibility which, however, requires high levels of sophistication and coordination among individuals and strategies that depend not only on the stock of pollution but also on the precise date of the period.
32
ACCEPTED MANUSCRIPT gain an understanding of behavioral drivers that are considered crucial in overcoming this special type of social dilemma. In particular, the focus is on a central issue of climate change: the long-term persistence of key greenhouse gases in the atmosphere, which generates irreversibility. We developed an experimental platform carefully calibrated to
RI PT
identify the causal effect of different degrees of persistence of pollution on the ability to cooperate in mitigating emissions. We compare settings that cover the full range of possibilities: a static (although repeated) treatment with no persistence, a dynamic treatment where emissions cumulate and last forever, and an intermediate treatment with
SC
a decay in pollution at a rate of 50% every round.
We report three main findings. First, the persistence of pollution leads to a high initial
M AN U
cooperation level (Result 2). This contrasts with what one may conjecture: a scenario where today’s actions have only immediate consequences could be expected to elicit more cooperation than a complex scenario with diluted and persistent consequences over time. Second, although the previous result seems reassuring, we also report that, when pollution accumulates because of persistence, cooperation declines (Result 3). This may be
TE D
explained with the adoption of trigger strategies reacting to (deviating) current emissions or strategies that are increasing in the stock of pollution. These strategies are indeed equilibria of the game with dynamic externalities. Our evidence suggests that the
EP
increasing trend in emissions emerging in the Halving and Persistent treatments is linked to the stock of pollution rather than to time or experience (Result 7). With Markov
AC C
strategies, the level of the stock of pollution can become a coordination device that is not available when persistence is nil (Proposition 3). However, this type of coordination is a difficult endeavor. With persistence, lower initial emissions may be a clever way to learn others’ intentions toward cooperation in the presence of irreversibility. However, cooperation is not perfectly controllable by a single decision-maker because the stock of pollution reflects the emissions of all decision-makers. Hence, it can be easily depleted and, when cooperation breaks down, the stock keeps growing. Third, behavioral differences in emissions between rich and poor types as modeled in the experiment reported here are of second-order importance (Result 1). The extent to
33
ACCEPTED MANUSCRIPT which the rich participants emit less than poor ones is rather small, and it is especially evident with dynamic externalities. This empirical result is consistent with the theoretical predictions of our environment and suggests that rich participants are not willing to contribute much more than poor participants to the mitigation efforts, even if mitigation
RI PT
is more demanding for the poor in terms of welfare. As a consequence, there is a wide earnings gap between poor and rich types.
Any economic experiment on climate change is necessarily a study in a highly simplified setting with some restrictive assumptions. This is also true in our case. Take
SC
for example the damage function: in the laboratory, impacts are immediate and deterministic instead of delayed and stochastic (Ghidoni et al., 2017). Uncertainty or limited
M AN U
perception of the actual environment may impair decision-makers’ ability to cooperate. Also the initial level of pollution stock, zero in all our treatments, may play a role. Future work can remedy these and other limitations of our approach. Nevertheless, we believe that experiments are additional means to further our understanding of how to foster international cooperation in tackling climate change.
TE D
This study makes a novel contribution to the behavioral components of cooperation to solve the climate change problem by addressing the issue of pollution persistence. One cannot take findings from laboratory experiments and use them directly to give policy
EP
advice. Caution should be exercised in this transfer of knowledge because the external validity of economic experiments on climate change is not automatic. Yet our results have
AC C
some relevance for the research agenda on climate change and can complement available field evidence. Scholars have claimed that persistence severely affects the ability of nations to cooperate, but the empirical support is lacking. With a simple model of climate change, our study suggests that policy-makers should not delay mitigation actions until the situation deteriorates, because the experimental evidence suggests that cooperation becomes harder than it was early on. In the experiment, the climate model involves bottom-up mitigation efforts but does not include a system of pledges similar to the one that led to the Paris agreement. Voluntary pledges allow for common but differentiated responsibilities in mitigation of poor and rich regions. In our climate game, satisfying the
34
ACCEPTED MANUSCRIPT right for development of poor regions risks derailing the overall cooperative equilibrium. Rich regions should be willing to restrict emissions voluntarily in order to ensure the right to development of poor regions, but this tendency is rather weak in the experiment. One solution could be explicitly agreeing on distinct targets of emissions by income level,
RI PT
which would at the same time sustain a strong aggregate mitigation and the right to development. Carbon is forever, but the good news is that, at least in the short-term, this will not initially condemn us to suffer from lack of cooperation any more than in the usual static social dilemmas. Nevertheless, when taking a long-run perspective, policy
SC
interventions are urgent and must start as soon as possible.
M AN U
References
Andreoni, J. (1995). Warm-glow versus cold-prickle: the effects of positive and negative framing on cooperation in experiments. Quarterly Journal of Economics 110 (1), 1–21. Barrett, S. and A. Dannenberg (2012). Climate negotiations under scientific uncertainty.
TE D
PNAS 109 (43), 17372–17376.
Barrett, S. and A. Dannenberg (2014). Sensitivity of collective action to uncertainty
EP
about climate tipping points. Nature Climate Change 4 (1), 36–39. Battaglini, M., S. Nunnari, and T. R. Palfrey (2016). The dynamic free rider problem:
AC C
A laboratory study. American Economic Journal: Microeconomics 8 (4), 268–308. Bosetti, V., C. Carraro, and M. Tavoni (2012). Timing of mitigation and technology availability in achieving a low-carbon world. Env. & Resource Econ. 51 (3), 353–369. Bosetti, V., M. Heugues, and A. Tavoni (2017). Luring others into climate action: Coalition formation games with threshold and spillover effects. Oxford Economic Papers 69 (2), 410–431. Burke, M., S. Hsiang, and E. Miguel (2015). Global non-linear effect of temperature on economic production. Nature 527 (7577), 235–239. 35
ACCEPTED MANUSCRIPT Camera, G. and M. Casari (2009). Cooperation among strangers under the shadow of the future. American Economic Review 99 (3), 979–1005. Dal B´o, P. and G. R. Fr´echette (2017). On the determinants of cooperation in infinitely repeated games: A survey. Journal of Economic Literature (Forthcoming).
RI PT
Dannenberg, A., A. L¨oschel, G. Paolacci, C. Reif, and A. Tavoni (2015). On the provision of public goods with probabilistic and ambiguous thresholds. Environmental and Resource Economics 61 (3), 365–383.
Self-enforcing climate-change treaties.
M AN U
PNAS 101 (14), 5174–5179.
SC
Dutta, P. K. and R. Radner (2004).
Dutta, P. K. and R. Radner (2009). A strategic analysis of global warming: Theory and some numbers. Journal of Economic Behavior & Organization 71 (2), 187–209. Falk, A. and J. J. Heckman (2009). Lab experiments are a major source of knowledge in
TE D
the social sciences. Science 326 (5952), 535–538.
Fiala, L. and S. Suetens (2017). Transparency and cooperation in repeated dilemma games: a meta study. Experimental economics 20 (4), 755–771.
EP
Fischbacher, U. (2007). z-Tree: Zurich toolbox for ready-made economic experiments. Experimental Economics 10 (2), 171–178.
AC C
Gerlagh, R. and E. Van der Heijden (2015). Going green: Framing effects in a dynamic coordination game. CentER Discussion Paper 2015-054. Ghidoni, R., G. Calzolari, and M. Casari (2017). Climate change: Behavioral responses from extreme events and delayed damages. Energy Economics 68 (1), 103–115. Greiner, B. (2015). Subject pool recruitment procedures: Organizing experiments with ORSEE. Journal of the Economic Science Association 1 (1), 114–125. Hauser, O. P., D. G. Rand, A. Peysakhovich, and M. A. Nowak (2014). Cooperating with the future. Nature 511 (7508), 220–223. 36
ACCEPTED MANUSCRIPT Herr, A., R. Gardner, and J. M. Walker (1997).
An experimental study of time-
independent and time-dependent externalities in the commons. Games and Economic Behavior 19 (1), 77–96. Inman, M. (2008). Carbon is forever. Nature Reports Climate Change, 156–158.
RI PT
IPCC (2007). Climate Change 2007: The Physical Science Basis. Contribution of WGI to 4th Assessment Report of IPCC. Cambridge University Press.
IPCC (2014). Climate Change 2014: Mitigation of Climate Change. Contribution of
SC
WGIII to 5th Assessment Report of IPCC. Cambridge University Press.
M AN U
JRC and PBL (2016). The emissions database for global atmospheric research (EDGAR). Available at http://edgar.jrc.ec.europa.eu/.
Khadjavi, M. and A. Lange (2015). Doing good or doing harm: experimental evidence on giving and taking in public good games. Experimental Economics 18 (3), 432–441.
TE D
K¨oke, S. and A. Lange (2017). Negotiating environmental agreements under ratification constraints. Journal of Environmental Economics and Management 83, 90–106. Laury, S. and C. Holt (2008). Voluntary provision of public goods: Experimental results
EP
with interior nash equilibria. In C. Plott and V. Smith (Eds.), Handbook of Experimental Economics Results, Volume 1, pp. 792–801. North-Holland.
AC C
Milinski, M., R. D. Sommerfeld, H.-J. Krambeck, F. A. Reed, and J. Marotzke (2008). The collective-risk social dilemma and the prevention of simulated dangerous climate change. PNAS 105 (7), 2291–2294. Nordhaus, W. D. (2013). The climate casino: Risk, uncertainty, and economics for a warming world. Yale University Press. Pevnitskaya, S. and D. Ryvkin (2013). Environmental context and termination uncertainty in games with a dynamic public bad. Environment and Development Economics 18 (01), 27–49.
37
ACCEPTED MANUSCRIPT Sherstyuk, K., N. Tarui, M.-L. V. Ravago, and T. Saijo (2016). Intergenerational games with dynamic externalities and climate change experiments. Journal of the Association of Environmental and Resource Economists 3 (2), 247–281. Stern, N. H., H. M. Treasury, et al. (2006). Stern Review: The economics of climate
RI PT
change. Cambridge University Press. Tavoni, A., A. Dannenberg, G. Kallis, and A. L¨oschel (2011). Inequality, communication,
11825–11829.
SC
and the avoidance of disastrous climate change in a public goods game. PNAS 108 (29),
Wagner, G. and M. L. Weitzman (2015). Climate shock: The economic consequences of
AC C
EP
TE D
M AN U
a hotter planet. Princeton University Press.
38