Subjective probabilities on “small” domains

Subjective probabilities on “small” domains

Journal of Economic Theory 133 (2007) 236 – 265 www.elsevier.com/locate/jet Subjective probabilities on “small” domains Igor Kopylova, b,∗,1 a Instit...

385KB Sizes 5 Downloads 99 Views

Journal of Economic Theory 133 (2007) 236 – 265 www.elsevier.com/locate/jet

Subjective probabilities on “small” domains Igor Kopylova, b,∗,1 a Institute for Mathematical Behavioral Sciences, University of California, Irvine, CA 92697, USA b Department of Economics, University of California, Irvine, CA 92697, USA

Received 16 March 2005 Available online 18 January 2006

Abstract The Savagian choice-theoretic construction of subjective probability does not apply to preferences, like those in the Ellsberg Paradox, that reflect a distinction between risk and ambiguity. We formulate two representation results—one for expected utility, the other for probabilistic sophistication—that derive subjective probabilities but only on a “small” domain of risky events. Risky events can be either specified exogenously or in terms of choice behavior; in the latter case, both the values and the domain of probability are subjective. The analysis identifies a mathematical structure—called a mosaic—that is intuitive for both exogenous and behavioral specifications of risky events. This structure is weaker than an algebra or even a -system. © 2005 Elsevier Inc. All rights reserved. JEL classification: D80; D81 Keywords: Subjective probability; Expected utility; Probabilistic sophistication; Ellsberg Paradox; Mosaic

1. Introduction Savage [19] provides foundations for the use of subjective probability in decision making. More precisely, he axiomatizes an expected utility representation for preference over uncertain prospects—acts defined on a set S of states of nature—in a way that does not rely on any extraneous randomizing device. One component of this representation is a probabilistic belief assigned subjectively on a universal -algebra of events , such as the power set of all subsets of S. ∗ Corresponding author at: Department of Economics, University of California, Irvine, CA 92697, USA.

E-mail address: [email protected]. 1 This paper is based on a chapter of my doctoral thesis (University of Rochester, 2003) supervised by Larry Epstein.

I owe special thanks to him for guidance and generous advice. I am grateful to Peter Wakker, Eddie Dekel, Itzhak Gilboa, Klaus Nehring, Mark Machina, Peter Klibanoff, Arthur Robson, and anonymous referees for helpful comments. 0022-0531/$ - see front matter © 2005 Elsevier Inc. All rights reserved. doi:10.1016/j.jet.2005.11.002

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

237

Savage’s theory has some well-known limitations. In a paradox due to Allais [2], all events have explicit numerical probabilities, but preferences are not represented by expected utility. Motivated by theAllais paradox, Machina and Schmeidler [17] derive subjective probabilities separately from expected utility. In their model of probabilistic sophistication, the decision maker ranks acts in two stages: first, she uses subjective probabilities to translate each act into a lottery (a distribution over prizes), and then she ranks the induced lotteries but not necessarily via expected utility. Note that both Savage’s theory and Machina–Schmeidler’s extension derive subjective probabilities for all events in the universal -algebra . In another paradox, due to Ellsberg [4], the decision maker is unwilling to assign probabilities to some events in . For example, consider an urn that contains balls of three possible colors—blue (B), green (G), and red (R). The decision maker is told only that the total number of balls is 90 and R = 30. Then the typical preference is to bet on the event {R} rather than on the event {B} because the frequency of {R} is known to be 13 , while the frequency of {B} is not known precisely and lies between 0 and 23 . Analogously, it is typical to bet on {B, G} rather than on {R, G}. This preference reversal shows that the decision maker does not rely on any belief p because otherwise p({R}) > p({B}) but also p({B}) + p({G}) > p({R}) + p({G}). Thus, the Ellsberg Paradox demonstrates the behavioral significance of the distinction between risk, which can be represented by numerical probabilities, and ambiguity, which cannot. 2 In the light of his findings, Ellsberg states that “both the predictive and normative use of the Savage or equivalent postulates might be improved by avoiding attempts to apply them in certain, specifiable circumstances where they do not seem acceptable”. For instance, one might restrict Savage’s (or Machina–Schmeidler’s) axioms to acts measurable with respect to a -algebra ∗ ⊆  generated by an extraneous randomizing device, such as a fair coin or a roulette wheel. Then one could still obtain subjective probabilities, but only on the “small” domain ∗ ⊆ . Our main objective is to adapt Savage’s and Machina–Schmeidler’s axioms and characterize the use of subjective probability on a suitable domain R ⊆  of risky events that need not be a algebra. Weakening the mathematical structure of the domain R (which may seem a technical issue at first) is well-motivated in the literature on decision making under ambiguity. In this literature, R is often understood as an exogenous class of events for which objective probabilities (frequencies) are given to the decision maker. As pointed out by Zhang [22], such R is only a -system, that is, closed under complements and disjoint unions, but not necessarily under intersections. For example, consider an urn containing balls of four possible colors—blue (B), green (G), red (R), and yellow (Y )—and suppose that the decision maker is told only that B + G + R + Y = 100 and B + G = B + Y = 50. Then R = {∅, {B, G}, {R, Y }, {B, Y }, {G, R}, S} is a -system, but not an algebra. Alternatively, risky events can be defined endogenously. In other words, one can define whether an event A ∈  is subjectively risky or ambiguous in terms of the decision maker’s preference over acts. Zhang [23] and Epstein–Zhang [5] formulate two leading definitions of subjectively risky events. 3 However, neither of these definitions implies that the domain of such events, RZ ⊆  2 This distinction is often attributed to Knight [14]. He refers to uncertainty rather than ambiguity. We adopt Ellsberg’s terminology, in which uncertainty is comprehensive and includes both risk and ambiguity. 3 These authors use the term unambiguous rather than risky. The difference between the two definitions in Zhang [23] and Epstein–Zhang [5] reflects the distinction between expected utility and probabilistic sophistication. Ghirardato [8] reviews other possible approaches.

238

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

or REZ ⊆ , respectively, is a -algebra or even a -system. In general, both RZ and REZ have a weaker structure that is called mosaic and is defined later. In a more abstract vein, assuming countable unions or intersections can be a problem, even for the universal class of events . 4 Indeed, the decision maker must conceive of all events in  prior to ranking all -measurable acts. However, a -algebra, even when generated by simple events, often contains very complex ones. To illustrate, let each state of the world s be determined by an infinite sequence of coin tosses, let  be the algebra of cylinders—events that rely on finitely many coin tosses—and let () be the smallest -algebra that contains . It is well-known that many events in () are not readily obtained via combinations of cylinders. More precisely, for any countable ordinal , there are sets in () that cannot be arrived at from  by a -sequence of set-theoretic operations, each operation being a complement, a countable union or a countable intersection [3, pp. 31–32]. In this sense, the class () is substantially more complex than . Probability theorists avoid this complexity and do not construct measures directly on a -algebra; rather, the typical procedure is to describe the measure explicitly on “simple” events and then to apply a measure extension theorem (recall the construction of the Lebesgue measure on the Borel -algebra). Similarly, the decision maker who conceives of all events in an algebra  may be unable or unwilling to conceive of some sets in (). Therefore, reliance on all events in a algebra may be problematic for a normative theory of subjective probability. Our model alleviates this problematic aspect by taking both  and R to be finitely additive domains (an algebra and a mosaic, respectively). We formulate two main representation results—one for expected utility (Theorem 3.1), the other for probabilistic sophistication (Theorem 4.1)—that derive subjective probabilities on a given mosaic R ⊆  from preference over R-measurable acts. In particular, one can apply these results to R = RZ and to R = REZ , respectively, and thus, obtain fully subjective models of expected utility and probabilistic sophistication. Both models are constructive and derive the values and the domain of the subjective probability measure from preference. However, the two models use different definitions of risk and obtain different representations for preference. The former delivers expected utility over RZ -measurable acts, while the latter delivers probabilistic sophistication over REZ -measurable acts. The noted Theorems 3.1 and 4.1 show that the content of Savage’s and Machina–Schmeidler’s axioms can be preserved almost entirely when restricted to a “small” domain of risky events. An essential modification is required only for Savage’s P6; all the other axioms are restricted in a straightforward fashion. (Other extensions of Savage’s result, such as Gul [12] and Abdellaoui and Wakker [1], adopt arguably less transparent axioms for preference and additional structure for outcomes of acts.) However, the construction of subjective probability, which is a key part of Savage’s proof and is used also by Machina–Schmeidler, does not carry through to mosaics— or even to algebras—and needs to be replaced by an alternative strategy. We propose a compact formula that computes subjective probabilities for all risky events from the ranking of bets on such events. This formula is intuitive and reveals a general connection between the betting behavior and the underlying numerical probability. Our results allow a lot of flexibility for applications. In particular, choice outside the domain of risky acts is not restricted by any conditions, or a fortiori by any parametric model, such as Choquet expected utility [20] or the multiple priors model [10]. Also, there is freedom in the 4 Savage acknowledges that the assumption that  is a -algebra is essential for his proof, but notes that it is peculiar that “one should use countable unions of events in order to derive a finitely additive probability measure.” Machina–Schmeidler and Epstein–Zhang also take  to be a -algebra.

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

239

identity of the mosaic R where subjective probabilities are derived. We focus on applications where R = RZ and R = REZ , but in general, R need not equal either of these domains. This paper proceeds as follows. Next we present our formal framework and define the notion of a mosaic. In Sections 3 and 4, we formulate the two main representation results—one for expected utility (Theorem 3.1), the other for probabilistic sophistication (Theorem 4.1). In Section 5, we apply these results to R = RZ and to R = REZ , respectively. Proofs are sketched in Section 6 and are presented in detail in the Appendix. 2. Preliminaries We use a version of Savage’s framework. Given are • a set X of deterministic outcomes, • a set S of states of the world, • a class  of subsets of S that are called events. No formal structure is imposed on X and S. The class  is assumed to be an algebra rather than a -algebra. Call a function f : S → X an act if it is -measurable and has finite range. Identify outcomes x ∈ X with the corresponding constant acts. Let F be the set of all acts. Interpret each act f ∈ F as a physical action that yields the outcome f (s) contingent on the state s ∈ S. The decision maker’s preference over such actions is given as a binary relation  on F. 5 In addition to the primitives S, X, , and , fix a class R ⊆  of risky events; call all other events in  ambiguous. Say that S = ∪m i=1 Si is a grand partition if the events S1 , . . . Sm are disjoint and risky. Assume that R is a mosaic, that is, satisfies the following conditions: (∗) S ∈ R ; (∗∗) A ∈ R ⇒ Ac ∈ R; and () S = ∪m i=1 Si is a grand partition ⇒ Si ∪ Sj ∈ R for all i, j = 1 . . . m. Equivalently, () requires that any partition coarser than a grand partition is also a grand partition. Note that a mosaic need not be closed under intersections, or unions, or even disjoint unions. For example, if S = {R, G, B}, then the mosaic {∅, {R}, {G}, {G, B}, {R, B}, S} does not contain the disjoint union of its elements {R} and {G}. The mosaic structure—unlike the more restrictive -system—is implied by the behavioral definitions of risky events due to Zhang [23] and Epstein–Zhang [5] (see Section 5 for further discussion). Call an act f ∈ F risky if it is R-measurable, that is, f −1 (x) ∈ R for all x ∈ X; call f ambiguous otherwise. Note that each constant act x ∈ X is risky because S ∈ R and ∅ = S c ∈ R. Let G ⊆ F be the set of all risky acts. The fact that R may not be closed under intersections makes the following notation useful. Given a collection of risky events E ⊆ R, let RE = {A ∈ R : A ∩ E ∈ R for all E ∈ E}, 5 Savage’s original framework differs in two respects. First, he assumes  to be a -algebra; second, he models preference on the domain of all -measurable mappings from S into X, calling any f ∈ F a simple act to reflect the fact that f has finite range. We do not use acts other than simple (hence, this term becomes redundant). Accordingly, we do not need an extra axiom, such as Savage’s P7, to extend our representations from F to the larger domain of choice. However, we retain a “technical” axiom analogous to Savage’s P6 that puts some implicit restrictions on S and ; in particular, these sets cannot be finite.

240

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

and let GE be the set of (RE)-measurable acts. In other words, GE consists of risky acts that remain R-measurable when restricted to any event E from the given collection E. Note that if R is an algebra, then the above notation is redundant because R = RE and G = GE. For any acts f, h ∈ F and any event A ∈ , define a composite act f Ah that yields f (s) if s ∈ A and h(s) if s ∈ Ac . Note that f Ah may be ambiguous even when the event A and the acts f and h are risky. For example, take outcomes x = y and risky events A, B ∈ R such that A∩B ∈ / R; then f = xBy and h = y are risky acts, but f Ah = x(A ∩ B)y is not. One way to guarantee that f Ah ∈ G is to require that A ∈ R,

f ∈ G  {A}

and

h ∈ G  {Ac }.

(1)

Under these conditions, the events f −1 (x) ∩ A and h−1 (x) ∩ Ac form a grand partition as x varies over the finite range of the acts f and h; by (), (f −1 (x) ∩ A) ∪ (h−1 (x) ∩ Ac ) = (f Ah)−1 (x) is a risky event for each x ∈ X, and the act f Ah is risky. 3. Subjective expected utility To characterize subjective expected utility, Savage [19] (see also [7,16]) imposes a list of axioms P1–P6 on the preference  over the set of all acts F. However, the Ellsberg-type behavior violates one of these axioms, P2. Axiom P2 (Sure-Thing Principle). For all events A ∈ , acts f, g ∈ F, and outcomes x, y ∈ X, 6 f Ax  gAx ⇒ f Ay  gAy.

(3)

P2 states that preference is separable across mutually exclusive events and can be conditioned on any event A independently of the outcome obtained on the complement of A. To avoid Ellsberg-type preference reversals, rewrite P2 so that all the composite acts f Ax, f Ay, gAx, and gAy in the invariance (3) are risky, that is, belong to the class G of R-measurable acts. By (1), it is sufficient to require that A ∈ R and f, g ∈ G  {A}. Axiom P2(R). For all risky events A ∈ R, risky acts f, g ∈ G  {A}, and outcomes x, y ∈ X, f Ax  gAx ⇒ f Ay  gAy.

(4)

P2(R) asserts separability of preference but only across risky events. More precisely, P2(R) requires that preference over risky acts that have R-measurable restrictions to a risky event A can be conditioned on A independently of the outcome obtained on the complement of A. Note that P2 implies P2(R), and the two axioms are equivalent if R = . 6 Savage uses an equivalent formulation: for all A ∈  and f, g, h, h ∈ F ,

f Ah  gAh ⇒ f Ah  gAh .

(2)

The invariance (3) is a special case of (2) with constant h and h . Conversely, (2) follows from (3) by induction over the number of outcomes in the range of h and h .

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

241

Even though the Ellsberg-type behavior may comply with Savage’s axioms other than P2, it does not seem intuitive to restrain choices among ambiguous acts in order to derive expected utility for risky acts. Thus, we rewrite the rest of the list P1–P6 so that the rewritten axioms P1(R)–P6(R) involve only risky acts and events. Axiom P1(R).  is complete and transitive on G. If R = , then P1(R) is equivalent to Savage’s P1. In general, P1(R) is weaker than P1 and allows an incomplete or intransitive choice among ambiguous acts. Axiom P3(R). For each risky event A ∈ R, at least one of the following statements holds simultaneously for all outcomes x, y ∈ X and for all acts h ∈ G  {Ac }: (i) x  y ⇔ xAh  yAh; (ii) xAh ∼ yAh. P3(R) is a version of state-independence for the ranking of deterministic outcomes. This axiom classifies each risky event as either non-null or null so that the ranking of X remains unchanged when conditioned on any non-null risky event and becomes degenerate when conditioned on any null risky event. This axiom uses only risky acts and does not classify ambiguous events. P3(R) follows from Savage’s P3; the two axioms are equivalent if R = . Axiom P4(R). For all risky events A, B ∈ R, for all outcomes x  x and z  z , if xAx  xBx , then zAz  zBz . P4(R) requires that if A and B are risky events, then the preference to bet A rather than on B should reflect exclusively the subjective relative likelihoods of A and B and be independent of the stakes involved in such bets. P4(R) follows from Savage’s P4; the two axioms are equivalent if R = . Axiom P5(R). There exist outcomes x and x such that x  x . P5(R) is identical to Savage’s P5. Before stating P6(R), we briefly discuss Savage’s Axiom P6 (Small Event Continuity). For any outcome x and for any acts f  g, there exists a partition S = ∪m i=1 Si such that for all i = 1 . . . m, xSi f  g and f  xSi g. P6 requires that for any outcome x and for any acts f  g, the universal event S can be partitioned into small events Si so that the strict preference f  g is not reversed when the outcomes f (s), or alternatively g(s), are replaced by x for all s ∈ Si . To motivate P6, consider a coin such that all finite sequences (s1 , . . . , sn ) of its heads and tails are events in . It is intuitive that for a sufficiently large n, each sequence (s1 , . . . , sn ) is a small event. Note that P6 is a condition on the preference, but also on the other primitives: for instance, P6 implies that both S and  are infinite. Rewrite P6 in terms of risky events and risky acts as follows.

242

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

Axiom P6(R). For any outcome x, for any finite collection of risky events E ⊆ R, and for any E-measurable acts f  g, there exists a grand partition {S1 , . . . , Sm } ⊆ R  E such that for all i = 1 . . . m, xSi f  g and f  xSi g. 7 When R = , P6(R) is equivalent to P6 because   E = , but in the general case when R ⊂ , P6(R) does not follow from P6 because the small events in P6(R), unlike those in P6, are restricted to a particular subclass R  E ⊆ R ⊂ . To motivate this restriction, fix a finite collection E ⊆ R and consider a coin such that all finite sequences (s1 , . . . , sn ) of its heads and tails are risky events in R and are independent of all events in E. In other words, suppose that the decision maker’s perception of the likelihoods of the events in the finite collection E is not affected by the results of the coin tosses (this notion of independence is used only to motivate P6(R) and is not a formal part of our model). It is intuitive that for a sufficiently large n, every event (s1 , . . . , sn ) is small, and for each E ∈ E, the event (s1 , . . . , sn ) ∩ E is risky because it can be assigned probability p(s1 , . . . , sn ) · p(E).  Call a function p : R → [0, 1] a probability measure if m i=1 p(Si ) = 1 for all grand partitions m S = ∪i=1 Si . Call a probability measure p : R → [0, 1] finely ranged if for any finite collection E ⊆ R and for any ε > 0, there exists a grand partition {S1 , . . . , Sm } ⊆ R  E such that p(Si ) < ε for all i = 1 . . . m. Our first main result is Theorem 3.1. Let  be an algebra, and let R ⊆  be a mosaic. Then the following two statements are equivalent: (1)  satisfies P 1(R), P 2(R), P 3(R), P 4(R), P 5(R), P 6(R); (2)  is represented on the set G by expected utility U (f ) =



u(x) · p (f −1 (x)) for f ∈ G,

(5)

x∈X

where u : X → R is a non-constant utility index, and p : R → [0, 1] is a finely ranged probability measure. In this representation, the index u is unique up to a positive linear transformation, and the probability measure p is unique. The decision maker as portrayed by (5) assigns probabilities to all risky events A ∈ R, attaches utility indices to all outcomes x ∈ X, and then ranks all risky acts f ∈ G via expected utility. Therefore, Theorem 3.1 provides foundations for the use of probabilities on risky events and for expected utility maximization on risky acts. The Savage Theorem is a special case of Theorem 3.1, where  is a -algebra and R = . In this case, the two results use equivalent axioms and deliver identical representations (this identity follows from uniqueness). Moreover, Theorem 3.1 implies that even if the algebra  is not countably additive, Savage’s axioms taken “as is” are still sufficient for an expected utility

7 All the acts xS f and xS g are risky because f, g ∈ G  {S c }. To show that f ∈ G  {S c }, consider the grand partition i i i i of S into risky events f −1 (z) ∩ Sj , where z ∈ X and 1  j  m; here, f −1 (z) ∩ Sj ∈ R because f −1 (z) ∈ E and Sj ∈ R  E. By (), f −1 (y) ∩ Sic = ∪j =i (f −1 (y) ∩ Sj ) ∈ R for all y ∈ X.

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

243

representation on the set F of all acts. 8 In other words, the assumption that  is countably additive is not crucial for axiomatizing expected utility via P1–P6. In general, Theorem 3.1 uses axioms that are parallel to Savage’s counterparts but applies them only to risky acts and events. Most importantly, P2(R) does not postulate separability of preference across ambiguous events. Accordingly, in representation (5), only risky events are assigned subjective probabilities and only risky acts are ranked via expected utility. Note that choice among ambiguous acts is not restricted by ambiguity aversion or by any parametric utility representation, such as Choquet expected utility [20] or maxmin expected utility [10]. The proof of Theorem 3.1—most importantly, the construction of the probability measure p—requires a new approach that is sketched in Section 6. 4. Probabilistic sophistication In order to accommodate Allais-type behavior, Machina and Schmeidler [17] extend Savage’s theory. They model a probabilistically sophisticated decision maker who ranks acts in two stages: first, she uses subjective probabilities to reduce each act to a lottery—a distribution over outcomes—and then she ranks the induced lotteries via a risk preference which may have no expected utility representation. The fundamental difference between expected utility maximization and probabilistic sophistication is that the former implies a strong form of separability across mutually exclusive events (such as P2 in Savage’s framework), while the latter does not. Motivated by this observation, Machina and Schmeidler relax the Sure-Thing Principle. Axiom P4∗ (Strong Comparative Probability). For all events A, B, C ∈ , for all outcomes x  x and z  z , and for all acts h, h ∈ F, 9 (xAx )Ch  (xBx )Ch ⇒ (zAz )Ch  (zBz )Ch .

(6)

P4 is a special case for C = S. In addition to this special case, P4∗ requires that the preference to bet on A∩C rather than on B ∩C is invariant of the outcomes that are obtained when C does not occur. This invariance reflects exclusively the subjective relative likelihoods of the events A ∩ C and B ∩ C, and does not imply that the decision maker’s risk preference is separable (i.e. satisfies Independence over induced lotteries). The Ellsberg Paradox motivates rewriting P4∗ for preference over risky acts. Axiom P4∗ (R). For all risky events A, B ∈ R and C ∈ R  {A, Ac , B, B c }, for all outcomes x  x and z  z , and for all acts h, h ∈ G  {C c }, (xAx )Ch  (xBx )Ch ⇒ (zAz )Ch  (zBz )Ch .

(7)

8 If  is a -algebra, then a probability measure p :  → [0, 1] is finely ranged if an only if it is convex-ranged, that is for each event B ∈ , the range {p(A) : A ∈ , A ⊆ B} equals the interval [0, p(B)]. This equivalence does not hold if  is an algebra, in which case p is finely ranged if and only if for each event B ∈ , {p(A) : A ∈ , A ⊆ B} is dense in [0, p(B)]. 9 Machina–Schmeidler assume that A and B are disjoint and C = A ∪ B. This assumption is wlog because all the composite acts in (6) are unchanged when A and B are replaced by the disjoint events A = (A\B)∩C and B = (B\A)∩C, C by the union C = A ∪ B , h by g = xCh, and h by g = x Ch .

244

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

P4(R) is a special case for C = S. In addition to this special case, P4∗ (R) requires that if the events A ∩ C, B ∩ C, and C are risky (and some extra R-measurability constraints hold), then the preference to bet on A ∩ C rather than on B ∩ C is invariant of the outcomes that are obtained when C does not occur. This invariance reflects exclusively the subjective relative likelihoods of the risky events A ∩ C and B ∩ C. It does not imply that the decision maker’s risk preference is separable over lotteries induced by risky acts, or that her preference to bet is separable across ambiguous events. Accordingly, P4∗ (R) is weaker than the combination of P2(R) and P4(R), and weaker than P4∗ (P4∗ (R) is equivalent to P4∗ if R = ). To state our second representation result, we need a few preliminaries. Let L be the set of all lotteries—probability distributions l : X → [0, 1] with finite support. Fix a probability measure p : R → [0, 1]. Every risky act f ∈ G induces a lottery, written [f ]p , such that [f ]p (x) = p(f −1 (x)) for all x ∈ X. Let Lp = {l ∈ L : l = [f ]p for some f ∈ G}

 be the set of lotteries induced by risky acts. Given Y ⊆ X, let l(Y ) = x∈Y l(x) for all l ∈ L, and let Lp (Y ) = {l ∈ Lp : l(Y ) = 1}. Fix a risk preference relation 1 on Lp . Let X be the restriction of 1 to the set X of degenerate lotteries, and let Yx = {y ∈ X : y X x} be the upper contour sets of the ranking X . Say that a lottery l weakly dominates l , written l l , if l(Yx ) l (Yx ) for all x ∈ X. Say that l strictly dominates l , written l  l , if l  l and l(Yx ) > l (Yx ) for some x ∈ X. Say that the relation 1 is (i) strictly monotonic if for all l, l ∈ Lp , l l implies l 1 l , and l  l implies l 1 l ; (ii) continuous if for all finite Y ⊆ X and l ∈ Lp (Y ), the upper and the lower contour sets {l ∈ Lp (Y ) : l 1 l} and {l ∈ Lp (Y ) : l 1 l} are closed in the finite-dimensional set Lp (Y ). Theorem 4.1. Let  be an algebra, and let R ⊆  be a mosaic. Then the following two statements are equivalent: (i)  satisfies P 1(R), P 3(R), P 4∗ (R), P 5(R), P 6(R); (ii)  on the set G is represented by f  g ⇔ [f ]p 1 [g]p

for f, g ∈ G,

(8)

where p : R → [0, 1] is a finely ranged probability measure, and the binary relation 1 on Lp is non-degenerate, complete, transitive, continuous, and strictly monotonic. In this representation, the probability measure p and the strictly monotonic binary relation 1 are unique. Similar to Theorem 3.1, Theorem 4.1 models choice only among risky acts, and derives subjective probabilities only for risky events. In representation (8), the probabilities are used to translate risky acts f ∈ G into lotteries [f ]p ∈ Lp , and the induced lotteries are ranked via a continuous and strictly monotonic weak order rather than via expected utility. Therefore, Theorem 4.1 provides foundations for probabilistic sophistication on risky acts. The axioms in Theorem 4.1 are parallel to Machina–Schmeidler’s counterparts but apply only to risky acts and events. Most importantly, P4∗ (R) does not restrict the preference to bet on ambiguous events. Accordingly, only risky events A ∈ R are assigned subjective probabilities, only risky acts f ∈ G are translated into lotteries, and only lotteries [f ]p ∈ Lp that are induced by risky acts are ranked by the risk preference 1 .

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

245

Note that representation (8) retains the risk preference as a binary relation 1 on the subdomain Lp ⊆ L. Thus, (8) constitutes a notion of probabilistic sophistication different from, albeit closely related to the one used by Machina–Schmeidler, who specify a suitable utility representation V for the risk preference. It is an open question whether conditions of Theorem 4.1 are sufficient for 1 to have any utility representation. In particular, one cannot use Machina–Schmeidler’s construction of V (see the example in Section 6.2). Nevertheless, it seems appropriate to describe the use of probabilities by the strictly monotonic risk preference 1 alone. If  is a -algebra and R = , then p is convex-ranged and Lp = L. In this case, Theorem 4.1 becomes a version of the Machina–Schmeidler Theorem: both results use equivalent axioms and deliver the same subjective probability p and the same risk preference 1 on the entire L. It is straightforward to show that such 1 has a continuous utility representation V (this is the last step of Machina–Schmeidler’s proof). More generally, Theorem 4.1 shows that even if the algebra  is not countably additive, axioms P1, P3, P4∗ , P5, P6 taken “as is” characterize a version of probabilistic sophistication over -measurable acts—a version that does not specify a utility representation for the risk preference. 5. Subjective definitions of risk Similar to the results of Savage and Machina–Schmeidler, Theorems 3.1 and 4.1 are formulated for a given class of events. Accordingly, subjective probabilities in representations (5) and (8) are derived on a mosaic R which is exogenous to the model. The exogenous specification of risky events may be problematic because decision makers may disagree about the identity of events to which they assign probabilities. In order to model the subjective nature of risk, Zhang [23] defines risky events in terms of the preference . His definition is motivated by the Sure-Thing Principle rather than by any particular attitude towards ambiguity, such as ambiguity aversion. 10 Definition (Zhang). Call an event E ∈  subjectively risky if for all outcomes x, y ∈ X and for all acts f, g ∈ F, xEf  xEg ⇒ yEf  yEg and f Ex  gEx ⇒ f Ey  gEy.

(9)

Otherwise, call E subjectively ambiguous. This definition takes complementary events E and E c to be subjectively risky if the preference  is separable across E and E c and can be conditioned on either of these events independently of the outcome obtained on the other. The use of acts that are constant on E (or on E c ) reflects the intuition that subsets of a subjectively risky event may be subjectively ambiguous. Zhang’s definition may be problematic because it attributes all violations of Savage’s model to ambiguity. For example, a probabilistically sophisticated decision maker may assign probabilities to all events in , but violate conditions (9) for some E ∈  because of a non-linear risk preference. To address this concern, Epstein–Zhang [5] formulate a less stringent definition of subjectively 10 In contrast, Ghirardato–Marinacci [9] use a definition that is at least in part motivated by such attitude. Ghirardato [8] reviews other possible approaches.

246

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

risky events that is motivated by Strong Comparative Probability rather than by the Sure-Thing Principle. Call acts f, g ∈ F joint bets if there exist events A, B, C ∈ , outcomes z  z , and an act h ∈ F such that f = (zAz )Ch and g = (zBz )Ch. Definition (Epstein–Zhang). An event E ∈  is called subjectively risky if for all outcomes x, y ∈ X and for all joint bets f, g ∈ F, xEf  xEg ⇒ yEf  yEg and f Ex  gEx ⇒ f Ey  gEy.

(10)

Otherwise, E is called subjectively ambiguous. This definition takes complementary events E and E c to be subjectively risky if the preference to bet on subsets of E c (or on subsets of E) is invariant of the outcome obtained on E (or on E c , respectively). 11 The focus on comparisons between joint bets that are constant on E (or on E c ), reflects the intuition that (i) the risk preference may be non-linear, and (ii) subsets of a subjectively risky event may be subjectively ambiguous. Denote by RZ and REZ the classes of subjectively risky events defined by Zhang and by Epstein–Zhang, respectively. Both RZ and REZ are uniquely derived from the preference . Let GZ and GEZ be the associated sets of risky acts. By definition, RZ ⊆ REZ ⊆  and GZ ⊆ GEZ ⊆ F, but in the special case when X has only two elements x  x , any two acts f, g ∈ F are joint bets and hence, RZ = REZ and GZ = GEZ . Next we apply Theorems 3.1 and 4.1 to the subjective domains RZ and REZ , respectively. Theorem 5.1. Let  be an algebra, and let  be a reflexive preference on F. Then the following statements are true. I. The classes RZ and REZ are mosaics. II. The preference  satisfies P 1(RZ ), P 3(RZ ), P 4(RZ ), P 5(RZ ), P 6(RZ ) if and only if  is represented on the set GZ by expected utility  u(x) · p (f −1 (x)) for f ∈ GZ , (11) U (f ) = x∈X

where u : X → R is a non-constant utility index, and p : RZ → [0, 1] is a finely ranged probability measure. In this representation, u is unique up to a positive linear transformation, and the probability measure p is unique. III. The preference  satisfies P 1(REZ ), P 3(REZ ), P 4(REZ ), P 5(REZ ), and P 6(REZ ) if and only if  is represented on the set GEZ by f  g ⇔ [f ]p 1 [g]p

for f, g ∈ GEZ ,

(12)

11 If f and g are joint bets, then both rankings xEf  xEg and yEf  yEg reveal a preference to bet on A∩C∩E c ⊆ E c rather than on B ∩ C ∩ E c ⊆ E c . Similarly, the rankings f Ex  gEx and f Ey  gEy reveal a preference to bet on A ∩ C ∩ E ⊆ E rather than on B ∩ C ∩ E ⊆ E.

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

247

where p : REZ → [0, 1] is a finely ranged probability measure, and the binary relation 1 on Lp is non-degenerate, complete, transitive, continuous, and strictly monotonic. In this representation, the probability measure p and the strictly monotonic binary relation 1 are unique. Note that all components of representations (11) and (12), including the domains RZ , REZ and the values p(·) of subjective probabilities, are derived from preference. Therefore, Theorem 5.1 provides fully subjective theories of expected utility and probabilistic sophistication. If  satisfies P2, then RZ =  and representation (11) holds on the set F of all acts. More generally, when P2 does not hold, preference is still separable across events in RZ , and the axioms P1(RZ ), P3(RZ ), P4(RZ ), P5(RZ ), P6(RZ ) provide foundations for the use of subjective probability on RZ as a component of an expected utility representation on GZ . Alternatively, if  satisfies P4∗ , then REZ = , and representation (12) holds on the set F of all acts. If P4∗ does not hold, then betting preference is separable across events in REZ , and the axioms P1(REZ ), P3(REZ ), P4(REZ ), P5(REZ ), P6(REZ ) provide foundations for probabilistic sophistication on GEZ . In principle, one can apply Theorems 3.1 and 4.1 on domains other than RZ and REZ . The selection of RZ or REZ has two major benefits. First, this selection is done via explicit and arguably intuitive behavioral definitions. Second, fewer axioms need to be imposed on the preference over GZ and GEZ because the definitions of these domains imply P2(RZ ) and P4∗ (REZ ), respectively. 12 To relate representation (12) to Epstein–Zhang’s main result, consider a special case of mosaics. Call a mosaic R a (finitely additive) -system if it is closed under arbitrary disjoint unions rather than only under unions of elements of a grand partition. The class REZ (or RZ ) need not be a -system. For example, take S = {R, G, B}, X = {x, x }, and the following ranking: x  x{R, G}x  x{R, B}x  x{G, B}x  x{G}x  x{R}x  x{B}x  x . Then the events {R} and {G} are subjectively risky, but the event {B} and hence, its complement {R} ∪ {G} are subjectively ambiguous. It follows that the class REZ is not a -system. It is arguably intuitive that the decision maker, who assigns probabilities p(A) and p(B) to disjoint events A and B, should also assign the sum p(A) + p(B) to the union A ∪ B. Motivated by this intuition, Epstein–Zhang impose additional conditions on preference that guarantee that REZ is a -system. 13 It is, however, an open question whether the -system structure for risky events has a simple behavioral foundation in Savage’s general framework. Theorem 5.1 assumes a weaker structure for the class of risky events, but also differs substantially from Epstein–Zhang in the axioms adopted. Roughly, it dispenses with two of their arguably least appealing conditions—Monotone Continuity and Strong Partition Neutrality—that are not implied by the definition of REZ . Both axioms are even less appealing if  is finitely additive, because the former uses countable intersections, and the latter relies on equipartitions that need not exist (see the example in Section 6.1).

12 However, the richness conditions P6(R ) and P6(R ) do not follow from the definitions of R and R Z EZ Z EZ and are violated in some settings. This concern can be addressed by strengthening the behavioral definitions of risky events, which is a current research objective. 13 In fact, their conditions are too weak and their theorem is not valid as stated (we omit a formal counterexample). To correct Epstein–Zhang’s theorem, strengthen their Axiom 4 so that, in their notation, An and Bm can vary over all of  and not only over A.

248

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

6. Sketch of proofs In this section we discuss the main aspects that differentiate the proofs of Theorems 3.1 and 4.1 from those of Savage and Machina–Schmeidler. First, we focus on the construction of subjective probabilities for risky events. 6.1. Construction of subjective probability Analogously to de Finetti [6] and Savage [19], we seek a probability measure p that represents the comparative likelihood relation 0 . For arbitrary risky events A and B, the relation A 0 B is defined in terms of preference by A 0 B ⇔ xAx  xBx

for all outcomes x  x ,

and is interpreted as A being subjectively at least as likely as B. We propose the following construction for the finely ranged probability measure p : R → [0, 1] that represents 0 on R. Fix an arbitrary risky event A ∈ R. Say that a grand partition S = ∪m i=1 Si is finer than A if A 0 Si for all i = 1 . . . m. Among all grand partitions finer than A, take one that has a minimal number of elements. Let (A) be this minimal number; let (A) = +∞ if there exists no grand partition finer than A. The subjective probability p(A) can be constructed via the following formula:  n   1 p(A) = sup , (13) (Ai ) i=1

where the least upper bound is taken over all partitions of A into risky events A1 , . . . , An . To motivate formula (13), suppose that a finely ranged probability measure p does represent 1 1 0 . Then (A) < p(A)  (A)−1 for all A ∈ A. 14 Fix a risky event A and vary both the size  1 are lower bounds for and the elements of a risky partition A = ∪ni=1 Ai . The sums ni=1 (A i) n p(A ) = p(A). Moreover, these sums become arbitrarily close to p(A) for sufficiently fine i i=1  1 risky partitions because the ratios (A p(A ) approach 1 as the probabilities p(Ai ) approach i i) zero. This argument suggests constructing p(A) via formula (13). Even though the above motivation for formula (13) is relatively transparent, it takes a lot of work in the Appendix (Theorem A.1) to show that the function p : R → [0, 1] given by (13) is indeed a finely ranged probability measure that represents 0 . In contrast with the construction of subjective probability by Savage (or by Fishburn [7]) on a -algebra and by Zhang [22] on a countably additive -system, formula (13) does not rely on 14 Take N such that 1 < p(A)  1 . Show that (A) = N . First, (A)  N because S can be partitioned into (A) N N−1 1 . To prove (A)  N , construct an N -element grand partition finer than events that all have probabilities smaller than N−1 A. As p is finely ranged, there is a grand partition {S1 , . . . , Sm } such that p(Si ) < p(A) − N1 for all i = 1 . . . m. Divide

this partition into groups

S = S1 ∪ · · · ∪ Sk1 ∪ · · · ∪ SkN−2 +1 ∪ · · · ∪ SkN−1 ∪ SkN−1 +1 ∪ · · · ∪ Sm  

 

 

B1

BN−1

BN

1 so that N1 < p(Bi ) < p(A) for all i = 1 . . . N − 1. Then p(BN ) < 1 − N−1 N = N < p(A), and hence, the grand partition {B1 , . . . , BN } is finer than A.

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

249

partitioning the state space S into an arbitrarily large number N of equiprobable events. As the following example illustrates, such equipartitions need not exist if the domain of risky events is a mosaic or even a finitely additive algebra. Adopt the setting, where S = ∞ k=1 {Hk , Tk }. For an arbitrary finite n, identify coin-tossing n every A ⊆ k=1 {Hk , Tk } with the obvious subset of S, and call A a cylinder of rank n. Let  be the set of all cylinders, and R = . Construct a finely ranged probability measure p ∗ :  → [0, 1] such that for all A, B ∈ , A = B ⇒ p∗ (A) = p∗ (B).

(14)

Let p∗ (s1 , . . . , sn ) = 1 (s1 ) · 2 (s2 ) · · · · · n (sn ), where each i is a probability measure on {Hi , Ti }. Construct 1 , 2 , . . . so that (14) holds. Use induction with respect to the rank n of the cylinders A and B. For n = 0, p ∗ (S) = p ∗ (∅). For n > 0, assume that (14) holds for all cylinders of rank n − 1. Fix arbitrary cylinders A and B of rank n such that A = B. Write A = (Hn ∩ AH ) ∪ (Tn ∩ AT ) and B = (Hn ∩ BH ) ∪ (Tn ∩ BT ), where AH , AT , BH , BT are cylinders of rank n − 1. Then either p ∗ (AH ) = p∗ (BH ) or p∗ (AT ) = p∗ (BT ), and hence, there exists at most one n such that n (Hn ) · p ∗ (AH ) + n (Tn ) · p ∗ (AT ) = n (Hn ) · p ∗ (BH ) + n (Tn ) · p ∗ (BT ). As the number of cylinders of rank n is finite, it is possible to take n such n (Hn ) ∈ [0.4, 0.6], and (14) holds for all cylinders of rank n. By induction, there exists a sequence 1 , 2 , . . . such that p∗ :  → [0, 1] satisfies (14) for all A, B ∈  and is finely ranged because p ∗ (s1 , . . . , sn ) 0.6n for all (s1 , . . . , sn ). It is obvious that (14) rules out equipartitions. Therefore, Savage’s construction of subjective probability fails for mosaics and even for finitely additive algebras. There is another lesson to be drawn from this example. Note that if (14) holds, then f ↔ [f ]p∗ is a bijection between the set G of risky acts and the set Lp of induced lotteries because f = g implies [f ]p∗ = [g]p∗ . Therefore, every complete and transitive preference  on G is represented by f  g ⇔ [f ]p∗ 1 [g]p∗

for f, g ∈ G

for some complete and transitive relation 1 on Lp . This observation shows that if  is not a -algebra, then any meaningful notion of probabilistic sophistication must impose additional conditions on the risk preference besides completeness and transitivity. In Machina–Schmeidler’s paradigm, these conditions are monotonicity and continuity. (See Grant [11] for a model of probabilistic sophistication where monotonicity of risk preference is relaxed.) 6.2. Utility representations for risk preference The natural step after the construction of a subjective probability measure p is to define the risk preference 1 by taking l 1 l whenever l = [f ]p and l = [g]p for some risky acts f  g. However, the analysis of the relation 1 is complicated by the fact that 1 is complete only on the set Lp ⊆ L that need not have the convenient algebraic structure of L. Accordingly, our next step is to extend 1 from Lp , which is dense in L, to all of L by continuity. We show that the list of axioms in Theorem 3.1 guarantees that such an extension exists, satisfies von Neumann–Morgenstern’s conditions, and hence, is represented by expected utility. However, the weaker list of axioms in Theorem 4.1 does not guarantee the existence of a continuous extension of the risk preference 1 to the set L of all lotteries. For example, let

250

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

x

x

2 1 x + z 3 3 l(y)

l(z) l(x)

z

y

z

1 1 y + z 2 2

y

Fig. 1. Non-extendable risk preference.

X = {x, y, z} and suppose that no risky event is assigned the probability equal to 13 , which is the case if risky events are cylinders obtained by tossing a fair coin. Suppose that 1 has indifference curves illustrated by Fig. 1. It is easy to show that such a risk preference cannot be extended by continuity to all of L. Therefore, one cannot use Machina–Schmeidler’s construction of a utility representation. Thus, Theorem 4.1 retains the risk preference as a binary relation 1 on Lp . Alternatively, one can impose a stronger continuity condition on the underlying preference  that would be sufficient for the risk preference 1 to have a uniformly continuous utility representation (we omit a formal statement of this result). Appendix A. Proofs of Theorems 3.1 and 4.1 A.1. Preliminaries Fix a set of states S, a set of outcomes X, a mosaic of risky events R ⊆ 2S , and a binary relation  over the class G of risky (R-measurable) acts. No other primitives, such as the entire algebra  or the ranking of acts other than risky, are necessary to formulate and prove Theorems 3.1 and 4.1. Hereafter in Appendix A, the adjective risky is omitted because it refers to all relevant objects. Thus, events—written A, B, C, D, E, F , H , S—are elements of the mosaic R, partitions belong to R, and acts—written f , g, h—are elements of G. Write the complement of an event A as ¬A rather than Ac (superscripts are reserved for numbering). Let A ⊕ B denote the union A ∪ B together with the fact that the sets A and B are disjoint. Let AB denote the set difference A \ B together with the fact that B is a subset of A. Write a partition A = A1 ⊕ A2 ⊕ · · · ⊕ An briefly as {A = An1 }. For all indices i, j 0, j j j j let Ai = k=i Ak (if i > j , let Ai = ∅). Then Ai ∈ R because R is a mosaic and S = A1 ⊕ A2 ⊕ · · · ⊕ An ⊕ ¬A is a grand partition. For any collection E ⊆ R of events, let (E) be the smallest algebra that contains E. If E is finite, then (E) is also finite. For A, B ∈ R, write (A) instead of ({A}), (A, B) instead of ({A, B}), etc. Note that (A) = {∅, A, ¬A, S} ⊆ R because R is a mosaic, but (A, B) need

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

251

n not belong to R. For all grand partitions {S = Am 1 } and {S = B1 },

(A1 , A2 , . . . , Am , B1 , B2 , . . . , Bn ) ⊆ R ⇔ Ai ∩ Bj ∈ R for all i = 1 . . . m and j = 1 . . . n ⇔ {A1 , A2 , . . . , Am } ⊆ R  {B1 , B2 , . . . , Bn }.

(A.1)

A.2. Construction of subjective probability Suppose that  satisfies P1(R), P3(R), P4∗ (R), P5(R), P6(R). Fix outcomes x  x . For all A, B ∈ R, let A 0 B if and only if xAx  xBx . Call the binary relation 0 comparative likelihood. By P4∗ (R), this relation is invariant if the pair of outcomes x  x is replaced by another pair z  z . We seek a probability measure p : R → [0, 1] that represents 0 . Such p is called quantitative probability. Call an event A ∈ R non-null if A 0 ∅; call A null otherwise. Say that a partition {A = An1 } is non-null if Ai 0 ∅ for all i = 1 . . . n. Say that a partition {A = An1 } is finer than an event B if Ai ≺0 B for all i = 1 . . . n; in this case, write {A = An1 } ≺0 B. Axioms P1(R), P3(R), P4∗ (R), P5(R), P6(R) imply that 0 satisfies the following properties that generalize de Finetti’s [6] definition of qualitative probability and Savage’s conditions of fineness (QF) and tightness (QT). Q1. 0 is complete and transitive. Q2. S 0 ∅. Q3. For all events A ∈ R, S 0 A 0 ∅. Conditions Q1–Q3 follow immediately from the ranking x  x by P1(R) and P3(R). Q4. For all partitions {A = A21 } and {B = B12 } such that (A1 , A2 , B1 , B2 ) ⊆ R, A1 0 ( 0 )B1

and A2 0 B2 ⇒ A 0 ( 0 ) B.

By induction, Q4 implies that for all partitions {A = An1 } and {B = B1n } such that (A1 , . . . , An , B1 , . . . , Bn ) ⊆ R, A1 0 ( 0 )B1

and Ai 0 Bi for all i > 1 ⇒ A 0 ( 0 ) B.

Thus, Q4 asserts that 0 is additive over any subalgebra of the mosaic R. Proof. Fix partitions {A = A21 } and {B = B12 } such that (A1 , A2 , B1 , B2 ) ⊆ R. Let ∗ = (A1 , A2 , B1 , B2 ). Let F∗ be the class of ∗ -measurable functions that map S into {x, x }. Then F∗ ⊆ G. When restricted to F∗ ,  satisfies Savage’s axioms P1–P5 (but violates P6 because ∗ is finite). These axioms imply that the comparative likelihood 0 is additive on ∗ and hence satisfies Q4 (see property C3 in [7]).  QF (Fineness). For any non-null event A and finite collection E ⊆ R, there exists a grand partition {S = S1m } ⊆ R  E finer than A. Proof. Fix A 0 ∅ and finite E ⊆ R. Let f = xAx , g = x , and E = (A) ∪ E. Then E is finite, f and g are E -measurable, and A 0 ∅ implies f  g. By P6(R), there exists a grand partition {S = S1m } ⊆ R  E ⊆ R  E such that for all i, f  xSi g, that is, A 0 Si . 

252

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

QT (Tightness). For any events A 0 B, there exists a non-null partition A = C ⊕ D such that C = AD 0 B and a non-null partition ¬B = C ⊕ D such that A 0 B ⊕ C . Proof. Fix A 0 B. Let f = xAx , g = xBx , and E = (A) ∪ (B). Then E is finite, f and g are E-measurable, and A 0 B implies f  g. By P6(R), there exists a grand partition {S = S1m } ⊆ R  E such that x Si f  g for all i. Let Ci = A \ Si and Di = A ∩ Si . By (A.1), (S1 , . . . , Sm , A, ¬A) ⊆ R and Ci , Di ∈ R. Take k such that Dk 0 ∅; if there is no such k, then by Q4, ∅  D1m = A, which contradicts A 0 B 0 ∅. As x Sk f  g, then Ck 0 B, and A = Ck ⊕ Dk is a non-null partition. By P6(R), there exists (another) grand partition {S = S1m } ⊆ R  E such that f  xSi g for all i. Let Ci = (¬B) ∩ Si and Di = (¬B) \ Si . By (A.1), (S1 , . . . , Sm , B, ¬B) ⊆ R and Ci , Di ∈ R. Take k such that Ck 0 ∅; if there is no such k, then by Q4, B 0 B ⊕ C1m = S, which contradicts S 0 A 0 B. As f  xSk g, then A 0 B ⊕Ck . The partition ¬B = Ck ⊕Dk is non-null; otherwise Dk is null, and by Q4 there is a contradiction B ⊕ Ck 0 (B ⊕ Ck ) ⊕ Dk = S 0 A.  While a quantitative probability representation need not exist for an arbitrary qualitative probability [15], Savage shows that such a representation exists for any fine and tight qualitative probability on a -algebra. Niiniluoto [18] and Wakker [21] extend this result to algebras. We extend it further to mosaics. Thus, we derive subjective probability from the comparative likelihood relation 0 rather than directly from the preference . This construction applies in the proofs of both Theorems 3.1 and 4.1; it applies also in a de Finetti-type setting where 0 is taken as a primitive and Q1–Q4, QF, QT are imposed as axioms. Deduce the following properties Q5–Q8 from Q1 to Q4, QF, QT. Q5. For all partitions {A = A21 } and {B = B12 }, if A1 0 ( 0 )B1 and A2 0 B2 , then A 0 ( 0 ) B. In contrast with Q4, Q5 does not require that (A1 , A2 , B1 , B2 ) ⊆ R. From Q5, it follows by induction that for all partitions {A = An1 } and {B = B1n }, A1 0 ( 0 )B1

and Ai 0 Bi

for all i > 1 ⇒ A 0 ( 0 ) B.

Proof. Fix partitions {A = A21 } and {B = B12 } and consider three cases. Case I: A1 0 B1 and A2 0 B2 . By QT, there are non-null partitions A1 = C1 ⊕ D1 and A2 = C2 ⊕ D2 such that C1 0 B1 and C2 0 B2 . Let E = (C1 , D1 , C2 , D2 , ¬A) ∪ (B1 , B2 , ¬B). Take a grand partition {S = S1m } ⊆ R  E that is finer than both D1 and D2 . Let k1 be the minimal index such that S1k1 0 C1 , and k2 be the maximal index such that Skm2 0 C2 . By (A.1), (S1 , . . . , Sm , C1 , D1 , C2 , D2 , ¬A) ⊆ R. Then by Q4

A1 = C1 ⊕ D1 0 S1k1 −1 ⊕ Sk1 = S1k1 0 C1 0 B1 and A2 = C2 ⊕ D2 0 Skm2 +1 ⊕ Sk2 = Skm2 0 C2 0 B2 . by Q4

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

253

Note that S1k1 and Skm2 are disjoint; otherwise k2 k1 , A2 0 Skm2 0 Skm1 +1 and by Q4, there is a contradiction: A = A1 ⊕ A2 0 S1k1 ⊕ Skm1 +1 = S. Thus, by Q4

by Q4

A = A1 ⊕ A2 0 S1k1 ⊕ Skm2 0 B1 ⊕ B2 = B because (S1k1 , Skm2 , A1 , A2 ) ⊆ R and (S1k1 , Skm2 , B1 , B2 ) ⊆ R. Case II: A1 0 B1 and A2 0 B2 . By QT, there is a non-null partition A1 = C1 ⊕ D1 such that C1 0 B1 . By Case I, A = C1 ⊕ (A2 ⊕ D1 ) 0 B1 ⊕ B2 = B. Case III: A1 0 B1 and A2 0 B2 . Suppose that B 0 A. Then there is a non-null partition ¬A = C ⊕D such that B 0 A⊕C .Yet by Case II, A⊕C = (A1 ⊕C )⊕A2 0 B1 ⊕B2 = B. This contradiction implies A 0 B.  Q6. For any non-null A and for any B, there exists a partition {B = B1m } finer than A. Proof. By QF, there exists a grand partition {S = S1m } ⊆ R  {B, ¬B} finer than A. For all i, the sets Bi = B ∩ Si and ¬B ∩ Si belong to R and partition Si . Therefore, Bi 0 Si ≺0 A, and the partition {B = B1m } is finer than A.  Q7. For any events A 0 B and for any non-null partition {A = An1 }, there exists an n-element partition {B = B1n } such that Ai 0 Bi for all i = 1 . . . n. Proof. Take n = 2. Fix events A 0 B and a non-null partition {A = A21 }. By QT, there is a nonnull partition ¬B = C ⊕ D such that A 0 B ⊕ C . By Q6, there is a partition {B = E1m } finer m than C . Let k be the maximal index in [0, m] such that A1 0 E1k . Suppose that Ek+1  0 A2 . Then k < m, by Q5

E1k ⊕ C 0 E1k ⊕ Ek+1 = E1k+1

def. of k 0 A1

and m B ⊕ C = (E1k ⊕ C ) ⊕ Ek+1 0 A1 ⊕ A2 = A. by Q5

m , and B = E k ⊕ E m is the required two-element This contradiction implies that A2 0 Ek+1 1 k+1 partition of B. Complete the proof by induction with respect to n. 

Q8. For any events A 0 B and for any partition {B = B1n }, there exists an n-element partition {A = An1 } such that Ai 0 Bi for all i = 1 . . . n. Proof. Take n = 2. Fix events A 0 B and a partition {B = B12 }. By QT, there is a non-null partition A = C⊕D such that C 0 B. By Q6, there is a partition {C = E1m } finer than D. Let k be the maximal index in [0, m] such that B1 0 E1k . Then k < m, and E1k ⊕D 0 E1k ⊕Ek+1 0 B1 . m . Then by Q5, B ⊕ B  E k ⊕ E m = C. This contradiction implies Suppose that B2 0 Ek+1 1 2 0 1 k+1 m m is the required partition of A. Complete the proof that Ek+1 0 B2 , and A = (E1k ⊕ D) ⊕ Ek+1 by induction with respect to n. 

254

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

For every A ∈ R, let (A) =

min

{S=S1m } ≺0 A

m

and r(A) =

1 . (A)

By definition, (A) is the minimal number of events that may constitute a grand partition finer than A. If A is non-null, then by QF, (A) is finite; if A is null, then (A) = +∞. Thus, the value r(A) belongs to the set 21 , 13 , . . . , 0 ⊆ [0, 1], and r(A) = 0 if and only if A is null. The function r : R → R satisfies the following properties R1–R7. R1. The function r almost agrees with 0 , that is, for all events A and B, A 0 B ⇒ r(A) r(B) and r(A) > r(B) ⇒ A 0 B. Proof. Any partition finer than B 0 A is also finer than A. Therefore, A 0 B implies (A)(B) and r(A)r(B). Conversely, r(A) > r(B) implies A 0 B.  R2. For all partitions {A = An1 }, max (Ai ) > n, or equivalently, min r(Ai ) < n1 . i=1...n

i=1...n

Proof. Fix a partition {A = An1 }. Pick Ak so that Ai 0 Ak for all i = 1 . . . n. Suppose that (A ) n(Ak ). Take a grand partition {S = S1 k } finer than Ak . Then Ai 0 Ak 0 Si for all (Ak ) (Ak ) 0 S1 = S. This contradiction implies (Ak ) > n.  i = 1 . . . (Ak ). By Q5, A1 R3. For any n, there exists A 0 ∅ such that (A) > n, that is, r(A) < n1 . In other words, the function r takes arbitrarily small positive values. (A)

Proof. For any A 0 ∅, take a grand partition {S = S1 } finer than A. This partition is non-null; otherwise S could be partitioned into (A) − 1 events finer than A. By R2, max (Si ) > (A). Thus, the function  is unbounded on non-null events.

i=1...(A)



R4. For any A 0 ∅, there exists a non-null partition A = C ⊕ D such that r(C) = r(A). (A)

Proof. Fix an event A 0 ∅ and a grand partition {S = S1 } finer than A. Pick Sk so that Sk 0 Si for all i. By QT, there exists a non-null partition A = C ⊕ D 0 Sk such that C 0 Sk . (A) Then {S = S1 } is finer than C. Therefore, (C) (A). On the other hand, A 0 C implies (C)(A). Thus, r(C) = r(A).  Given a non-null event B, say that {A = B1n } is a B-partition if (i) r(Bi ) = r(B) for all i = 1 . . . n − 1 (but not necessarily for i = n), and (ii) this partition is finer than B. R5. For any A ∈ R and for any non-null B ∈ R, there exists a B-partition {A = B1n }. Proof. Say that a partition {A = An1 } is B-acceptable, if r(Ai ) = r(B) and Ai ≺0 B for all i = 1 . . . n − 1 (but not necessarily for i = n). For example, the trivial partition {A = A} is

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

255

B-acceptable. If {A = An1 } is B-acceptable, then by R2, n−1 <

max (Ai ) = (B). Therefore, i=1...n−1 n there exists a B-acceptable partition, written {A = B1 }, that has the maximal number of elements among all B-acceptable partitions. Suppose that Bn 0 B. By R4 and Q6, there exists a non-null partition B = C ⊕ D such that r(C) = r(B) and a partition {Bn = E1m } finer than D. Let k be the maximal index in [0, m] such that C 0 E1k . Then B = C ⊕ D 0 E1k ⊕ Ek+1 = E1k+1 0 C. It follows that r(B)r(E1k+1 ) r(C), that is, r(B) = r(E1k+1 ). Therefore, the partition m of A into (n + 1) events B1 , . . . , Bn−1 , E1k+1 , Ek+2 is B-acceptable, but has more elements n than the partition {A = B1 }. This contradiction implies that B 0 Bn . Thus, {A = B1n } is a B-partition.



R6. For any event A, there exists ε > 0 such that for all non-null events B and for all B-partitions {A = B1n }, r(B) < ε ⇒ r(A)(n − 1) · r(B). Proof. Wlog, fix a non-null event A. By R4, there is a non-null partition A = C ⊕ D such that r(C) = r(A). Let ε = r(D). Fix a non-null event B such that r(B) < ε and a B-partition {A = B1n }. Note that r(A)(n − 1) · r(B) is equivalent to (n − 1) · (A) (B). To show the latter inequality, partition S into (n − 1) · (A) events finer than B. As Bn ≺0 B ≺0 D and A = C ⊕ D = B1n−1 ⊕ Bn , then A 0 B1n−1 0 C. By Q1, r(A) r(B1n−1 ) r(C), that is, (A) (B1n−1 ) = (A). Take a grand partition {S = S1 } finer than B1n−1 . By Q7, each event Sj ≺0 B1n−1 can be partitioned into n − 1 events, Sj,1 , Sj,2 , . . . , Sj,n−1 such that Sj,i ≺0 Bi ≺0 B for all i = 1 . . . n − 1. The partition of S into (n − 1) · (A) elements Sj,i is finer than B.  R7. For all grand partitions {S = S1m },

m

i=1 r(Si )

< 1.

Proof. Fix a grand partition {S = S1m }. By R3, R5 and R6, there exists a non-null event B and B-partitions {Si = B(i)n1 i } such that r(Si ) < (ni − 1) · r(B) for all i = 1 . . . m. By R2, m 

(ni − 1) <

i=1

Thus,

m

i=1 r(Si )

<

max

i=1...m j =1...(ni −1)

m

i=1 (ni

(B(i)j ) =

1 . r(B)

− 1) · r(B) <

1 r(B)

· r(B) = 1.



The following theorem delivers a quantitative probability representation for 0 . Theorem A.1. A binary relation 0 on a mosaic R satisfies Q1–Q4, QF , and QT if and only if 0 is represented by a finely ranged probability measure p : R → [0, 1]. The probability measure p that represents 0 is unique and for all A ∈ R, p(A) = sup

n 

{A=An1 } i=1

r(Ai ).

(A.2)

Proof. Suppose first that a finely ranged probability measure p : R → [0, 1] represents 0 . The proof of Q1–Q4, QF is straightforward. To show QT, fix arbitrary events A 0 B, and let ε = p(A) − p(B) > 0. Take a grand partition {S = S1m } ⊆ R  {A, ¬A, B, ¬B} such that

256

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

p(Si ) < ε for all i = 1 . . . m. Then {A = E1m }, where Ei = A ∩ Si , and p(Ek ) > 0 for some k. Let D = Ek . Then p(AD) > p(A) − ε = p(B), and AD 0 B. The proof of the second part of QT is analogous. Suppose instead that 0 satisfies Q1–Q4, QF, QT. Define p : R → [0, 1] by (A.2). Show that p is a probability measure. Fix a grand partition {S = S1m }, a non-null event B, and B-partitions  m 1 {Si = B(i)n1 i }. Then m i=1 ni (B) = r(B) because the partition of S into i=1 ni events B(i)j is finer than B. It follows that  m  m m    p(Si ) (ni − 1) · r(B) = ni · r(B) − m · r(B) 1 − m · r(B). i=1

i=1

m

i=1

As r(B) can small, i=1 p(Si ) 1. be arbitrarily mi mi By R7, m r(A(i) j ) < 1 for all partitions {Si = A(i)1 }. Hence, i=1 j =1 ⎛ ⎞ mi m m    ⎝ sup p(Si ) = r(A(i)j )⎠ 1, i=1

i=1

m

{Si =A(i)1 i } j =1

 that is, m i=1 p(Si ) = 1. As p is non-negative, p is a probability measure. Show that p represents 0 . Fix events A 0 B. Then by QT, there exists a non-null partition A = C ⊕ D such that C 0 B. By Q8, for any partition {B = B1n }, there exists a partition {C = C1n } such that Ci 0 Bi for all i = 1 . . . n. Thus, p(A)r(D) + sup

n 

{C=C1n } i=1

r(Ci ) r(D) + sup

n 

{B=B1n } i=1

r(Bi ) = r(D) + p(B) > p(B).

 Fix (other) events A, B such that p(A) > p(B). Take a partition {A = An1 } such that ni=1 r(Ai ) > p(B) and r(A1 ) > 0. By R4, there exists a non-null partition A1 = C⊕D such that r(A1 ) = r(C). Then p(AD) ni=1 r(Ai ) > p(B), and AD 0 B. Thus, A 0 B, and p represents 0 . Show that p is finely ranged. Fix a finite collection E ⊆ R and ε > 0. Take a non-null (B) event B such that r(B) < ε and a (non-null) grand partition {S = B1 } finer than B. Then (B) 1 i=1 p(Bi ) = 1. Therefore, there exists Bi 0 ∅ such that p(Bi )  (B) < ε. By QF, there m exists a grand partition {S = S1 } ⊆ R  E finer than Bi . It follows that p(Si ) < p(Bi ) < ε for all i = 1 . . . m. Show that p is the unique probability measure that represents 0 . Suppose that another probability measure p∗ : R → [0, 1] represents 0 . Fix an event A ∈ R. If A is null, then p ∗ (A) = (A) p∗ (∅) = 0 = p(A). If A is non-null, take a grand partition {S = S1 } finer than A. As p∗ is   (A) additive and represents 0 , then 1 = p ∗ (S) = i=1 p ∗ (Si ) < (A) · p∗ (A), that is p ∗ (A) > n n 1 n ∗ ∗ i=1 p (Ai ) > i=1 r(Ai ) and (A) = r(A). Therefore, for all partitions {A = A1 }, p (A) = p∗ (A)p(A). Similarly, p ∗ (¬A) p(¬A). Thus, p(A) = p ∗ (A).  Next, we formulate one useful property of finely ranged probability measures. Lemma A.2. If p : R → [0, 1] is a finely ranged probability measure, then for all n > 0, for all finite collections E ⊆ R, and for all events A ∈ R  E, the set

 p(A, n, E) = (p(A1 ), p(A2 ), . . . , p(An )) : {A = An1 } ⊆ R  E

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

257

of vectors (p(A1 ), p(A2 ), . . . , p(An )) ∈ Rn+ is dense in the simplex   n  (A, n) = (v1 , v2 , . . . , vn ) ∈ Rn+ : vi = p(A) . i=1

Proof. Fix a finite collection E ⊆ R, an event A ∈ R  E, and numbers v ∈ [0, p(A)] and ε > 0. Show that there exists a partition {A = A21 } ⊆ R  E such that v − ε < p(A1 ) v. Take a finite collection E that contains (A) and the intersections A ∩ E for all E ∈ E. As p is finely ranged, there exists a grand partition {S = S1m } ⊆ R  E such that p(Si ) < ε for all i = 1 . . . m. For all i = 1 . . . m, the sets A ∩ Si and ¬A ∩ Si belong to R, and p(A ∩ Si ) p(Si ) < ε. Let k ∈ [0, m] be the minimal index such that p(A ∩ S1k ) > v − ε. Then p(A ∩ S1k ) v because m . For otherwise p(A ∩ S1k−1 ) > v − p(A ∩ Sk ) > v − ε. Let A1 = A ∩ S1k and A2 = A ∩ Sk+1 each E ∈ E, the sets (A ∩ Si ) ∩ E = Si ∩ (A ∩ E) belong to R for all i = 1 . . . m and, together with ¬(A ∩ E), form a grand partition. The sets A1 ∩ E and A2 ∩ E are unions of elements of this grand partition and hence, are also events in R. Thus, A 1 ∈ R  E and A2 ∈ R  E. Fix an arbitrary vector (v1 , v2 , . . . , vn ) ∈ Rn+ such that ni=1 vi = p(A). By induction with ε respect to n, there exists a partition {A = An1 } ⊆ R  E such that vi − 2n < p(Ai ) vi for all i = 1 . . . n − 1. Then n 

|p(Ai ) − vi | < (n − 1) ·

i=1

ε ε + |p(An ) − vn | < 2 · (n − 1) · = ε. 2n 2n

Thus, p(A, n, E) is dense in (A, n).



A.3. Preference and first-order stochastic dominance Suppose that  satisfies P1(R), P3(R), P4∗ (R), P5(R), P6(R). Then the finely ranged probability measure p : R → [0, 1] defined by (A.2) represents the preference over binary acts that have outcomes x  x . The following lemma characterizes the preference  between arbitrary acts f and g when the lotteries [f ]p and [g]p induced by these acts via the measure p satisfy the first-order stochastic dominance. Lemma A.3. For all acts f and g, [f ]p  [g]p ⇒ f  g,

(A.3)

[f ]p ? [g]p ⇒ f  g.

(A.4)

Proof. Write arbitrary acts f and g as ⎡ ⎤ ⎡ z1 if s ∈ F1 z1 ⎢ z2 if s ∈ F2 ⎥ ⎢ z2 ⎢ ⎥ ⎢ ⎢ ⎥ ⎢ .. f =⎢ and g = ⎥ ⎢ . ⎢ ⎥ ⎢ ⎣ zn if s ∈ Fn ⎦ ⎣ zn h(s) if s ∈ H h(s)

⎤ if s ∈ G1 if s ∈ G2 ⎥ ⎥ ⎥ .. ⎥ . ⎥ if s ∈ Gn ⎦ if s ∈ H

for some n, for some outcomes z1  z2  · · ·  zn , for some event H (that can be empty), for some act h ∈ G  H , and for some partitions S = F1 ⊕ F2 ⊕ · · · ⊕ Fn ⊕ H and S = G1 ⊕ G2 ⊕ · · · ⊕ Gn ⊕ H . Say that acts f and g differ among n outcomes.

258

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

Prove (A.3) and (A.4) by induction with respect to n. If n = 1, then f = g, and (A.3) and (A.4) hold. Fix n > 1 and suppose that (A.3) and (A.4) hold for all acts that differ among n − 1 outcomes. Fix arbitrary acts f and g that differ among n outcomes. If zi ∼ zj for some i = j , then by P3(R), zi Fj f ∼ f and zi Gj g ∼ g. The dominance [f ]p  ( ? )[g]p implies [zi Fj f ]p  ( ? )[zi Gj g]p , and hence, the ranking zi Fj f  ()zi Gj g because the acts zi Fj f and zi Gj g differ among n − 1 outcomes. By transitivity, f  g. Wlog, z1  z2  · · ·  zn . Then [f ]p  [g]p if and only if p(F1k ) p(Gk1 ) for all k = 1 . . . n; [f ]p ? [g]p if and only if p(F1k ) p(Gk1 ) for all k = 1 . . . n and p(F1k ) > p(Gk1 ) for some k. Consider several cases. Case I: p(F1k ) > p(Gk1 ) for all k = 1 . . . n − 1, and (F1 , F2 , . . . , Fn , G1 , G2 , . . . , Gn ) ⊆ R. Take ε > 0 such that p(F1k ) > p(Gk1 ) + ε for all k = 1 . . . n − 1. For all i = 2, . . . , n, let Bi = Fi ∩ G1 and vi = p(Bi ). Then p(F1 \ G1 ) = ni=1 vi , where v1 = p(F1 \ G1 ) −

n 

vi = p(F1 \ G1 ) − p(G1 \ F1 ) = p(F1 ) − p(G1 ).

i=2

By Lemma A.2, there exists a partition {F1 \ G1 = An1 } such that p(A1 ) > v1 − ε and p(Ai ) > vi for i = 2, . . . , n. Let f1 = z2 A1 f and fi = (z1 Bi zi )(Ai ∪ Bi )fi−1 for all i = 2 . . . n. By P3(R), f  f1 . As Ai 0 Bi , then by P4∗ (R), fi−1 = (z1 Ai zi )(Ai ∪ Bi )fi−1  fi . By transitivity, f  fn . The acts fn and g can be written as ⎡ ⎤ ⎡ ⎤ z2 if s ∈ E2 z2 if s ∈ G2 ⎢ ⎥ ⎢ ⎥ .. .. ⎢ ⎥ ⎢ ⎥ . . ⎢ ⎥ ⎢ ⎥ fn = ⎢ zn if s ∈ En ⎥ and g = ⎢ zn if s ∈ Gn ⎥ , ⎢ ⎥ ⎢ ⎥ ⎣ z1 if s ∈ G1 ⎦ ⎣ z1 if s ∈ G1 ⎦ h(s) if s ∈ H h(s) if s ∈ H where E2 = (F2 ⊕ A1 ⊕ A2 )B2 , and Ei = (Fi ⊕ Ai )Bi for i = 3 . . . n. The dominance [fn ]p  [g]p holds because for all k = 2 . . . n − 1, p(E2k ) > p(A1 ) +

k 

p(Fi ) > p(F1 ) − p(G1 ) − ε +

i=2 k 

= −p(G1 ) − ε +

k 

p(Fi )

i=2

p(Fi ) > p(Gk2 ).

i=1

Then fn  g because fn and g differ among n − 1 outcomes. By transitivity, f  g. Case II: p(F1k ) > p(Gk1 ) for all k = 1 . . . n−1. By LemmaA.2, there exists a partition {¬H = n A1 } that belongs to R  ((F1 , . . . , Fn )∪(G1 , . . . , Gn )) and satisfies p(F1k ) > p(Ak1 ) > p(Gk1 ) for all k = 1 . . . n − 1. Take an act f such that f (s) = zi if s ∈ Ai for i = 1 . . . n, and f (s) = h(s) if s ∈ H . Case I implies that f  f  g and hence, that f  g.

Case III: p(F1k )p(Gk1 ) for all k = 1 . . . n. Suppose that, contrary to (A.3), g  f . If Fn is null, then [zn−1 Fn f ]p  [zn−1 Gn g]p and zn−1 Fn f  zn−1 Gn g because the acts zn−1 Fn f and zn−1 Gn g differ among n − 1 outcomes. By P3(R), zn−1 Fn f ∼ f and zn−1 Gn g  g and hence, f  g. If Fn is non-null, then by P6(R), there exists a non-null partition {Fn = E1m } such that g  x1 E1 f . Then p(F1k ⊕ E1 ) > p(Gk1 ) for all k = 1 . . . n − 1. By Case II, x1 E1 f  g. This contradiction proves (A.3).

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

259

Case IV: p(F1k )p(Gk1 ) for all k = 1 . . . n and p(F1k ) > p(Gk1 ) for some k ∈ [1, n]. There exists k such that p(F1k ) > p(Gk1 ) and p(F1k+1 ) = p(Gk+1 1 ). Note that Gk+1 is non-null. By Lemma A.2, there exists a non-null partition Gk+1 = C ⊕ D such that p(D) < p(F1k ) − p(Gk1 ). Then [f ]p  [zk Dg]p , and by Case III, f  zk Dg. By P3(R), zk Dg  g. Thus f  g, which proves (A.4).  Define a metric on the set L of all lotteries by  l − l  = |l(x) − l (x)| for l, l ∈ L. x∈X

The next two lemmas relate this metric and the first-order stochastic dominance. Assume that the set of outcomes X is finite. Then there exist x ∗ , x∗ ∈ X such that x ∗  x  x∗ for all x ∈ X. Write l  l + o if lotteries l, l ∈ L are such that l  l for all l ∈ L in some neighborhood of l ; write l + o  l if l  l for all l ∈ L in some neighborhood of l. Note that x ∗  x ∗ + o and x∗ + o  x∗ . Thus, neither l  l + o nor l + o  l implies l ? l . Lemma A.4. If X is finite, then for any acts g  g , there exist acts h  h such that [g]p + o  [h]p and [h ]p  [g ]p + o. Proof. Fix acts g  g . Let the support of the lottery l = [g]p consist of outcomes y1  y2  · · ·  yn . By P6(R), the non-null event g −1 (y1 ) has a non-null partition {g −1 (y1 ) = E1m } such that x∗ E1 g  g . Let h = x∗ E1 g. For each lottery l ∈ L such that [g]p − l  < p(E1 ) and for each x ∈ X, l (Yx ) [h]p (Yx ). Indeed, if x ∼ x∗ , then l (Yx ) = 1 = [h]p (Yx ); if y1  x  x∗ , then l (Yx ) [g]p (Yx ) − p(E1 ) = [h]p (Yx ) because [g]p − l  < p(E1 ) and g −1 (Yx ) = h−1 (Yx ) ⊕ E1 ; if x  y1 , then l (Yx ) 0 = [h]p (Yx ). Therefore, l  [h]p , and hence, [g]p + o  [h]p . Analogously, h  g implies that there exists a non-null E1 such that h  h = x ∗ E1 g and [h ]p  [g ]p + o.  Lemma A.5. If X is finite, then for any lottery l ∈ L and for any non-null event E, there exists an act h ∈ G  {¬E} such that [h]p − l < p(E) and [x ∗ Eh]p  l + o  [x∗ Eh]p . Proof. Fix a lottery l and a non-null event E. Let the support of l consist of outcomes y1  y2  · · ·  yn . Take ε = 21 · p(E) · min{l(y1 ), l(yn )}. By Lemma A.2, there exist partitions {E = An1 } and {¬E = B1n } such that for all i = 1 . . . n, p(E) · (1 + ε) · l(yi ) p(Ai ) p(E) · (1 − ε) · l(yi ) and (1 − p(E)) · (1 + ε) · l(yi ) p(Bi ) (1 − p(E)) · (1 − ε) · l(yi ). Take h ∈ G such that h(s) = yi if s ∈ Ai ⊕ Bi for i = 1 . . . n. Then [h]p − l =

n  i=1

|p(Ai ) + p(Bi ) − l(yi )|ε ·

n 

l(yi ) = ε < p(E).

i=1

Let l ∗ = [x ∗ Eh]p and l∗ = [x∗ Eh]p . Fix an arbitrary lottery l ∈ L such that l − l  < ε. For each x ∈ X, one of the following four cases applies.

260

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

Case I: x ∼ x∗ . Then l ∗ (Yx ) = 1 = l (Yx ) = l∗ (Yx ). Case II: yn  x  x∗ . Then l ∗ (Yx ) = 1 l (Yx ) l(Yx ) − ε = 1 − ε 1 − p(E) = l∗ (Yx ). Case III: y1  x  yn . Then p(E) · (1 − l(Yx ))p(E) · l(yn ) 2 · ε and  p(Bi ) p(E) + (1 − p(E)) · (1 − ε) · l(Yx ) l ∗ (Yx ) = p(E) + i:yi x

 l(Yx ) + p(E) · (1 − l(Yx )) − ε l(Yx ) + 2 · ε − ε = l(Yx ) + ε l (Yx ). Also, p(E) · l(Yx )p(E) · l(y1 ) 2 · ε and  p(Bi ) (1 − p(E)) · (1 + ε) · l(Yx ) l∗ (Yx ) = i:yi x

 l(Yx ) − p(E) · l(Yx ) + ε l(Yx ) − 2 · ε + ε = l(Yx ) − ε l (Yx ). Case IV: x  y1 . Then l ∗ (Yx ) = p(E) 0 + ε = l(Yx ) + ε l (Yx ) 0 = l∗ (Yx ). Thus, l ∗ (Yx )l (Yx )l∗ (Yx ) for all x ∈ X, and l ∗  l  l∗ for all l in the ε-neighborhood of l. Thus, l ∗  l + o  l∗ .  A.4. Necessity of axioms in Theorem 4.1 Suppose that for all f, g ∈ G, f  g ⇔ [f ]p 1 [g]p ,

(A.5)

where p : R → [0, 1] is a finely ranged probability measure, and the binary relation 1 on Lp is non-degenerate, complete, transitive, continuous, and strictly monotonic. Show that  satisfies the list of axioms in Theorem 4.1. P1(R) and P5(R) are trivial. For each event A ∈ R, either p(A) = 0 or p(A) > 0. If p(A) = 0, then for all outcomes x, y ∈ X and for all acts h ∈ G  {¬A}, [xAh]p = [yAh]p , [xAh]p ∼1 [yAh]p , and xAh ∼ yAh. If p(A) > 0, then for all acts h ∈ G  {¬A} and for all outcomes x  ()y, the dominance [xAh]p  ( ? )[yAh]p holds, which implies [xAh]p 1 ( 1 )[yAh]p because 1 is strictly monotonic, and by (A.5), xAh  ()yAh. In this case, x  y if and only if xAh  yAh, Thus,  satisfies P3(R). For all events A, B, ∈ R, and C ∈ R  {A, Ac , B, B c }, for all outcomes x  x and z  z , and for all acts h ∈ G  {C c }, the inequality p(A)(>)p(B) implies the dominance [(xAx )Ch]p  ( ? )[(xBx )Ch]p , and rankings [(xAx )Ch]p 1 ( 1 )[(xBx )Ch]p and (xAx )Ch  () (xBx )Ch accordingly. Therefore, for all acts h, h ∈ G  {C c }, (xAx )Ch  (xBx )Ch ⇔ p(A) p(B) ⇔ (zAz )Ch  (zAz )Ch . Thus,  satisfies P4∗ (R). Show that  satisfies P6(R). Without loss of generality, X is finite. Fix an outcome x ∈ X, a finite collection E ⊆ R, and a pair of E-measurable acts f  g. By continuity of the risk preference 1 , there exists > 0 such that l 1 [g]p for all l ∈ Lp in the -neighborhood of [f ]p , and [f ]p 1 l for all l ∈ Lp in the -neighborhood of [g]p . Partition S into events {S1 , . . . , Sm } ⊆ R  E such that p(Si ) < 2 for all i = 1 . . . m. Then [xSi f ]p lies in the neighborhood of [f ]p ; hence, [xSi f ]p 1 [g]p and xSi f  g. Analogously, f  xSi g.

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

261

A.5. Sufficiency of axioms in Theorem 4.1 Suppose that  satisfies P1(R), P3(R), P4∗ (R), P5(R), P6(R). Let p : R → [0, 1] be the probability measure given by (A.2). Let Lp be the set of lotteries that are induced by acts via the probability p. For all lotteries l, l ∈ Lp , let l 1 l ⇔ l = [g]p

and l = [g ]p

for some acts g  g .

For all acts f and f , f  f implies [f ]p 1 [f ]p . On the other hand, [f ]p 1 [f ]p implies that [f ]p = [g]p and [f ]p = [g ]p for some g  g , and by (A.3), that f ∼ g  g ∼ f . Thus, representation (A.5) holds. The relation 1 is non-degenerate, complete and transitive on Lp because  is non-degenerate, complete and transitive on G. By (A.3) and (A.4), 1 is strictly monotonic. Show that 1 is continuous. Suppose first that X is finite. Fix arbitrary lotteries l, l ∈ Lp such that l 1 l . Then l = [g]p and l = [g ]p for some acts g  g . By Lemma A.4, there exist acts h  h such that l + o  [h]p and [h ]p  l + o. Therefore, for all l ∈ Lp in some neighborhood of l, l  [h]p 1 [h ]p  [g ]p = l and l 1 l because 1 is monotonic and transitive. Thus, the set {l ∈ Lp : l 1 l } is open in Lp for all l ∈ Lp . Analogously, the set {l ∈ Lp : l 1 l } is open in Lp for all l ∈ Lp . If X is infinite, then given a finite Y ⊆ X, 1 is continuous on Lp (Y ) and by definition, is continuous on Lp . Finally, suppose that f  g ⇔ [f ]p∗ ∗1 [g]p∗ for a finely ranged probability measure p ∗ : R → [0, 1] and a strictly monotonic ∗1 . Strict monotonicity of ∗1 implies that p ∗ represents the comparative likelihood relation 0 and hence, p∗ = p. Then [f ]p 1 [g]p if and only if [f ]p ∗1 [g]p . Thus, ∗1 = 1 . A.6. Necessity of axioms in Theorem 3.1 Suppose that  is represented by expected utility  U (f ) = u(x) · p(f −1 (x)) for f ∈ G,

(A.6)

x∈X

where p : R → [0, 1] is a finely ranged probability measure, and u maps X into R. Then  is also represented by (A.5) with the same p and with the risk preference 1 induced on Lp via the expected utility function U (all necessary properties of 1 are easily verified). Thus,  satisfies the list of axioms in Theorem 4.1. Moreover, for all events A ∈ R, for all acts f, g ∈ G  {A} and for all outcomes x, y, U (f Ax) U (gAx) if and only if U (f Ay) U (gAy) because U (f Ax) − U (f Ay) = U (gAx) − U (gAy) = (u(x) − u(y))p(¬A). Thus,  satisfies P2(R). A.7. Sufficiency of axioms in Theorem 3.1 Suppose that  satisfies P1(R), P2(R), P3(R), P4(R), P5(R), P6(R). Then P4∗ (R) is implied by P2(R) and P4(R), and hence all of our previous results apply. Let p : R → [0, 1] be the probability measure given by (A.2).

262

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

Suppose that X is finite. Construct the extended risk preference 1 on the set L of all lotteries rather than only on Lp . For all l, l ∈ L, let l 1 l ⇔ ∃ acts gi  gi =l

for i = 1, 2, . . . : lim [gi ]p i→∞

lim [gi ]p = l .

and

(A.7)

i→∞

Show that 1 is non-degenerate, complete, transitive, and satisfies a version of the Independence Axiom: for all l, l , l ∈ L, l 1 l ⇒

1 2l

+ 21 l 1 21 l + 21 l .

(A.8)

Show also that for all f, f ∈ G,  is represented by f  f ⇔ [f ]p 1 [f ]p .

(A.9)

Conclude that both 1 on L and  on G can be represented by expected utility. Step 1: To show that 1 is complete, fix arbitrary lotteries l, l and consider two mutually exclusive cases. (1) The weak preference g  g holds for all acts g and g such that [g]p  l+o and l +o  [g ]p . Fix a sequence of non-null events Ei such that lim p(Ei ) = 0. By Lemma A.5, there exist i→∞

∞ sequences {hi }∞ i=1 and {hi }i=1 such that for all i, [hi ]p − l < p(Ei ), [hi ]p − l  < p(Ei ), ∗ [x Ei hi ]p  l + o, and l + o  [x∗ Ei hi ]p . It follows that the weak preference x ∗ Ei hi  x∗ Ei h i holds. Note that

[x ∗ Ei hi ]p − l [x ∗ Ei hi ]p − [hi ]p  + [hi ]p − l < 3 · p(Ei ), that is, lim [x ∗ Ei hi ]p = l. Analogously, lim [x∗ Ei h i ]p = l . Then gi = x ∗ Ei hi and i→∞

i→∞

gi = x∗ Ei h i satisfy (A.7), and hence, l  l . (2) The strict preference g  g holds for some acts g and g such that [g]p  l + o and ∞ l + o  [g ]p . Suppose that some sequences {gi }∞ i=1 and {gi }i=1 satisfy (A.7). Then [g]p  lim [gi ]p + o and lim [gi ]p + o  [g ]p imply that [g]p  [gi ]p and [gi ]p  [g ]p for i→∞

i→∞

sufficiently large i. By (A.3), g  gi  gi  g which contradicts g  g. Thus, no sequences ∞ {gi }∞ i=1 and {gi }i=1 satisfy (A.7) and l 1 l does not hold. To show that l 1 l, fix a sequence of non-null events Ei such that lim p(Ei ) = 0 and sequences of acts {hi } and {h i } such that i→∞

lim [x∗ Ei hi ]p = l, lim [x ∗ Ei h i ]p = l , l +o  [x∗ Ei hi ]p and [x ∗ Ei h i ]p  l +o. Then for

i→∞

i→∞

all i, x ∗ Ei h i  g  g  x∗ Ei hi because [x ∗ Ei h i ]p  l  [g ]p and [g]p  l  [x∗ Ei hi ]p . By definition, l 1 l. Thus, l 1 l.

Thus, either l  l or l  l must hold. Moreover, for all lotteries l, l ∈ L, l 1 l ⇔ g  g [g]p  l + o

and

and

such that

l + o  [g ]p

l 1 l ⇔ g  g [g]p  l + o

for all g, g ∈ G

for some g, g ∈ G

l + o  [g ]p .

(A.10) such that (A.11)

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

263

Step 2: To show that 1 is transitive, fix arbitrary lotteries l, l , l such that l 1 l 1 l . Suppose that l 1 l. By (A.11), there exist acts g  g such that [g]p  l +o and l +o  [g ]p . Construct a non-null event E and an act h ∈ G  {¬E} such that g  x ∗ Eh  x∗ Eh  g. Take a non-null event g −1 (x). By P6(R), there exists a non-null partition {g −1 (x) = Am 1 } such that g  x ∗ A1 g. Let h = x ∗ A1 g. Then h  xA1 g = g. By P6(R), there exists a non-null partition {A1 = B1n } such that x∗ B1 h  g. Let E = B1 . Then g  x ∗ A1 h = h = x ∗ Eh  x∗ Eh  g. By Lemma A.5, there exists an act h ∈ G  {¬E} such that [x ∗ Eh ]p  l + o  [x∗ Eh ]p . Two cases are possible: either x ∗ Eh  x ∗ Eh or x ∗ Eh  x ∗ Eh. If x ∗ Eh  x ∗ Eh , then g  x ∗ Eh  x ∗ Eh and by (A.11), l 1 l . If x ∗ Eh  x ∗ Eh, then by P2(R), x∗ Eh  x∗ Eh  g and by (A.11), l 1 l. Either case contradicts l 1 l 1 l . Thus, l 1 l is impossible and l 1 l holds. Step 3: To show that 1 is continuous, fix arbitrary lotteries l and l such that l 1 l . By (A.11), there exist acts g  g such that l + o  [g]p and [g ]p  l + o. For all l in some neighborhood of l, l + o  [g]p and by (A.11), l 1 l . Thus, the set {l ∈ L : l 1 l } is open in L for all l ∈ L. Analogously, the set {l ∈ L : l 1 l } is open in L for all l ∈ L. Step 4: To show representation (A.9), fix arbitrary acts f and f . If f  f , then (A.7) holds for gi = f and gi = f ; therefore, [f ]p 1 [f ]p . If f  f , then by Lemma A.4, there exist acts h  h such that [f ]p + o  [h]p and [h ]p  [f ]p + o. By (A.11), [f ]p 1 [f ]p . Step 5: To show that 1 satisfies invariance (A.8), fix arbitrary lotteries l, l , and l such that l 1 l . Write the supports of l, l , and l as y1  · · ·  yn , y1  · · ·  yn and y1  · · ·  yn , respectively. Fix arbitrary ε > 0 and an event H such that 21 − ε < p(H ) < 21 + ε. Let  = p(H ). By Lemma A.2, there exist partitions p(Bi ) i) (i) {H = An1 } and {¬H = B1n } such that 1  p(A ·l(yi ) 1 + ε and 1  (1−)·l(yi ) 1 + ε for all

i = 1 . . . n − 1; n

p(Bn ) n) 1 − ε  p(A ·l(yn ) 1 and 1 − ε  (1−)·l(yn ) 1;

i) i) 1 and 1 − ε  (1−p(D 1 for all (ii) {H = C1 } and {¬H = D1n } such that 1 − ε  p(C ·l (y ) )·l (y )

i = 1 . . . n − 1;

p(C )

i

i

p(D )

1  ·l (yn ) 1 + ε and 1 (1−)·ln (y ) 1 + ε; n

n

i) i) 1+ε and 1−ε  (1−p(G 1+ε (iii) {H = F1n } and {¬H = Gn1 } such that 1−ε  p(F ·l (y ) )·l (y )

for all i = 1 . . . n .

i

i

Take acts f , f and f such that f (s) = yi if s ∈ Ai ⊕Bi for i = 1 . . . n, f (s) = yi if s ∈ Ci ⊕Di for i = 1 . . . n , and f (s) = yi if s ∈ Fi ⊕ Gi for i = 1 . . . n . Then [f ]p  l 1 l  [f ]p . It follows that [f ]p 1 [f ]p and by (A.9), f  f . By P2(R), the strict rankings f Hf  f Hf and f Hf  f Hf imply f = f Hf  f Hf  f Hf = f , and hence, cannot hold together. If f Hf  f Hf , let g = f Hf and g = f Hf . Then [g]p − ( 21 l + 21 l ) [f Hf ]p − (l + (1 − )l ) +(l + (1 − )l ) − ( 21 l + 21 l ) 3ε. Analogously, [g ]p − ( 21 l + 21 l ) 3ε. If f Hf  f Hf , let g = f Hf and g = f Hf . Then [g]p − ( 21 l + 21 l ) 3ε and [g ]p − ( 21 l + 21 l ) 3ε. Take ε = 1i for i = 1, 2, . . . and obtain acts gi  gi such that gi → 21 l+ 21 l and gi → 21 l + 21 l . Thus, 21 l + 21 l 1 21 l + 21 l .

264

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

By the main result in Herstein–Milnor [13], which is an extension of the von Neumann– Morgenstern Theorem, 1 can be represented by expected utility for all l, l ∈ L l 1 l ⇔



u(x) · l(x)

x∈X



u(x) · l (x),

x∈X

where u : X → R is unique up to a positive linear transformation. This representation, together with (A.9), implies (A.6). The uniqueness of p follows from Theorem A.1. Finally, if X is infinite, then for all finite Y ⊆ X such that x, x ∈ Y , the preference over acts f ∈ G that have range in Y , f (S) ⊆ Y , has a unique expected utility representation U (f ) =  −1 (y)) where u : Y → R satisfies u (x) = 1 and u (x ) = 0. As all u ’s are Y Y Y Y y∈Y uY (y) · p(f unique, then there exist a function u : X → R that extends all of these indices simultaneously.  Then U (f ) = x∈X u(x) · p(f −1 (x)) represents  on the entire G. Appendix B. Proof of Theorem 5.1 Fix a set of states S, a set of outcomes X, an algebra of events  ⊆ 2S , and a reflexive preference  over the set F of all acts. Derive the domains RZ ⊆  and REZ ⊆  from  via Zhang’s and Epstein–Zhang’s definitions, respectively. Show that RZ is a mosaic. S ∈ RZ because x  x for all x ∈ X by reflexivity. By definition, E ∈ RZ is equivalent to ¬E ∈ RZ . Partition S into m 2 subjectively risky events {S1 , . . . , Sm } ⊆ RZ . Then for all x, y ∈ X and f, g ∈ F, x(S1 ∪ S2 )f  x(S1 ∪ S2 )g ⇒ xS1 (xS2 f )  xS1 (xS2 g) ⇒ yS1 (xS2 f )  yS1 (xS2 g) ⇒ xS2 (yS1 f )  xS2 (yS1 g)

S1 ∈RZ

⇒ yS2 (yS1 f )  yS2 (yS1 g) ⇒ y(S1 ∪ S2 )f  y(S1 ∪ S2 )g.

S2 ∈RZ

A similar argument repeated inductively for S3 , . . . , Sm ∈ REZ shows that for all x, y ∈ X and m m m f, g ∈ F, x(∪m i=3 Si )f  x(∪i=3 Si )g ⇒ y(∪i=3 Si )f  y(∪i=3 Si )g, or equivalently, f (S1 ∪ S2 )x  g(S1 ∪ S2 )x ⇒ f (S1 ∪ S2 )y  g(S1 ∪ S2 )y. Thus, S1 ∪ S2 ∈ RZ . The axiom P2(RZ ) follows immediately from the definition of RZ . Theorem 3.1 delivers representation (11). Analogously, one can show that REZ is a mosaic, the only difference being that f and g are required everywhere to be joint bets. P4(REZ ) implies P4∗ (REZ ) because for all A, B ∈ REZ , C ∈ REZ  {A, Ac , B, B c }, x  x , z  z , and h, h ∈ GEZ  {C c }, (xAx )Ch  (xBx )Ch (xAx )Cx  (xBx )Cx (zAz )Cz  (zBz )Cz

⇒ {induction by the size of the range h(C)}, ⇒ {P4(REZ )}, ⇒ {induction by the size of the range h (C)},

(zAz )Ch  (zBz )Ch . Thus, Theorem 4.1 delivers representation (12).

I. Kopylov / Journal of Economic Theory 133 (2007) 236 – 265

265

References [1] M. Abdellaoui, P. Wakker, The likelihood method for decision under uncertainty, Theory and Decision 58 (2005) 3–76. [2] M.Allais, La psychologie de l’home rationnel devant le risque: critique des postulats et axiomes de l’écoleAméricaine, Econometrica 21 (1953) 503–546 (translated and reprinted in Allais and Hagen, 1979). [3] P. Billingsley, Probability and Measure, third ed., Wiley, New York, 1995. [4] D. Ellsberg, Risk, ambiguity, and the Savage axioms, Quar. J. Econ. 75 (1961) 643–669. [5] L. Epstein, J. Zhang, Subjective probabilities on subjectively unambiguous events, Econometrica 69 (2001) 265–306. [6] B. de Finetti, La prévision: ses lois logiques, ses sources subjectives, Annales de l’Institute Henri Poincare 7 (1937) 1–68 (translated and reprinted in Kyburg and Smokler, 1964). [7] P.C. Fishburn, Utility Theory for Decision-Making, Wiley, New York, 1970. [8] P. Ghirardato, Defining ambiguity and ambiguity attitude, in: I. Gilboa (Ed.), Uncertainty in Economic Theory: A Collection of Essays in Honor of David Schmeidler’s 65th Birthday, Routledge, London, 2004. [9] P. Ghirardato, M. Marinacci, Ambiguity mase precise: a comparative foundation, J. Econ. Theory 102 (2002) 251–289. [10] I. Gilboa, D. Schmeidler, Maxmin expected utility with non-unique prior, J. Math. Eco. 18 (1989) 141–153. [11] S. Grant, Subjective probability without monotonicity: or how Machina’s Mom may also be probabilistically sophisticated, Econometrica 63 (1995) 159–180. [12] F. Gul, Savage’s theorem with a finite number of states, J. Econ. Theory 61 (1993) 99–110. [13] I. Herstein, J. Milnor, An axiomatic approach to measurable utility, Econometrica 21 (1953) 291–297. [14] F. Knight, Risk, Uncertainty and Profit, Houghton Mifflin, Boston, 1921. [15] C. Kraft, J. Pratt, A. Seidenberg, Intuitive probability on finite sets, Ann. Math. Statist. 30 (1959) 408–430. [16] D.M. Kreps, Notes on the Theory of Choice, Westview Press, Boulder and London, 1988. [17] M. Machina, D. Schmeidler, A more robust definition of subjective probability, Econometrica 57 (1992) 558–571. [18] I. Niiniluoto, A note on fine and tight qualitative probabilities, Ann. Math. Statist. 43 (1972) 1581–1591. [19] L.J. Savage, The Foundations of Statistics, second revised ed., Dover Publications Inc., New York, 1972. [20] D. Schmeidler, Subjective probability and expected utility without additivity, Econometrica 57 (1989) 571–585. [21] P. Wakker, Agreeing probability measures for comparative probability structures, Ann. Statist. 9 (1981) 658–662. [22] J. Zhang, Qualitative probabilities on -systems, Math. Soc. Sci. 38 (1999) 11–20. [23] J. Zhang, Subjective ambiguity, expected utility and Choquet expected utility, Economic Theory 20 (2002) 159–181.