Towards Effects in Mathematical Operational Semantics

Available online at www.sciencedirect.com Electronic Notes in Theoretical Computer Science 276 (2011) 81–104 www.elsevier.com/locate/entcs Towards E...

Download PDF

350KB Sizes 1 Downloads 86 Views

Report

PDF Reader
Full Text

Available online at www.sciencedirect.com

Electronic Notes in Theoretical Computer Science 276 (2011) 81–104 www.elsevier.com/locate/entcs

Towards Eﬀects in Mathematical Operational Semantics Faris Abou-Saleh Dirk Pattinson Department of Computing Imperial College London

Abstract In this paper, we study extensions of mathematical operational semantics with algebraic eﬀects. Our starting point is an eﬀect-free coalgebraic operational semantics, given by a natural transformation of syntax over behaviour. The operational semantics of the extended language arises by distributing program syntax over eﬀects, again inducing a coalgebraic operational semantics, but this time in the Kleisli category for the monad derived from the algebraic eﬀects. The ﬁnal coalgebra in this Kleisli category then serves as the denotational model. For it to exist, we ensure that the the Kleisli category is enriched over CPOs by considering the monad of possibly inﬁnite terms, extended with a bottom element. Unlike the eﬀectless setting, not all operational speciﬁcations give rise to adequate and compositional semantics. We give a proof of adequacy and compositionality provided the speciﬁcations can be described by evaluation-in-context. We illustrate our techniques with a simple extension of (stateless) while programs with global store, i.e. variable lookup. Keywords: Operational semantics, Coalgebras, Eﬀects, Kleisli category, Comodels

1

Introduction

Operational and denotational semantics provide orthogonal descriptions of a programming language: operational semantics speciﬁes the execution of programs, and denotational semantics associates a mathematical meaning to programs. Crucially, denotational semantics is compositional so that the denotation of a program can be inferred from the interpretation of its subprograms, and so facilitates the task of compositional veriﬁcation, either of program properties, or of features of the language at large. The operational and denotational description of a language are connected via the notion of adequacy – operationally equivalent programs should have the same denotation, and vice-versa. The veriﬁcation of adequacy is usually done on a language-by-language basis and is often tedious for non-trivial languages. A more general approach was proposed by Turi and Plotkin [24] in the case of structural operational semantics, assuming that the operational behaviour of a language can be modelled coalgebraically. Then, the semantic domain can be taken as a ﬁnal coalgebra, and mapping programs to their denotations provides an 1571-0661/$ – see front matter © 2011 Elsevier B.V. All rights reserved. doi:10.1016/j.entcs.2011.09.016

82

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

adequate denotational semantics. The main result of op.cit. is that this semantics is compositional if the operational rules can be given in the form of a natural transformation of syntax over behaviour, i.e. they form an abstract operational semantics. While this programme of mathematical operational semantics is well-suited to process-calculi-like languages, it is not yet clear how imperative languages, or indeed any language that produces computational eﬀects [14,15,5], can be treated adequately, and a number of diﬃculties present themselves. Firstly, giving an abstract operational semantics relies on syntax and behaviour being deﬁned on the same category, whereas computational eﬀects are are most naturally expressed in the Kleisli category of a monad [12], which is unsuitable for representating syntax. Second, many of the standard techniques for constructing ﬁnal coalgebras (that play the roles of denotational domains) fail in a Kleisli category. Finally, working with a Kleisli category to obtain adequate denotational models, we need new proof techniques to show compositionality of the denotational semantics. In this paper, we make ﬁrst steps towards abstract operational semantics for languages with eﬀects and address the diﬃculties outlined above. We begin with a pure, eﬀectless language, and an operational semantics given by a natural transformation. We extend the syntax with computational eﬀects as given by an algebraic theory [19], and extend the operational semantics to handle these eﬀects; this induces an eﬀectful operational model by structural recursion. To obtain the extended operational semantics, we introduce a notion of ‘dependency-support’ that records which subterms appear in the premises of the operational rules. In particular, this covers the case where operational rules have at most one premise that exhibits nontrivial behaviour. Once it is obtained, an eﬀectful operational model is considered as a coalgebra in the Kleisli category induced by the algebraic eﬀects. This ensures that when the coalgebra behaviour is iterated, one accumulates the eﬀects arising during program execution. The existence of ﬁnal coalgebras (denotational models) in this setting is subject to this Kleisli-category being enriched over chain-complete partial orders; we explore the issue of satisfying this requirement in a generic way. Lastly, unlike the eﬀectless setting, we demonstrate that not all abstract operational semantics are compositional. We give some abstract conditions that guarantee compositionality; in particular, we show that operational semantics described by evaluation-in-context give rise to compositional semantics. At this point, our approach has several limitations. We neglect equational speciﬁcations of algebraic eﬀects which results in a too ﬁne-grained notion of operational equivalence. Moreover, one often wishes to abstract away some aspects of program execution traces, such as the number of steps before termination; we do not do this here. We plan to address these points in future work following [18] along with more discussion of comodels [16]. Related Work. Algebraic eﬀects have been considered in [14] in the context of PCF, which has recently been extended to account for a larger class of operational phenomena [8]. Both papers are mainly operational, and indeed [14] concludes

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

83

with the remark that ‘one would wish to reconcile this work with the co-algebraic treatment of operational semantics’ in [24]. The framework of abstract operational semantics itself has been extended into various directions [9,10], but eﬀects have so far been elusive. Our treatment of computational eﬀects is inspired by [19,6]. Final coalgebras in Kleisli categories were studied in [4].

2

Introducing Eﬀects into Syntax and Behaviour

We recall the mathematical operational semantics of Turi and Plotkin [23,24] applied to multi-sorted syntax signatures in a category C S . In general, we assume that C is cartesian closed, with countable products and coproducts (ω-arities are needed for eﬀects, e.g. the read-operation of [15]). We also assume that countable polynomial functors have initial algebras and ﬁnal coalgebras. Deﬁnition 2.1 If F : C → C is a functor, an F -algebra is a pair A, α where A ∈ C and α : F A → A. Dually, an F -coalgebra is a pair C, γ where γ : C → F C. Algebras and coalgebras form categories that we denote by Alg(F ) and Coalg(F ), respectively. An S-sorted signature Sig over a set S of sorts has function symbols with (at most countable) arities, written as f : (si )0≤i<α → sf where si ∈ S, and α ≤ ω is the arity of f . If arities are clear from the context, we write f : (si ) → sf . Every S-sorted signature Sig induces a functor ΣSig : C S → C S given by ΣSig (Xs )s∈S = ( f :(si )→s i<α Xsi )s∈S where α is the arity of f . For our running example, we take while programs in C = Set that we describe via an extension of an eﬀectless language – the fragment of while without variable lookup x or update x:=n – with eﬀects for global state. The base language that we call stateless while mainly serves the purpose of exemplifying our techniques. Example 2.2 Let S = {Num, Bool, Prog} denote numerical, boolean and program expressions, respectively. The signature of stateless while is given by

N ::=n | N + N | N ∗ N | +n (N ) | ∗n (N ) E ::=b | N = N | N ≤ N | =n (N ) | ≤n (N ) | ¬E | E ∧ E P ::=skip | P ; P | while (E) do {P } | if (E) then {P } else {P } where n is a numeral in N, and b is a boolean in B = {true, f alse}. The auxiliary operators +n (−), ∗n (−), etc. record partial results of evaluating expressions like N + M from left-to-rightAs explained in Section 3, these auxiliary operators are introduced to give a correct eﬀectful operational semantics for + and ∗. The grammar above induces a syntax functor Σ : Set3 → Set3 in a straightforward way. Remark 2.3 If Σ is a syntax functor induced by a signature, the assumptions on C provide us with a free construction F U where U : Alg(Σ) → C S is the forgetful functor on Σ-algebras. We write TΣ = U F ; intuitively, the s-component (TΣ X)s contains the terms of sort s over sorted variables (Xs ).

84

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

Operational models of languages given by a signature Sig are coalgebras TΣ 0 → BTΣ 0 where B : C S → C S is an endofunctor describing the behaviour of programs and TΣ 0 is the set of closed programs. Their operational semantics may be speciﬁed by an abstract operational semantics (abstract OS): Deﬁnition 2.4 An abstract operational semantics (abstract OS) for syntax and behaviour functors Σ, B is a natural transformation ρX : Σ(X × BX) −→ BTΣ X. Example 2.5 The observable behaviours of stateless while are deterministic transitions that may produce a value (for boolean and arithmetic expressions) whereas for programs, only termination is observable. That is, we let B(N, E, P ) = (N + N, E + B, P + 1), or equivalently (N, E, P ) + (N, B, 1). For numeric expressions u, u ∈ N , we write u → u to represent a transition √ from u to u – formally, u has behaviour inr(u ) ∈ N + N. We write u → n if u terminates with value n ∈ N√ – formally inr(n) ∈ N + N. Similar notation applies for other types. We write p → if program p ∈ P terminates. The abstract operational semantics for stateless while may be presented by standard operational rules such as:

√

n→n

√

b→b

√

skip →

u → u u + v → u + v

√

u→n u + v → +n (v)

e → e if (e) then {p} else {q} → if (e ) then {p} else {q} p → p p ; q → p ; q

√

p→ p;q →q

v → v +n (v) → +n (v ) √

√

v→m √

+n (v) → n + m

e → true if (e) then {p} else {q} → p

(etc.)

while (e) do {p} → if (e) then {p; while (e) do {p}} else {skip}

and other familiar rules for the remaining operators ∗, ¬, =, <= , ∧ (see e.g. [13]). It can easily be veriﬁed that these rules induce a natural transformation ρ : Σ(X × BX) → BTΣ X that distributes syntax over behaviour. To illustrate, we deﬁne ρ for some operators. We suppose X = (N, E, P ) and abbreviate if (e) then {p} else {q} to if(e, p, q), and similarly for while(e, p): + ((u, bu ), (v, bv )) +n ((v, bv )) if ((e, be ), (p, bp ), (q, bq )) while ((e, be ), (p, bp ))

→ Cases{ − −→ Cases{ −→ Cases{ −→ if(e,

bu = n in N : +n (v), bu = u in N : (u + v)} bv = m in N : (n + m), bv = v in N : +n (v )} be = f alse in B : (q), be = true in B : (p), be = e in E : (if(e , p, q))} (p ; while(e, p)), skip)

Theorem 5.1 of [24] shows how ρ induces operational models by structural recursion. with closed programs being the carrier of the model. Proposition 2.6 Suppose ρX : Σ(X × BX) → BTΣ X is a natural transformation. For every coalgebra Y, γ : Y → BY , there is a unique morphism, which we denote T˜γ, such that the following diagram commutes. Here, η and μ are the unit and multiplication of T , and ψ is the Σ-algebra structure of T Y .

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

Y γ

BY

ηY

BηY

/ TY o

ψ

T˜γ

/ BT Y o

ΣT Y

BμY ◦ρT Y

85

Σid,T˜γ

Σ(T Y × BT Y )

To induce the desired operational model – a coalgebra structure for closed programs, T 0 → BT 0 – we take Y = 0 and γ =?BY : 0 → BY the initial map. Example 2.7 For Example 2.5, this operational model describes the transition behaviour of while programs without variable lookup or update. Coalgebraic bisimilarity is given by a map from TΣ 0 into the ﬁnal B-coalgebra, which is (N × N, N × B, N × 1) – it maps expressions of each type to the number n ∈ N of steps-to-termination and the terminal value v ∈ N, B, 1 for that expression. Two expressions p, q are then behaviourally equivalent iﬀ they terminate in the same number of steps, and with the same terminal value. 2.1

Eﬀects as Syntax

We now extend an abstract operational semantics with eﬀectful commands, such as variable lookup/update for while programs. We extend the signature by adding constructors that describe eﬀects such as variable lookup at every sort. Deﬁnition 2.8 An eﬀect signature is a single-sorted signature Eﬀ representing eﬀects of arity at most ω. The eﬀectful extension Sig ⊕ Eﬀ of a language Sig with eﬀects Eﬀ consists of the function symbols in Σ, together with a function symbol δs : (s)0≤j<α → s (i.e. all arguments of type s) for each δ ∈ Eﬀ of arity α and all ˜ for sorts s of Sig. If Δ : C → C is the signature functor induced by Eﬀ, we write Δ n n n ˜ is the signature functor induced by Sig ⊕ Eﬀ. Δ : C → C . Thus Σ + Δ Key examples of eﬀects include global state, interactive I/O, and non-determinism. We focus on global state [15] as required for while programs. Example 2.9 Given a ﬁnite set of variable-locations L with values in V , the eﬀects for global state are ‘read’ rd : v → l and ‘write’ wr : 1 → l×v operations on variables, where |L| = l < ω and |V | = v ≤ ω. Thus the eﬀect signature Eﬀ has function symbols rdx of arity v, and wrx,n of arity 1, for all x ∈ L, n ∈ V . We read rdx ((cn )n∈V ) as the computation that reads the value n of x in the store and then executes the command cn . The command wrx,n (c) assigns n to x and proceeds with command c. The free-algebra functor TΣ+Δ ˜ (Remark 2.3) describes the program terms of the extended language, allowing eﬀects to be freely incorporated into program syntax. In particular, an extension of stateless while with eﬀects for global state allows us to describe ‘standard’ while-programs in the following way. Example 2.10 Extending the signature Sigsl for stateless while with Eﬀ given by global state introduces new syntax operators rdx and wrx,n of arity ω and 1 respectively (for all x ∈ L and n ∈ N), i.e. ΔX = L×X ω +L×N×X where L is the

86

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

set of store locations. We can thus write numeric expressions like 3 + rdx (0, 1, 2, . . .) – corresponding to 3 + x – and we can express x:=3 as wrx,3 (skip). 2.2

Eﬀectful Behaviour and the Final Coalgebra in Kl(M)

We argue that eﬀects are most naturally captured by moving from the underlying category C S to a Kleisli category Kl(M ) for a suitable monad M , where M X constructs an appropriate set of eﬀect-trees with leaves in X. As described in [8], program execution can be understood as producing a syntactic eﬀect-tree whose leaves are programs. If M constructs these eﬀect-trees, then a suitable functor for eﬀectful behaviour is M B, where B is as before. An eﬀectful operational model is thus an M B-coalgebra with carrier TΣ+Δ ˜ 0. This structure is appropriate for describing the one-step evolution of programs as they are executed. However, it does not directly yield the right notion of multi-step evaluation; iterating a coalgebra map γ : X → M BX in the underlying category γ

M Bγ

C gives a map X −→ M BX −→ M BM BX that fails to accumulate the eﬀects. However, a distributive law λ : BM → M B allows us to accumulate eﬀects correctly as follows:

γ

M Bγ

Mλ

μ

2

BX B X X −→ M BX −→ M BM BX −→ M 2 B 2 X −→ M B2X

(1)

The eﬀects arising in the ﬁrst two execution steps have been combined, giving a single eﬀect tree whose leaves (in B 2 X) describe two eﬀectless transition steps. This result can be achieved naturally by moving from C into a Kleisli category Kl(M ); M B-coalgebras in C are interpreted as B-algebras in Kl(M ) for a behaviour functor B obtained by ‘lifting’ B from C into Kl(M ). The above chain of morphisms then corresponds to the iteration Bγ ◦ γ; we can think of Kleisli-morphisms as accumulating and propagating eﬀects. Recall the objects of the Kleisli category Kl(M ) are the same as the underlying category C, but with morphisms f : X → Y in 1-1 correspondence with the (‘underlying’) morphisms f : X → M Y in C. Composition g ◦ f , for g : Y → Z and f : X → Y , is given by the arrow h : X → Z corresponding to the morphism f

Mg

μ

Z X → MY → MMZ → M Z (its ‘overlying’ arrow). We write g † for the latter part, μZ ◦ M g : M Y → M Z, and use notation in the natural way – e.g. the above identity becomes g ◦ f = (g † ◦ f ) . We write J for the canonical (left-adjoint) inclusion functor C → Kl(M ), identityon-objects and sending f : X → Y to Jf = ηY ◦ f : X → Y [3]. One ﬁnds that g ◦ Jf = (g ◦ f ) and Jf ◦ h = (M f ◦ h) . A ‘lifting’ B of the behaviour functor B into Kl(M ) satisﬁes JB = BJ. In particular, this implies that B is identity-on-objects – so a Kleisli-morphism γ : X → BX has underlying type γ : X → M BX, establishing the 1-1 correspondence between M B-coalgebras and B-coalgebras mentioned above. Liftings can be described in terms of distributive laws [21,3]:

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

87

Lemma 2.11 There is a 1-1 correspondence between liftings B of B into Kl(M ) and distributive laws λ : BM → M B. For B deﬁned by sums and copowers (i.e. isomorphic to BX = V + A × X for some V, A), these distributive laws exist. Given a distributive law λ, the corresponding B sends an arrow f : X → Y Bf

λ

Y to the arrow Bf : BX → BY corresponding to BX → BM Y → M BY . Thus for γ : X → BX, the composition Bγ ◦ γ corresponds to the chain of arrows anticipated in (1):

(Bγ)† ◦ γ = (λX ◦ Bγ)† ◦ γ = μBX ◦ M λX ◦ M Bγ ◦ γ We later use the easy fact that if f is an M B-coalgebra morphism, then Jf is a B-coalgebra morphism. The terminal sequence for B in the Kleisli category suggests that the ﬁnal Bcoalgebra in Kl(M ), if it exists, is a natural candidate for program denotations. Standard results guaranteeing its existence are reviewed below [21,4,7]. Deﬁnition 2.12 A category C is Cppo-enriched if all hom-sets are complete partial orders with least element ⊥ and composition is ω-continuous in both arguments (g ◦ n fn = n (g ◦ fn ) and similarly for the ﬁrst argument). Composition in C is left-strict if ⊥X,Y ◦ f = ⊥X,Z for f : Y → Z. A functor B : C → D between Cppoenriched categories C, D is locally continuous if it preserves suprema of ω-chains, i.e. B( n fn ) = n B(fn ), and locally monotone if Bf ≤ Bg whenever f ≤ g. The following is shown in [4, Theorem 3.3 and Proposition 3.9] and readily extends componentwise to the multi-sorted setting. Proposition 2.13 Let M be a monad on C, Kl(M ) a Cppo-enriched Kleisli category with left-strict composition, and F a locally-continuous lifting of F into Kl(M ). If the initial F -algebra D, α exists in C, then the ﬁnal F -coalgebra in Kl(M ) is given by D, Jα−1 . If C = Setn , F need only be locally monotonic. Proof. Follows from [4] Propositions 3.2 and 3.9. Note that in Proposition 3.2 the initial F -algebra in Kl(M ) is given by η ◦ α, where η is the unit of M ; its inverse in 2 Kl(C) is easily shown to be η ◦ α−1 , or equivalently Jα−1 . Particular eﬀect monads M may satisfy these constraints, or be modiﬁed to do so. However, ideally one would like to generate such monads from a given eﬀect signature. We must equip the Kleisli category with a Cppo-enriched structure with left-strict composition; obtaining appropriate B is usually not a problem. For a given eﬀect signature Δ without equations, deﬁning a Cppo-enriched structure is not problematic. In Setn , we may deﬁne: ˜ on C n , we deﬁne F ˜ to be Deﬁnition 2.14 For a countably-polynomial functor Δ Δ n ˜ + ⊥-coalgebra functor on C where ⊥ ∼ the cofree Δ = 1. FΔ ˜ is indeed a monad (Theorems 2 and 4 of [11]), and is given component-wise by FΔ : (FΔ ˜ X)s = FΔ (Xs ). The objects FΔ ˜ may be given the natural partial order ˜ structure. In Cppo, this structure appears as the initial Δ-algebra [2].

88

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

By contrast, left-strictness is not so straightforward. Intuitively, this requires that if we replace all leaves of an eﬀect-tree with ⊥, the resulting tree is identiﬁed with ⊥. We anticipate a better solution in Cppo, perhaps by introducing divergence as an eﬀect ([5] Example 6); for now, we give a strategy in Set for adjusting FΔ ˜ in Set to ensure left-strictness.

2.2.1 Left-strictness in Set In Set, we may achieve left-strictness by restricting FΔ X as follows: Deﬁnition 2.15 We write GΔ for the functor with GΔ X ⊆ FΔ X consisting of the eﬀect-trees which do not have a non-trivial subtree with all leaves equal to ⊥. On morphisms, GΔ f is the restriction of FΔ f . We write GΔ ˜ for GΔ applied componentwise in Setn . It is straightforward to show that GΔ is a functor on Set. An ordering G on GΔ X is inherited from that of FΔ X: for t, t ∈ GΔ X, t G t iﬀ t is obtained from t by replacing some ⊥ leaves with subtrees. GΔ is easily deﬁned on arrows f : X → Y as the restriction of FΔ f : FΔ X → FΔ Y to GΔ X, which simply relabels each leaf x ∈ X with f (x). (The result is a tree in GΔ Y .) GΔ has a monad structure similar to that of FΔ : the unit ηX maps x ∈ X to the singleton tree with leaf x (which lies in GΔ X); the multiplication μX : G2Δ X → GΔ X ‘plugs in’ the trees at each leaf and then prunes any resulting all-⊥ subtrees. The monad axioms are straightforward to verify, and the deﬁnition of multiplication ensures that ⊥† = μ◦GΔ ⊥ indeed maps every tree to ⊥, so that Kl(GΔ ) is left-strict. We have that GΔ X ⊆ FΔ X is a sub-cppo of FΔ X, as the limit of an ω-chain in GΔ X also lies in GΔ X. This follows from the contrapositive: if the limit tω of an ω-chain (tn ) in FΔ X has a non-trivial all-⊥ subtree (tω ∈ / GΔ X) then some tn must also have non-trivial all-⊥ subtree (tn ∈ / GΔ X). Thus Kl(GΔ ) has a Cppo-enriched structure, obtained pointwise from that given by GΔ . To apply proposition 2.13 for behaviour functor BX = V + A × X, it remains to supply a locally monotonic lifting B, or equivalently a distributive law λX : BGΔ X → GΔ BX. This may be deﬁned as before: λX (inl(v)) = ηX (inl(v)), and λX (inr(a, t)) = t where t is obtained by replacing every non-⊥ leaf x with inr(a, x). Local monotonicity follows from the fact that for an arrow f : X → GΔ Y , Bf = λY ◦ Bf takes as input either a value inl(v) or a pair (a, x) ∈ A × X; it correspondingly returns the singleton tree with leaf inl(v), or the tree obtained from f (x) ∈ GΔ Y by replacing every non-⊥ leaf y with (a, y).

Corollary 2.16 There is a lifting of BX = V + A × X to Kl(GΔ ), and Kl(GΔ ) has a ﬁnal B-coalgebra as given by Proposition 2.13.

n Lastly, this result extends componentwise to GΔ ˜ on Set .

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

3

89

From Eﬀectless to Eﬀectful Abstract Operational Semantics

Given a suitable monad M , we consider the problem of obtaining a suitable operational model, an M B-coalgebra with carrier TΣ+Δ ˜ 0, the set of closed program syntax terms mixed with eﬀects. Theorem 2.6 allows us to do this via structural recursion; it requires an operational speciﬁcation – an abstract operational semantics ˜ and behaviour – for the eﬀectfully extended language with syntax functor Σ + Δ Eﬀ ˜ M , i.e. a natural transformation ρ : (Σ + Δ)(X × M BX) → M BTΣ+Δ ˜ X. Eﬀ This section describes a ﬁrst attempt at systematically obtaining ρ from an abstract operational semantics ρ for the eﬀectless fragment of the language. Note that one does not have to use ρ; one could directly specify ρEﬀ for the language in question, avoiding the need for dependency-functions below, but the operational rules become more intricate; we hope to elaborate on this in future work. Remark 3.1 In addition to Proposition 2.13, we assume M is given componentwise by a monad M0 , and has a (componentwise) strength and a natural transformation ˜ X → M X. In Set this holds for M = G ˜ X, using the Δ-part ˜ of the φX : ΔM Δ ˜ inverse of the Δ + ⊥ + X coalgebra structure of F ˜ X. Δ

˜ To deﬁne ρEﬀ from ρ, we must handle program syntax Σ and eﬀect syntax Δ. 1 One readily obtains a natural transformation (later called ψ ) for the latter, but it 2 : Σ(X × M BX) −→ M BT X to handle existing is less easy to deﬁne suitable ψX Σ syntax. We soon show how an assumption that ρ is ‘dependency-supported’ allows us to deﬁne ψ 2 correctly. Let us attempt to describe the eﬀectful behaviour of if (e) then {p} else {q} when e, p, q exhibit eﬀectful behaviour. For instance, e might have behaviour rdx (true, e , f alse, . . .) – depending on whether x is 0, 1, 2, e terminates as true, evolves to e , terminates as f alse, and so on. We would expect the behaviour of if (e) then {p} else {q} to depend similarly on the value of x – it would respectively evolve to p, if (e ) then {p} else {q}, q, and so on. Note that we never consider the behaviour of p or q, eﬀectful or otherwise; eﬀects present in their behaviour should only be considered once the condition e is evaluated. Now we consider how to formalise this in Set3 . Assume a tuple X = (N, E, P ) with e ∈ E and p, q ∈ P . The possible behaviours of e and p, q are the second and third components of BX = (N + N, E + B, P + 1). Recall that eﬀectless operational rules are commonly represented by a natural transformation Σ(X × BX) → BTΣ X. For if statements, this gives a function ρ : (E × BE) × (P × BP )2 → BTΣ X whose arguments are respectively a pair (e, b) of a boolean variable e and an (eﬀectless) behaviour for it, and similarly for programs p, q ∈ P . Suppose we now have eﬀectful behaviours for e, p, q – elements of M BE and M BP , rather than BE and BP . We aim to deﬁne a function analogous to ρ, but incorporating eﬀectful behaviour: (E × M BE) × (P × M BP )2 −→ M BTΣ X. The informal argument involves ‘pulling out’ the eﬀects present in the behaviour of e,

90

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

which corresponds to two applications of monad strength [12]: (E × M BE) × (P × M BP )2 −→ M ((E × BE) × (P × M BP )2 ) However, now we are stuck. We need something of form (E ×BE)×(P ×BP )2 so that we can apply ρ; however, repeatedly using the monad strength would propagate the behaviours of p and q, which is incorrect as argued above. At this point, the informal argument used the fact that the behaviours of p and q are irrelevant. Thus one might simply discard the eﬀectful behaviours M BP and replace them with arbitrary values in BP : M ((E × BE) × (P × M BP )2 ) → M ((E × BE) × (P × BP )2 ) Now applying M ρ to if (e) then {p} else {q} gives the correct eﬀectful behaviour. However, it is a crude way to exploit the information that ‘the behaviour of if (e) then {p} else {q} does not depend on that of p or q’. We express this dependency more systematically, assuming that ρ has an appropriate factorisation – deducible from the operational rules – discarding irrelevant arguments. Without loss of generality, we assume the ﬁrst n arguments must be kept. This gives a canonical means of deﬁning ψ 2 if the behaviours of every program term depends on that of at most one subterm – i.e. operational rules have at most one premise. However, two issues arise in the presence of multiple premises. Consider a ‘synchronous execution’ operator: p → p , q → q p × q → p × q

√

p → p , q → p × q → p

√

√

p →, q → q p × q → q

√

p →, q → √

p×q →

If p and q both introduce variable updates (e.g. to the same variable), obviously there can be no canonical choice for the eﬀectful behaviour of p × q; one must make a choice whether to apply the variable update of p ﬁrst, or of q. In the presence of equations, if the eﬀects are commutative (e.g. non-determinism), this choice makes no diﬀerence and the extension is canonical; otherwise one is forced to put an ordering on the sub-terms specifying the order of propagation. Without loss of generality, we will suppose this ordering is simply left-to-right, and deﬁne a natural transformation comb to propagate eﬀects in this way using monadic strength. The second issue is that a term may not always depend on the same number of sub-terms. In while, + and ∗ are an example of this; without auxiliary operators +n and ∗n , a standard operational semantics would contain the rules u → u u + v → u + v

√

u → n, v → v u+v →u+v

√

√

u → n, v → m √

u+v →n+m

In applying the ﬁrst rule, we must not propagate any eﬀects of v, unlike the other cases. This would require a more ﬁne-grained approach than the one above; however, introducing auxiliaries can ensure the behaviour of every syntax constructor once

91

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

again always depends on the same sub-terms – here, we achieve this by introducing +n and ∗n as above. For the rest of this section, we will assume that the behaviour of every program term of arity α depends on some number n ≤ α of subterms, whose eﬀects are to be propagated from left-to-right. We make this formal: Deﬁnition 3.2 For a signature Sig, a dependency function dep : Sig → N is a function satisfying 0 ≤ dep(f ) < ar(f ) for every symbol f ∈ Sig. Deﬁnition 3.3 Given a signature Sig with corresponding syntax functor Σ and a dependency function dep, for each Y in C S , the restriction resY,X is a natural transformation in X, whose s-component is given by: (ΣX)s = (X × Y ) resY,X : si f :(s )→s ∈Sig 0≤i
When dep(f ) = 0, we take π0 as the terminal map and Ys0 as the initial object 1. Deﬁnition 3.4 Given a dependency function dep, an abstract operational semantics ρX : Σ(X ×BX) → BTΣ X is dep-supported if there is a natural transformation ρ such that ρX is given by the composition ⎛ resBX,X ΣX −−−−−→ ⎝

f :(si )→s

⎛ ⎝

0≤i
(BX)si ×

0≤i
⎞⎞ Xsi ⎠⎠

ρ

X −−− → BTΣ X

s

Trivially, every language is dep-supported if we set dep(f ) = ar(f ) for all f ; but this will not necessarily give the desired behaviour, as illustrated above. Example 3.5 For while programs, only the behaviour of the ﬁrst argument matters – except for while statements, and constants, which have no arguments. Thus we deﬁne a dependency function by dep(f ) = 1 for all f except the constants (n ∈ N, b ∈ B, and skip), and while (e) do {p} – which have dep(f ) = 0. It is easily shown that the abstract operational semantics for While is then dep-supported. To handle multiple premises, one needs a method comb of combining eﬀect-trees. One may use the monad strength stX,Y , of type X × (M Y ) −→ M (X × Y ). Here, it takes an x and an eﬀect-tree with leaves in Y , and pairs each leaf y with x, giving an eﬀect-tree with leaves in X × Y . Deﬁnition 3.6 For all n < ω, given objects Y0, . . . , Yn−1 weinductively deﬁne an arrow combY0 ,...,Yn−1 : 0≤i≤n−1 (M Yi ) −→ M 0≤i≤n−1 Yi by combY0 = idM Y0 , and combY0 ,...,Yn+1 in terms of combY0 ,...,Yn as follows:

92

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

M Yn × st

−−−−−→ M st

−−−−→ 3.1

id×combY

,...,Yn

−−−−−−−0−−−→

0≤i
M (M Yn × 0≤i
∼ =

−−−−−−−−→ μ

−−−−−−−−→

M Yn × M 0≤i
Abstract Operational Semantics for Eﬀectful Behaviour

Now we deﬁne the ρEﬀ from the beginning of this section, assuming ρ is depsupported. We express it in terms of the following components: 1 ˜ : Δ(X × M BX) → M BTΣ X ψX

2 ψX : Σ(X × M BX) → M BTΣ X

1 , ψ 2 ] where inc ‘includes’ terms given by We may then deﬁne ρEﬀ = M B inc ◦ [ψX X TΣ X into the ‘bigger language’ TΣ+Δ ˜ X. We can do this by giving TΣ+Δ ˜ X the evident Σ-algebra structure; then inc is the initial Σ-algebra map TΣ X → TΣ+Δ ˜ X. 1 by M Bη ◦ φ ˜ Deﬁnition 3.7 We deﬁne ψX X BX ◦ Δπ2 , where π2 is the projection X × Y → Y , φBX is as given by Remark 3.1, and η is the unit of TΣ . 2 is deﬁned below. In the second map, ζ isomorphically replaces Deﬁnition 3.8 ψX (M BX)si with M0 ((BX)si ) (Remark 3.1) if dep(f ) > 0, otherwise it is the unit η1 : 1 → M 1 (recall (−)s0 = 1 by Deﬁnition 3.3). The fourth uses strength of TΔ . The ﬁfth map ‘swaps the M and the coproduct f ∈Sig ’ as follows. For each f we write injf for the injection into the f -component of the coproduct Y → g∈Sig Yg ; we apply M to injf and take the coproduct [M inj]f over all f ∈ Sig.

res(M BX),X

−−−−−−−→ ( f (ζ×id))s

−−−−−−−−→

( f (comb×id))s

−−−−−−−−−−→ ( f st)s

−−−−−−−−−→ ([M injf ]f ∈Sig )s

−−−−−−−−−→ M ρ

−−−−−−→

2 : ψX

Σ(X × M BX)

(M BX) × X s s i i f :(si )→s ∈Sig 0≤i
M BTΣ X

M inc

−−−−−−−→

M BTΣ+Δ ˜X

Lemma 3.9 ψ 1 and ψ 2 are natural transformations. Proof. All the components of the deﬁnition – namely φB X, ηX , resY,X , ρX , incX , combXs1 ,...,Xsn , and stBX, Xsi and injf for any f ∈ Sig – are natural in X. 2 Thus we obtain ρEﬀ from Deﬁnition 3.7. It is natural because it is deﬁned in terms of natural transformations ψ 1 , ψ 2 , and incX : TΣ X → TΣ+Δ ˜ X. Example 3.10 We describe the action of ρEﬀ for stateless while programs extended with global state. It takes as input terms of form σ((x1 , t1 ), (x2 , t2 ), . . .) where σ is

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

93

either a syntax constructor for stateless while (Example 2.2), or eﬀect syntax for global state, viz. rdx or wrx,n . Each xi is a variable (of appropriate sort), and ti a corresponding eﬀectful behaviour – an eﬀect-tree (of reads rdx and updates wrx,n ) whose leaves are variables yi or terminal values vi . The output is an element of M BTΣ+Δ ˜ X – an eﬀect-tree with each leaf either an expression t, or a terminal value v of the same sort as σ. ρEﬀ acts straightforwardly (via ψ 1 ) on eﬀect terms such as wrx,n ((p, tp )), where p is an expression and tp an eﬀectful behaviour. This term is simply mapped to wrx,n (p); similarly, rdx ((x1 , t1 ), (x2 , t2 ), . . .) is mapped to rdx (x1 , x2 , . . .). The eﬀect on syntax terms like +((m, tm ), (n, tn )) follows from the deﬁnition of ψ 2 . The restriction res discards the behaviour tn as it is irrelevant; the strength comb ‘pulls’ the tree tm out, giving an eﬀect tree tm of the same shape as tm , but whose leaves are of form +((m, bm ), n) where bm is a leaf of tm , an eﬀectless behaviour. Finally, the standard operational semantics (Example 2.5) is applied to each leaf of tm . As an illustration, we might have +((n, rdx (n , 5 + 2, 3, . . .)), (m, 5)) → rdx (n + m, +5 (2) + m, +3 (m), . . .) Given ρEﬀ , Theorem 2.6 yields an operational model by structural recursion: Corollary 3.11 Given an eﬀect signature Eﬀ, every dep-supported abstract operational semantics ρ : Σ(X × BX) → BTΣ X gives rise to an eﬀectful operational model, an M B-coalgebra with carrier carrier TΣ+Δ ˜ 0. Example 3.12 We illustrate the operational behaviour of the program p = if (x = 1) then {y:=2} else {skip} where x is shorthand for rdx (0, 1, 2, . . .), and y:=2 shorthand for wry,2 (skip): −→ rdx (if (0 = 1) then {y:=2} else {skip}, if (1 = 1) then {y:=2} else {skip}, if (2 = 1) then {y:=2} else {skip}, . . .)) −→ rdx (if (=0 (1)) then {y:=2} else {skip}, if (=1 (1)) then {y:=2} else {skip}, if (=2 (1)) then {y:=2} else {skip}, . . .)) −→ rdx (∗, wry,2 (skip), ∗, ∗, . . .) −→ rdx (∗, wry,2 (∗), ∗, ∗, . . .)

If B has a lifting to Kl(M ), the operational model can be seen as a B-coalgebra living in the Kleisli category. If in addition the ﬁnal B-coalgebra exists, we may thus deﬁne an operational equivalence for programs in terms of the unique Kleislicoalgebra map [[−]] into the ﬁnal B-coalgebra. For C = Set and BX = V + A × X, Proposition 2.16 implies both these conditions. Deﬁnition 3.13 For C = Set, two programs p, q ∈ TΣ+Δ ˜ 0 are operationally equiv∼ alent, p =op q, if [[p]] = [[q]] where [[−]] is the unique B-coalgebra morphism into the ﬁnal M -coalgebra. Example 3.14 Theorem 2.13 implies the carrier of the ﬁnal B-coalgebra is the initial B-algebra, D. The underlying arrow therefore maps into M D. For While expressions, D is (N × N, N × B, N × 1); hence (at sort s) the elements of (M D)s

94

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

are eﬀect-trees with leaves in (n, v) ∈ Ds . One may show (by uniqueness) that the ﬁnal B-coalgebra arrow ! maps each term to the overall eﬀect-tree it produces during execution, with leaves (n, v) ∈ D describing executions – n is the number of steps before termination, and v the ﬁnal value of that execution. For example, the program p of Example 3.12 is mapped to rdx ((3, ∗), wry,2 (4, ∗), (3, ∗), (3, ∗), . . .).

4

Towards Adequacy for Eﬀectful Operational Semantics

In the previous section, we argued that an operational model for an eﬀectful language ﬁnds its natural place in the Kleisli-category Kl(M ). This section is devoted to the problem of obtaining an adequate eﬀectful denotational semantics. To discuss adequacy, it is convenient to change notation. From now on, we ˜ simply to Σ (i.e. Σ relabel the syntax functor of the extended language, Σ + Δ, now incorporates eﬀectful commands directly). We now write T for the free Σalgebra functor. In the same vein, we forget about the eﬀectless abstract OS ρ : Σ(X × BX) → BT X and relabel the eﬀectful abstract OS ρEﬀ to ρ. The eﬀectful denotational model arises by mapping the operational model into the ﬁnal coalgebra D in the Kleisli category Kl(M ). In the underlying category this gives a map T 0 → M D, so we aim to take M D as our denotational model. 4.1

A Couple of Counterexamples

There is a key diﬀerence between the eﬀectful and eﬀectless settings: for some eﬀectful operational speciﬁcations (abstract OS), the operational semantics is not compositional, and adequacy fails. Example 4.1 Consider an interleaving operator | or a ‘partial’ (one-step) evaluator :>, deﬁned by the following rules (at any type): √

√

x → x

x→v

x → x

x→v

x | y → y | x

x|y→y

x :> y → y

x :> y → y

These rules are single-premise, so they may be eﬀectfully extended as in the previous section. Operationally, the eﬀectful extension x | y branches according to the eﬀects given by the ﬁrst step of behaviour of x, then by that of y; then more eﬀects are introduced by the successors of x, then by the successors of y, and so on. x :> y exhibits eﬀects from the ﬁrst step of x’s execution only. Now consider the following two programs, p1 = wry,1 (skip; skip) and p2 = skip; wry,1 (skip). They are identiﬁed by the map into the ﬁnal Kleisli coalgebra (as they both assign y = 1 and terminate in 3 steps); thus are operationally equivalent. Yet if we put them in the contexts [−] | q or [−] :> q, where q = rdy (wrz,0 (skip), wrz,42 (skip), wrz,42 (skip), . . .)

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

95

the results are not operationally equivalent; if y is initially 0, then p1 | q will assign z = 42, and p2 | q will assign z = 0. p1 :> q and p2 :> q are similarly diﬀerent. Thus, the operational semantics is not compositional with respect to | or :>. Compositionality fails because our denotational semantics intentionally discards information about precisely when eﬀects occur during program execution. If the operational semantics of a syntax operator like | depends on this information, our denotational semantics clearly cannot be adequate. Thus we must restrict ρ to forbid too-ﬁne-grained operational interaction between subterms. One option is a form of evaluation-in-context, where a subterm xi is evaluated until all its execution branches have terminated, before evaluating any other terms on those branches. We formalise this notion below. 4.2

Extending The Original Semantics

We review the adequacy proof of Turi and Plotkin in our eﬀectful setting, in terms of Σ-algebras and M B-coalgebras; our denotational and operational semantics arise as a quotient of theirs. We write D, s for the ﬁnal M B-coalgebra – the denotational model in Turi and Plotkin’s original setting. We say D is a layered semantics, in that it describes all possible layerings of eﬀects M interleaved with eﬀectless transitions B. Recall that D, α is the initial B-algebra. As before, we assume Proposition 2.13 holds for F = B, so that D, Jα−1 : D → M BD is the ﬁnal B-coalgebra. We equip the new denotational model M D with an M B-coalgebra structure, s = M Bη M ◦ Jα−1 . Like the ﬁnal M B-coalgebra, this formally describes sequences of eﬀect-layers M and eﬀectless transitions B, except that every eﬀect layer after the ﬁrst is trivial (non-branching), as they arise from the unit η M . In a sense, the eﬀects are ‘collected’ into the ﬁrst step. By Proposition 2.6, ρ induces M B-coalgebra structures T D, T˜s and T M D, T˜s for syntax terms over the ‘layered’ and ‘collected’ denotational models respectively. By taking the syntax functor to be Σ, the behaviour functor to be M B, and the abstract OS to be ρ, we obtain the following diagram (cf. [24], p.84). ΣT 0 ψT 0

T0 T˜?

M BT 0

Σden

den op M Bop /

/ ΣD β

/D

s

M BD

where we write ψT 0 for the Σ-algebra structure of free-syntax ters T 0. Proposition 2.6 induces (via ρ) the coalgebra structure T˜? on closed terms T 0, where ? : 0 → M BT 0 . The Σ-algebra structure β on the denotational model M D is obtained from the ﬁnal M B-coalgebra morphism βT D from T D, T˜s into D, as follows: Ση T

ψ

β

D TD T D −→ D. β : ΣD −→ ΣT D −→

96

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

The initiality of T 0 induces the Σ-algebra morphism den into D, β, giving a (necessarily) compositional map into the denotational model. op is the M Bcoalgebra morphism from T 0 into D induced by ﬁnality of the latter, giving an operational characterisation of program behaviour. Turi and Plotkin’s work implies that the operational map op is a Σ-algebra morphism, so that the initiality of T 0 implies that den = op (as there is only one such map), so that the operational and denotational maps coincide. In particular, it implies compositionality of the operational equivalence ∼ =op induced by op, as deﬁned in 3.13 – i.e. ∼ =op is a congruence with respect to syntax given by Σ. It also implies that the denotational semantics is adequate with respect to ∼ =op . 4.3

An Adaptation to the Kleisli Setting

Our strategy is to map from the layered denotational model D into M D. As the former is an M B-coalgebra – and thus a B-coalgebra – we obtain a unique Bcoalgebra morphism c : D → D in Kl(M ) which eﬀectively collects all the eﬀects from every step of the behaviours in D into the ﬁrst step, giving a single layer of eﬀects and allowing the former diagram to be extended: Σop

ΣT 0 ψ0

op

M Bop

T0 T˜?

M BT 0

/ ΣD

Σc

β

(∗)

/D

/ ΣM D

c

s

/D

(Bc)†/

(2)

β

/ MD

M α−1

M BD

The Σ-algebra structure β is induced in a similar manner to β, via the ﬁnal B-coalgebra morphism β T M D from T M D into D: β = β T M D ◦ ψM D ◦ Ση T . This gives an algebraic structure to the denotational model, and thus by initiality of T 0, induces a denotational map – a Σ-algebra morphism – from T 0 into M D. The operational map in our semantics – given by the ﬁnal B-coalgebra morphism from the operational model T 0 into D (in Kl(M )) – factors through the original one, op, as follows. The bottom-right square commutes by deﬁnition of c as a Bcoalgebra morphism. By deﬁnition op is an M B-coalgebra morphism, hence Jop is also a B-coalgebra morphism; thus the composition c ◦ Jop, equal to c ◦ op in the underlying category, is a B-coalgebra morphism. By ﬁnality of D as a B-coalgebra, c ◦ op is necessarily equal to the ﬁnal B-coalgebra morphism from T 0 into M D. Now we aim to show the top-right square (*) commutes – i.e. that c is a Σalgebra morphism. This would imply the denotational maps and operational maps coincide, giving adequacy and compositionality as for the original semantics. This is because op is known to be a Σ-algebra morphism, so the operational map c ◦ op would also be a Σ-algebra morphism, so must coincide with the denotational map. The Σ’s in condition (*) may be replaced with T ’s as follows, giving the square (+) below. (The top line is β, the bottom β.)

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

ΣD

Ση T

/ ΣT D

Σc

Ση T

ψD

/ TD

ΣT c

ψM D

Tc

βT D

/D

(+) βT M D

97

c

/ ΣT M D / TMD / MD ΣM D The ﬁrst square is the image under Σ of the naturality of η. The second commutes because ψX , the Σ-algebra structure of T X – the free Σ-algebra over X – is a natural transformation. This is an easy consequence of the deﬁnition of T by adjunction, T = U F . One may interpret the condition (+) as an abstract constraint on the abstract OS ρ as follows. T D describes terms over the ‘old’ denotational model, whose arguments may introduce eﬀects at each transition step. The upper path assigns an overall eﬀect-tree to these terms, based on their behaviour given by ρ. The lower path (via T c) collects the eﬀects of each argument into its ﬁrst execution step, giving an element of T M D, and then similarly assigns an overall eﬀect-tree. Thus, (+) implies that the stage at which eﬀects occur (in the arguments of terms T D) is irrelevant, as they might as well all be in the ﬁrst step.

4.4

A Condition on Cones

One may characterise the condition (+) somewhat more concretely, in terms of cones in the Kleisli category. For simplicity we assume the initial sequence of B in C converges after ω steps (as it does for BX = V + A × X). Note that left-strictness implies M 0 = 1, so 0 is the ﬁnal object in Kl(M ) ([3, Lemma 2.3.5]). Deﬁnition 4.2 The cone generated by a B-coalgebra X, γ (over the ﬁnal sequence n up to ω) consists of the arrows (γn : X → B 0)n<ω obtainable by composition in the following diagram: X

γ

!X

0o

!B0

/ BX

Bγ

B!X

B0 o

B!B0

/ B2X

/···

2

B !X

2

2

B γ

B 0o

2

B !B0

··· n

The limit-colimit coincidence described in [4] implies that (sn : D → B 0)n<ω is a limiting cone [3]. It is straightforward to show that the B-coalgebra morphism from any X, γ into D must coincide with the mediating morphism between their generated cones. Revisiting the condition (+), the path c ◦ βT D is a composition of B-coalgebra morphisms, so it is also one, and must coincide with the mediating morphism from ((T˜s)n ) to (sn ). In the other path, βT M D coincides with the mediating morphism from ((T˜s)n ) to (sn ), so precomposing with T c gives another cone over the ﬁnal sequence ((T c ◦ T˜s)n ), with mediating morphism βT M D ◦ T c. Our strategy is to show this cone is the same as the cone generated by T D, T˜s – i.e., that (T˜s)n = (T c ◦ T˜s)n . This would imply the mediating morphisms are

98

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

the same, giving equality of the upper and lower paths. The situation is depicted n below; we must show the two paths T D → B 0 coincide for all n. T˜s

T D!T D Tc

T˜s

/ BT M D B!T M D

~

2

/···

B !T D

B T˜s

/ B2T M D 2

B !T M D

|

2 B T˜s

/ B2T D

B!T D

TMD !T M D

B T˜s

/ BT D

2

2 B T˜s

/···

~

0 B0 B 0 ··· We will focus on the case where ρ is in evaluation-in-context format, a perspective which is often helpful for specifying eﬀectful operational semantics – (cf. [8,14]): Deﬁnition 4.3 An abstract OS ρ is in evaluation-in-context format if it arises as an extension of eﬀectless operational rules corresponding to the following templates: √

x1 → x1

x1 → v

σ(x1 , . . . , xn ) → σ(x1 , . . . , xn )

σ(x1 , . . . , xn ) → t

σ(x1 , . . . , xn ) → t

or

√

or

x1 → v σ(x1 , . . . , xn ) → u

σ(x1 , . . . , xn ) → v

where t is an arbitrary term over v and the xi . Thus a term’s behaviour depends on at most one subterm – without loss of generality, the ﬁrst. It is executed in its place until termination, at which point the term evolves to another term depending on the ﬁnal value. Theorem 4.4 (C = Set) For BX = V + X, M X = GΔ ˜ X, and ρ in evaluationin-context format, we have (T˜s)n = (T c ◦ T˜s)n . Thus condition (+) holds, and the denotational and operational semantics induced by the initial and ﬁnal horizontal (co)algebra morphisms in (2) coincide. n

Proof. First, note that the horizontal arrows T X → B T X for X = D, M D have (underlying) codomain M B n T X, i.e. eﬀect-trees over n-step behaviour traces; their leaves may be terminal traces – ending with a value in V – or not, ending with a n n term in T X. When the horizontal arrows are composed with vertical B X → B 0, the non-terminal leaves are replaced with ⊥. (This may be proven by induction on n, using the fact that the distributive law for GΔ ˜ and B maps non-terminal inr(M 0) ∈ BM 0 into ⊥.) n Thus, for any n, to show the two paths T D → B 0 agree, it is enough to show n that when applied to any term t(d1 , . . .) ∈ T D, the horizontal paths to B T D and n B T M D produce eﬀect-trees which share the same terminal leaves (and agree in shape before those leaves). This is because all non-terminal leaves will be mapped to ⊥ by the ensuing vertical arrows (and any resulting all-⊥ subtrees removed by left-strictness as given by the deﬁnition of GΔ ˜ ).

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

99

We prove this by induction on n. The n = 0 case is immediate as both vertical arrows map into the ﬁnal object 0 in Kl(M ); now assume the inductive hypothesis for n − 1. The action of both horizontal paths consist of iterating B-coalgebra maps induced by the derivation rules of ρ, applied to syntax terms over D or M D. By assumption, ρ is obtained from eﬀectless rules, which are single-premise and the premise must be the ﬁrst argument. When applying ρ to a syntax term σ(x1 , . . .), there are several cases to consider. If σ is an eﬀectless command, and the behaviour of x1 in M BX is relevant (dep(σ) = 1), the eﬀectless rules are applied to each leaf of x1 ’s behaviour (in BX); the rule conclusions become the leaves of the behaviour of σ(x1 , . . .). As the rule conclusions are eﬀectless, so are any further deductions. If there is no dependency on x1 (dep(σ) = 0), the behaviour of σ(x1 , . . .) is given directly by the rule conclusion. If σ is an eﬀect, a layer of branching will be introduced, and σ(x1 , . . .) will evolve to some xi on each branch. As a result, we may class any eﬀectless derivation – as occurs at the leaves of the behaviour of a general term t(x1 , . . .) – into the following four types. (The conventions for will become clearer later.) x1 → x1 .. . t(x1 , . . .) → t(x1 , . . .)

σ(x1 , . . .) → t0 (y1 , . . .) .. . t(x1 , . . .) → t (y1 , . . .) or u x1 → v .. . t(x1 , . . .) → t (y1 , . . .)

x1 → v .. . t(x1 , . . .) → u

By assumption on ρ, the top-left is the only form of derivation possible from a premise x1 → x1 ; all other arguments of t are ﬁxed throughout. If the premise is x1 → v for some v, the rule conclusions may similarly indicate termination (bottomleft) or give rise to a new term with a maybe-new ﬁrst argument, t (y1 , . . .) (bottomright). Similarly, eﬀect syntax and premiseless terms give rise to terms t (y1 , . . .), but the former extends the eﬀect-tree. Such deductions deﬁne the coalgebra structure-maps of T D and T M D; we consider how these maps are iterated in terms of these deductions. Take a term t(d1 , . . .) n in T D, and consider the action of the horizontal map T D → B T D. We may represent d1 schematically as follows: d1

m1

d1 v

m2

d1 v

m3

d 1

···

v

Here, m1 represents the eﬀectful behaviour of d1 ∈ D. Its leaves, elements of BD, are either elements of D – collectively represented by the label d1 – or terminal values represented by v. Likewise, m2 represents the eﬀectful behaviour of each leaf d1 , and so on. (Each d1 has its own eﬀect-tree m2 , but to keep diagrams simple, we

100

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

do not attempt to represent this information.) We ﬁrst focus on the case where the behaviour of the term t(d1 , . . .) depends on its ﬁrst argument. Its eﬀectful behaviour resembles that of d1 – ‘the same branches’ – but with each leaf l derived via the operational rules, with premises given by the corresponding leaf r of d1 . Depending on whether they are terminal values v or other elements of D, l will take one of the forms listed above (with x, y relabelled d, e). Thus we may partially represent the behaviour of t(d1 , . . .) in terms of that of d1 as follows, omitting continuations which depend on other arguments e , e , etc: t(d1 , . . .)

t(d1 , . . .) m1

t(d1 , . . .) m2

t (e1 , . . .)

t(d 1 , . . .) m3

t (e1 , . . .)

u

u

···

t (e 1 , . . .) u

(3) Note that the terms t and denotations e depend only on t and the corresponding leaf v of d1 , and similarly for t , e and so on. Now let us consider the other n Tc ‘horizontal’ path T D → T M D → B T M D. We write f1 for the result in M D of applying c to (d1 ), and similarly g1 = c(e1 ), etc. It may be represented thus: etc. m3 m2 f1

(2, v ) (1, v )

m1

v

where mi represent the eﬀect-trees occurring in the behaviours of the ith successors of d1 ; they have been combined into a single layer of eﬀects. We write (m, v) for the element of the initial algebra D which terminates in m steps with value v. Now let us consider the same term t(f1 , . . .) as before, but with arguments di replaced by their images under c. As before, the behaviour of t(f1 , . . .) is obtained by applying the eﬀectless rules to each leaf of f1 , retaining the conclusions in an eﬀect-tree of the same shape. Writing (m) for m primes··· , ﬁrst note that for every m, and every leaf v (m) of (m) the mth successor of d1 of d1 , there is a corresponding leaf (m − 1, v) of f1 (where we identify (0, v) with v). Now for any series of eﬀectful transitions t(d1 , . . .) → (m−1) (m) . . . → t(d1 , . . .) → u(m) or t(m) (e1 , . . .), the premises of their derivations √

→ v (m) (for some successor d1 must respectively be d1 → d1 , d1 → d1 , . . ., d1 (m) of d1 , and so on). In addition, by naturality of ρ the arguments ei of t(m) must leaf come from those of t(d1 , . . .). By the previous remark, there is a corresponding √ (m − 1, v (m) ) of f1 with the same transitions: (m − 1, v (m) ) → . . . → v (m) . This allows us to derive the following transitions: (m−1)

t((m − 1, u(m) )) → . . . → t(1, u(m) ) →

(m)

u(m) or t(m) (g1 , . . .)

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

101

This implies the behaviour of t(f1 , . . .) is as follows, where the u(m) match the corresponding leaves of (3), t(m) match the corresponding syntax-terms, and f (m) , g (m) are the image under c of d(m) , e(m) . etc. m3

t(f1 , . . .) → t(f1 , . . .) → t (g1 , . . .) t(f1 , . . .) → t(f1 , . . .) → u

m2 f1

m1

t(f1 , . . .) → t (g1 , . . .) t(f1 , . . .) → u t (g1 , . . .) u

Now we show the required induction step. First, consider the case where the behaviour of t(d1 , . . .) depends on d1 . The diagram (3) shows that after iterating the behaviour of t(d1 , . . .) for n steps, the resulting terminal leaves can come from two sources – either from the terminal leaves v (m) of d1 (for m < n), resulting in terminal leaves u(m) in the behaviour of t(d1 , . . .); or from the unshown execution (m) of successor terms, t(m) (e1 , . . .). The above diagram shows that for every terminal leaf arising from the former case, there is a matching leaf in the behaviour of t(f1 , . . .), and vice versa. For (m) the latter case, it also shows that the successor terms t(m) (e1 , . . .) reachable in (m) m steps by t(d1 , . . .) correspond with terms t(m) (g1 , . . .) reachable by t(f1 , . . .) in m steps. Any terminal leaves arising in n steps via these successors must arise in less than n steps when evaluating them directly; so the inductive hypothesis implies both horizontal paths agree at those leaves too. Finally, if the behaviour of term t(d1 , . . .) does not depend on d1 , the derivation of its behaviour t(d1 , . . .) → t (e1 , . . .) may be repeated exactly for the term obtained by applying T c: t(f1 , . . .) → t (g1 , . . .), introducing the same eﬀects. The inductive hypothesis applied to these successor terms tells us that for t(d1 , . . .) and t(f1 , . . .), the terminal leaves produced by n steps of behaviour agree. 2 Example 4.5 By inspection of the operational rules for stateless while, we see the abstract OS for the full while language are in evaluation-in-context format; Theorem 4.4 implies compositionality of the eﬀectful operational semantics illustrated by Examples 3.10, 3.12 and 3.14. Adequacy follows from the fact that two programs are operationally equivalence p ∼ =op q iﬀ they are identiﬁed by the ﬁnal coalgebra map, which coincides with the denotational map; hence denotational equivalence implies operational equivalence. However, the induced denotational semantics is more ﬁne-grained than the traditional semantics for While, in two respects. Firstly, we distinguish expressions like 0 and 0 + 0 + 0 because they respectively terminate in 0 and 2 steps, even though

102

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

the ﬁnal value is the same; this is usually not the desired semantics. Secondly, in the absence of equations relating eﬀect-trees, denotations such as wrx,2 (1, ∗) and wrx,2 (wrx,2 (1, ∗)) are inappropriately distinguished. However, the semantics does abstract away from the timing of eﬀects provided they stay in the right order, as programs p1 and p2 of Example 4.1 demonstrate.

5

Discussion and Future Work

In summary, we started with premise-supported operational speciﬁcations for multisorted, eﬀectless languages, and demonstrated how to extend them with purely syntactic eﬀects. This gives an operational model as a coalgebra in a Kleisli category, if the behaviour functor has a lifting. Under certain conditions, the ﬁnal Kleislicoalgebra exists, and we deﬁne an operational semantics by the unique Kleislicoalgebra morphism into the ﬁnal Kleisli-coalgebra. We may take this coalgebra to be the denotational model, by giving it an algebra structure. By mapping from the ﬁnal coalgebra in the underlying category into the ﬁnal Kleisli-coalgebra, we showed that our semantic maps are as a quotient of those in Turi and Plotkin’s framework, and that our denotational semantics is adequate when the eﬀectful operational semantics is in the evaluation-in-context format. However there are several key omissions that we would like to address. Firstly, we have not considered equations on eﬀects. Equations for ﬁnite eﬀect-trees TΔ ˜X amounts to quotienting the algebra; in a Lawvere theory, they correspond to sketches [1]. A less syntax-driven approach would be to represent eﬀect-trees via the free model of the Lawvere theory in the base category. This would mean discarding the ‘ﬁnal Δ-coalgebra’ approach of Deﬁnition 2.14 and taking M as the free Δalgebra functor in Cppo; with equations, this corresponds to a free model functor on Cppo. This approach may also clarify how one might guarantee left-strictness of the resulting Kleisli category in a more canonical way. Secondly, a more conceptual, abstract adequacy proof would allow easier generalisation. It may help to have a more semantic characterisation of evaluationin-context, possibly in terms of the operational speciﬁcations ρ rather than on the induced operational models T D, T M D. Also, it is not yet clear how useful dependency-functions will be for obtaining eﬀectful operational semantics ρEﬀ from eﬀectless ones. They work well for single-premise languages like while, and the multi-premise situation seems more straightforward when the eﬀects are commutative. Otherwise, it may be easier to specify them directly; more examples will clarify this situation. Lastly, our denotational semantics is sensitive to the number of steps-totermination, which may be too ﬁne-grained for some applications. However, in a Kleisli category it may be possible to to discard this information. Another key question we did not have space to discuss is the relationship of the above semantics with comodels [16], which implement eﬀects and are suitable for e.g. global state and interactive I/O [20]. However, they do not account for non-determinism, unless we exclude equations. By contrast, the conventional form

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

103

of operational semantics for nondeterminism is obtained by imposing equations on the eﬀect-trees in the operational model we obtain in this paper. For each comodel with carrier C there is a notion of ‘comodel-induced operational model’, this time a coalgebra in the Kleisli category of the side-eﬀect monad with bottom, SC X = (⊥ + C × X)C . Analogously to Kl(GΔ ˜ ), one may give a Cppo-enrichment – so that there is a ﬁnal Kleisli-coalgebra which we might take as a denotational model, and the target of an operational equivalence. Given the eﬀectful operational model – a GΔ ˜ B-coalgebra – one may obtain a comodel-induced operational model, an SC B-coalgebra, via a natural transformation : GΔ ˜ → SC allowing comodels to ‘traverse’ trees and produce a result x in X, as well as a new comodel state. Example 5.1 The comodels for global state are transition systems with implementations of variable lookup and update; the cofree comodel is the canonical example, the standard implementation S of global store with carrier NL . Given a comodel, applying to the operational model for While in Example 3.12, we obtain an arrow which, transposed, is of form T 0 × C → BT 0 × C. This closely corresponds to the √ standard transition-form of While programs: p, c → p , c or p, c → c where each c in C is a state of the comodel – the ‘store’. Consider p = if (x = 1) then {y:=2} else {skip} as in example 3.12, with the same shorthand notation. Suppose the store consists of two variables and is initially c = [x : 1, y : 0]: → → → →

[x : 1, y [x : 1, y [x : 1, y [x : 1, y [x : 1, y

: 0], : 0], : 0], : 0], : 2],

if (x = 1) then {y:=2} else {skip} if (1 = 1) then {y:=2} else {skip} if (=1 (1)) then {y:=2} else {skip} y:=2 ∗

The map ! into the ﬁnal SC -coalgebra is of type T 0 → (⊥ + C × D)C , where D = (N × N, N × B, N × 1) as explained after Example 3.12. By uniqueness, one may check that it gives for each program p and initial comodel state c, either (a.) a tuple (c , n, v) giving the ﬁnal comodel state, steps to termination, and output value; or (b.) ⊥ if p diverges in state c. Indeed, one ﬁnds !(p)([x : 1, y : 0]) = ([x : 1, y : 2], 4) in agreement with the above behaviour. It may be that adequacy of the eﬀectful semantics in this paper implies adequacy of the comodel-based semantics; if the combination of eﬀect-trees comb is by monad strength, it may be possible to exploit the fact that ρEﬀ is deﬁned ‘in the same way for each monad’ in terms of monad morphisms.

References [1] M. Barr and C. Wells. Toposes, triples, and theories. Reprints in Theory and Applications of Categories no.1, pages 1–289, 2005.

104

F. Abou-Saleh, D. Pattinson / Electronic Notes in Theoretical Computer Science 276 (2011) 81–104

[2] J. A. Goguen, J. W. Thatcher, E. G. Wagner, and J. B. Wright. Initial algebra semantics and continuous algebras. J. ACM, 24:68–95, January 1977. [3] I. Hasuo. Tracing Anonymity with Coalgebras. PhD thesis, Radboud University, Nijmegen, March 2008. [4] I. Hasuo, B. Jacobs, and A. Sokolova. Generic trace semantics via coinduction. Logical Methods in Computer Science, 3(4), 2007. [5] M. Hyland, G. Plotkin, and J. Power. Combining eﬀects: sum and tensor. Theor. Comput. Sci., 357:70–99, 2006. [6] M. Hyland and J. Power. Discrete lawvere theories and computational eﬀects. Theor. Comput. Sci., 366(1-2):144–162, 2006. [7] B. Jacobs. Trace semantics for coalgebras. Electr. Notes Theor. Comput. Sci., 106:167–184, 2004. [8] P. Johann, A. Simpson, and J. Voigtl¨ ander. A generic operational metatheory for algebraic eﬀects. In Proc. LICS 2010, pages 209–218. IEEE Computer Society, 2010. [9] B. Klin. Bialgebraic methods and modal logic in structural operational semantics. Inf. Comput., 207(2):237–257, 2009. [10] B. Klin and V. Sassone. Structural operational semantics for stochastic process calculi. In R. M. Amadio, editor, Proc. FoSSaCS 2008, volume 4962 of Lecture Notes in Computer Science, pages 428– 442. Springer, 2008. [11] R. Matthes and T. Uustalu. Substitution in non-wellfounded syntax with variable binding. Theor. Comput. Sci., 327(1-2):155–174, 2004. [12] E. Moggi. Notions of computation and monads. Inf. Comput., 93(1):55–92, 1991. [13] H. R. Nielson and F. Nielson. Semantics with Applications: A Formal Introduction. Wiley Professional Computing. Wiley, 1992. [14] G. D. Plotkin and J. Power. Adequacy for algebraic eﬀects. In FoSSaCS ’01: Proceedings of the 4th International Conference on Foundations of Software Science and Computation Structures, pages 1–24, London, UK, 2001. Springer-Verlag. [15] G. D. Plotkin and J. Power. Notions of computation determine monads. In FoSSaCS ’02: Proceedings of the 5th International Conference on Foundations of Software Science and Computation Structures, London, UK, 2002. Springer-Verlag. [16] G. D. Plotkin and J. Power. Tensors of comodels and models for operational semantics. Electr. Notes Theor. Comput. Sci., 218:295–311, 2008. [17] M. B. Smyth and G. D. Plotkin. The Category-Theoretic Solution of Recursive Domain Equations In SIAM J. Comput., 11(4):761–83, 1982. [18] J. Power. Enriched lawvere theories. Theory and Applications of Categories, 6(7):83–93, 1999. [19] J. Power. Countable lawvere theories and computational eﬀects. Electr. Notes Theor. Comput. Sci., 161:59–71, 2006. [20] J. Power and O. Shkaravska. From comodels to coalgebras: State and arrays. Electron. Notes Theor. Comput. Sci., 106, 2004. [21] J. Power and D. Turi. A coalgebraic foundation for linear time semantics. In In Category Theory and Computer Science. Elsevier, 1999. [22] J. J. M. M. Rutten and D. Turi. Initial algebra and ﬁnal coalgebra semantics for concurrency. In J. W. de Bakker, W. P. de Roever, and G. Rozenberg, editors, Proc. REX School/Symposium 1993, volume 803 of Lecture Notes in Computer Science, pages 530–582. Springer, 1994. [23] D. Turi. Functorial Operational Semantics and its Denotational Dual. PhD thesis, Free University, Amsterdam, June 1996. [24] D. Turi and G. D. Plotkin. Towards a mathematical operational semantics. In LICS, pages 280–291, 1997.

Towards Effects in Mathematical Operational Semantics

Towards Effects in Mathematical Operational Semantics

Recommend Documents