Extension of M-Convexity and L-Convexity to Polyhedral Convex Functions

Advances in Applied Mathematics 25, 352–427 (2000) doi:10.1006/aama.2000.0702, available online at http://www.idealibrary.com on Extension of M-Conve...

Download PDF

454KB Sizes 7 Downloads 45 Views

Report

PDF Reader
Full Text

Advances in Applied Mathematics 25, 352–427 (2000) doi:10.1006/aama.2000.0702, available online at http://www.idealibrary.com on

Extension of M-Convexity and L-Convexity to Polyhedral Convex Functions1 Kazuo Murota Research Institute for Mathematical Sciences, Kyoto University, Kyoto 606-8502, Japan E-mail: [email protected]

and Akiyoshi Shioura Department of Mechanical Engineering, Sophia University, 7-1 Kioi-cho, Chiyoda-ku, Tokyo 102-8554, Japan E-mail: [email protected] Received March 30, 1999; accepted May 24, 2000

The concepts of M-convex and L-convex functions were proposed by Murota in 1996 as two mutually conjugate classes of discrete functions over integer lattice points. M/L-convex functions are deeply connected with the well-solvability in nonlinear combinatorial optimization with integer variables. In this paper, we extend the concept of M-convexity and L-convexity to polyhedral convex functions, aiming at clarifying the well-behaved structure in well-solved nonlinear combinatorial optimization problems in real variables. The extended M/L-convexity often appears in nonlinear combinatorial optimization problems with piecewise-linear convex cost. We investigate the structure of polyhedral M-convex and L-convex functions from the dual viewpoint of analysis and combinatorics and provide some properties and characterizations. It is also shown that polyhedral M/L-convex functions have nice conjugacy relationships. © 2000 Academic Press Key Words: combinatorial optimization; matroid; base polyhedron; convex analysis; polyhedral convex function.

1 This work is supported by Grant-in-Aid of the Ministry of Education, Science, Sports and Culture of Japan.

352 0196-8858/00 $35.00 Copyright © 2000 by Academic Press All rights of reproduction in any form reserved.

polyhedral m/l-convex functions

353

1. INTRODUCTION In the area of combinatorial optimization, there exist many “well-solved” problems, i.e., the problems which have nice combinatorial structure and which can be solved efﬁciently (see [5, 23, 39]). Many researchers have been trying to identify the well-behaved structure in combinatorial optimization problems. The concept of matroid, introduced by Whitney [47], plays an important role in the ﬁeld of combinatorial optimization (see [46, 48–50]). Matroidal structure is closely related to the well-solvability of combinatorial optimization problems such as those on graphs and matroids and can be found in a fairly large number of efﬁciently solvable problems. Matroidal structure yields the tractability of problems in the following way: • Global optimality is equivalent to local optimality, which implies the success of the so-called greedy algorithm for the problem of optimizing a linear function over a single matroid. • A nice duality theorem, Edmonds’ intersection theorem [9], guarantees the existence of a certiﬁcate for the optimality in the matroid intersection problem in terms of dual variables. In 1970, Edmonds introduced the concept of polymatroid by extending that of matroid to sets of real vectors ([9], see also [46]). A polymatroid V P ⊆ R+ is a polyhedron given as V P = x ∈ R+ xw ≤ ρX ∀X ⊆ V w∈V

by a submodular set function ρ 2V → R with certain additional conditions, where R+ denotes the set of nonnegative reals, and ρ is called submodular if ρX + ρY ≥ ρX ∩ Y + ρX ∪ Y

∀X Y ⊆ V

Polymatroids share nice combinatorial properties of matroids: for example, the greedy algorithm for matroids still works for polymatroids, and a duality holds for the polymatroid intersection problem. Fujishige, emphasizing the essential role of submodularity of ρ, generalized the concept of polymatroid to that of a submodular system [16, 17]. In recent years, nonlinear combinatorial optimization problems have been investigated more often due to theoretical interest and necessity in practical application. The nonlinear resource allocation problem (see [19, 22]) and the convex cost submodular ﬂow problem (see [17, 21]) are examples of nonlinear combinatorial optimization problems. Both problems have nice combinatorial structures, which lead to efﬁcient combinatorial algorithms. These results, however, do not completely ﬁt in the framework of matroid, polymatroid, and submodular systems.

354

murota and shioura

The concepts of M-convex and L-convex functions, introduced by Murota [29, 30, 32], afford a nice framework for well-solved nonlinear combinatorial optimization problems. An M-convex function is a natural extension of the concept of valuated matroid introduced by Dress and Wenzel [7, 8] (see also [26–28, 36]) as well as a quantitative generalization of the set of integral points in an integral base polyhedron [17]. An L-convex function is an extension of the submodular set function. Let V be a ﬁnite set. A function f ZV → R ∪ +∞ is called M-convex if domZ f = and it satisﬁes (M-EXC[Z]): (M-EXC[Z]) ∀x y ∈ domZ f , ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y such that f x + f y ≥ f x − χu + χv + f y + χu − χv where domZ f = x ∈ ZV −∞ < f x < +∞ supp+ x − y = w ∈ V xw > yw supp− x − y = w ∈ V xw < yw and χw ∈ 0 1V is the characteristic vector of w ∈ V . A function g ZV → R ∪ +∞ is called L-convex2 if domZ g = and it satisﬁes (LF1[Z]) and (LF2[Z]):

λ ∈ Z,

(LF1[Z])

gp + gq ≥ gp ∧ q + gp ∨ q

∀p q ∈ domZ g,

(LF2[Z])

∃r ∈ R such that gp + λ1 = gp + λr ∀p ∈ domZ g

where p ∧ q, p ∨ q ∈ R V denote the vectors with p ∧ qv = minpv qv, p ∨ qv = maxpv qv v ∈ V , and 1 ∈ R V is the vector with each component being equal to one. M/L-convex functions have nice properties: • Local optimality is equivalent to global optimality. • M/L-convex functions can be extended to ordinary convex functions. • M/L-convex functions are conjugate to each other. • A (discrete) separation theorem and a Fenchel-type duality theorem hold for a pair of M-convex/M-concave (L-convex/L-concave) functions. 2 In the original deﬁnition [32], an L-convex function is assumed to be integer-valued. See Remark 6.1.

polyhedral m/l-convex functions

355

The minimization of M/L-convex functions can be done in polynomial time [11, 43]. Application of M-convex functions can be found in system analysis through polynomial matrices [25, 31, 33, 36], and in mathematical economics [6]. M-convexity and L-convexity appear in various nonlinear combinatorial optimization problems with integer variables. Such nice combinatorial properties, however, are enjoyed not only by combinatorial optimization problems in integer variables but also by those in real variables. We dwell on this point by considering the minimum cost ﬂow/tension problems. Let G = V A be a directed graph with a speciﬁed vertex subset T ⊆ V . Suppose we are given a family of piecewise-linear convex functions fa R → R ∪ +∞ a ∈ A, each of which represents the cost of ﬂow on the arc a. A function ξ A → R is called a ﬂow. The boundary ∂ξ V → R of a ﬂow ξ is given by ∂ξv = ξa a∈ A leaves v − ξa a∈ A enters v v ∈ V (1) Then, the cost function f R T → R ∪±∞ of the minimum cost ﬂow that realizes a supply/demand vector x ∈ R T is deﬁned by   ξ ∈ RA   f x = inf x ∈ R T (2) fa ξa ∂ξw = −xw w ∈ T a∈A  ∂ξw = 0 w ∈ V \ T Suppose we are given another family of piecewise-linear convex functions ga R → R ∪ +∞ a ∈ A, each of which represents the cost of tension on the arc a. Any function p V → R is called a potential. Given a potential p, its coboundary δp A → R is deﬁned by δpa = pu − pv

a = u v ∈ A

(3)

Then, the cost function g R T → R ∪±∞ of the minimum cost tension that realizes a potential vector p ∈ R T is written as p ∈ RV gp = inf p ∈ R T (4) ga −δpa pw = p w w ∈ T a∈A It is well known that the minimum cost ﬂow/tension problems with piecewise-linear convex cost can be solved efﬁciently by various combinatorial algorithms (see [1, 42]). It can be shown that both f and g are polyhedral convex functions (see Section 4 for the deﬁnition of a polyhedral convex function), which is a direct extension of results in Iri [20] and Rockafellar [42] for the case of T = 2.

356

murota and shioura

We consider here the cost functions fZ and gZ for the integer version of the minimum cost ﬂow/tension problems:   ξ ∈ ZA   fZ x = inf x ∈ ZT fa ξa ∂ξw = −xw w ∈ T  a∈A ∂ξw = 0 w ∈ V \ T p ∈ ZV gZ p = inf p ∈ ZT ga −δpa w w ∈ T pw = p a∈A It is shown in [32, 34, 35] that fZ satisﬁes (M-EXC[Z]) and gZ satisﬁes (LF1[Z]) and (LF2[Z]); i.e., fZ and gZ are M-convex and L-convex, respectively. These results indicate that the polyhedral convex functions f and g deﬁned by (2) and (4) must have nice combinatorial properties like Mconvexity and L-convexity, respectively. As is shown later in Example 2.4, f satisﬁes the property (M-EXC): (M-EXC) ∀x y ∈ dom f , ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y, ∃α0 > 0 such that f x + f y ≥ f x − αχu − χv + f y + αχu − χv

0 ≤ ∀α ≤ α0

which is a generalization of (M-EXC[Z]), and g satisﬁes (LF1) and (LF2): (LF1) (LF2) ∀λ ∈ R ,

gp + gq ≥ gp ∧ q + gp ∨ q

∀p q ∈ dom g,

∃r ∈ R such that gp + λ1 = gp + λr

∀p ∈ dom g

which can be obtained by generalizing (LF1[Z]) and (LF2[Z]), where dom f = x ∈ R V −∞ < f x < +∞ dom g = p ∈ R V −∞ < gp < +∞ The observation above indicates the possibility of extending the concepts of M-convexity and L-convexity to polyhedral convex functions. This can be done in the following way. For a polyhedral convex function f R V → R ∪ +∞, we call f M-convex if dom f = and f satisﬁes the property (M-EXC). Similarly, for a polyhedral convex function g R V → R ∪ +∞ we call g L-convex if dom g = and g satisﬁes (LF1) and (LF2). The aim of this paper is to investigate the structures of polyhedral Mconvex and L-convex functions from the dual viewpoint of analysis and combinatorics and to provide a nice framework for well-solvable nonlinear combinatorial optimization problems in real variables. The organization of this paper is as follows.

polyhedral m/l-convex functions

357

Section 2 provides some natural classes of polyhedral M/L-convex functions. We also prove the polyhedral M-convexity and L-convexity of the functions f and g deﬁned in (2) and (4). To investigate polyhedral M/L-convex functions, we need to consider the set version of M/L-convexity. A polyhedron B ⊆ R V is called M-convex if it is nonempty and satisﬁes (B-EXC): (B-EXC) such that

∀x y ∈ B, ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y, ∃α0 > 0

x − αχu − χv ∈ B y + αχu − χv ∈ B

0 ≤ ∀α ≤ α0

As is explained later in Theorem 3.3, an M-convex polyhedron is nothing but the base polyhedron of a submodular system [17]. Similarly, a polyhedron D ⊆ R V is called L-convex if it is nonempty and satisﬁes (LS1) and (LS2): (LS1)

p q ∈ D ⇒ p ∧ q p ∨ q ∈ D,

(LS2)

p ∈ D ⇒ p + λ1 ∈ D ∀λ ∈ R .

We investigate the structure of M/L-convex polyhedra in Section 3. Section 4 shows fundamental properties of polyhedral M/L-convex functions. We present various equivalent axioms for polyhedral M/L-convex functions and give some properties on local structure of polyhedral M/L-convex functions such as directional derivatives, subdifferentials, minimizers, etc. In Section 4, we also investigate positively homogeneous polyhedral M/L-convex functions, which are important subclasses of polyhedral M/L-convex functions. It is shown that positively homogeneous polyhedral M/L-convex functions have one-to-one correspondences with certain set functions, and also with L/M-convex polyhedra. For a function f R V → R ∪ +∞, its conjugate function f • R V → R ∪±∞ is deﬁned by f • p = sup p x − f x x∈R V

p ∈ R V

where p x = pvxv v ∈ V . It is shown in [32, 35] that there is a conjugacy relationship between integer-valued M/L-convex functions over the integer lattice. In Section 5, we show that the conjugacy relationship also exists for polyhedral M/L-convex functions. Section 5 also provides various characterization of polyhedral M/L-convex functions by local structures such as directional derivative, the set of minimizers, and subdifferentials. As is mentioned above, the concepts of M/L-convexity were originally introduced for functions deﬁned over the integer lattice [29, 30, 32]. In Section 6, we clarify the relationship of M/L-convexity over the integer lattice and polyhedral M/L-convexity discussed in this paper.

358

murota and shioura

Finally, some duality theorems on polyhedral M/L-convex functions are presented in Section 7. Although duality theorems in this section can be obtained by application of the existing theorems in convex analysis to polyhedral M/L-convex functions, they are worth stating in their own right in connection to integrality properties. We also explain a more general and stronger result on the transformation of polyhedral M/L-convex function called “network induction.” The validity of network induction is proved by using the duality for polyhedral M/L-convex functions. In this paper, we focus on polyhedral convex functions with M/Lconvexity. The concepts of M/L-convexity can be further extended to general (not necessarily polyhedral) convex functions by considering the property (M-EXC) and properties (LF1), (LF2), respectively. Results in this direction will be reported elsewhere. 2. EXAMPLES OF POLYHEDRAL M/L-CONVEX FUNCTIONS Polyhedral M-convex and L-convex functions have various examples. Let V be a nonempty ﬁnite set. Example 2.1 (Afﬁne Functions). Let B ⊆ R V be an M-convex polyhedron, p0 ∈ R V , and α ∈ R . Then the function f R V → R ∪ +∞ such that dom f = B

f x = p0 x + α

x ∈ B

is polyhedral M-convex with equality in (M-EXC), which is an immediate consequence of (B-EXC) for B. Let D ⊆ R V be an L-convex polyhedron, x0 ∈ R V , and ν ∈ R . Then the function g R V → R ∪ +∞ such that dom g = D

gp = p x0 + ν

p ∈ D

is polyhedral L-convex with equality in the submodular inequality (LF1) and r = 1 x0 in (LF2). We denote by 1 the class of one-dimensional piecewise-linear convex functions with a nonempty effective domain, i.e., 1 = ϕ ϕ R → R ∪ +∞ piecewise-linear convex dom ϕ =

(5)

A piecewise-linear convex function (in this paper) is nothing but a onedimensional polyhedral convex function. Note that the effective domain dom ϕ of a function ϕ ∈ 1 is a closed interval on R .

polyhedral m/l-convex functions

359

Example 2.2. For ϕ ∈ 1 , the function f R 2 → R ∪ +∞ deﬁned by ϕx1 x1 + x2 = 0, f x1 x2 = +∞ (otherwise) is polyhedral M-convex. For ψ ∈ 1 , the function g R 2 → R ∪ +∞ deﬁned by gp1 p2 = ψp1 − p2

p1 p2 ∈ R 2

is polyhedral L-convex. Furthermore, f and g are conjugate to each other if ϕ and ψ are conjugate to each other. Example 2.3 (Separable-Convex Functions). Let B ⊆ R V be an Mconvex polyhedron. For a family of functions fw ∈ 1 w ∈ V , the function f R V → R ∪ +∞ deﬁned by fw xw x ∈ B, f x = w∈V +∞ x ∈ B is polyhedral M-convex if dom f = . Let D ⊆ R V be an L-convex polyhedron. For functions guv ∈ 1 indexed by u v ∈ V , the function g R V → R ∪ +∞ deﬁned by guv pv − pu p ∈ D, gp = u v∈V +∞ p ∈ D is polyhedral L-convex with r = 0 in (LF2) if dom g = . Example 2.4 (Minimum Cost Flow/Tension Problems). In the Introduction, we have deﬁned the cost functions f and g of the minimum cost ﬂow/tension problems by (2) and (4), respectively. We here state the following properties of f and g: (i) f is polyhedral M-convex if f x0 is ﬁnite for some x0 ∈ R T . (ii) g is polyhedral L-convex if gp0 is ﬁnite for some p0 ∈ R T . (iii) Suppose that fa and ga are conjugate to each other for all a ∈ A. Then, f and g are conjugate to each other if one of the following conditions holds: (a) f x0 is ﬁnite (i.e., −∞ < f x0 < +∞) for some x0 ∈ R T , (b) gp0 is ﬁnite (i.e., −∞ < gp0 < +∞) for some p0 ∈ R T , (c) f x0 < +∞, gp0 < +∞ for some x0 ∈ R T , p0 ∈ R T . In the following, we prove (M-EXC) for f and (LF1), (LF2) for g. We omit the proofs of (iii) and the following properties: • if f x0 is ﬁnite for some x0 ∈ R T , then dom f = and f > −∞,

360

murota and shioura • if gp0 is ﬁnite for some p0 ∈ R T , then dom g = and g > −∞,

since their proofs are just a straightforward extension of the results in [20, 42] for the case T = 2. [(M-EXC) for f ] Let x y ∈ dom f and u ∈ supp+ x − y. Also, let ξ η ∈ R A be ﬂows such that −xw w ∈ T , −yw w ∈ T , ∂ξw = ∂ηw = 0 w ∈ V \ T , 0 w ∈ V \ T , f x = fa ξa f y = fa ηa a∈A

a∈A

Note that the existence of such ξ and η is assured by the polyhedral convexity of each fa . By a standard augmenting path argument we see that there exist π A → 0 ±1 and v ∈ supp− x − y ⊆ T such that supp+ π ⊆ supp+ ξ − η

supp− π ⊆ supp− ξ − η ∂π = χv − χu

Put α0 = minξa − ηa a ∈ A πa = 0 > 0 Let α be any value with 0 ≤ α ≤ α0 . Then, we have ∂ξ − απ = −x + αχu − χv ∂η + απ = −y − αχu − χv fa ξa − απa + fa ηa + απa ≤ fa ξa + fa ηa

a ∈ A

which implies the inequality f x − αχu − χv + f y + αχu − χv ≤ fa ξa − απa + fa ηa + απa a∈A

≤

fa ξa + fa ηa = f x + f y

a∈A

Thus, f satisﬁes (M-EXC). [(LF1) for g] Let p q ∈ dom g, and p q ∈ R V be vectors with pw = p w w ∈ T ga −δpa = gp

a∈A

qw = q w w ∈ T ga −δqa = gq

a∈A

Note that the existence of such p and q is assured by the polyhedral convexity of each ga . For a = u v ∈ A, we have ga −δp ∧ qa + ga −δp ∨ qa ≤ ga −δpa + ga −δqa

polyhedral m/l-convex functions

361

by the convexity of ga . It is obvious that p ∧ qw = p ∧ q w Hence, we have gp ∧ q + gp ∨ q ≤

a∈A

≤

a∈A

p ∨ qw = p ∨ q w w ∈ T

ga −δp ∧ qa + ga −δpa +

a∈A

a∈A

ga −δp ∨ qa

ga −δqa = gp + gq

[(LF2) for g] Let 1T (resp. 1V ) be the vector in R T (resp. R V ) with each component being equal to one. For p ∈ R T and λ ∈ R , we have gp + λ1T = inf ga −δpa ˜ a∈A

p˜ ∈ R V pw ˜ = p w + λw ∈ T

p ∈ RV = inf ga −δp + λ1V a pw + λ = p w + λw ∈ T a∈A p ∈ RV = inf = gp ga −δpa pw = p ww ∈ T a∈A

3. M-CONVEX AND L-CONVEX POLYHEDRA In this section, we introduce two classes of polyhedra, called M-convex polyhedra and L-convex polyhedra. In fact, M-convex and L-convex polyhedra are familiar objects in combinatorial optimization, and this section is a recapitulation of almost known facts on these polyhedra from our point of view. The results in this section will be the basis of argument on polyhedral M-convex and L-convex functions beginning in Section 4. 3.1. Deﬁnitions and Notation We denote by R the set of reals and by Z the set of integers. Also, denote by R+ the set of nonnegative reals. Throughout this paper, we assume that V is a nonempty ﬁnite set. For any ﬁnite set X, its cardinality is denoted by X. The characteristic vector of a subset X ⊆ V is denoted by χX ∈ 0 1V , i.e., 1 w ∈ X, χX w = 0 w ∈ V \ X. In particular, we use the notation 0 = χ , 1 = χV .

362

murota and shioura

Let x = xw w ∈ V ∈ R V . We deﬁne suppx = v ∈ V xv = 0 +

supp x = v ∈ V xv > 0 !x!1 = xv p x =

v∈V

v∈V

pvxv p ∈ R V

supp− x = v ∈ V xv < 0 !x!∞ = max xv v∈V

xX =

xv X ⊆ V

v∈X

We sometimes write 0V , 1V , and p xV to indicate that 0, 1, p, and x are vectors in R V . For ε > 0, we deﬁne N∞ x ε = y ∈ R V !y − x!∞ < ε For any p q ∈ R V , p ∧ q and p ∨ q denote the vectors in R V such that p ∧ qw = minpw qw p ∨ qw = maxpw qw w ∈ V For a V → R ∪−∞ and b V → R ∪+∞ with av ≤ bv v ∈ V , we deﬁne the interval "a b# ⊆ R V by "a b# = x ∈ R V av ≤ xv ≤ bv v ∈ V For any two vectors p q ∈ R V , we denote Box"p q# = "p ∧ q p ∨ q#. Let S ⊆ R V . The set S is called convex if 1 − αx + αy ∈ S holds for any x y ∈ S and any α ∈ "0 1#, and called conic if αx ∈ S holds for any x ∈ S and any α > 0. A conic set is also called a cone. The convex hull of S, denoted by convS, is the smallest convex set containing S; that is, k k convS = λi xi k ≥ 1 xi ∈ S λi ≥ 0 λi = 1 i=1

i=1

Note that S is convex if and only if S = convS. For any set S ⊆ R V , the convex closure of S, denoted by $ S, is the smallest closed convex set containing S. Hence, N∞ x ε denotes a closed neighborhood y ∈ R V !y − x!∞ ≤ ε. A set S ⊆ R V is called polyhedral if it is represented as the intersection of a ﬁnite number of closed half-spaces, i.e., there exist some pi ki=1 ⊆ R V and αi ki=1 ⊆ R k ≥ 0 such that S = x ∈ R V pi x ≤ αi ∀i = 1 k

(6)

A polyhedral set is also called a polyhedron. Obviously, any polyhedron is a closed convex set.

polyhedral m/l-convex functions

363

We use the following convention involving ±∞: infα α ∈ = +∞ supα α ∈ = −∞ +∞ ≤ +∞ −∞ ≤ −∞ 0 × +∞ = +∞ × 0 = 0 × −∞ = −∞ × 0 = 0 In this paper, we never use expressions such as +∞ − ∞ or −∞ + ∞. 3.2. M-Convex Polyhedra A polyhedron B ⊆ R V is called M-convex if it is nonempty and satisﬁes (B-EXC): (B-EXC) such that

∀x y ∈ B, ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y, ∃α0 > 0

x − αχu − χv ∈ B

y + αχu − χv ∈ B

0 ≤ ∀α ≤ α0

We denote by 0 the family of M-convex polyhedra, i.e., 0 = B ⊆ R V B M-convex polyhedron It is well known as a folklore that what we call an “M-convex polyhedron” is nothing but the base polyhedron of a submodular system [17] (see also Theorem 3.3). We use the term “M-convex polyhedron” for denotational symmetry to “L-convex polyhedron.” 3.2.1. Properties of M-Convex Polyhedra For any S ⊆ R V and x ∈ S, the exchange capacity c˜S x v u u v ∈ V is deﬁned by c˜S x v u = supα α ∈ R+ x + αχv − χu ∈ S where the subscript S to c˜ may be omitted when there is no ambiguity. The property (B-EXC) can be rewritten in terms of exchange capacity, as follows: (B-EXC ) ∀x y ∈ B, ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y such that c˜B x v u > 0, c˜B y u v > 0. We also consider a one-sided exchange property: (B-EXC− ) ∀x y ∈ B, ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y such that c˜B x v u > 0. The exchange property (B-EXC− ), apparently weaker than (B-EXC), is in fact equivalent to (B-EXC). Theorem 3.1. For a polyhedron B ⊆ R V , (B-EXC) ⇐⇒ (B-EXC− ). Proof.

Proof is given in Subsection 3.2.3.

364

murota and shioura

Fundamental properties of M-convex polyhedra are shown below. An Mconvex polyhedron is contained in a hyperplane x ∈ R V xV = r for some r ∈ R . Theorem 3.2. Let B ⊆ R V be a polyhedron with (B-EXC− ). Then, xV = yV ∀x y ∈ B. Proof. Assume, to the contrary, that x0 V > y0 V holds for some x0 y0 ∈ B. Let x∗ ∈ B be a vector with !x∗ − y0 !1 = inf!x − y0 !1 x ∈ B xV = x0 V Note that supp+ x∗ − y0 = . Let u ∈ supp+ x∗ − y0 . By (B-EXC− ), there exist some v ∈ supp− x∗ − y0 and a sufﬁciently small α > 0 such that x = x∗ − αχu − χv ∈ B. It, however, is a contradiction to the choice of x∗ since x V = x0 V and !x − y0 !1 = !x∗ − y0 !1 − 2α. We shall show that an M-convex polyhedron is described by a submodular set function. Let ρ 2V → R ∪ +∞ and denote dom ρ = X ⊆ V ρX < +∞. A function ρ is said to be submodular if it satisﬁes ρX + ρY ≥ ρX ∩ Y + ρX ∪ Y

∀X Y ⊆ V

(7)

We denote the class of (normalized) submodular set functions by = ρ 2V → R ∪ +∞ ρ submodular, ρ = 0 ρV < +∞ A set function µ 2V → R ∪−∞ is called supermodular if −µ is submodular. For any nonempty B ⊆ R V , deﬁne ρB 2V → R ∪ +∞ by ρB X = sup xX x∈B

X ⊆ V

(8)

For a set function ρ 2V → R ∪ +∞, we deﬁne Bρ ⊆ R V by V Bρ = x ∈ R xX ≤ ρX X ⊆ V xV = ρV

(9)

Note that the exchange capacity associated with x ∈ Bρ can be written as [17]: c˜Bρ x v u = minρX − xX X ⊆ V u ∈ X v ∈ X

u v ∈ V (10) The following fact has been known to experts (cf. [4, 9, 46, Chap. 18]), but the precise statement cannot be found in the literature. Theorem 3.3. (i) For B ∈ 0 , we have ρB ∈ and BρB = B. (ii) For ρ ∈ , we have Bρ ∈ 0 and ρBρ = ρ. (iii) The mappings B (→ ρB B ∈ 0 and ρ (→ Bρ ρ ∈ provide a one-to-one correspondence between 0 and and are the inverse of each other.

polyhedral m/l-convex functions Proof.

365

Proof is given later in Subsection 3.2.3.

For any sets S1 S2 ⊆ R V , we denote by S1 + S2 ⊆ R V the Minkowskisum of S1 and S2 , i.e., S1 + S2 = p1 + p2 pi ∈ Si i = 1 2 Theorem 3.4 [17]. For B1 B2 ∈ 0 , the set B1 + B2 is M-convex with B1 + B2 = BρB1 + ρB2 . Remark 3.5. The intersection of two M-convex polyhedra is not necessarily an M-convex polyhedron, as shown in the example below. Let V = a b c d and deﬁne B1 B2 ⊆ R V as B1 = conv1 1 0 0 1 0 1 0 0 0 1 1 0 1 0 1 B2 = conv1 1 0 0 1 0 1 0 0 0 1 1 0 1 1 0 1 0 0 1 Then, B1 and B2 are M-convex polyhedra, and the submodular functions ρi = ρBi i = 1 2 deﬁned by (8) are ρi = 0

ρi v = 1 for v ∈ V i = 1 2

ρi X = 2

for X ⊆ V with X ≥ 3 i = 1 2

ρ1 a d = ρ1 b c = 1 ρ1 a b = ρ1 c d = ρ1 a c = ρ1 b d = 2 ρ2 b d = 1 ρ2 a b = ρ2 a c = ρ2 a d = ρ2 b c = ρ2 c d = 2 We see that B1 ∩ B2 = conv1 1 0 0 1 0 1 0 0 0 1 1 which is not an M-convex polyhedron. Note that the associated set function ρ = ρB1 ∩B2 in (8) is ρa d = ρb c = ρb d = 1 ρa b = ρc d = ρa c = 2 ρX = ρ1 X = ρ2 X

if X = 2

which is not submodular. Separation theorems of the following form hold for submodular functions and M-convex polyhedra. The latter half of Theorem 3.7 is already shown in [34, Theorem 3.6].

366

murota and shioura

Theorem 3.6 (Frank [12]). Let ρ 2V → R ∪ +∞, µ 2V → R ∪ −∞ be a pair of functions with ρ −µ ∈ . If µX ≤ ρX ∀X ⊆ V , then there exists x∗ ∈ R V such that µX ≤ x∗ X ≤ ρX

∀X ⊆ V

Moreover, if ρ and µ are integer-valued, then there exists such x∗ ∈ ZV . Theorem 3.7. Let B1 B2 ∈ 0 . If B1 ∩ B2 = , then there exists p∗ ∈ 0 1V ∪ −χV such that inf p∗ x − sup p∗ x > 0

x∈B1

x∈B2

(11)

Moreover, if both of B1 and B2 are integral polyhedra, i.e., Bi = Bi ∩ ZV i = 1 2, then “> 0” in (11) can be replaced by “≥ 1.” Proof. Put ρi = ρBi i = 1 2. If ρ1 V = ρ2 V , then we can choose p∗ = χV or −χV . Hence, we may assume that ρ1 V = ρ2 V . Deﬁne the functions ρ and µ by µX = ρ1 V − ρ1 V \ X

ρX = ρ2 X X ⊆ V V

Then ρ −µ ∈ and there is no x ∈ R with µX ≤ xX ≤ ρX ∀X ⊆ V . By Theorem 3.6, there exists some X0 ⊆ V such that µX0 > ρX0 . Since inf χX0 x = µX0 > sup χX0 x = ρX0

x∈B1

x∈B2

we can choose p∗ = χX0 . The second claim follows from the integrality of B1 B2 . 3.2.2. M- -Convex Polyhedra Theorem 3.2 shows that an M-convex polyhedron is contained in a hyperplane of the form x ∈ R V xV = r for some r ∈ R . Therefore, any Mconvex polyhedron loses no information by the projection onto a (V −1)dimensional space. We call a polyhedron Q ⊆ R V M- -convex if the set R V deﬁned by Q⊆ = x x ∈ R V x ∈ Q x = −xV Q 0 0

(12)

= v0 ∪ V . In fact, by the results is an M-convex polyhedron, where V of [15, 17], what we name M -convex polyhedron here is nothing but a generalized polymatroid in [13, 14], where a generalized polymatroid is a nonempty set Q ⊆ R V given by Q = x ∈ R V µX ≤ xX ≤ ρX X ⊆ V

(13)

polyhedral m/l-convex functions

367

with a pair of submodular/supermodular functions ρ 2V → R ∪ +∞, µ 2V → R ∪−∞ satisfying the inequality ρX − µY ≥ ρX \ Y − µY \ X

X Y ⊆ V

-

M -convex polyhedron is an essentially equivalent concept to M-convex polyhedron, while the class of M- -convex polyhedra properly contains that of M-convex polyhedra. Every property of M-convex polyhedra can be restated in terms of M- -convex polyhedra, and vice versa. M- -convex polyhedra can be characterized by various exchange properties, as follows (cf. [37, Remark 5.2; 45]). For Q ⊆ R V and u v ∈ V , we denote c˜Q x u = supα α ∈ R+ x − αχu ∈ Q c˜Q x v = supα α ∈ R+ x + αχv ∈ Q (G-EXC) ∀x y ∈ Q, ∀u ∈ supp+ x − y, either (i) or (ii) (or both) holds: (i) ∃v ∈ supp− x − y such that c˜Q x v u > 0 and c˜Q y u v > 0, (ii) c˜Q x u > 0 and c˜Q y u > 0. (G-EXC− ) ∀x y ∈ Q, ∀u ∈ supp+ x − y, either c˜Q x v u > 0 ∃v ∈ supp− x − y or c˜Q x u > 0. Theorem 3.8. Let Q ⊆ R V be a nonempty polyhedron. Then, Q is M- -convex ⇐⇒ Q satisﬁes (G-EXC) ⇐⇒ Q satisﬁes (G-EXC− ) Proof.

Proof is given later in Subsection 3.2.3.

The following property, which seems more natural when described in terms of M- -convex polyhedron, is obvious from (13). Theorem 3.9. Let Q ⊆ R V be an M- -convex polyhedron. Then "x y# ⊆ Q holds for any x y ∈ Q with x ≤ y. 3.2.3. Proofs This section provides the proofs of Theorems 3.1, 3.3, and 3.8. We use the following results on submodular set functions. A family ⊆ 2V is called a distributive lattice if X ∩ Y X ∪ Y ∈ holds for any X Y ∈ . Note that dom ρ of a submodular function ρ ∈ is a distributive lattice. Theorem 3.10. Let ρ ∈ . For any x ∈ Bρ, the family x = X ⊆ V xX = ρX is a distributive lattice with V ⊆ x. We ﬁrst prove a slightly stronger claim than Theorem 3.3 (i). A sequence Xi ki=0 k ≥ 0 of distinct elements in 2V with X0 ⊆ X1 ⊆ · · · ⊆ Xk is called a chain.

368

murota and shioura

Lemma 3.11. Let B ⊆ R V be a nonempty polyhedron with (B-EXC− ), and Xi ki=0 k ≥ 0 be a chain such that Xi ki=0 ⊆ dom ρB . Then, there exists x∗ ∈ B with x∗ Xi = ρB Xi i = 0 1 k. Proof. The claim is shown by induction on the integer k. Let xk ∈ B be a vector with xk Xk = ρB Xk . By the inductive hypothesis, there exists a vector x ∈ B satisfying xXi = ρB Xi i = 0 1 k − 1. We may assume that x minimizes the value !x − xk !1 of all such vectors. Suppose that xXk < ρB Xk . By (B-EXC− ), there exist u ∈ supp+ x − xk \ Xk and v ∈ supp− x − xk such that x = x − αχu − χv ∈ B with a sufﬁciently small α > 0. Here v ∈ V \ Xk−1 holds since u ∈ V \ Xk−1 and x Xk−1 ≤ ρB Xk−1 = xXk−1 . The vector x satisﬁes x Xi = ρB Xi i = 0 1 k − 1 and !x − xk !1 = !x − xk !1 − 2α, a contradiction. Hence xXk = ρB Xk . For any polyhedron S ⊆ R V , a vector x ∈ S is called an extreme point of S if there are no y1 y2 ∈ S \ x and α ∈ R with 0 < α < 1 such that x = αy1 + 1 − αy2 . Theorem 3.12 [17, Theorem 3.22]. Let ρ ∈ . Then, x ∈ R V is an V extreme point of Bρ if and only if there exists a chain Xi i=0 such that V

Xi i=0 ⊆ dom ρ xXi = ρXi

X0 =

XV = V

i = 0 1 V

Lemma 3.13. For any nonempty polyhedron B ⊆ R V with (B-EXC− ), we have ρB ∈ and B = BρB . Proof. The outline of the proof is similar to that for [4, Lemma 5.2]. We ﬁrst consider the case when B is bounded. Let X Y ∈ dom ρB . From Lemma 3.11, there exists x∗ ∈ B with x∗ X ∩ Y = ρB X ∩ Y and x∗ X ∪ Y = ρB X ∪ Y , which implies the inequality ρB X + ρB Y ≥ x∗ X + x∗ Y = x∗ X ∩ Y + x∗ X ∪ Y = ρB X ∩ Y + ρB X ∪ Y Therefore, ρB ∈ . Lemma 3.11 and Theorem 3.12 imply that any extreme point of BρB is contained in B, which shows that BρB ⊆ B. The reverse inclusion is obvious. Next, assume that B is unbounded. Let x0 be any vector in B. For each nonnegative integer k, put Bk = x ∈ B !x − x0 !∞ ≤ k and ρk = ρBk . Since each Bk is a bounded M-convex polyhedron, it holds that ρk ∈ and

polyhedral m/l-convex functions Bk = Bρk . Hence, we have BρB =

∞

k=1

Bρk =

∞

k=1

369 Bk = B, and

ρB X + ρB Y = lim ρk X + ρk Y k→∞

≥ lim ρk X ∩ Y + ρk X ∪ Y k→∞

= ρB X ∩ Y + ρB X ∪ Y for any X Y ∈ dom ρB , i.e., ρB ∈ . Next, we prove Theorem 3.3 (ii). Lemma 3.14 (cf. [17]). Let ρ ∈ , x ∈ Bρ, and u ∈ V . Put X0 = v ∈ V cx ˜ v u = 0

Y0 = v ∈ V cx ˜ u v > 0

Then we have u ∈ V \ X0 , xX0 = ρX0 , and u ∈ Y0 , xY0 = ρY0 . Note that the set Y0 in Lemma 3.14 is nothing but the dependence function depx u deﬁned in [17]. We consider another one-sided exchange property: (B-EXC+ ) ∀x y ∈ B, ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y such that c˜B y u v > 0. Lemma 3.15. For any ρ ∈ , the set Bρ satisﬁes (B-EXC− ) and (B-EXC+ ). Proof. We prove (B-EXC− ) only, since (B-EXC+ ) can be shown in the similar way. Let x y ∈ Bρ with x = y and u ∈ supp+ x − y. To the contrary assume cx ˜ v u = 0 for all v ∈ supp− x − y. Put X0 = v ∈ V cx ˜ v u = 0. By Lemma 3.14 we have xX0 = ρX0 ≥ yX0 , whereas xX0 < yX0 since u ∈ V \ X0 and supp− x − y ⊆ X0 , a contradiction. Lemma 3.16. Let B ⊆ R V be a nonempty polyhedron with (B-EXC− ) and (B-EXC+ ). For any x y ∈ B, and v ∈ supp− x − y, there exists y ∈ B such that y v = xv y w = yw

xw ≥ y w ≥ yw w ∈ supp+ x − y w ∈ v ∪ supp+ x − y

Proof. Put supp+ x − y = u1 u2 uk k ≥ 1, and y0 = y. For i = 1 2 k, we deﬁne αi ∈ R+ and yi ∈ B iteratively as αi = minyi−1 v − xv xui − yi−1 ui cy ˜ i−1 ui v yi = yi−1 + αi χui − χv

370

murota and shioura

Assume yk v > xv. By (B-EXC− ), there exists some ui ∈ supp− yk − x ⊆ supp+ x − y such that yˆ = yk − αχv − χui ∈ B for a sufﬁciently small α > 0. Since ui ∈ supp+ yˆ − yi and supp− yˆ − yi = v, (B-EXC+ ) implies yi + βχui − χv ∈ B for a sufﬁciently small β > 0, a contradiction to the fact that αi = cy ˜ i−1 ui v. Lemma 3.17.

For any ρ ∈ , the set Bρ satisﬁes (B-EXC).

Proof. First note that Bρ satisﬁes (B-EXC− ) and (B-EXC+ ) by Lemma 3.15. Let x y ∈ Bρ with x = y and u ∈ supp+ x − y. Put ˜ u v > 0. Applying Lemma 3.16 repeatedly, we have Y0 = v ∈ V cy some y ∈ Bρ such that y v = xv v ∈ supp− x − y \ Y0 xw ≥ y w ≥ yw w ∈ supp+ x − y y w = yw w ∈ supp− x − y \ Y0 ∪ supp+ x − y Moreover, y w = yw holds for w ∈ supp+ x − y ∩ Y0 since otherwise y Y0 > yY0 = ρY0 by Lemma 3.14, a contradiction. In particular, y u = yu < xu. Applying (B-EXC− ) to x, y , and u ∈ supp+ x − y , we have v0 ∈ supp− x − y with cx ˜ v0 u > 0. From supp− x − y = − supp x − y ∩ Y0 follows cy ˜ u v0 > 0. Theorem 3.18 [17]. For any ρ ∈ we have Bρ = and ρBρ = ρ. Theorem 3.3(iii) is an immediate corollary of Lemmas 3.13, 3.17, and Theorem 3.18. We then give the proof of Theorem 3.1. Proof of Theorem 3 1. The implication (B-EXC) ⇒ (B-EXC− ) is obvious. Let B ⊆ R V be a nonempty polyhedron with (B-EXC− ). Then, we have ρB ∈ and B = BρB , as shown in Lemma 3.13. Hence, Lemma 3.17 implies (B-EXC) for B. Finally, we prove Theorem 3.8. Since the implications “Q is M- -convex ⇒ (G-EXC) ⇒ (G-EXC− )” are easy to see, we show “(G-EXC− ) ⇒ Q is M- -convex” only. We ﬁrst note that (G-EXC− ) for Q is equivalent to the following property deﬁned by (12): for Q ∀u ∈ supp+ x − y \ v , ∃v ∈ supp− x − (G0 -EXC− ) ∀x y ∈ Q, 0 y ∪ v0 such that c˜Qx v u > 0. Using this property, we prove the submodularity of ρQ and the equation = Bρ , where ρ 2V → R ∪ +∞ and Bρ ⊆ R V are deﬁned Q Q Q Q by (8) and (9), respectively. The proof is similar to that of Theorem 3.3(i).

polyhedral m/l-convex functions

371

Lemma 3.19. Let Xi ki=0 k ≥ 0 be a chain such that Xi ki=0 ⊆ with x X = ρ X i = 0 1 k. dom ρQ. Then there exists x∗ ∈ Q ∗ i i Q Proof. The claim is shown by induction on the integer k. By the induc such tive hypothesis and the deﬁnition of ρQ, there exist vectors x y ∈ Q that xXi = ρQXi

i = 1 2 k

yX0 = ρQX0

(14) (15)

be a pair of vectors minimizing the value !x − y ! among Let x∗ y∗ ∈ Q ∗ ∗ 1 satisfying (14), (15), and xX = maxx X all pairs of vectors x y ∈ Q 0 k 14. We further assume that x and y maximize the value x ∈ Q ∗ ∗ y∗ v0 − x∗ v0 among all such vectors. Claim. y∗ w ≤ x∗ w ∀w ∈ V \ X0 ∪ v0 . Assume, to the contrary, that there exists some w0 ∈ V \ X0 ∪ v0 with y∗ w0 > x∗ w0 . Then, (G0 -EXC− ) implies either (a-1), (a-2), or (a-3): (a-1) ∃v ∈ X0 such that c˜Qy∗ v w0 > 0, (a-2) ∃v ∈ supp− y∗ − x∗ \ X0 such that c˜Qy∗ v w0 > 0, (a-3) v0 ∈ V \ X0 , y∗ v0 ≥ x∗ v0 , and c˜Qy∗ v w0 > 0 for v = v0 . where α is a sufﬁciently small positive Put y = y∗ − αχw0 − χv ∈ Q, number. In either case we have a contradiction: (a-1) ⇒ y X0 > ρQX0 (a-2) ⇒ y X0 = ρQX0 !y − x∗ !1 < !y∗ − x∗ !1 (a-3) ⇒ y X0 = ρQX0 !y − x∗ !1 = !y∗ − x∗ !1 y v0 − x∗ v0 > y∗ v0 − x∗ v0 This ends the proof of the claim. Suppose that either v0 ∈ V \ X0 and x∗ v0 ≥ y∗ v0 , or v0 ∈ X0 holds. Then, for i = 1 2 k, we have − y∗ V \ Xi y∗ Xi = y∗ V − x∗ V \ Xi = x∗ Xi = ρ Xi ≥ x∗ V Q implying y∗ Xi = ρQXi . It remains to consider the case where v0 ∈ V \ X0

x∗ v0 < y∗ v0

(16)

Assume, to the contrary, that x∗ X0 < ρQX0 = y∗ X0 . Then there exists some u ∈ supp+ x∗ − y∗ \ X0 . Hence, (G0 -EXC− ) implies either (b-1) or (b-2):

372

murota and shioura (b-1) ∃v ∈ supp− x∗ − y∗ ∩ X0 such that c˜Qx∗ v u > 0, (b-2) c˜Qx∗ v u > 0 for v = v0 .

where α is a sufﬁciently small positive numPut x = x∗ + αχv − χu ∈ Q, ber. In case of (b-1), we have u ∈ X1 \ X0 since x∗ satisﬁes (14). Therefore, x also satisﬁes (14) and x X0 > x∗ X0 , a contradiction. In case of (b-2), we have x X0 = x∗ X0 and !x − y∗ !1 < !x∗ − y∗ !1 by (16), a contradiction. In the same way as the proof of Lemma 3.13, we can show that ρQ ∈ = Bρ by using Lemma 3.19 and Theorem 3.12. and Q Q 3.3. L-Convex Polyhedra A polyhedron D ⊆ R V is called L-convex if it is nonempty and satisﬁes (LS1) and (LS2): (LS1) (LS2)

p q ∈ D ⇒ p ∧ q p ∨ q ∈ D, p ∈ D ⇒ p ± λ1 ∈ D ∀λ ∈ R .

We denote by 0 the family of L-convex polyhedra, i.e.,

0 = D ⊆ R V D L-convex polyhedron

3.3.1. Properties of L-Convex Polyhedra The properties (LS1) and (LS2) can be characterized by local properties: (LS1loc -a)

∀p ∈ D, ∃ε > 0 such that

p1 p2 ∈ D ∩ N∞ p ε ⇒ p1 ∧ p2 p1 ∨ p2 ∈ D (LS1loc -b)

∀p q ∈ D, 0 < ∃ε ≤ 1 such that

1 − εp + εp ∧ q ∈ D (LS2loc )

1 − εp + εp ∨ q ∈ D

∀p ∈ D, ∃ε > 0 such that p + λ1 ∈ D ∀λ ∈ "0 ε#.

Theorem 3.20. Let D ⊆ R V be a closed set. (i) If D is convex, then (LS1) ⇐⇒ (LS1loc -a) ⇐⇒ (LS1loc -b). (ii) (LS2) ⇐⇒ (LS2loc ). Proof. (i) Since (LS1) ⇒ (LS1loc -a) is obvious, we show (LS1loc -a) ⇒ (LS1loc -b) ⇒ (LS1). Assume (LS1loc -a). Let p q ∈ D, and ε be the value in (LS1loc -a) associated with p. The convexity of D implies 1 − ε p + ε q ∈ D ∩ N∞ p ε for a sufﬁciently small ε > 0. Hence, we have p ∧ 1 − ε p + ε q = 1 − ε p + ε p ∧ q ∈ D p ∨ 1 − ε p + ε q = 1 − ε p + ε p ∨ q ∈ D i.e., (LS1loc -b) holds.

polyhedral m/l-convex functions

373

Next assume (LS1loc -b) for D to prove (LS1). Let p q ∈ D. We show p ∧ q ∈ D only, since p ∨ q ∈ D can be shown similarly. Set α∗ = supα 0 ≤ α ≤ 1 1 − αp + αp ∧ q ∈ D p∗ = 1 − α∗ p + α∗ p ∧ q ∈ D Assume α∗ < 1. By (LS1loc -b), there exists some ε with 0 < ε ≤ 1 such that 1 − εp∗ + εp∗ ∧ q = 1 − ε − α∗ + εα∗ p + ε + α∗ − εα∗ p ∧ q ∈ D a contradiction to the deﬁnition of α∗ . Hence α∗ = 1 and p ∧ q ∈ D hold. (ii) We show the direction (LS2loc ) ⇒ (LS2) only. Let p ∈ D and λ∗ = supλ λ ∈ R+ p + λ1 ∈ D. To the contrary assume that λ∗ < +∞. Then we have p∗ = p + λ∗ 1 ∈ D. However, (LS2loc ) implies p∗ + ε1 ∈ D for some ε > 0, a contradiction to the deﬁnition of λ∗ . Thus we have p + λ1 ∈ D for all λ ≥ 0. Similarly, we have p + λ1 ∈ D for all λ ≤ 0. We show the system of inequalities which describes the polyhedral structure of L-convex polyhedra. A function γ V × V → R ∪ +∞ with γv v = 0 ∀v ∈ V is called a distance function. For a distance function γ, deﬁne the set Dγ ⊆ R V by V Dγ = p ∈ R pv − pu ≤ γu v u v ∈ V

(17)

V

Given a nonempty set D ⊆ R , the function γD V × V → R ∪+∞ is deﬁned by γD u v = suppv − pu p∈D

(18)

Note that γD is a distance function, and D ⊆ DγD holds in general. Given a distance function γ, we denote by Gγ the directed graph with the vertex set V and the arc set Aγ = u v u v ∈ V u = v γu v < +∞, where the value γu v is regarded as the length of the arc u v ∈ Aγ . For u v ∈ V , let $ γ u v be the length of a shortest path from u to v in Gγ , i.e., $ γ u v = inf γu v P ⊆ Aγ P path from u to v u v ∈P

It is well known that $ γ > −∞ if and only if Gγ has no negative cycle. We consider the triangle inequality γv1 v2 + γv2 v3 ≥ γv1 v3

∀v1 v2 v3 ∈ V

(19)

for a distance function, and let be the family of distance functions with triangle inequality, i.e., = γ γ V × V → R ∪ +∞ γv v = 0 v ∈ V γ satisﬁes (19) Note that for any distance function γ V × V → R ∪ +∞, if $ γ > −∞ then $ γ ∈ , and if γ ∈ then $ γ = γ.

374

murota and shioura

Lemma 3.21. Let γ V × V → R ∪ +∞ be a distance function. (i) (ii) (iii)

Dγ = ⇐⇒ Gγ has no negative cycle. γ . Dγ = ⇒ Dγ = D$ Dγ = ⇒ Dγ ∈ 0 .

(iv)

If γ ∈ , then Dγ = and γ = γDγ . (v) If γu v ∈ Z ∪ +∞ ∀u v ∈ V , then Dγ is an integral polyhedron, i.e., Dγ = Dγ ∩ ZV . Proof.

(i), (ii)

These are well-known facts. See, e.g., [20, 23].

(iii) Condition (LS2) for Dγ is obvious. To show (LS1) for Dγ, we take p q ∈ Dγ. Then, it is easy show that p ∧ qv − p ∧ qu ≤ γu v

p ∨ qv − p ∨ qu ≤ γu v

for any u v ∈ V , i.e., p ∧ q p ∨ q ∈ D. (iv) Since γ ∈ , the graph Gγ has no negative cycle and therefore Dγ = by (i). We have γu v = $ γ u v = sup pv − pu = γDγ u v p∈Dγ

where the second equality is by a fundamental result concerning the shortest path problem. (v) The claim is obvious from the total unimodularity of the coefﬁcient matrix of the system of inequalities in (17). Lemma 3.22. For any nonempty polyhedron D ⊆ R V , we have γD ∈ . Moreover, if D ∈ 0 , then DγD = D. Proof. The ﬁrst claim is clear from the following inequality: for any v1 v2 v3 ∈ V , we have γD v1 v2 + γD v2 v3 = suppv2 − pv1 + suppv3 − pv2 p∈D

p∈D

≥ suppv3 − pv1 = γD v1 v3 p∈D

Suppose D ∈ 0 . We show that DγD ⊆ D. Let q ∈ DγD . For any u v ∈ V there exists puv ∈ D with puv v − puv u ≥ qv − qu, where we may assume that puv u = qu and puv v ≥ qv by (LS2). For each u ∈ V , the vector pu = v∈V p uv ∈ D satisﬁes pu u = qu, pu v ≥ qv ∀v ∈ V . Therefore, q = u∈V pu ∈ D. Theorem 3.23. (i) (ii)

For D ∈ 0 , we have γD ∈ and DγD = D.

For γ ∈ , we have Dγ ∈ 0 and γDγ = γ.

polyhedral m/l-convex functions

375

(iii) The mappings D (→ γD D ∈ 0 and γ (→ Dγ γ ∈ provide a one-to-one correspondence between 0 and and are the inverse of each other. Proof.

This is immediate from Lemma 3.21(iii), (iv), and Lemma 3.22.

As a special case of L-convex polyhedra, we investigate the class of L-convex cones, which are conic L-convex polyhedra. This amounts to restricting the function values of γ to 0 +∞. We show below the relationship of L-convex cones with “transitive” directed graphs, and with distributive lattices. A directed graph G = V A is called transitive if for any u v v w ∈ A we have u w ∈ A. Given a set of vectors D ⊆ R V , we deﬁne the directed graph GD = V AD by AD = u v u v ∈ V pu ≥ pv ∀p ∈ D For a directed graph G = V A, the set DG ⊆ R V is deﬁned by DG = p ∈ R V pu ≥ pv ∀u v ∈ A The next theorem is an immediate consequence of Theorem 3.23. Theorem 3.24. (i) For any L-convex cone D ⊆ R V , GD = V AD is transitive and DGD = D.

(ii) For any transitive directed graph G = V A, DG ⊆ R V is an L-convex cone and GDG = G. (iii) The mappings D (→ GD and G (→ DG provide a one-to-one correspondence between L-convex cones D ⊆ R V and transitive directed graphs G = V A and are the inverse of each other. On the other hand, Birkhoff’s representation theorem [2] yields a oneto-one correspondence between transitive directed graphs G = V A and distributive lattices ⊆ 2V with V ⊆ . This fact, together with Theorem 3.24, provides a one-to-one correspondence between L-convex cones and distributive lattices, as follows. Given a set of vectors D ⊆ R V , we deﬁne D ⊆ 2V by D = X ⊆ V χX ∈ D For any family of subsets ⊆ 2V with V ⊆ , deﬁne D ⊆ R V by λX χX λX ≥ 0 X ∈ \ V D = X∈

Theorem 3.25. (i) For an L-convex cone D ⊆ R V , D is a distributive lattice with V ⊆ D , and DD = D.

376

murota and shioura

(ii) For a distributive lattice ⊆ 2V with V ⊆ , D is an L-convex cone and D = . Moreover, for any p ∈ D there exists a chain 0 with V ⊆ 0 ⊆ and λX ∈ R X ∈ 0 such that λX ≥ 0 ∀X ∈ 0 \ V and p = X∈0 λX χX . (iii) The mappings D (→ D and (→ D provide a one-to-one correspondence between L-convex cones D ⊆ R V and distributive lattices ⊆ 2V with V ⊆ and are the inverse of each other. The intersection of L-convex polyhedra is also L-convex. Theorem 3.26. Let D1 D2 ∈ 0 . Deﬁne γ V × V → R ∪ +∞ by γu v = minγD1 u v γD2 u v u v ∈ V . Then, (i)

D1 ∩ D2 = Dγ,

(ii)

D1 ∩ D2 = ⇐⇒ Gγ has no negative cycle,

(iii)

D1 ∩ D2 = ⇒ D1 ∩ D2 ∈ 0 .

Proof. The claim (i) follows from D1 ∩ D2 = DγD1 ∩ DγD2 = Dγ. The claims (ii) and (iii) are by (i) and (iii) of Lemma 3.21, respectively. Remark 3.27. The Minkowski-sum of two L-convex polyhedra is not necessarily an L-convex polyhedron, as shown below. Put V = a b c d. Let γ1 γ2 V × V → 0 1 be such that γ1 u v = 1 ⇐⇒ u v ∈ c a c b d a d b d c γ2 u v = 1 ⇐⇒ u v ∈ b a b c d a d b d c Since γ1 γ2 ∈ , we have Dγ1 Dγ2 ∈ 0 by Lemma 3.21. However, D = Dγ1 + Dγ2 is not L-convex. Let p1 = 1 1 0 0

p2 = q1 = 0 0 0 0

q2 = 1 0 1 0

Then, it holds that pi qi ∈ Dγi i = 1 2. Hence, we have p1 + p2 q1 + q2 ∈ D, but p1 + p2 ∧ q1 + q2 = 1 0 0 0 ∈ D. Note that p ∈ Dγi pd = 0 = convSi

i = 1 2

p ∈ D pd = 0 = convS1 + S2 where S1 = 0 0 0 0 1 1 0 0 1 1 1 0 S2 = 0 0 0 0 1 0 1 0 1 1 1 0 Separation theorems of the following form hold for distance functions with triangle inequality and for L-convex polyhedra.

polyhedral m/l-convex functions

377

Theorem 3.28. Let γ V × V → R ∪ +∞, ν V × V → R ∪−∞ be functions such that γ −ν ∈ . Suppose that k i=1

νv2i+1 v2i ≤

k i=1

γv2i−1 v2i

holds for any distinct elements vi 2k i=1 ⊆ V with v2k+1 = v1 . Then, there exists p∗ ∈ R V with νu v ≤ p∗ v − p∗ u ≤ γu v

∀u v ∈ V

(20)

Moreover, if γ and ν are integer-valued, then there exists such p∗ ∈ ZV . Proof. The existence of p∗ ∈ R V with (20) is equivalent to Dν 0 ∩ Dγ = , where ν 0 V × V → R ∪ +∞ is deﬁned by ν 0 u v = −νv u u v ∈ V . Deﬁne η V × V → R ∪ +∞ by ηu v = minγu v ν 0 u v u v ∈ V . By Theorem 3.26 (ii), it sufﬁces to show that Gη has no negative cycle. Suppose, to the contrary, that v1 v2 vh ⊆ V h ≥ 1 forms a negative cycle. Since γ and ν 0 satisfy the triangle inequality, we may assume that h = 2k k ≥ 1 and ηv2i−1 v2i = γv2i−1 v2i

ηv2i v2i+1 = −νv2i+1 v2i i = 1 2 k

where v2k+1 = v1 . This implies ki=1 νv2i+1 v2i > ki=1 γv2i−1 v2i , a contradiction. The integrality assertion follows from the argument above and Lemma 3.21 (v). Remark 3.29. The assumption in Theorem 3.28 cannot be replaced by the weaker condition νu v ≤ γu v

∀u v ∈ V

as shown in the following example. Put V = a b c d, and deﬁne γ V × V → R , ν V × V → R as γ a b c d

a 0 2 1 1

b c d 0 1 −1 0 1 1 −1 0 0 1 2 0

a b c ν a 0 0 −1 b 0 0 −1 c −1 −1 0 0 d −1 −1

d −1 −1 0 0

where it reads as γa d = −1, γb a = 2, etc. We can easily check that γ −ν ∈ and νu v ≤ γu v ∀u v ∈ V . However, there is no p∗ ∈ R V

378

murota and shioura

satisfying (20). Suppose, to the contrary, that such p∗ ∈ R V exists. Then we have p∗ a − p∗ b ≤ minγb a −νa b = 0 p∗ b − p∗ c ≤ minγc b −νb c = −1 p∗ c − p∗ d ≤ minγd c −νc d = 0 p∗ d − p∗ a ≤ minγa d −νd a = −1 Addition of these inequalities yields 0 ≤ −2, a contradiction. Note that the assumption of Theorem 3.28 is not satisﬁed since νa b + νc d = 0 > −2 = γc b + γa d. The latter part of the following theorem is already shown in [34, Theorem 5.6]. Theorem 3.30. Let D1 D2 ∈ 0 . If D1 ∩ D2 = , then there exists x∗ ∈ 0 ±1V such that inf p x∗ − sup p x∗ > 0

p∈D1

p∈D2

(21)

Moreover, if D1 and D2 are integral polyhedra, i.e., Di = Di ∩ ZV i = 1 2, then “> 0” in (21) can be replaced by “≥ 1.” Proof. Put γi = γDi i = 1 2. Since Dγ1 ∩ Dγ2 = , it follows from Theorem 3.28 that there distinct elements

kexists a sequence of k vi 2k ⊆ V k ≥ 1 such that γ v v + γ 2i i=1 1 2i−1 i=1 2 v2i v2i+1 < 0, i=1 where v2k+1 = v1 . Let x∗ ∈ 0 ±1V be a vector such that x∗ v2i = −1, x∗ v2i−1 = 1 i = 1 k, and x∗ w = 0 otherwise. Then we have inf p x∗ − sup p x∗

p∈D1

p∈D2

= inf

p∈D1

≥−

k i=1

k i=1

pv2i−1 − pv2i − sup

γ1 v2i−1 v2i −

k

p∈D2 i=1

k i=1

pv2i+1 − pv2i

γ2 v2i v2i+1 > 0

The integrality assertion is obvious from the inequality above.

polyhedral m/l-convex functions

379

3.3.2. L- -Convex Polyhedra Due to the property (LS2), an L-convex polyhedron loses no information when restricted to a hyperplane p ∈ R V pv = 0 for any v ∈ V . We call a polyhedron P ⊆ R V L- -convex if the set P˜ ⊆ R V deﬁned by P˜ = λ p + λ1 p ∈ P λ ∈ R = v0 ∪ V . We see that L- -convex is an L-convex polyhedron, where V polyhedra are essentially the same as L-convex polyhedra, while the class of L- -convex polyhedra properly contains that of L-convex polyhedra. Intervals are the only polyhedra that are both L- - and M- -convex. Theorem 3.31 (cf. [38, Lemma 5.7]). A polyhedron S ⊆ R V is both polyhedral M- -convex and polyhedral L- -convex if and only if S is an interval; i.e., it is represented as S = "a b# for some a V → R ∪−∞ and b V → R ∪+∞ with av ≤ bv v ∈ V . Proof. Consider the polyhedral description of S in terms of linear inequalities of the form p x ≤ α. For an L- -convex set, p = χv − χu or p = ±χv for some elements u v ∈ V by Theorem 3.23 (i), while for M- -convex set, p = ±χX for some subset X ⊆ V by (13). Therefore, S is an interval. Remark 3.32. There exists no polyhedron which is both M-convex and L-convex, i.e., 0 ∩ 0 = (Proof. If S ∈ 0 ∩ 0 , then Theorem 3.2 implies xV = yV for any x y ∈ S, whereas (LS2) implies that x + λ1 ∈ S for any λ ∈ R .) 4. POLYHEDRAL M-CONVEX AND L-CONVEX FUNCTIONS 4.1. Review of Fundamental Results on Polyhedral Convex Functions This section is devoted to a summary of the relevant results on polyhedral convex functions. See [41, 44] for more accounts. Let f R V → R ∪ ±∞ be a function. The epigraph of f , denoted by epi f , is the set epi f = x α x ∈ R V α ∈ R α ≥ f x

⊆ R V × R

The effective domain of f , denoted by dom f , is the set dom f = x ∈ R V −∞ < f x < +∞ We call f convex if epi f is a convex set in R V × R . A function f is called concave if −f is convex. When f > −∞, f is convex if and only if f αx + 1 − αy ≤ αf x + 1 − αf y

∀x y ∈ R V ∀α ∈ "0 1#

380

murota and shioura

A convex function is said to be polyhedral if its epigraph is polyhedral. Let f R V → R ∪±∞. The (convex) conjugate f • R V → R ∪±∞ of f is deﬁned by f • p = sup p x − f x x∈R V

p ∈ R V

(22)

Theorem 4.1 [41, Theorem 19.2]. (i) For a polyhedral convex function f R V → R ∪ +∞ with dom f = , its conjugate function f • is also polyhedral convex with dom f • = , f • > −∞, and f •• = f . (ii) The mapping f (→ f • induces a symmetric one-to-one correspondence in the class of polyhedral convex functions f R V → R ∪ +∞ with dom f = . A function f R V → R ∪ +∞ is called positively homogeneous if f αx = αf x for any x ∈ R V and α > 0. Note that f 0 = 0 if dom f = . is a posiTheorem 4.2 [41, Corollary 4.7.1]. If f R V → R ∪ +∞

tively homogeneous convex function, then f ki=1 αi xi ≤ ki=1 αi f xi for any xi ∈ R V and αi ≥ 0 i = 1 k. For any nonempty set S ⊆ R V , we deﬁne the support function δ∗S R V → R ∪ +∞ of S by δ∗S p = supp x x∈S

p ∈ R V

(23)

For any positively homogeneous function f R V → R ∪ +∞, we deﬁne the set Sf ⊆ R V by Sf = x ∈ R V p x ≤ f p ∀p ∈ R V

(24)

Theorem 4.3 [41, Sect. 13]. (i) For any nonempty polyhedron S ⊆ R V , δ∗S is a positively homogeneous polyhedral convex function with dom δ∗S = , and S = Sδ∗S . (ii) For any positively homogeneous polyhedral convex function f R V → R ∪ +∞ with dom f = , Sf is a nonempty polyhedron, and f = δ∗Sf . (iii) The mappings S (→ δ∗S and f (→ Sf provide a one-to-one correspondence between nonempty polyhedra S ⊆ R V and positively homogeneous polyhedral convex functions f R V → R ∪ +∞ with dom f = and are the inverse of each other.

polyhedral m/l-convex functions

381

Let f R V → R ∪±∞. For any x ∈ dom f and y ∈ R V , we deﬁne the (one-sided) directional derivative of f at x with respect to y by f x+ y = lim α↓0

f x + αy − f x α

if the limit exists. Note that if f is polyhedral convex, then f x+ y is well-deﬁned for any x ∈ dom f and y ∈ R V by f x+ y = f x + αy − f x/α with a sufﬁciently small α > 0. The subdifferential ∂f x of f at x is deﬁned by ∂f x = p ∈ R V f y ≥ f x + p y − x ∀y ∈ R V and each vector in ∂f x is called a subgradient of f at x. Theorem 4.4 [41, Theorem 23.10]. Let f R V → R ∪ +∞ be a polyhedral convex function with dom f = , and x ∈ dom f . Then, ∂f x is a nonempty polyhedral convex set, and f x+ · is a positively homogeneous polyhedral convex function such that dom f x+ · = , f x+ · > −∞, and f x+ · = δ∗∂f x . Let f R V → R ∪ +∞ be any function. We denote by arg min f the set of minimizers of f , i.e., arg min f = x ∈ R V f x ≤ f y ∀y ∈ R V Note that arg min f can be empty. For any p ∈ R V , the function f "p# R V → R ∪ +∞ is deﬁned by f "p#x = f x + p x

x ∈ R V

(25)

Theorem 4.5 [41, Theorem 23.5]. Let f R V → R ∪ +∞ be a polyhedral convex function with dom f = . For any x ∈ R V and p ∈ R V , we have p ∈ ∂f x ⇐⇒ x ∈ arg min f "−p# ⇐⇒ x ∈ ∂f • p ⇐⇒ p ∈ arg min f • "−x# For fi R V → R ∪ +∞ i = 1 2, the inﬁmum convolution (or simply the convolution) f1 f2 R V → R ∪ ±∞ is deﬁned by x1 x2 ∈ R V x ∈ R V (26) f1 f2 x = inf f1 x1 + f2 x2 x1 + x2 = x It is easy to verify that if f1 domf1

f2 does not take the value −∞, then f2 = dom f1 + dom f2

382

murota and shioura

Theorem 4.6 [41, Sects. 16, 19, and 20]. Let fi R V → R ∪ +∞ i = 1 2 be polyhedral convex functions with dom fi = . Then, f1 f2 is also polyhedral convex and f1

f2 • = f1• + f2•

Moreover, if dom f1 ∩ dom f2 = then it holds that f1 + f2 • = f1•

f2•

We consider the special case of one-dimensional functions. A function f R → R ∪ ±∞ is called piecewise-linear if dom f is a closed set and can be divided into a ﬁnite number of subintervals on each of which f is linear. We denote by 1 the class of piecewise-linear convex functions (cf. (5)). Note that a piecewise-linear convex function is nothing but a one-dimensional polyhedral convex function. For f R → R ∪ ±∞ and x ∈ dom f , we denote f+ x = f x+ 1, f− x = −f x+ −1. If f is piecewise-linear, then the values f+ x and f− x are well-deﬁned for any x ∈ dom f and lim f+ y = lim f− y = f+ x y↓x

y↓x

lim f+ y = lim f− y = f− x y↑x

y↑x

4.2. Polyhedral M-Convex Functions A polyhedral convex function f R V → R ∪+∞ is called M-convex if dom f = and f satisﬁes (M-EXC): (M-EXC) ∀x y ∈ dom f , ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y, ∃α0 > 0 such that f x + f y ≥ f x − αχu − χv + f y + αχu − χv

0 ≤ ∀α ≤ α0

We denote = f f R V → R ∪ +∞ polyhedral M-convex 0

= f f R V → R ∪ +∞ positively homogeneous polyhedral M-convex

It may be obvious from the deﬁnition that polyhedral M-convex functions are a quantitative extension of M-convex polyhedra. Theorem 4.7. (i) ⇐⇒ dom f ∈ 0 . (ii)

For a function f R V → 0 +∞, we have f ∈

For f ∈ , we have dom f ∈ 0 .

polyhedral m/l-convex functions

383

4.2.1. Axioms for Polyhedral M-Convex Functions We consider two slightly different exchange axioms, where the former is weaker and the latter is stronger than (M-EXC). (M-EXCw ) ∀x y ∈ dom f , ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y, ∃α > 0 such that f x + f y ≥ f x − αχu − χv + f y + αχu − χv (M-EXCs ) ∀x y ∈ dom f , ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y such that f x + f y ≥ f x − αχu − χv + f y + αχu − χv for any α ∈ R with 0 ≤ α ≤ xu − yu/2k, where k = supp− x − y. Theorem 4.8. For a polyhedral convex function f R V → R ∪ +∞ with dom f = , (M-EXC) ⇐⇒ (M-EXCw ). Proof. We show (M-EXCw ) ⇒ (M-EXC) only. Let x y ∈ dom f and u ∈ supp+ x − y. Then, there exist v ∈ supp− x − y, α0 > 0 such that f x + f y ≥ f x − αχu − χv + f y + αχu − χv

(27)

for α = α0 . The convexity of f implies (27) for any α ∈ "0 α0 #. Lemma 4.9. Let f ∈ , x y ∈ dom f be distinct vectors, u ∈ supp+ x − y, and supp− x − y = v1 v2 vk k = supp− x − y. Then, there exist sequences xi ki=0 yi ki=0 ⊆ dom f and αi ki=1 ⊆ R+ satisfying

k i=1 αi = xu − yu/2 and the conditions x0 = x y0 = y xi = xi−1 − αi χu − χvi yi = yi−1 + αi χu − χvi i = 1 k (28) f xi−1 + f yi−1 ≥ f xi + f yi

i = 1 k

(29)

Proof. Put x0 = x, y0 = y. For i = 1 2 k, we deﬁne αi ∈ R and xi yi ∈ R V iteratively by the equations αi = supα ∈ R+ α ≤ minxi−1 u − yi−1 u yi−1 vi − xi−1 vi /2 f xi−1 − αχu − χvi + f yi−1 + αχu − χvi ≤ f xi−1 + f yi−1 xi = xi−1 − αi χu − χvi

yi = yi−1 + αi χu − χvi

Then, the sequences xi ki=0 , yi ki=0 , and αi ki=1 satisfy the conditions (28)

and (29). In the following, we show ki=1 αi = xu − yu/2.

384

murota and shioura

Assume, to the contrary, that ki=1 αi < xu − yu/2. Since u ∈ supp+ xk − yk , there exist some vi ∈ supp− xk − yk ⊆ supp− x − y and α0 > 0 such that f xk + f yk ≥ f xk − αχu − χvi + f yk + αχu − χvi

(30)

for any α ∈ "0 α0 #. Here, vi = vk holds by the choice of xk and yk . By Theorem 4.14 to be shown later, we have f xi − αχu − χvi − f xi + f yi + αχu − χvi − f yi ≤ f xk − αχu − χvi − f xk + f yk + αχu − χvi − f yk ≤ 0 for any α ∈ "0 α0 #, where the second inequality is by (30). This contradicts the deﬁnitions of xi and yi . Theorem 4.10. For a polyhedral convex function f R V → R ∪ +∞ with dom f = , (M-EXC) ⇐⇒ (M-EXCs ). Proof. It sufﬁces to prove (M-EXC) ⇒ (M-EXCs ). Assume (M-EXC) for f . Let x y ∈ dom f be distinct vectors, and u ∈ supp+ x − y. By Lemma 4.9, there exist sequences xi ki=0 yi ki=0 ⊆ dom f and

αi ki=1 ⊆ R+ satisfying ki=1 αi = xu − yu/2, (28), and (29). Let i∗ be an integer with 1 ≤ i∗ ≤ k such that αi∗ = max αi ≥ xu − yu/2k 1≤i≤k

From Theorem 4.14 to be shown later, we obtain the inequalities f x − αi∗ χu − χvi − f x + f y + αi∗ χu − χvi − f y ∗

∗

≤ f xi∗ − f xi∗ −1 + f yi∗ − f yi∗ −1 ≤ 0 where the second inequality is by (29). From the convexity of f follows f x + f y ≥ f x − αχu − χvi ∗

+ f y + αχu − χvi ∗

∀α ∈ "0 αi∗ #

We can rewrite (M-EXC) in terms of directional derivatives. For a polyhedral convex function f R V → R ∪ +∞ with dom f = and x ∈ dom f , deﬁne f x+ · · V × V → R ∪ +∞ by f x+ v u = f x+ χv − χu

u v ∈ V

Note that f x − αχu − χv − f x = αf x+ v u for sufﬁciently small α > 0. We consider (M-EXC ):

polyhedral m/l-convex functions

385

(M-EXC ) ∀x y ∈ dom f , ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y such that f x+ v u + f y+ u v ≤ 0 Theorem 4.11. For a polyhedral convex function f R V → R ∪+∞ with dom f = , (M-EXC) ⇐⇒ (M-EXC ). 4.2.2. Fundamental Properties of Polyhedral M-Convex Functions In this section, we show various properties of polyhedral M-convex functions. Global optimality of a polyhedral M-convex function is characterized by local optimality. Theorem 4.12. Let f ∈ and x ∈ dom f . Then, f x ≤ f y ∀y ∈ R V ⇐⇒ f x+ v u ≥ 0 ∀u v ∈ V . Proof. We show the ⇐ direction only. Assume, to the contrary, that f x0 < f x holds for some x0 ∈ R V . Put S = y ∈ R V f y ≤ f x0 . Let x∗ ∈ S be a vector such that !x∗ − x!1 = inf y∈S !y − x!1 . By (M-EXC) applied to x x∗ and some u ∈ supp+ x − x∗ , there exists v ∈ supp− x − x∗ and a sufﬁciently small α > 0 such that f x∗ − f x∗ + αχu − χv ≥ f x − αχu − χv − f x ≥ αf x+ v u ≥ 0 Hence, we have f x∗ + αχu − χv ≤ f x∗ ≤ f x0 , which contradicts the choice of x∗ since !x∗ + αχu − χv − x!1 = !x∗ − x!1 − 2α. A polyhedral M-convex function has supermodularity when projected onto the hyperplane x ∈ R V xw0 = 0 w0 ∈ V , as stated in Theorem 4.14. Lemma 4.13. Let f ∈ , w0 ∈ V , and x ∈ R V . Then, for any u v ∈ V and λ µ ≥ 0 we have f x + λχu − χw0 + f x + µχv − χw0 ≤ f x + f x + λχu − χw0 + µχv − χw0

(31)

Proof. Fix u v ∈ V and λ µ ≥ 0. For λ µ ∈ R+ , we denote xλ µ = x + λ χu − χw0 + µ χv − χw0 . We may assume that x0 0 xλ µ ∈ dom f . Then, it follows from Theorem 3.9 that xλ 0 xλ µ ∈ dom f

0 ≤ ∀λ ≤ λ

386

murota and shioura

Deﬁne the functions f1 f2 "0 λ# → R by f1 λ = f xλ 0

f2 λ = f xλ µ 0 ≤ λ ≤ λ

Let γ be any value with 0 < γ ≤ λ. For any β ∈ "0 γ, (M-EXC ) yields f1 + β − f2 − γ ≤ 0 since supp+ xγ µ − xβ 0 = u v and supp− xγ µ − xβ 0 = w0 . This implies f2 − f1 − γ = f2 − γ − f1 − γ = f2 − γ − limf1 + β ≥ 0 β↑γ

for any γ with 0 < γ ≤ λ. Therefore, the piecewise-linear function f2 − f1 is nondecreasing on "0 λ#, from which we have f xλ µ − f xλ 0 = f2 λ − f1 λ ≥ f2 0 − f1 0 = f x0 µ − f x0 0 i.e., the inequality (31) holds. Theorem 4.14. Let f ∈ , w0 ∈ V , and x ∈ R V . For any X Y ⊆ V \ w0 with X ∩ Y = and λw ≥ 0 w ∈ X ∪ Y , we have

f x +

w∈X

λw χw − χw0 + f x +

≤ f x + f x +

w∈X∪Y

w∈Y

λw χw − χw0

λw χw − χw0

(32)

Proof. We show the inequality (32) by induction on the cardinality of the set X ∪ Y . It sufﬁces to consider the case when X = and Y = . Note that the case when X = Y = 1 is already shown in Lemma 4.13. Therefore, we may

assume that Y ≥ 2. Let v ∈ Y . We assume that x ∈ dom f and x + w∈X∪Y λw χw − χw0 ∈ dom f . Then we have x = x + λv χv − χw0 ∈ dom f by Theorem 3.9. Hence, the inductive assumption yields f x +

w∈X

λw χw − χw0 − f x

≤ f x +

w∈X∪v

≤ f x +

w∈X∪Y

λw χw − χw0 − f x + λv χv − χw0

λw χw − χw0 − f x +

w∈Y

λw χw − χw0

polyhedral m/l-convex functions

387

Directional derivative functions and subdifferentials of a polyhedral M-convex function have nice structures such as M/L-convexity, and they can be explicitly described by certain distance functions with triangle inequality (cf. Theorem 4.15(i)). For any distance function γ V × V → R ∪ +∞ (i.e., γv v = 0 for all v ∈ V ), we deﬁne fγ R V → R ∪ ±∞ by    λuv χv − χu = x  x ∈ R V (33) λ γu v uv∈V fγ x = inf  uv∈V uv λ ≥ 0 u v ∈ V uv

Theorem 4.15. Let f ∈ and x ∈ dom f . (i)

The function γx V × V → R ∪ +∞ deﬁned by γx u v = f x+ v u

u v ∈ V

(34)

satisﬁes γx v v = 0 ∀v ∈ V and the triangle inequality (19), i.e., γx ∈ . (ii) We have f x+ · = fγx and f x+ · ∈ 0 . Proof.

(i)

For u v w ∈ V and a sufﬁciently small α > 0, we have

αγx u v + αγx v w = f x + αχv − χu − f x + f x + αχw − χv − f x ≥ f x + αχw − χu − f x = αγx u w where the inequality is by Lemma 4.13. Obviously, γx is a distance function. (ii) The proof is postponed after Theorem 4.19. L-convexity appears in subdifferentials of a polyhedral M-convex function. Theorem 4.16. Let f ∈ and x ∈ dom f . (i)

∂f x ∈ 0 and ∂f x is represented as

∂f x = Dγx = p ∈ R V pv − pu ≤ f x+ v u u v ∈ V (35) where γx V × V → R ∪ +∞ is given by (34). (ii) For any y ∈ R V we have f y − f x ≥ supp∈∂f x p y − x = fγx y − x. Proof. Equation (35) follows from Theorems 4.5 and 4.12. The L-convexity of ∂f x is immediate from (35) and Theorem 4.15(i). The claim (ii) is immediate from (i) and the linear programming duality.

388

murota and shioura

The next theorem shows that each face of the epigraph of a polyhedral M-convex function is an M-convex polyhedron when it is projected to R V . The proof is obvious and therefore omitted. Theorem 4.17. inf f "−p# > −∞.

For f ∈ and p ∈ R V , we have arg min f "−p# ∈ 0 if

The class of polyhedral M-convex functions is closed under various fundamental operations. Let f R V → R ∪ +∞ be a function. For any subset U ⊆ V , we deﬁne fU R U → R ∪ +∞ by fU y = f y 0V \U

y ∈ R U

(36)

Theorem 4.18. Let f f1 f2 ∈ . (1) For p ∈ R V , the function f "−p# R V → R ∪ +∞ given by (25) is polyhedral M-convex. (2) For a ∈ R V and ν > 0, the functions ν · f a − x and ν · f a + x are polyhedral M-convex in x. (3) For any U ⊆ V , the function fU R U → R ∪ +∞ is polyhedral M-convex if dom fU = . (4) For ϕv ∈ 1 v ∈ V , the function f˜ R V → R ∪ +∞ deﬁned by f˜x = f x + ϕv xv x ∈ R V v∈V

is polyhedral M-convex if dom f˜ = . (5) For any a V → R ∪ −∞ and b V → R ∪ +∞ with a ≤ b, the restriction f"ab# R V → R ∪ +∞ of f deﬁned by f x x ∈ "a b#, f"ab# x = (37) +∞ x ∈ "a b# is polyhedral M-convex if dom f ∩ "a b# = . (6) The convolution f1 f2 R V → R ∪ ±∞ deﬁned by (26) is polyhedral M-convex, provided f1 f2 x0 is ﬁnite for some x0 ∈ R V . (7) For U ⊆ V , the function fˆ R × R U → R ∪ ±∞ deﬁned by fˆy0 y = inff x xv = yv v ∈ U y0 = xV \ U

(38)

is polyhedral M-convex if fˆy0 y is ﬁnite (i.e., −∞ < fˆy0 y < +∞) for some y0 y ∈ R × R U . Proof. Parts (1) to (5) are easy to prove. The proofs of (6) and (7) are given in Section 5 and subsection 7.3, respectively. A more general and stronger transformation, called “network induction,” is explained in subsection 7.3.

polyhedral m/l-convex functions

389

4.2.3. Positively Homogeneous Polyhedral M-Convex Functions This section clariﬁes the relationship of positively homogeneous polyhedral M-convex functions to distance functions with triangle inequalities, and also to L-convex polyhedra. For a positively homogeneous polyhedral convex function f R V → R ∪ +∞ with 0 ∈ dom f , deﬁne γf V × V → R ∪ +∞ by γf u v = f 0+ v u = f χv − χu

u v ∈ V

(39)

Recall the deﬁnition of fγ in (33). Theorem 4.19. (ii)

(i) For f ∈ 0 , we have γf ∈ and fγf = f .

For γ ∈ , we have fγ ∈ 0 and γfγ = γ.

(iii) The mappings f (→ γf f ∈ 0 and γ (→ fγ γ ∈ provide a one-to-one correspondence between 0 and and are the inverse of each other. Proof. (i) By Theorem 4.15 (i) and Theorem 4.16 (ii), we have γf ∈

and f ≥ fγf . For any x ∈ R V and λuv uv∈V satisfying uv∈V λuv χv − χu = x and λuv ≥ 0 u v ∈ V , Theorem 4.2 implies that λuv γf u v = λuv f χv − χu u v∈V

u v∈V

≥f

u v∈V

λuv χv − χu = f x

Hence, we have fγf = f . (ii) Polyhedral M-convexity of fγ can be shown as a special case of Example 2.4. To be more precise, we have fγ x = f x ∀x ∈ R V for the function f in Example 2.4, where T = V , A = u v u v ∈ V u = v, and γu vξ ξ ≥ 0 fu v ξ = u v ∈ A +∞ ξ < 0 For any u v ∈ V , we have γfγ u v = fγ χv − χu , which is equal to the shortest path distance from u to v in the directed graph G = V A, where A = V × V and the length of an arc u v ∈ A is given by γu v. Since γ ∈ , it holds γfγ u v = γu v ∀u v ∈ V . (iii)

This is clear from (i) and (ii).

We now prove the polyhedral M-convexity of the directional derivative function f x+ ·.

390

murota and shioura

Proof of Theorem 4.15(ii). By Theorems 4.15(i) and 4.19, it sufﬁces to show f x+ · = fγx , which can be done in the same way as the proof of Theorem 4.19(i). As immediate consequences of Theorems 3.23, 4.3, and 4.19, we obtain a one-to-one correspondence between positively homogeneous polyhedral M-convex functions and L-convex polyhedra. Recall the notation Sf and δ∗S in (23) and (24). Theorem 4.20. (i)

For f ∈ 0 , we have Sf ∈ 0 and δ∗Sf = f .

(ii) For D ∈ 0 , we have δ∗D ∈ 0 and Sδ∗D = D. (iii) The mappings f (→ Sf f ∈ 0 and D (→ δ∗D D ∈ 0 provide a one-to-one correspondence between 0 and 0 and are the inverse of each other. 4.2.4. Polyhedral M- -Convex Functions We introduce a variant of polyhedral M-convex functions, called polyhedral M- -convex functions. From Theorems 3.2 and 4.7(ii), we see that the effective domain of a polyhedral M-convex function is contained in a hyperplane x ∈ R V xV = r for some r ∈ R . Therefore, no information is lost when a polyhedral M-convex function is projected onto a (V −1)dimensional space. A function f R V → R ∪ +∞ is called polyhedral M- -convex if the function f˜ R V → R ∪ +∞ deﬁned by f x x0 = −xV f˜x0 x = x0 x ∈ R V +∞ x0 = −xV = v0 ∪ V . Polyhedral is a polyhedral M-convex function, where V M -convex functions are essentially equivalent to polyhedral M-convex functions, whereas the class of polyhedral M- -convex functions properly contains that of polyhedral M-convex functions. M- -convexity was originally introduced in [37] as a concept for functions deﬁned over the integer lattice (see also [18]). Polyhedral M- -convex functions can be characterized by the following exchange property (cf. [37]): (M- -EXC) ∀x y ∈ dom f , ∀u ∈ supp+ x − y, either (i) or (ii) (or both) holds: (i) ∃v ∈ supp− x − y, ∃α0 > 0 such that f x + f y ≥ f x − αχu − χv + f y + αχu − χv

0 ≤ ∀α ≤ α0

(ii) ∃α0 > 0 such that f x + f y ≥ f x − αχu + f y + αχu

0 ≤ ∀α ≤ α0

polyhedral m/l-convex functions

391

Theorem 4.21. Let f R V → R ∪ +∞ be a polyhedral convex function with dom f = . Then, f is M- -convex if and only if it satisﬁes (M- -EXC). Proof.

Proof is given in Section 5.

Example 4.22 (Separable-Convex Functions). Let ⊆ 2V be a laminar family, i.e., is a family of subsets of V such that for any X Y ∈ at least one of X \ Y , Y \ X and X ∩ Y is empty. For fX ∈ 1 X ∈ , the function f R V → R ∪ +∞ given by f x = fX xX x ∈ R V X∈

-

is polyhedral M -convex if dom f = [6]. This can be proved similarly to Example 2.4. Let B ⊆ R V be an M- -convex polyhedron. For f0 ∈ 1 and fv ∈ 1 v ∈ V , the function f R V → R ∪ +∞ deﬁned by fv xv x ∈ B, f0 xV + f x = v∈V +∞ x ∈ B is polyhedral M- -convex if dom f = . Another class of polyhedral M- -convex functions arises from the minimum cost ﬂow problem, as in Example 2.4. From the deﬁnition of polyhedral M- -convex functions, every property of polyhedral M-convex functions can be restated in terms of polyhedral M- -convex functions, and vice versa. We present some properties below, which seem more natural when restated in terms of polyhedral M- -convex functions. For any function f R V → R ∪ +∞ and any subset U ⊆ V , we deﬁne the projection f U R U → R ∪ ±∞ of f to U as f U y = inf f y z z∈R V \U

y ∈ R U

(40)

Theorem 4.23. For any polyhedral M- -convex function f R V → R ∪ +∞ and U ⊆ V , f U is polyhedral M- -convex if f U x0 is ﬁnite for some x0 ∈ R U . Proof.

This is obvious from Theorem 4.18(7).

Theorem 4.24. If f R V → R ∪ +∞ is polyhedral M- -convex, then it satisﬁes the supermodular inequality, f x + f y ≤ f x ∧ y + f x ∨ y Proof.

This is immediate from Theorem 4.14.

∀x y ∈ R V

392

murota and shioura 4.3. Polyhedral L-Convex Functions

A polyhedral convex function g R V → R ∪ +∞ is said to be L-convex if dom g = and g satisﬁes (LF1) and (LF2): (LF1) gp + gq ≥ gp ∧ q + gp ∨ q ∀p q ∈ dom g, (LF2) ∃r ∈ R such that gp + λ1 = gp + λr ∀p ∈ dom g ∀λ ∈ R . We denote = g g R V → R ∪ +∞ polyhedral L-convex 0

= g g R V → R ∪ +∞ positively homogeneous polyhedral L-convex

As is obvious from the deﬁnition, polyhedral L-convex functions are a quantitative generalization of L-convex polyhedra. Theorem 4.25. (i) For a function g R V → 0 +∞, g ∈ ⇐⇒ dom g ∈ 0 . (ii) For g ∈ , we have dom g ∈ 0 . 4.3.1. Axioms for Polyhedral L-Convex Functions We ﬁrst show that (LF1) and (LF2) are equivalent to local properties (LF1loc ) and (LF2loc ): (LF1loc )

∀p ∈ dom g, ∃ε > 0 such that

gq1 + gq2 ≥ gq1 ∧ q2 + gq1 ∨ q2 (LF2loc )

∀q1 q2 ∈ N∞ p ε

∀p ∈ dom g, ∃ε > 0, ∃r ∈ R such that

q ∈ N∞ p ε q + λ1 ∈ N∞ p ε λ ∈ R ⇒ gq + λ1 = gq + λr Theorem 4.26. Let g R V → R ∪ +∞ be a polyhedral convex function with dom g = . Then, (i) (LF1) ⇐⇒ (LF1loc ). (ii) (LF2) ⇐⇒ (LF2loc ). Proof. (i) We show (LF1loc ) ⇒ (LF1) only. Assume (LF1loc ) for g. Then, Theorem 3.20 (i) implies (LS1) for dom g. For p ∈ dom g, we denote by εp the value ε in (LF1loc ) associated with p. Let p p ∈ dom g and recall the deﬁnition of Box"p p # in subsection 3.1. Since Box"p p # ∩ dom g is a compact set, there exist q1 q2 qm ∈ Box"p p # ∩ dom g m ≥ 1 such that Box"p p # ∩ dom g ⊆ m i=1 N∞ qi εqi . We show the submodular inequality gp + gp ≥ gp ∧ p + gp ∨ p

(41)

polyhedral m/l-convex functions

393

by induction on the integer m. We may assume m ≥ 2, since otherwise (41) holds immediately. For λ µ ∈ "0 1#, put pλ µ = p + λp ∨ p − p + µp ∧ p − p. Then, for w ∈ V we have   1 − λpw + λp w w ∈ supp+ p − p, pλ µw = pw w ∈ V \ suppp − p, (42)  1 − µpw + µp w w ∈ supp− p − p. Furthermore, for any λ µ ∈ "0 1# we have pλ µ ∈ convp p p ∧ p p ∨ p ⊆ Box"p p # ∩ dom g We may assume that p ∈ N∞ q1 εq1 . Put p∗ = pλ∗ µ∗ , where λ∗ = supλ ∈ "0 1# pλ 0 ∈ N∞ q1 εq1 µ∗ = supµ ∈ "0 1# p0 µ ∈ N∞ q1 εq1 Then, we have p∗ ∈ N∞ q1 εq1 , implying gp + gp∗ ≥ gp ∧ p∗ + gp ∨ p∗

(43)

In the following, we assume λ∗ < 1 and µ∗ < 1, since the other cases can be shown similarly and more easily. Then, we have Box"p ∨ p∗ p ∨ p∗ # ∩ dom g ⊆ Box"p ∧ p∗ p ∧ p∗ # ∩ dom g ⊆ Box"p∗ p # ∩ dom g ⊆

m i=2

m i=2 m i=2

N∞ qi εqi N∞ qi εqi

N∞ qi εqi

Hence, the inductive hypothesis implies that gp ∨ p∗ + gp ∨ p∗ ≥ gp∗ + gp ∨ p gp ∧ p∗ + gp ∧ p∗ ≥ gp ∧ p + gp∗ gp∗ + gp ≥ gp∗ ∧ p + gp∗ ∨ p where we note that p ∨ p∗ ∧ p ∨ p∗ = p∗

p ∨ p∗ ∨ p ∨ p∗ = p ∨ p

p ∧ p∗ ∧ p ∧ p∗ = p ∧ p

p ∧ p∗ ∨ p ∧ p∗ = p∗

Combining the above inequalities with (43), we obtain the submodular inequality (41).

394

murota and shioura

(ii) We show (LF2loc ) ⇒ (LF2) only. Assume (LF2loc ) for g. Then, dom g satisﬁes (LS2) by Theorem 3.20(ii). Let xi ∈ R V and λi ∈ R i = 1 k be such that gp = max 1≤i≤k p xi + λi p ∈ dom g. As is shown later, the value 1 xi is the same for all i. Thus, g satisﬁes (LF2). We now prove that 1 xi = 1 xj for any i j. Let p0 be any vector in dom g and deﬁne ψ R → R by ψµ = gp0 + µ 1 µ ∈ R . Then, ψ is piecewise-linear convex. Moreover, we have ψ+ µ+ = max 1≤i≤k 1 xi for a sufﬁciently large value of µ+ , and ψ+ µ− = min1≤i≤k 1 xi for a sufﬁciently small value of µ− . By (LF2loc ), we see that ψ is a linear function. Hence, we have max 1≤i≤k 1 xi = ψ+ µ+ = ψ+ µ− = min1≤i≤k 1 xi . Theorem 4.27. Let a V → R ∪−∞ and b V → R ∪ +∞ be such that a ≤ b. Also, let g R V → R ∪ +∞ be a function with dom g = "a b#. have

(i)

g is submodular ⇐⇒ for any p ∈ R V , any u v ∈ V u = v, we gp + λχu + gp + µχv ≥ gp + gp + λχu + µχv

(ii)

∀λ µ ≥ 0

(44)

The condition (44) can be replaced by gp + λχu + gp + µχv ≥ gp + gp + λχu + µχv

(45)

for any λ ∈ "0 pl−1 − pl # and µ ∈ "0 pm−1 − pm #, where p1 > p2 > · · · > pk are distinct values in pvv∈V , and pu = pl , pv = pm . Proof. We show the “if” part of (i) only, and the other proofs are omitted. Let p q ∈ "a b#. Note that p ∧ q p ∨ q ∈ "a b#. We show the submodular inequality for p q by induction on the cardinality of suppp − q. Without loss of generality, we assume supp− p − q ≥ 2. Let v ∈ supp− p − q, λ = qv − pv. Since p ∧ q + λχv ∈ "a b#, the inductive assumption and (44) imply that gp − gp ∧ q ≥ gp + λχv − gp ∧ q + λχv ≥ gp ∨ q − gq

4.3.2. Fundamental Properties of Polyhedral L -Convex Functions In this section, we show various properties of polyhedral L-convex functions. Lemma 4.28. Let g ∈ . Then, for any p q ∈ dom g and λ ∈ R we have gp + gq ≥ gp + λ1 ∧ q + gp ∨ q − λ1 In particular, we have gp + gq ≥ gp + λχX + gq − λχX

(46)

polyhedral m/l-convex functions

395

for any p q ∈ dom g and λ ∈ "0 λ1 − λ2 #, where λ1 = max qv − pv

X = v ∈ V qv − pv = λ1 (47)

v∈V

λ2 = max qv − pv

(48)

v∈V \X

Proof.

The inequality (46) can be obtained as LHS of 46 = gp + gq − λ1 + λr ≥ gp ∧ q − λ1 + gp ∨ q − λ1 + λr = gp ∧ q − λ1 + λ1 + gp ∨ q − λ1 = RHS of 46

The second inequality follows from this inequality since p ∨ q − λ1 − λ 1 = p + λχX

p + λ1 − λ 1 ∧ q = q − λχX

for λ ∈ "0 λ1 − λ2 #. Global optimality of a polyhedral L-convex function is characterized by local optimality. Theorem 4.29. Let g ∈ and p ∈ dom g. Then, gp ≤ gq ∀q ∈ R ⇐⇒ g p+ χX ≥ 0 ∀X ⊆ V and g p+ χV = r = 0, where r is in (LF2). V

Proof. We show the sufﬁciency by contradiction. Suppose that there exists some q ∈ R V with gq < gp and assume that q minimizes the number of distinct values in pv − qvv∈V of all such vectors. Deﬁne λ1 λ2 , and X ⊆ V by (47) and (48). Since g p+ χV = r = 0, we have X = V and λ2 > −∞. By Lemma 4.28, we obtain gp + gq ≥ gp + λ1 − λ2 χX + gq − λ1 − λ2 χX

(49)

Put q = q − λ1 − λ2 χX . Since g p+ χX ≥ 0, (49) implies gq ≤ gq < gp. This inequality, however, is a contradiction since the number of distinct values in pv − q vv∈V is less than that of pv − qvv∈V . by

Given a set function ρ 2V → R ∪ +∞, deﬁne gρ R V → R ∪ +∞ gρ p =

k−1 j=1

pj − pj+1 ρVj + pk ρVk

(50)

where p1 > p2 > · · · > pk are distinct values in pvv∈V , and Vj = v ∈ V pv ≥ pj j = 1 k. The function gρ is called the Lov´ asz extension of ρ.

396

murota and shioura

Theorem 4.30. If ρ ∈ , then gρ p = sup p x x∈Bρ

Proof.

∀p ∈ R V

See [17].

Theorem 4.31 (Lov´asz [24]). Let ρ 2V → R ∪ +∞ be a function such that ρ = 0 and ρV < +∞. Then, ρ ∈ ⇐⇒ gρ is convex. Directional derivative functions and subdifferentials of a polyhedral L-convex function have nice structures such as M/L-convexity, and they can be explicitly described by certain submodular functions (cf. Theorem 4.32(i)). Theorem 4.32. (i)

Let g ∈ and p ∈ dom g.

The function ρp 2V → R ∪ +∞ deﬁned by ρp X = g p+ χX

X ⊆ V

(51)

satisﬁes ρp = 0, −∞ < ρp V < +∞, and the submodular inequality (7), i.e., ρp ∈ . (ii) We have g p+ · = gρp and g p+ · ∈ 0 . Proof. We show g p+ · ∈ 0 only. The claim (i) is an immediate corollary of (ii), and the equation g p+ · = gρp will be shown later in Theorem 4.36(i). Let p ∈ dom g. For q1 q2 ∈ R V and a sufﬁciently small µ > 0, we have g p+ q1 + g p+ q2 = gp + µq1 − gp/µ + gp + µq2 − gp/µ ≥ gp + µq1 ∧ q2 − gp/µ + gp + µq1 ∨ q2 − gp/µ = g p+ q1 ∧ q2 + g p+ q1 ∨ q2 Hence, (LF1) holds for g p+ ·. Condition (LF2) for g p+ · can be shown similarly. M-convexity appears in subdifferentials of a polyhedral L-convex function. Theorem 4.33. Let g ∈ and p ∈ dom g. (i)

∂gp ∈ 0 and ∂gp is represented as ∂gp = Bρp = x ∈ R V xX ≤ g p+ χX ∀X ⊆ V xV = g p+ χV

where ρp 2V → R ∪ +∞ is deﬁned by (51).

(52)

polyhedral m/l-convex functions (ii)

397

For any q ∈ R V we have gp + q − gp ≥ sup q x = gρp q x∈∂gp

Proof. From Theorems 4.5 and 4.29 follows (52), which, together with Theorem 4.32(i), implies ∂gp ∈ 0 . The claim (ii) is immediate from (i) and Theorem 4.30. The next theorem shows that each face of the epigraph of a polyhedral L-convex function is an L-convex polyhedron when it is projected to R V . The proof is obvious and therefore omitted. Theorem 4.34. inf g"−x# > −∞.

For g ∈ and x ∈ R V , we have arg min g"−x# ∈ 0 if

The class of polyhedral L-convex functions is closed under various fundamental operations. Theorem 4.35. Let g g1 g2 ∈ . (1) For x ∈ R V , the function g"−x# R V → R ∪ +∞ given by (25) is polyhedral L-convex. (2) For a ∈ R V , β ∈ R , and ν > 0, the function ν · ga + βp is polyhedral L-convex in p. (3) For any U ⊆ V , the function gU R U → R ∪ ±∞ given by (40) is polyhedral L-convex if gU p0 is ﬁnite for some p0 ∈ R V . (4) For ψv ∈ 1 v ∈ V , the function g˜ R V → R ∪ ±∞ deﬁned by gp ˜ = inf gq + ψv pv − qv p ∈ R V q∈R V

v∈V

is polyhedral L-convex if gp ˜ 0 is ﬁnite for some p0 ∈ R V . (5) For any γ ∈ , the restriction gDγ R V → R ∪ +∞ of g deﬁned by gp p ∈ Dγ, gDγ p = +∞ p ∈ Dγ is polyhedral L-convex if dom g ∩ Dγ = . (6)

g1 + g2 ∈ if dom g1 ∩ dom g2 = .

A more general and stronger transformation, called “network induction,” is explained in subsection 7.3.

398

murota and shioura

4.3.3. Positively Homogeneous Polyhedral L-Convex Functions This section clariﬁes the relationship of positively homogeneous polyhedral L-convex functions with submodular functions, and with M-convex polyhedra. For a positively homogeneous polyhedral convex function g R V → R ∪ +∞ with 0 ∈ dom g, deﬁne a set function ρg 2V → R ∪ +∞ by ρg X = g 0+ χX = gχX

X ⊆ V

(53)

Recall the deﬁnition of the Lov´asz extension gρ in (50). Theorem 4.36.

For g ∈ 0 , we have ρg ∈ and gρg = g.

(i)

(ii) For ρ ∈ , we have gρ ∈ 0 and ρgρ = ρ. (iii) The mappings g (→ ρg g ∈ 0 and ρ (→ gρ ρ ∈ provide a one-to-one correspondence between 0 and and are the inverse of each other. Proof. (i) Condition (LF1) for g yields ρg ∈ . We see from Theorem 4.33(ii) that g ≥ gρg . On the other hand, from Theorem 4.2 follows gρg p =

k−1 j=1

≥g

pj − pj+1 gχVj + pk gχVk

k−1 pj − pj+1 χVj + pk χVk = gp j=1

where pj and Vj j = 1 k are deﬁned as in the Lov´asz extension (50). (ii) We show (LF1) for gρ only. The other claims are obvious. First assume that ρ < +∞. By Theorem 4.27, it sufﬁces to show that (45) holds for any p ∈ R V and u v ∈ V u = v, where g = gρ . Deﬁne pj and Vj j = 1 k as in the Lov´asz extension (50). Let l, m be such that pu = pl , pv = pm , and λ ∈ "0 pl−1 − pl #, µ ∈ "0 pm−1 − pm #. From the deﬁnition of gρ we have gρ p + λχu = gρ p + λρVl−1 ∪ u − ρVl−1 gρ p + µχv = gρ p + µρVm−1 ∪ v − ρVm−1 (Case 1. l = m.) We may assume that l < m. Then, (45) holds with equality, gρ p + λχu + µχv = gρ p + λρVl−1 ∪ u − ρVl−1 + µρVm−1 ∪ v − ρVm−1 = gρ p + λχu + gρ p + µχv − gρ p

polyhedral m/l-convex functions

399

(Case 2. l = m.) We may assume that λ ≥ µ. Then gρ p + λχu + µχv = gρ p + λρVl−1 ∪ u − ρVl−1 + µρVl−1 ∪ u v − ρVl−1 ∪ u ≤ gρ p + λρVl−1 ∪ u − ρVl−1 + µρVl−1 ∪ v − ρVl−1 = gρ p + λχu + gρ p + µχv − gρ p where the inequality is by the submodularity of ρ. Hence, (45) follows. Next, we consider the general case where ρ may take the value +∞. Let x ∈ Bρ. For each positive integer k, we deﬁne ρk 2V → R by ρk X = min ρX \ Y + xY + kY Y ⊆X

X ⊆ V

Note that ρk is the submodular function associated with y ∈ Bρ yv ≤ xv + k v ∈ V , which is a bounded M-convex polyhedron. Hence, we have ρk < +∞ and gρk satisﬁes (LF1). Since ρX = limk→∞ ρk X ∀X ⊆ V , it holds that gρ p = limk→∞ gρk p p ∈ R V and therefore gρ also satisﬁes (LF1). (iii)

This is clear from (i) and (ii).

From Theorem 4.36, we see that a polyhedral convex function is positively homogeneous polyhedral L-convex if and only if it is the Lov´asz extension of a submodular set function. Corollary 4.37.

0

= gρ ρ ∈ .

As immediate consequences of Theorems 3.3, 4.3, and 4.36, we obtain a one-to-one correspondence between positively homogeneous polyhedral L-convex functions and M-convex polyhedra. Recall the notation Sg and δ∗S in (23) and (24). Theorem 4.38. (ii)

(i)

For g ∈ 0 , we have Sg ∈ 0 and δ∗Sg = g.

For B ∈ 0 , we have δ∗B ∈ 0 and Sδ∗B = B.

(iii) The mappings g (→ Sg g ∈ 0 and B (→ δ∗B B ∈ 0 provide a one-to-one correspondence between 0 and 0 , and are the inverse of each other.

400

murota and shioura

4.3.4. Polyhedral L- -Convex Functions Due to the property (LF2), a polyhedral L-convex function loses no information when restricted to a hyperplane p ∈ R V pv = 0 for any v ∈ V . We call a polyhedral convex function g R V → R ∪ +∞ L- -convex if the function g˜ R V → R ∪ +∞ deﬁned by gp ˜ 0 p = gp − p0 1

p0 p ∈ R V

(54)

= v0 ∪ V . We see that polyhedral L- is polyhedral L-convex, where V convex functions are essentially the same as polyhedral L-convex functions, while the class of polyhedral L- -convex functions properly contains that of polyhedral L-convex functions. The concept of L- -convexity was originally introduced in [18] as a concept for functions deﬁned over the integer lattice. Theorem 4.39. Let g R V → R ∪ +∞ be a polyhedral convex function with dom g = . Then g is L- -convex ⇐⇒ g satisﬁes (46) for any p q ∈ dom g and λ ≥ 0. Proof. The “only if” part is clear from the deﬁnition of L- -convex functions and Lemma 4.28. Hence, we show the “if” part only. It suf ﬁces to show that g˜ R V → R ∪ +∞ deﬁned by (54) fulﬁlls (LF1). Let p0 p q0 q ∈ dom g, ˜ and without loss of generality assume p0 ≥ q0 . Then gp ˜ 0 p + gq ˜ 0 q = gp − p0 1 + gq − q0 1 ≥ gp − p0 1 +p0 − q0 1 ∧ q − q0 1 + gp − p0 1 ∨ q − q0 1 −p0 − q0 1 = gp ∧ q − q0 1 + gp ∨ q − p0 1 = gp ˜ 0 ∧ q0 p ∧ q + gp ˜ 0 ∨ q0 p ∨ q where the inequality is by (46). Hence, (LF1) holds for g. ˜ Example 4.40 (Separable-Convex Functions). Let D ⊆ R V be an L- convex polyhedron. For any convex functions gv ∈ 1 v ∈ V , the function g R V → R ∪ +∞ deﬁned by gv pv p ∈ D, gp = v∈V +∞ p ∈ D is polyhedral L- -convex if dom g = . From the deﬁnition of polyhedral L- -convex functions, every property of polyhedral L-convex functions can be restated in terms of polyhedral L- convex functions, and vice versa. We show two properties below, which seem more natural when restated in terms of polyhedral L- -convex functions.

polyhedral m/l-convex functions

401

Theorem 4.41. Let g R V → R ∪ +∞ be a polyhedral L- -convex function. (i) For any a V → R ∪−∞ and b V → R ∪ +∞, the restriction g"a b# R V → R ∪ +∞ of g to "a b# deﬁned by (37) is polyhedral L- -convex if dom g ∩ "a b# = . (ii) For any U ⊆ V , the function gU R U → R ∪ +∞ given by (36) is polyhedral L- -convex if dom gU = . Proof. Part (i) is immediate from the translation of Theorem 4.35(5), and (ii) is a corollary of (i). Examples 4.22 and 4.40 show that any separable-convex function is polyhedral M- -convex as well as polyhedral L- -convex. The converse is also true, as shown in Theorem 4.42 below. Theorem 4.42 (cf. [38, Theorem 3.17]). Let f R V → R ∪ +∞ be a polyhedral convex function with dom f = . Then f is both M - -convex and L- -convex if and only if f is separable-convex. Proof. Let f be both M - -convex and L- -convex. Theorems 3.31, 4.7(ii), and 4.25 (ii) imply that dom f is an interval; i.e., there exist a V → R ∪−∞ and b V → R ∪+∞ with av ≤ bv v ∈ V such that dom f = "a b#. Similarly, Theorems 3.31, 4.17, and 4.34 imply that arg min f "−p# is an interval for any p ∈ R V with inf f "−p# > −∞. Hence, kv for each v ∈ V there exist a sequence ai vi=0 ⊆ R ∪ ±∞ kv ≥ 0 such that • a0 v = av < a1 v < · · · < akv −1 v < akv v = bv, • for any iv = 1 2 kv v ∈ V , the function f is linear over the interval x ∈ R V aiv −1 v ≤ xv ≤ aiv v v ∈ V . Due to these properties, we have f x + αχv − f x = f y + αχv − f y for any x y ∈ dom f with xv = yv and any α ∈ R . For all v ∈ V , put fv α = f x0 + α − x0 vχv − f x0 where x0 ∈ dom f . Then, we have f x = fv xv + f x0 v∈V

α ∈ R

x ∈ R V

Moreover, each fv is polyhedral convex since f itself is polyhedral convex. Therefore, f is a separable-convex function. Remark 4.43. There exists no function which is both polyhedral M-convex and L-convex, i.e., ∩ = (see Remark 3.32).

402

murota and shioura 5. CONJUGACY AND CHARACTERIZATIONS

Polyhedral M-convex and L-convex functions are conjugate to each other. Theorem 5.1. For f ∈ and g ∈ , we have f • ∈ and g• ∈ . Moreover, the mappings f (→ f • f ∈ and g (→ g• g ∈ provide a oneto-one correspondence between and and are the inverse of each other. Polyhedral M/L-convex functions are characterized by local polyhedral structures such as directional derivative functions, subdifferentials, and the sets of minimizers. Theorem 5.2. Let f R V → R ∪ +∞ be a polyhedral convex function with dom f = . Then i f ∈ ⇐⇒ ii f x+ · ∈ 0 ∀x ∈ dom f ⇐⇒ iii ∂f x ∈ 0 ∀x ∈ dom f ⇐⇒ iv arg min f "−p# ∈ 0 ∀p ∈ R V with inf f "−p# > −∞ Theorem 5.3. Let g R V → R ∪ +∞ be a polyhedral convex function with dom g = . Then i g ∈ ⇐⇒ ii g p+ · ∈ 0 ∀p ∈ dom g ⇐⇒ iii ∂gp ∈ 0 ∀p ∈ dom g ⇐⇒ iv arg min g"−x# ∈ 0 ∀x ∈ R V with inf g"−x# > −∞ In the following, we ﬁrst prove Theorem 5.3, then Theorem 5.1, and ﬁnally Theorem 5.2. Proof of Theorem 5 3. The implication (i) ⇒ (ii) is by Theorem 4.32(ii). To show the reverse implication, we assume (ii) for g. For any p ∈ dom g, there exists ε > 0 such that gq − gp = g p+ q − p

∀q ∈ N∞ p ε

Hence, g satisﬁes (LF1loc ) and (LF2loc ), yielding (i) by Theorem 4.26. The equivalence (ii) ⇐⇒ (iii) is by Theorems 4.4 and 4.38. The implication (i) ⇒ (iv) follows from Theorem 4.34. We conclude the proof of Theorem 5.3 by showing that (iv) ⇒ (ii) holds. Lemma 5.4. Let g R V → R ∪ +∞ be positively homogeneous polyhedral convex with dom g = . Suppose that arg min g"−x# is an L-convex cone for any x ∈ R V with inf g"−x# > −∞. Then g ∈ 0 .

polyhedral m/l-convex functions

403

Proof. Deﬁne ρg 2V → R ∪ +∞ by (53). Since 0 ∈ dom g, there exists some x ∈ R V with 0 ∈ arg min g"−x#. By (LS2) for arg min g"−x#, we have 1 ∈ arg min g"−x# ⊆ dom g, i.e., ρg V = g1 < +∞. As is shown below, we have g = gρg , where gρg is deﬁned by (50) with ρ = ρg . Thus, we have ρg ∈ by Theorem 4.31, which, together with Corollary 4.37, implies g ∈ 0 . We now prove g = gρg . Note that gp ≤ gρg p p ∈ R V by Theorem 4.2. This implies that gp = gρg p = +∞ if p ∈ dom g. Therefore, we may assume that p ∈ dom g. Then there exists some x ∈ R V such that p ∈ arg min g"−x#. By Theorem 3.25, there exists some chain 0 ⊆ 2V and λX X ∈ 0 ⊆ R such that p= λ X χX λX ≥ 0 ∀X ∈ 0 \ V X∈0

0 ⊆ X ⊆ V χX ∈ arg min g"−x# By the linearity of g over arg min g"−x# and by (50), we have gp = λX gχX = gρg p X∈0

We now prove (iv) ⇒ (ii). For any p ∈ dom g, we denote gp = g p+ · for notational simplicity. By Lemma 5.4, we have only to prove that arg min gp "−x# is an L-convex cone for any p ∈ dom g and any x ∈ R V with inf gp "−x# > −∞. Let p ∈ dom g, and x ∈ R V be such that inf gp "−x# > −∞. Then we have inf g"−x# > −∞ and arg min g"−x# ∈ 0 . Let γ ∈ be such that Dγ = arg min g"−x#. Since arg min gp "−x# is the tangent cone of arg min g"−x# at p, it can be represented as arg min gp "−x# = q ∈ R V qv − qu ≤ 0 ∀u v ∈ Ap where Ap = u v u v ∈ V pv − pu = γu v. Since Ap is transitive, arg min gp "−x# is an L-convex cone by Theorem 3.24. Thus, Theorem 5.3 is proven. The following lemma is needed for the proof of Theorem 5.1. Lemma 5.5. Let g ∈ , and x y ∈ R V be such that inf g"−x# > −∞ and inf g"−y# > −∞. Then for any u ∈ supp+ x − y, there exists v ∈ supp− x − y such that pv − pu ≤ qv − qu ∀p ∈ arg min g"−x# ∀q ∈ arg min g"−y#

(55)

404

murota and shioura

Proof. First we note that xV = yV = r, where r ∈ R is the value in (LF2) for g. Let u ∈ supp+ x − y. We have arg min g"−x# arg min g"−y# ∈ 0 by Theorem 4.34. Hence, it sufﬁces to show the following: there exists some v ∈ supp− x − y such that pv ≤ qv ∀p ∈ Dx ∀q ∈ Dy , where Dx = p p ∈ arg min g"−x# pu = 0 Dy = q q ∈ arg min g"−y# qu = 0 Assume, to the contrary, that for any v ∈ supp− x − y, there exists a pair of vectors pv ∈ Dx , qv ∈ Dy such that pv v > qv v. Set p∗ = pv v ∈ supp− x − y q∗ = qv v ∈ supp− x − y Then we have p∗ ∈ Dx , q∗ ∈ Dy , and p∗ v > q∗ v ∀v ∈ supp− x − y. Let λ > 0 be any value with λ ≤ p∗ v − q∗ v for all v ∈ supp+ p∗ − q∗ . Putting p∗ v − λ v ∈ supp+ p∗ − q∗ , p = p∗ − λ1 ∨ q∗ = q∗ v v ∈ V \ supp+ p∗ − q∗ q∗ v + λ v ∈ supp+ p∗ − q∗ , q = p∗ ∧ q∗ + λ1 = p∗ v v ∈ V \ supp+ p∗ − q∗ , we have gp∗ + gq∗ ≥ gp + gq

(56)

by Lemma 4.28. Since supp− x − y ⊆ supp+ p∗ − q∗ , we obtain p x + q y − p∗ x − q∗ y = λ yv − xv v ∈ supp+ p∗ − q∗ + q∗ v − p∗ vxv − yvv ∈ V \ supp+ p∗ − q∗ ≥ λ yv − xv v ∈ V \ u = λxu − yu > 0 Combining this inequality with (56), we have g"−x#p + g"−y#q < g"−x#p∗ + g"−y#q∗ which is a contradiction since p∗ ∈ arg min g"−x#, q∗ ∈ arg min g"−y#. Proof of Theorem 5 1. First recall Theorem 4.1. [g ∈ ⇒ g• ∈ ] Let x y ∈ dom g• . Then, inf g"−x# > −∞ and inf g"−y# > −∞. By Lemma 5.5, for any u ∈ supp+ x − y there exists v ∈ supp− x − y satisfying the inequality (55), which implies g• x+ v u + g• y+ u v = suppv − pu p ∈ arg min g"−x# + supqu − qv q ∈ arg min g"−y# ≤ 0

polyhedral m/l-convex functions

405

where the equality is by Theorems 4.4 and 4.5. Hence, we have (M-EXC ) for g• and therefore (M-EXC) by Theorem 4.11. [f ∈ ⇒ f • ∈ ] If f ∈ , then Theorems 4.5 and 4.17 imply ∂f • p = arg min f "−p# ∈ 0

∀p ∈ dom f •

which, together with Theorem 5.3, yields f • ∈ . Proof of Theorem 5 2. The equivalence (ii) ⇐⇒ (iii) is by Theorems 4.4 and 4.20. By Theorems 4.5 and 5.1, the equivalence (i) ⇐⇒ (iii) ⇐⇒ (iv) is rewritten as follows in terms of f • , f • ∈ ⇐⇒ arg min f • "−x# ∈ 0 ∀x ∈ R V with inf f • "−x# > −∞ ⇐⇒ ∂f • p ∈ 0 ∀p ∈ dom f • which is already shown in Theorem 5.3. Proof of Theorem 4 186. From the assumption and Theorem 4.6, f1 f2 is a polyhedral convex function with f1 f2 x > −∞ ∀x ∈ R V . Hence, it follows from Theorem 4.1 that f1 f2 • is also a polyhedral convex function such that f1 f2 • x > −∞ ∀x ∈ R V and domf1 f2 • = . Theorems 4.6, 4.35(6), and 5.1 imply f1 f2 • = f1• + f2• ∈ , which shows f1 f2 ∈ by Theorem 5.1. Proof of Theorem 4 21. Let f R V → R ∪ +∞ be a polyhedral convex function with dom f = satisfying (M- -EXC). Let p ∈ R V be any vector with inf f "−p# > −∞. It follows from (M- -EXC) that arg min f "−p# satisﬁes (G-EXC), i.e., arg min f "−p# is an M- -convex polyhedron by Theorem 3.8. From the deﬁnition of polyhedral M- -convex functions and Theorem 5.2, the function f is polyhedral M- -convex. 6. RELATIONSHIP WITH M/L-CONVEX FUNCTIONS OVER THE INTEGER LATTICE The concepts of M-convexity and L-convexity were originally introduced for functions deﬁned over the integer lattice [29, 30, 32]. In this section, we present the relationship of M/L-convexity over the integer lattice and polyhedral M/L-convexity. A function f ZV → R ∪ +∞ is called discrete M-convex if domZ f = and it satisﬁes (M-EXC[Z]) ∀x y ∈ domZ f , ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y such that f x + f y ≥ f x − χu + χv + f y + χu − χv

406

murota and shioura

where domZ f = x ∈ ZV −∞ < f x < +∞. On the other hand, a function g ZV → R ∪ +∞ is called discrete L-convex if domZ g = and it satisﬁes (LF1[Z]) gp+gq ≥ gp∧q+gp∨q ∀pq ∈ domZ g, (LF2[Z]) ∃r ∈ R such that gp + λ1 = gp + λr ∀p ∈ domZ g ∀λ ∈ Z. We denote by "Z# and "Z#, respectively, the classes of discrete M-convex and L-convex functions: "Z# = f f ZV → R ∪ +∞ discrete M-convex "Z# = g g ZV → R ∪ +∞ discrete L-convex In the following, we sometimes regard f ZV → R ∪ +∞ as a function deﬁned over R V . In such a case, we have dom f ⊆ ZV . Remark 6.1. The concept of M-convex function over the integer lattice was ﬁrst deﬁned in [29, 30], where the effective domain is assumed to be bounded. The paper [32] discusses the case when the effective domain is not necessarily bounded, but assumes that functions take only integer values. The concept of L-convex function over the integer lattice appeared ﬁrst in [32], where function values are assumed be integral. Hence, the deﬁnition of L-convex functions above is a slight extension of the original one. We also deﬁne the set version of discrete M/L-convexity as follows. A set B ⊆ ZV is called discrete M-convex if B = and it satisﬁes that

(B-EXC[Z]) ∀x y ∈ B, ∀u ∈ supp+ x − y, ∃v ∈ supp− x − y such x − χu + χv ∈ B

y + χu − χv ∈ B

We call D ⊆ ZV discrete L-convex if D = and it satisﬁes (LS1[Z]) p q ∈ D ⇒ p ∧ q p ∨ q ∈ D, (LS2[Z]) p ∈ D ⇒ p + λ1 ∈ D ∀λ ∈ Z. We denote 0 "Z# = B B ⊆ ZV discrete M-convex 0 "Z# = D D ⊆ ZV discrete L-convex In this section, we use some known results on discrete M/L-convexity. See [29, 30, 32, 35] as references. Discrete M/L-convex functions can be extended to ordinary convex functions by taking their convex closure. Moreover, convex extensions of discrete M/L-convex functions coincide with “local” convex extensions. For any x ∈ R V and any integer k ≥ 0, we deﬁne the set HCk x = y ∈ ZV .xv/ − k ≤ yv ≤ 0xv1 + k v ∈ V

polyhedral m/l-convex functions

407

$ ∈ 0 and Theorem 6.2. For B ∈ 0 "Z#, we have B $ .xv/ ≤ yv ≤ 0xv1v ∈ V B ∩ HC0 x = y ∈ B

x ∈ R V (57)

Proof. Let ρ 2V → Z ∪+∞ be the submodular function such that B = Bρ ∩ ZV . Since Bρ is an integral polyhedron (cf. [17]), we have $ ∈ 0 . For any a b ∈ ZV the set B $ ∩ "a b# is also an integral Bρ = B polyhedron if it is nonempty. Thus (57) follows. $ ∈ 0 and Theorem 6.3. For D ∈ 0 "Z#, we have D $ .pv/ ≤ qv ≤ 0pv1 v ∈ V D ∩ HC0 p = q ∈ D

p ∈ R V (58)

Proof. Let γ V × V → Z ∪+∞ be the distance function with triangle inequality which is associated with D (see [32, Theorem 4.20]; [35, Theorem $= 2.28]), i.e., γ is such that D = Dγ ∩ ZV . Lemma 3.21 (v) implies D $ ∩ "a b# is also an integral Dγ ∈ 0 . Note that for any a b ∈ ZV the set D polyhedron if it is nonempty. Thus (58) follows. Let f ZV → R ∪ +∞. The convex closure f¯ R V → R ∪ ±∞ of f is given by x ∈ R V f¯x = sup p x + α p y + α ≤ f y y ∈ ZV p∈R V α∈R

We also deﬁne a function f˜ R V → R ∪ +∞ by f˜x = sup p x + α p y + α ≤ f y y ∈ HC0 x

(59)

p∈R V α∈R

x ∈ R V (60) Note that f˜ is the local convex extension of f , i.e., the convex closure of the restriction of f to the integral points around x. It admits an alternative expression λy y = x λy = 1 λy f y y∈HC0 x f˜x = inf y∈HC0 x λ ≥ 0 y ∈ HC x y∈HC x 0

y

0

x ∈ R V (61) by LP duality. From the deﬁnition, we have f˜x = f x ∀x ∈ ZV f˜x ≥ f¯x ∀x ∈ R V

(62)

Favati and Tardella [11] investigated the class of functions f ZV → R ∪ +∞ for which f¯ = f˜ holds. They call such a function integrally convex. It is easy to see that global optimality of integrally convex functions can be characterized by local optimality.

408

murota and shioura

Theorem 6.4 [11, Proposition 3.1]. Let f ZV → R ∪ +∞ be an integrally convex function and x ∈ dom f . Then f x ≤ f y for any y ∈ ZV if and only if f x ≤ f y for any y ∈ ZV with y − x∞ ≤ 1. Theorems 6.5 and 6.7 show that discrete M/L-convex functions are integrally convex. Theorem 6.5. For f ∈ "Z#, we have f¯x = f˜x ∀x ∈ R V and f¯x = f x ∀x ∈ ZV . Proof. We prove the former only. The latter follows from the former and the equation in (62). We ﬁrst consider the case when x ∈ domZ f . Deﬁne the conjugate f • of f by (22). Let ρ 2V → Z ∪ +∞ be the submodular function associated with the discrete M-convex set domZ f ∈ 0 "Z#, and p ∈ dom f • . If xV = ρV , then f¯x ≥ sup p + αχV x − sup p + αχV y − f y α∈R

y∈R V

= p x − f • p + sup αxV − ρV = +∞ α∈R

In the similar way, we can show that f¯x = +∞ when xX > ρX ∃X ⊂ V . By the inequality in (62), we have f˜x = f¯x = +∞ To show f¯x = f˜x for x ∈ domZ f , we consider the following dual pair of LP problems: LP1 maximize p x + α subject to p y + α ≤ f y y ∈ HC1 x p ∈ R V α ∈ R

LP2 minimize

y∈HC1 x

subject to

y∈HC1 x

λy f y λy y = x

y∈HC1 x

λy = 1 λy ≥ 0 y ∈ HC1 x

Problem (LP2) has a feasible solution since x ∈ HC1 x ∩ domZ f . Let p∗ α∗ ∈ R V × R and λ∗ = λ∗y y ∈ HC1 x be optimal solutions of (LP1) and (LP2), respectively. Then, it holds that f¯x ≤ p∗ x + α∗ = λ∗y f y ≤ f˜x (63) y∈HC1 x

We will show that both of the inequalities hold with equality. Put B = y ∈ HC1 x p∗ y + α∗ = f y = arg min f "−p∗ #y y∈HC1 x

∈ 0 "Z#

polyhedral m/l-convex functions

409

The complementary slackness condition implies y ∈ HC1 x λ∗y > $. In particular, we have x ∈ B ∩ HC0 x by 0 ⊆ B, which shows x ∈ B Theorem 6.2 since B ∈ 0 "Z#. Hence, there is another optimal solution λ˜ = λ˜ y y ∈ HC1 x of (LP2) such that if λ˜ y > 0 then y ∈ B ∩ HC0 x. Since λ˜ y f y = λ˜ y f y ≥ f˜x λ∗y f y = y∈HC1 x

y∈HC1 x

y∈HC0 x

the second inequality in (63) holds with equality. Let y0 ∈ B ∩ HC0 x. Then we have f "−p∗ #y0 − χu + χv ≥ f "−p∗ #y0 for any u v ∈ V . Since local optimality means global optimality for discrete M-convex functions [32, Theorem 4.6], we have α∗ = f "−p∗ #y0 ≤ f "−p∗ #y = −p∗ y + f y ∀y ∈ domZ f , i.e., p∗ y + α∗ ≤ f y ∀y ∈ domZ f . By (59), the ﬁrst inequality in (63) holds with equality. The following theorem claims that we can choose a common optimal λ in (61) for two discrete M-convex functions. An implication of this fact is discussed later in Corollary 6.8(i). Theorem 6.6. For any f g ∈ "Z# and x ∈ R V , there exists λ = λy y ∈ HC0 x such that λy y = x λy = 1 λy ≥ 0 y ∈ HC0 x (64) y∈HC0 x

f¯x = f˜x =

y∈HC0 x

y∈HC0 x

λy f y

gx ¯ = gx ˜ =

y∈HC0 x

λy gy (65)

Proof. We may assume that x ∈ domZ f ∩ domZ g, which implies both f˜x and gx ˜ are ﬁnite. By (60), there exist p α q β ∈ R V × R such that p y + α ≤ f y y ∈ HC0 x

p x + α = f˜x

q y + β ≤ gy y ∈ HC0 x

q x + β = gx ˜

Put Bf = y ∈ HC0 x p y + α = f y

∈ 0 "Z#

Bg = y ∈ HC0 x q y + β = gy

∈ 0 "Z#

Then we have x ∈ Bf ∩ Bg = Bf ∩ Bg . Therefore, there exists λ = λy y ∈ HC0 x satisfying (64) and λy = 0 y ∈ Bf ∩ Bg . Such λ satisﬁes (65) by LP-duality.

410

murota and shioura

Theorem 6.7 [32, Theorem 4.18]. Let g ∈ "Z#. (1)

For q ∈ domZ g and p ∈ "0 1#V , we have gq ¯ + p = gq ˜ + p = gq +

k

pj − pj+1 gq + χVj − gq

(66)

j=1

where pj and Vj j = 1 k are as in the deﬁnition of the Lov´ asz extension (50) and pk+1 = 0. (2) gp ¯ = gp ˜ = gp ∀p ∈ ZV . (3) gp ¯ + λ1 = gp ¯ + λr ∀p ∈ R V λ ∈ R , where r is in (LF2[Z]). (4) The equalities in (66) remain valid for q ∈ domZ g and p ∈ R V with max v∈V pv − minv∈V pv ≤ 1. As a corollary of Theorems 6.6 and 6.7, we obtain the following properties. For a function g ZV → R ∪ −∞, we deﬁne the concave closure g R V → R ∪ ±∞ of g by gx =

inf

p∈R V α∈R

p x + α p y + α ≥ gy y ∈ ZV

x ∈ R V

Note that g = −−g. Corollary 6.8. (i) Let f , g be functions such that f −g ∈ "Z#. If f x ≥ gx for any x ∈ ZV , then we have f¯x ≥ gx for any x ∈ R V . (ii) Let f , g be functions such that f −g ∈ "Z#. If f p ≥ gp for any p ∈ ZV , then we have f¯p ≥ gp for any p ∈ R V . The following theorems show that the convex extension of discrete M/L-convex functions are closely related to polyhedral M/L-convexity. Theorem 6.9. Let f ∈ "Z#. If f¯ is polyhedral convex, then f¯ ∈ . In particular, f¯ ∈ if domZ f is bounded. Proof.

By Theorems 6.2 and 6.5 (see also [32, Theorem 4.11]), we have arg min f¯"−p# = arg min f "−p# ∈ 0

∀p ∈ R V

which, together with Theorem 5.2, implies f¯ ∈ . Theorem 6.10. Let g ∈ "Z#. If g¯ is polyhedral convex, then g¯ ∈ . In particular, g¯ ∈ if p ∈ domZ g pv = 0 is bounded for some v ∈ V . Proof.

By Theorems 6.3 and 6.7 (1), we have arg min g"−x# ¯ = arg min g"−x# ∈ 0

which, together with Theorem 5.3, implies g¯ ∈ .

∀x ∈ R V

polyhedral m/l-convex functions

411

Remark 6.11. For f ZV → R ∪ +∞, if domZ f is bounded then its convex closure f¯ is polyhedral convex, but in general f¯ is not necessarily polyhedral convex, even if f is discrete M/L-convex. For example, consider the function f Z2 → Z ∪ +∞ deﬁned by 2 x x y ∈ Z2 x + y = 0, f x y = +∞ (otherwise). It is easy to see that f is discrete M-convex. However, the convex closure f¯ of f has inﬁnite number of linear-pieces and is not a polyhedral convex function. We now introduce two new classes of M/L-convexity. We call a function integral M-convex (resp. integral L-convex) if it is represented as the convex closure of some integer-valued discrete M-convex (resp. discrete L-convex) function. We denote by int and int , respectively, the classes of integral M-convex and L-convex functions: int = f f = fZ for some fZ ZV → Z ∪ +∞ fZ ∈ "Z# int = g g = gZ for some gZ ZV → Z ∪ +∞ gZ ∈ "Z# Note that integral M/L-convex functions may have inﬁnite number of linear pieces and therefore are not polyhedral convex in general if the effective domain is unbounded (see Remark 6.11). It was shown in [32] that integer-valued discrete M-convex and Lconvex functions are conjugate to each other under the integer Fenchel transformation. Theorem 6.12 [32, Theorem 4.24]. (i) For an integer-valued discrete M-convex function f ZV → Z ∪ +∞, the function g ZV → Z ∪ +∞ deﬁned by gp = sup p x − f x x∈ZV

p ∈ ZV

is integer-valued discrete L-convex. (ii) For an integer-valued discrete L-convex function g ZV → Z ∪ +∞, the function f ZV → Z ∪ +∞ deﬁned by f x = sup p x − gp p∈Z

V

x ∈ ZV

is integer-valued discrete M-convex. (iii) The mappings f (→ g and g (→ f deﬁned in (i) and (ii), respectively, provide a one-to-one correspondence between the classes of integer-valued discrete M-convex and L-convex functions and are the inverse of each other.

412

murota and shioura

Using this conjugacy relationship, we can show that integral M-convex and L-convex functions are conjugate to each other. Lemma 6.13. Let f ZV → Z ∪ +∞ be an integrally convex function. For any p ∈ R V , if inf f "−p# is ﬁnite, then arg min f "−p# = . Proof.

Put

ε0 = min pS − pT − 0pS − pT 1 + 1 ST ⊆V

> 0

We claim that for any x y ∈ domZ f with y − x∞ ≤ 1, if f "−p#y < f "−p#x then f "−p#y ≤ f "−p#x − ε0 . Since y − x∞ ≤ 1, there exist some S T ⊆ V such that y = x + χS − χT . If f "−p#y < f "−p#x then we have f y − f x < pS − pT ≤ 0pS − pT 1, implying f y − f x ≤ 0pS − pT 1 − 1 since f y − f x ∈ Z. Hence, we have the claim. Let x∗ ∈ domZ f satisfy f "−p#x∗ ≤ inf f "−p# + ε0 /2, and y ∈ domZ f be any vector with y − x∗ ∞ ≤ 1. If f "−p#y < f "−p#x∗ , then −ε0 ≥ f "−p#y − f "−p#x∗ ≥ inf f "−p# − inf f "−p# + ε0 /2 ≥ −ε0 /2 a contradiction. Hence, we have f "−p#y ≥ f "−p#x∗ for any y ∈ domZ f with y − x∗ ∞ ≤ 1. By Theorem 6.4 we have x∗ ∈ arg min f "−p#. For any f R V → R ∪ ±∞, we deﬁne fZ ZV → R ∪ ±∞ by f x x ∈ ZV , fZ x = +∞ x ∈ ZV . Theorem 6.14.

(67)

(i) For f ∈ int , it holds that f • ∈ int and

f • p = sup p x − f x x∈Z

V

p ∈ R V

(68)

In particular, for any p ∈ dom f • there exists some x ∈ dom f ∩ ZV such that f • p = p x − f x. (ii)

For g ∈ int , it holds that g• ∈ int and g• x = sup p x − gp p∈Z

V

x ∈ R V

In particular, for any x ∈ dom g• there exists some p ∈ dom g ∩ ZV such that g• x = p x − gp. (iii) The mappings f (→ f • f ∈ int and g (→ g• g ∈ int provide a one-to-one correspondence between int and int and are the inverse of each other.

polyhedral m/l-convex functions

413

Proof. We prove claim (i) only, since (ii) can be shown similarly to (i), and (iii) is an immediate corollary to (i), (ii), and Theorem 4.1. Equation (68) follows from the integral convexity of fZ shown in Theorem 6.5. For any p ∈ dom f • Lemma 6.13 assures the existence of x ∈ dom f ∩ ZV with f • p = p x − f x. We abbreviate f • Z to fZ• . Theorem 6.12 and Eq. (68) yield fZ• ∈ "Z#. Therefore, it sufﬁces to show f • p = fZ• p p ∈ R V . We have f • p = fZ• p for integral vectors p ∈ ZV and f • p ≤ fZ• p p ∈ R V . Hence, we may assume f • p < +∞ since otherwise f • p = fZ• p = +∞ holds. Then, there exists some x0 ∈ domZ fZ such that f • p = p x0 − fZ x0 . Put D = q ∈ R V f • q = q x0 − fZ x0 Then, we have D = ∂fZ x0 = ∂fZ x0 ∩ ZV

∂fZ x0 ∩ ZV ∈ 0 "Z#

(see, e.g., [32, Theorem 4.7; 35, Theorem 2.41]), i.e., D is an integral polyhedron. Thus, there exist some integral vectors qi ki=1 k ≥ 1 in ∂fZ x0 ∩ ZV such that p can be represented by a convex combination of qi ki=1 . Since f • is linear over D, f • qi = fZ• qi for i = 1 k, and f • p ≤ fZ• p, convexity of f • implies f • p = fZ• p. 7. DUALITY 7.1. Duality Theorems In this section, we discuss duality theorems for polyhedral M/L-convex functions. We ﬁrst state the Fenchel duality for convex/concave functions. For a convex function f R V → R ∪ +∞ and a concave function g R V → R ∪ −∞ with dom f = , dom g = , we have inf f x − gx ≥ inf f x − gx

x∈ZV

x∈R V

≥ sup g◦ p − f • p p∈R V

≥ sup g◦ p − f • p p∈ZV

where the concave conjugate g◦ R V → R ∪ ±∞ is deﬁned by g◦ p = inf p x − gx x∈R V

p ∈ R V

(69)

414

murota and shioura

The Fenchel duality (70) below shows that the second inequality in (69) is satisﬁed with equality under a certain assumption. This is just an application of the existing result in convex analysis to polyhedral convex/concave functions, and it stands independently of M-convexity/M-concavity or L-convexity/L-concavity of functions. On the other hand, the claim (ii) of Theorem 7.1 below shows that all inequalities in (69) are satisﬁed with equality if f and g are integral M-convex/M-concave. This integrality result is not obtained from the convexity/concavity alone, but from the combination of the convexity/concavity and the combinatorial properties of f and g. Theorem 7.1. (i) Let f R V → R ∪ +∞ and g R V → R ∪ −∞ be polyhedral convex/concave functions with dom f = , dom g = . If either of (a) dom f ∩ dom g = and (b) dom f • ∩ dom g◦ = holds, then we have inf f x − gx = sup g◦ p − f • p

x∈R V

p∈R V

(70)

The supremum is attained at some p∗ ∈ R V if (a) is satisﬁed, and the inﬁmum is attained at some x∗ ∈ R V if (b) is satisﬁed. (ii) If f −g ∈ int and either (a) or (b) is fulﬁlled, then we have inf f x − gx = inf f x − gx

x∈ZV

x∈R V

= sup g◦ p − f • p = sup g◦ p − f • p p∈R V

p∈ZV

The supremum is attained at some integral p∗ ∈ ZV if (a) is satisﬁed, and the inﬁmum is attained at some integral x∗ ∈ ZV if (b) is satisﬁed. Proof. The claim (i) is a standard result; see [41, 44]. We derive (ii) from the corresponding result for discrete M-convex/concave functions established in [32]. We deﬁne the function fZ ZV → Z ∪ +∞ by (67), and gZ ZV → Z ∪ −∞ by gx x ∈ ZV , gZ x = −∞ x ∈ ZV . Then we have fZ −gZ ∈ "Z#. We also deﬁne fZ• = f • Z ZV → Z ∪ +∞ and gZ◦ = g◦ Z ZV → Z ∪ −∞ in the similar way, where it is noted that (cf. (68)) fZ• p = sup p x − fZ x

p ∈ ZV

gZ◦ p

p ∈ ZV

x∈Z

V

= inf p x − gZ x x∈ZV

polyhedral m/l-convex functions

415

We have dom f = domZ fZ and dom g = domZ gZ , and from Theorem 6.14 follows dom f • = domZ fZ• and dom g◦ = domZ gZ◦ . These properties imply that the condition (a) is equivalent to domZ fZ ∩ domZ gZ = , and (b) is equivalent to domZ fZ• ∩ domZ gZ◦ = . Thus, the ﬁrst and the last term in (69) are equal by the Fenchel-type duality for discrete M-convex/Mconcave functions [32, Theorem 5.2], and each inequality in (69) holds with equality. When f and g are polyhedral M-convex and M-concave, respectively, in (70), then f • and g◦ are polyhedral L-convex and L-concave, respectively, by Theorem 5.1. Hence, Fenchel duality also reads that the minimization of the sum of polyhedral M-convex functions is equivalent to the minimization of the sum of their conjugate polyhedral L-convex functions. Note that the sum of polyhedral L-convex functions is also polyhedral Lconvex, whereas the sum of polyhedral M-convex functions is not necessarily polyhedral M-convex (cf. Remark 3.5). As a corollary of Theorem 7.1, we obtain an optimality criterion of the minimization of the sum of two polyhedral convex functions. Corollary 7.2. (i) Let fi R V → R ∪ +∞ i = 1 2 be a polyhedral convex function with dom f1 ∩ dom f2 = . Then there exists p∗ ∈ R V such that inf f1 x + f2 x = inf f1 "−p∗ #x + inf f2 "p∗ #x

x∈R V

x∈R V

x∈R V

(71)

(ii) If f1 f2 ∈ int and dom f1 ∩ dom f2 = , then there exists an integral p∗ ∈ ZV such that inf f1 x + f2 x = inf f1 x + f2 x

x∈ZV

x∈R V

= inf f1 "−p∗ #x + inf f2 "p∗ #x x∈R V

x∈R V

= inf f1 "−p∗ #x + inf f2 "p∗ #x x∈ZV

x∈ZV

Remark 7.3. The conditions (a) and (b) in Theorem 7.1 cannot be removed, as shown in the following example. Let f R 2 → R ∪ +∞ and g R 2 → R ∪ −∞ be deﬁned by x1 + x2 = 1, x1 f x1 x2 = +∞ x1 + x2 = 1, x1 + x2 = −1, −x1 gx1 x2 = −∞ x1 + x2 = −1. Then we have f −g ∈ ∩ int , dom f ∩ dom g = , and inf

x1 x2 ∈R 2

f x1 x2 − gx1 x2 = +∞

416

murota and shioura

On the other hand, it holds that p2 f • p1 p2 = +∞ −p2 g◦ p1 p2 = −∞

p1 − p2 = 1, p1 − p2 = 1 p1 − p2 = −1, p1 − p2 = −1

We have f • −g• ∈ ∩ int , dom f • ∩ dom g◦ = , and sup p1 p2 ∈R

2

g◦ p1 p2 − f • p1 p2 = −∞

Neither (a) nor (b) is fulﬁlled, and Eq. (70) does not hold. We state separation theorems for polyhedral convex/concave functions. Whereas the claim (i) of Theorems 7.4 below is just the standard separation theorem in convex analysis, (ii) and (iii) are based on the combinatorial properties of M/L-convexity and their essence lies in the discrete separation theorems shown in [29, 30, 32]. See also [18] for the proof of (ii) and (iii). Theorem 7.4. Let f R V → R ∪ +∞ and g R V → R ∪ −∞ be functions with dom f ∩ dom g = . Suppose f x ≥ gx ∀x ∈ R V holds. (i) If f and g are polyhedral convex and concave, respectively, then there exist p∗ ∈ R V and α∗ ∈ R such that f x ≥ p∗ x + α∗ ≥ gx

x ∈ R V V

(72)

(ii) (72).

If f −g ∈ int , then we can take integral p∗ ∈ Z

(iii) (72).

If f −g ∈ int , then we can take integral p∗ ∈ ZV and α∗ ∈ Z in

and α∗ ∈ Z in

7.2. Submodular Flow Problem with M-Convex Cost Function We consider a generalization of the ordinary submodular ﬂow problem and state a duality theorem for the problem. We also provide two optimality criteria, one of which is by negative cycles and the other by potentials. The generalization here is a polyhedral extension of the framework proposed in[29]. See, e.g., [3, 10, 12, 17, 40] for the ordinary submodular ﬂow problem. Let G = V A be a directed graph. A ﬂow is a vector ξ ∈ R A . Suppose that we are given a ﬂow cost function fa R → R ∪ +∞ for each a ∈ A and a boundary cost function f R V → R ∪ +∞. Then, the minimum cost ﬂow problem with a boundary cost function is formulated as (MFBC) Minimize Fξ = f ∂ξ + fa ξa subject to ξ ∈ R A a∈A

polyhedral m/l-convex functions

417

where ∂ξ ∈ R V is the boundary of ξ deﬁned by (1). The problem (MFBC) coincides with the ordinary (linear) submodular ﬂow problem if each fa is a linear function in a nonempty closed interval and f is the indicator function of some M-convex polyhedron; i.e., f is a polyhedral M-convex function with f R V → 0 +∞. We call a ﬂow ξ feasible if Fξ < +∞. We then consider the dual of the problem (MFBC). A potential is a vector p ∈ R V . Suppose that we are given a tension cost function ga R → R ∪ +∞ for each a ∈ A and a potential cost function g R V → R ∪ +∞. Then the minimum cost tension problem with a potential cost function is formulated as (MTPC) Minimize Gp = gp + ga −δpa subject to p ∈ R V a∈A

where δp ∈ R A is the coboundary of p deﬁned by (3). The problem (MTPC) coincides with the ordinary minimum (convex-)cost tension problem if gp = 0 for any p ∈ R V . We call a potential p feasible if Gp < +∞. Theorem 7.1 (i) yields the duality between (MFBC) and (MTPC). Note that this is independent of M/L-convexity. Theorem 7.5. Let f and g be polyhedral convex functions such that dom f = dom g = , and g = f • , and for each a ∈ A let fa ga ∈ 1 satisfy ga = fa• . Suppose that at least one of two problems (MFBC) and (MTPC) has a feasible solution. Then inf Fξ = − inf Gp

ξ∈R A

p∈R V

If (MFBC) is feasible, then Gp∗ = inf Gp for some p∗ ∈ R V , and if (MTPC) is feasible, then Fξ∗ = inf Fξ for some ξ∗ ∈ R A . Deﬁne functions f1 f2 R V → R ∪ ±∞ by fa ξa ∂ξ = −x f2 x = f −x x ∈ R V f1 x = inf

Proof.

ξ∈R A

a∈A

Similarly, we deﬁne functions g1 g2 R V → R ∪ ±∞ by g1 p = ga −δpa g2 p = gp p ∈ R V

(73)

a∈A

Note that f1 and g1 are special cases of functions f and g in Example 2.4, where T = V . Moreover, if (MFBC) is feasible, then f1 x0 is ﬁnite for some x0 ∈ R V , and if (MTPC) is feasible, then g1 p0 is ﬁnite for some p0 ∈ R V . Therefore, f1 and g1 are polyhedral convex functions such that f1 > −∞, g1 > −∞, dom f1 = , dom g1 = , and g1 = f1• . Moreover,

418

murota and shioura

we have −f2 ◦ p = −g2 p ∀p ∈ R V . Thus, Theorem 7.1(i) yields the equation inf Fξ = inf f1 x + f2 x

ξ∈R A

x∈R V

= − inf g2 p + g1 p = − inf Gp p∈R V

p∈R V

To state the ﬁrst optimality criterion, we deﬁne the auxiliary network Gξ = V Aξ wξ associated with a feasible ﬂow ξ ∈ R A . The underlying graph Gξ has the vertex set V and the arc set Aξ consisting of three disjoint parts, Aξ = Fξ ∪ Bξ ∪ Jξ , where Fξ = a a ∈ A fa ξa+ +1 < +∞ Bξ = a¯ a ∈ A fa ξa+ −1 < +∞

a¯ reorientation of a

Jξ = u v u v ∈ V u = v f ∂ξ+ v u < +∞ The weight function wξ Aξ → R is deﬁned by  a ∈ Fξ ,  fa ξa+ +1 ¯ −1 a ∈ Bξ , wξ a = fa¯ ξa+  f ∂ξ+ v u a ∈ Jξ For

any cycle C ⊆ Aξ in Gξ , we deﬁne the weight of C by wξ C = wξ a a ∈ C, and call C a negative cycle if wξ C < 0. Theorem 7.6. Let f be a polyhedral convex function with dom f = , and ξ ∈ R A be a feasible ﬂow. (i) There is no negative cycle in Gξ wξ if ξ is optimal for (MFBC). (ii) The converse is also true if f is M-convex. That is, for f ∈ : ξ is optimal for (MFBC) ⇐⇒ there is no negative cycle in Gξ wξ . Proof. (i) Suppose that there exists a negative cycle C ⊆ Aξ in Gξ . We show that Fξ∗ < Fξ holds for some ﬂow ξ∗ ∈ R A . Put CF = C ∩ Fξ CB = C ∩ Bξ CJ = C ∩ Jξ y= χv − χu u v∈CJ

Then, there exists a sufﬁciently small α > 0 such that f ∂ξ + αy − f ∂ξ = αf ∂ξ+ y ≤ α u v∈CJ

wξ u v

(74)

fa ξa + α − fa ξa = αwξ a

a ∈ CF

(75)

fa¯ ξa ¯ − α − fa¯ ξa ¯ = αwξ a ¯

a ∈ CB

(76)

polyhedral m/l-convex functions

419

where the inequality in (74) is from Theorems 4.2 and 4.4. We deﬁne a ﬂow ξ∗ ∈ R A by  a ∈ CF  ξa + α ¯ −α a ∈ CB ξ∗ a = ξa  ξa otherwise Combining the inequalities (74), (75), and (76), we obtain Fξ∗ − Fξ = f ∂ξ + αy − f ∂ξ + fa ξa + α − fa ξa +

a∈CB

a∈CF

fa¯ ξa ¯ − α − fa¯ ξa ¯

≤ αwξ C < 0 (ii) Suppose that there exists a ﬂow ξ∗ ∈ R V with Fξ∗ < Fξ. To prove the existence of a negative cycle in Gξ wξ , it sufﬁces to show the existence of a “circulation” in Gξ wξ with

negative cost w.r.t. wξ , i.e., the existence of ζ Aξ → R satisfying ζ ≥ 0, a∈Aξ wξ aζa < 0 and

ζa a∈ Aξ leaves v =

We have

ζa a∈ Aξ enters v v ∈ V (77)

fa ξ∗ a − fa ξa ≥ ξ∗ a − ξafa ξa+ +1 ∀a ∈ supp+ ξ∗ − ξ

(78)

fa ξ∗ a − fa ξa ≥ ξa − ξ∗ afa ξa+ −1 ∀a ∈ supp− ξ∗ − ξ f ∂ξ∗ − f ∂ξ ≥ f ∂ξ+ ∂ξ∗ − ∂ξ

(79) (80)

By Theorem 4.15 (ii), there exists some λ = λuv u v ∈ Jξ such that λuv ≥ 0 u v ∈ Jξ λuv χv − χu = ∂ξ∗ − ∂ξ u v∈Jξ

u v∈Jξ

λuv f ∂ξ+ v u = f ∂ξ+ ∂ξ∗ − ∂ξ

We now deﬁne a function ζ Aξ → R as   maxξ∗ a − ξa 0 ¯ − ξ∗ a ¯ 0 ζa = maxξa λ uv

a ∈ Fξ a ∈ Bξ a = u v ∈ Jξ .

(81)

420

murota and shioura

It is easy to see that ζ ≥ 0 and ζ satisﬁes (77). Moreover, (78), (79), and (81) imply wξ aζa = ξ∗ a − ξafa ξa+ +1 a∈supp+ ξ∗ −ξ

a∈Aξ

+

−

ξa − ξ∗ afa ξa+ −1

a∈supp ξ∗ −ξ

+

u v∈Jξ

λuv f ∂ξ+ v u

≤ Fξ∗ − Fξ < 0 Hence, ζ is a circulation in Gξ wξ with negative cost. Remark 7.7. The claim (ii) of Theorem 7.6 does not hold in general when f ∈ . Let G = V A be a directed graph with V = u v w and A = u w v w, fa R → R ∪ +∞ a ∈ A a family of ﬂow cost functions such that dom fa = "0 1#

fa α = 0 α ∈ dom fa

and f R V → R ∪ +∞ a boundary cost function such that − minxu xv xu ≥ 0 xv ≥ 0 xV = 0 f x = +∞ (otherwise). It is easy to see that f is polyhedral convex, but not polyhedral M-convex. For a feasible ﬂow ξ = 0 0 ∈ R A , the auxiliary network Gξ = V Aξ wξ is such that Aξ = u w v w w u w v

wξ a = 0 a ∈ Aξ

There is no negative cycle in Gξ wξ , but ξ is not optimal; a feasible ﬂow ξ∗ ∈ R A is optimal if and only if either ξ∗ u w = 1 or ξ∗ v w = 1 holds. We next state an optimality criterion by potentials. Recall the deﬁnition of the coboundary δp of a potential p ∈ R V in (3). Theorem 7.8. Let ξ∗ ∈ R A be a feasible ﬂow. (i)

If the condition

∂ξ∗ ∈ arg min f "−p∗ #

ξ∗ a ∈ arg min fa "−ηa#

∀a ∈ A

(82)

holds for some p∗ ∈ R V and η = −δp∗ , then ξ∗ is optimal for (MFBC). (ii) Suppose that f is polyhedral convex with dom f = , and fa ∈ 1 a ∈ A. Then, ξ∗ is optimal for (MFBC) if and only if (82) holds for some p∗ ∈ R V and η = −δp∗ .

polyhedral m/l-convex functions (iii)

421

If p∗ ∈ R V and η = −δp∗ satisfy (82), then we have p∗ v − p∗ u ≤ wξ u v

∀u v ∈ Aξ∗

(83)

(iv) Suppose f ∈ and fa ∈ 1 a ∈ A. Then ξ∗ is optimal for (MFBC) if and only if (83) holds for some p∗ ∈ R V . Proof.

Note that Fξ = f "−p∗ #∂ξ +

a∈A

fa "−ηa#ξa

for any p∗ ∈ R V and η = −δp∗ . Hence, the claim (i) follows immediately. The claim (iii) is also immediate from the deﬁnition of wξ . (ii) Suppose ξ∗ is optimal for (MFBC). Deﬁne fi R V → R ∪ ±∞ i = 1 2 by f1 x = f x f2 x = inf fa ξa ∂ξ = x x ∈ R V ξ∈R A

a∈A

Then Corollary 7.2 implies the existence of p∗ ∈ R V with (71). Note that inf f2 "p∗ #x = inf p∗ ∂ξV + fa ξa x∈R V

ξ∈R A

=

a∈A

a∈A

inf δp∗ aα + fa α

α∈R

where the second equality is by p∗ ∂ξV = δp∗ ξA . Hence, we have (82). (iv) By Theorem 4.12 for f ∈ and the convexity of fa a ∈ A, the condition (82) is equivalent to (83). Hence, the claim follows from (ii). Remark 7.9. The following claim is easy to show: for a feasible ﬂow ξ ∈ R A , if there exists p∗ ∈ R V with (83), then there is no negative cycle in the auxiliary network Gξ wξ . Indeed, for any cycle C in Gξ wξ wξ C = wξ u v ≥ p∗ v − p∗ u = 0 u v∈C

u v∈C

follows from (83). The converse is also a well-known fact in graph theory. Summarizing the above results for f ∈ , we obtain the following statement. Corollary 7.10. Suppose f ∈ . For any feasible ﬂow ξ ∈ R A , the following three conditions are equivalent: (OPT) ξ is optimal for (MFBC). (NNC) There is no negative cycle in the auxiliary network Gξ wξ . (POT) There exists a potential p ∈ R V with (83).

422

murota and shioura 7.3. Network Induction

As shown in Section 4 (Theorems 4.18 and 4.35 in particular), there are various operations for polyhedral M/L-convex functions. In this section, we explain a more powerful operation called network induction. Network induction is a transformation by using networks and includes many operations such as translation, restriction, projection, etc., as its special cases. We show that polyhedral M/L-convex functions can be transformed into polyhedral M/L-convex functions by network induction. Let G = V A+ S T be a directed graph with vertex set V , arc set A, and two disjoint vertex subsets S and T called entrance set and exit set, respectively. For f R S → R ∪ +∞ and fa R → R ∪ +∞ a ∈ A, deﬁne f˜ R T → R ∪ ±∞ by fa ξa ∂ξ = x −y 0 ξ ∈ R A f˜y = inf f x + ξx

a∈A

x −y 0 ∈ R S × R T × R V \S∪T

y ∈ R T

(84)

For g R S → R ∪ +∞ and ga R → R ∪ +∞ a ∈ A, deﬁne g˜ R T → R ∪ ±∞ by ga ηa η = −δp q r ∈ R A gq ˜ = inf gp + ηpr

a∈A

S

p q r ∈ R × R T × R V \S∪T

q ∈ R T

Theorem 7.11. Let f , g be polyhedral convex functions such that dom f = , dom g = , and g = f • , and fa ga ∈ 1 with ga = fa• a ∈ A. Suppose that at least one of the following three conditions holds: (a)

f˜y0 is ﬁnite for some y0 ∈ R T ,

gq ˜ 0 is ﬁnite for some q0 ∈ R T , ˜ 0 < +∞ for some y0 ∈ R T and q0 ∈ R T . (c) f˜y0 < +∞ and gq

(b)

Then, the following statements hold: (i) both f˜ and g˜ are polyhedral convex functions such that dom f˜ = , dom g˜ = , f˜ > −∞, g˜ > −∞, and g˜ = f˜• , (ii) if f is polyhedral M-convex, then f˜ is also polyhedral M-convex, (iii) Proof.

if g is polyhedral L-convex, then g˜ is also polyhedral L-convex. (i)

We ﬁrst prove two claims.

Claim 1. For y ∈ R T , we have f˜y = g ˜ • y if either f˜y < +∞ or gq ˜ < +∞ for some q ∈ R T .

polyhedral m/l-convex functions

423

Deﬁne a function fˆ R S × R T × R V \S∪T → R ∪ +∞ by y = −y z = 0 f x fˆx y z = +∞ (otherwise). Then its conjugate fˆ• R S × R T × R V \S∪T → R ∪ +∞ is given by fˆ• p q r = gp − q yT

p q r ∈ R S × R T × R V \S∪T

By Theorem 7.5, we have V ˜ ˆ fa ξa ξ ∈ R f y = inf f ∂ξ + a∈A

= − inf fˆ• p q r +

a∈A

ga ηa η = −δp q r ∈ R A

p q r ∈ R S × R T × R V \S∪T

= − inf −q yT + gq ˜ q ∈ R T = g ˜ • y This ends the proof for Claim 1. ˜ = f˜• q if either gq ˜ < +∞ or Claim 2. For q ∈ R T , we have gq T ˜ f y < +∞ for some y ∈ R . Deﬁne a function gˆ R S × R T × R V \S∪T → R ∪ +∞ by q = q, gp gp ˆ q r = +∞ (otherwise). Then its conjugate g ˆ • R S × R T × R V \S∪T → R ∪ +∞ is given by z = 0, f x + q y g ˆ • x y z = +∞ (otherwise). By Theorem 7.5, we have ga ηa η = −δp q r ∈ R A gq ˜ = inf gp ˆ q r + a∈A

p q r ∈ R S × R T × R V \S∪T

= − inf g ˆ • x y z +

a∈A

S

fa ξa ∂ξ = x y z ξ ∈ R A T

x y z ∈ R × R × R

V \S∪T

= − inf q y T + f˜−y y ∈ R T = f˜• q This ends the proof for Claim 2.

424

murota and shioura

We see from Claim 1 (resp. Claim 2) that the conditions (a) and (c) (resp. (b) and (c)) are equivalent. Hence, the statement (i) follows. (ii) Suppose that g is polyhedral L-convex. Deﬁne r0 = gp + λ1 − gp/λ, which is independent of p ∈ R S and λ ∈ R . Since δp + λ1 q + λ1 r + λ1 = δp q r we have

gq ˜ + λ1 = inf

ηp r

gp +

a∈A

= inf gp + λ1 + ηpr

∀λ ∈ R

ga ηa η = −δp q + λ1 r a∈A

ga ηa η = −δp q r

= gq ˜ + λr0 For q1 q2 ∈ dom g, ˜ we show gq ˜ 1 + gq ˜ 2 ≥ gq ˜ 1 ∧ q ˜ 1 ∨ q2 . 2 + gq For i = 1 2, let pi qi ri ∈ R V satisfy gq ˜ i = gpi + a∈A ga ηi a, where ηi = −δpi qi ri . Putting η∧ = −δp1 ∧ p2 q1 ∧ q2 r1 ∧ r2

η∨ = −δp1 ∨ p2 q1 ∨ q2 r1 ∨ r2

we obtain ga η1 a + ga η2 a ≥ ga η∧ a + ga η∨ a

a ∈ A

by the convexity of ga , while gp1 + gp2 ≥ gp1 ∧ p2 + gp1 ∨ p2 holds by the submodularity of g. Hence it follows gq ˜ 1 + gq ˜ 2 ≥ gp1 ∧ p2 + gp1 ∨ p2 + ga η∧ a + ga η∨ a a∈A

˜ 1 ∨ q2 ≥ gq ˜ 1 ∧ q2 + gq (iii)

This is immediate from (i), (iii), and Theorem 5.1.

Proof of Theorem 4 18 7. Consider the bipartite graph G = V ∪ v0 ∪ U A+ S T with S = V , T = U ∪ v0 , and A = v v0 v ∈ V \ U ∪ u u u ∈ U, where u ∈ U is the copy of u ∈ U. Then we have fˆy0 y = f˜−y0 −y for y0 y ∈ R × R U , where fˆ is deﬁned by (38) and f˜ is the induced function deﬁned by (84). Thus, fˆ is polyhedral M-convex by Theorem 7.11(ii).

polyhedral m/l-convex functions

425

Remark 7.12. The convolution of two functions can be represented as a special case of network induction, and vice versa. We ﬁrst show how to represent the convolution of two functions as a network induction. Let fi R V → R ∪ +∞ i = 1 2. Let V1 and V2 be disjoint copies of V and consider a bipartite graph G = V1 ∪ V2 ∪ V A+ S T with S = V1 ∪ V2 , T = V , and A = v1 v v ∈ V ∪ v2 v v ∈ V , where vi ∈ Vi is the copy of v ∈ V i = 1 2. Deﬁne f R V1 ∪V2 → R ∪ +∞ and fa R → R ∪ +∞ a ∈ A by f x1 x2 = f1 x1 + f2 x2 xi ∈ R Vi

fa α = 0 α ∈ R

Then f1 f2 x = f˜−x x ∈ R V , where f˜ is the induced function deﬁned by (84). We next show that the function f˜ obtained by the network induction (84) can be represented as the convolution of two functions. Suppose we are given a directed graph G = V A+ S T with an entrance set S and an exit set T , and functions f R S → R ∪ +∞, fa R → R ∪ +∞ a ∈ A. We deﬁne fˆi R S∪T → R ∪ ±∞ i = 1 2 as

f x y = 0T , x ∈ R S y ∈ R T +∞ (otherwise) ξ ∈ RA f ξa ˆ a f2 x y = inf x ∈ R S y ∈ R T ∂ξ = −x −y 0 a∈A fˆ1 x y =

Then the convolution fˆ1

fˆ2 R S∪T → R ∪ ±∞ is written as

x1 + x2 = x xi ∈ R S y1 + y2 = y yi ∈ R T = inf fˆ1 x1 0T + fˆ2 x2 y x1 + x2 = x xi ∈ R S = inf f x + fˆ2 x − x y x ∈ R S ξ ∈ R A x ∈ R S = inf f x + fa ξa ∂ξ = x − x −y 0 a∈A

fˆ1 fˆ2 x y = inf fˆ1 x1 y1 + fˆ2 x2 y2

for x ∈ R S and y ∈ R T . Thus, fˆ1

fˆ2 0S y = f˜y holds for y ∈ R T .

ACKNOWLEDGMENT The authors thank Satoru Fujishige for his helpful comments.

426

murota and shioura REFERENCES

1. R. K. Ahuja, T. L. Magnanti, and J. B. Orlin, “Network Flows—Theory, Algorithms, and Applications,” Prentice Hall, Englewood Cliffs, NJ, 1993. 2. G. Birkhoff, Rings of sets, Duke Math. J. 3 (1937), 443–454. 3. R. E. Bixby and W. H. Cunningham, Matroid optimization and algorithms, in “Handbook of Combinatorics” (R. L. Graham, M. Gr¨ otschel, and L. Lov´asz, Eds.), Vol. I, Chap. 11, pp. 551–609, Elsevier, Amsterdam, 1995. 4. A. Bouchet and W. H. Cunningham, Delta-matroids, jump systems, and bisubmodular polyhedra, SIAM J. Discrete Math. 8 (1995), 17–32. 5. W. J. Cook, W. H. Cunningham, W. R. Pulleyblank, and A. Schrijver, “Combinatorial Optimization,” Wiley, New York, 1998. 6. V. Danilov, G. Koshevoy, and K. Murota, Equilibria in economies with indivisible goods and money, RIMS preprint, No. 1204, Kyoto University, 1998. 7. A. W. M. Dress and W. Wenzel, Valuated matroid: A new look at the greedy algorithm, Appl. Math. Lett. 3 (1990), 33–35. 8. A. W. M. Dress and W. Wenzel, Valuated matroids, Adv. Math. 93 (1992), 214–250. 9. J. Edmonds, Submodular functions, matroids and certain polyhedra, in “Combinatorial Structures and Their Applications” (R. Guy, H. Hanani, N. Sauer, and J. Sch¨ onheim, Eds.), pp. 69–87, Gordon & Breach, New York, 1970. 10. U. Faigle, Matroids in combinatorial optimization, in “Combinatorial Geometries” (N. White, Ed.), pp. 161–210, Cambridge Univ. Press, London, 1987. 11. P. Favati and F. Tardella, Convexity in nonlinear integer programming, Ricerca Operativa 53 (1990), 3–44. 12. A. Frank, An algorithm for submodular functions on graphs, Ann. Discrete Math. 16 (1982), 97–120. 13. A. Frank, Generalized polymatroids, in “Finite and Inﬁnite Sets” (A. Hajnal, L. Lov´asz, and V. T. S´ os, Eds.), pp. 285–294, North-Holland, Amsterdam, 1984. ´ Tardos, Generalized polymatroids and submodular ﬂows, Math. Program14. A. Frank and E. ming 42 (1988), 489–563. 15. S. Fujishige, A note on Frank’s generalized polymatroids, Discrete Appl. Math. 7 (1984), 105–109. 16. S. Fujishige, Submodular systems and related topics, Math. Programming Study 22 (1984), 113–131. 17. S. Fujishige, “Submodular Functions and Optimization,” Annals of Discrete Mathematics, Vol. 47, North-Holland, Amsterdam, 1991. 18. S. Fujishige and K. Murota, Notes on L-/M-convex functions and the separation theorems, Math. Programming 88 (2000), 129–146. 19. T. Ibaraki and N. Katoh, “Resource Allocation Problems: Algorithmic Approaches,” MIT Press, Cambridge, MA, 1988. 20. M. Iri, “Network Flow, Transportation and Scheduling—Theory and Algorithms,” Academic Press, New York, 1969. 21. S. Iwata, A capacity scaling algorithm for convex cost submodular ﬂows, Math. Programming 76 (1997), 299–308. 22. N. Katoh and T. Ibaraki, Resource allocation problems, in “Handbook of Combinatorial Optimization II” (D.-Z. Du and P. M. Pardalos, Eds.), pp. 159–260, Kluwer Academic, Boston, MA, 1998. 23. E. L. Lawler, “Combinatorial Optimization: Networks and Matroids,” Holt, Rinehart & Winston, New York, 1976.

polyhedral m/l-convex functions

427

24. L. Lov´asz, Submodular functions and convexity, in “Mathematical Programming—The State of the Art” (A. Bachem, M. Gr¨ otschel, and B. Korte, Eds.), pp. 235–257, Springer-Verlag, Berlin, 1983. 25. K. Murota, Finding optimal minors of valuated bimatroids, Appl. Math. Lett. 8 (1995), 37–42. 26. K. Murota, Valuated matroid intersection. I. Optimality criteria, SIAM J. Discrete Math. 9 (1996), 545–561. 27. K. Murota, Valuated matroid intersection. II. Algorithms, SIAM J. Discrete Math. 9 (1996), 562–576. 28. K. Murota, Matroid valuation on independent sets, J. Combin. Theory Ser. B 69 (1997), 59–78. 29. K. Murota, Submodular ﬂow problem with a nonseparable cost function, Combinatorica 19 (1999), 87–109. 30. K. Murota, Convexity and Steinitz’s exchange property, Adv. Math. 124 (1996), 272–311. 31. K. Murota, Structural approach in systems analysis by mixed matrices—An exposition for index of DAE, in “ICIAM 95” (K. Kirchg¨assner, O. Mahrenholtz, and R. Mennicken, Eds.), pp. 257–279, Mathematical Research, Vol. 87, Akademie Verlag, Berlin, 1996. 32. K. Murota, Discrete convex analysis, Math. Programming 83 (1998), 313–371. 33. K. Murota, On the degree of mixed polynomial matrices, SIAM J. Matrix Anal. Appl. 20 (1999), 196–227. 34. K. Murota, Discrete convex analysis—Exposition on conjugacy and duality, in “Graph Theory and Combinatorial Biology” (L. Lov´asz et al., Eds.), pp. 253–278, The J´anos Bolyai Mathematical Society, 1999. 35. K. Murota, Discrete convex analysis, in “Discrete Structures and Algorithms V” (S. Fujishige, Ed.), pp. 51–100, Kindai-Kagakusha, Tokyo, 1998. [In Japanese] 36. K. Murota, “Matrices and Matroids for Systems Analysis,” Springer-Verlag, Berlin, 2000. 37. K. Murota and A. Shioura, M-convex function on generalized polymatroid, Math. Oper. Res. 24 (1999), 95–105. 38. K. Murota and A. Shioura, Relationship of M-/L-convex functions with discrete convex functions by Miller and by Favati–Tardella, Discrete Appl. Math., in press. 39. C. H. Papadimitriou and K. Steiglitz, “Combinatorial Optimization: Algorithms and Complexity,” Prentice-Hall, Englewood Cliffs, NJ, 1982. 40. W. R. Pulleyblank, Polyhedral combinatorics, in “Optimization” (G. L. Nemhauser, A. H. G. Rinnooy Kan, and M. J. Todd, Eds.), Handbooks in Operations Research and Management Science, Vol. 1, Chap. V, pp. 371–446, Elsevier, Amsterdam, 1989. 41. R. T. Rockafellar, “Convex Analysis,” Princeton Univ. Press, Princeton, 1970. 42. R. T. Rockafellar, “Network Flows and Monotropic Optimization,” Wiley, New York, 1984. 43. A. Shioura, Minimization of an M-convex function, Discrete Appl. Math. 84 (1998), 215–220. 44. J. Stoer and C. Witzgall, “Convexity and Optimization in Finite Dimension, I,” SpringerVerlag, Berlin, 1970. ´ Tardos, Generalized matroids and supermodular colourings, in “Matroid Theory” 45. E. (A. Recski and L. Lov´asz, Eds.), Colloquia Mathematica Societatis J´anos Bolyai, Vol. 40, pp. 359–382, North-Holland, Amsterdam, 1985. 46. D. Welsh, “Matroid Theory,” Academic Press, New York, 1976. 47. H. Whitney, On the abstract properties of linear dependence, Amer. J. Math. 57 (1935), 509–533. 48. N. White (Ed.), “Theory of Matroids,” Cambridge Univ. Press, London, 1986. 49. N. White (Ed.), “Combinatorial Geometries,” Cambridge Univ. Press, London, 1987. 50. N. White (Ed.), “Matroid Applications,” Cambridge Univ. Press, London, 1992.

Extension of M-Convexity and L-Convexity to Polyhedral Convex Functions

Extension of M-Convexity and L-Convexity to Polyhedral Convex Functions

Recommend Documents