Accepted Manuscript Title: Security Design and Capital Structure of Business Groups Author: Alexandre Messa PII: DOI: Reference:
S1062-9769(15)00036-8 http://dx.doi.org/doi:10.1016/j.qref.2015.03.005 QUAECO 846
To appear in:
The
Received date: Revised date: Accepted date:
24-7-2014 27-1-2015 11-3-2015
Quarterly
Review
of
Economics
and
Finance
Please cite this article as: Messa, A.,Security Design and Capital Structure of Business Groups, Quarterly Review of Economics and Finance (2015), http://dx.doi.org/10.1016/j.qref.2015.03.005 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
“Business Groups and Risk Sharing Among Firms” highlights: This paper investigates the optimal contract between a principal and an agent that manages a business group and diverts funds among its projects.
ip t
A pyramidal ownership structure may arise endogenously. The paper provides explanations for the cross-holding of equity between firms in business groups and the contagion between the asset prices of such firms.
cr
It is shown that a tax on intercorporate dividends may render the organization of
Ac ce p
te
d
M
an
us
such groups infeasible and lead to the creation of conglomerates.
1 Page 1 of 55
Alexandre Messa1
an
January 2015
us
cr
Instituto de Pesquisa Econômica Aplicada
ip t
Security Design and Capital Structure of Business Groups
Abstract
This paper investigates the optimal contract between a principal and an agent that
M
manages a business group and diverts funds among its projects. The optimal contract can be implemented by limited liability financial securities and results in a capital
d
structure that provides risk sharing among the group firms. The paper provides
te
explanations for the cross-holding of equity between firms in business groups, the contagion between the asset prices of such firms, and shows that a tax on intercorporate
Ac ce p
dividends may render the organization of such groups infeasible and lead to the creation of conglomerates.
JEL classification: D82, G30, G32, L20. Keywords: business groups; corporate governance; capital structure.
1
E-mail:
[email protected]. Telephone/fax number: 55-61-3315-5189/3315-5341.
2 Page 2 of 55
1
Introduction In several countries, firms with a diffuse investor basis and professional
management are more of an exception than the rule. In fact, in emerging countries and
ip t
in parts of Europe, it is much more common to find firms inserted in business groups controlled by a family or the state. In a stylized setting, this control is usually exerted
which, in turn, controls
cr
through a pyramidal structure: a certain family controls firm
firm , making the latter indirectly controlled by the family in question.
us
Firms in such groups are often managed by individuals designated to make decisions according to the controlling family interests, or even by own members of this family.2 Thus, firms belonging to business groups, as those with a diffuse investor basis,
an
feature an agency conflict triggered by the divergence of interests between the manager who maximizes his utility (or, in the case of business groups, the utility of the
M
controlling family) and the stockholders and creditors interested in the maximization of their securities value.
However, the ownership chain of a business group introduces an additional
d
dimension to the agency conflict. This arises from the possibility that the manager of a
te
certain firm will make decisions that benefit another firm of the same group, to the detriment of the one managed by him. For instance, Cheung, Qi, Rau and Stouraitis
Ac ce p
(2009), investigating Hong Kong publicly listed firms, show that a common practice among them is to buy and sell assets to their controlling shareholders at, respectively, above and below market prices. Also, Berkman, Cole and Fu (2009) show that another common practice among Chinese business groups concerns the issuance of a guarantee, given to a third party, by a firm for a loan taken by another firm of the same group. Although these transactions take place within the group, the stockholders and
creditors of each firm might be different. Therefore, these relationships involve wealth transfers from the stockholders and creditors of a firm to those of another one. In the corporate finance literature, these transfers of value across firms of the same business group, to the detriment of minor stockholders of one of the firms, are often referred to as tunneling.3
2 3
See, for instance, Weindenbaum (1996) for a description of this practice in Chinese business groups. For a relevant exposition on this concept, see Johnson, La Porta, Lopez-de-Silanes and Shleifer (2000).
3 Page 3 of 55
In order to assess the incentives behind business groups and potential frictions that may give rise to pyramidal structures, this paper introduces a principal-agent model in which the agent manages the projects of a business group. Initially, it’s assumed a firm managed by an agent with limited liability, under a contract with an investor, the principal, with unlimited liability. Afterwards, the agent is endowed with a new project.
ip t
To implement it, he4 needs to renegotiate the previous contract with the principal.
The agent then proceeds to manage two projects, the parent (the old project) and
cr
the subsidiary (the new one), each one of them has its own cash reserves5. As functions of the history of the projects cash flows, the contract between the parties specifies the
us
principal’s remuneration and investment in the projects. In turn, the agent has discretionary power to establish his own remuneration and observable transfers between the projects’ cash reserves6. Each project is liquidated once it is unable to make the
an
payments to the principal established by the contract7.
However, the profits of both projects are random, and the principal is neither
M
capable of observing their realizations nor their use by the agent. Consequently, the agent can transfer cash across projects, so as to maximize his own utility, without the principal being able to identify such practice.
d
It is assumed that the agent is not able to divert cash to himself from the
te
subsidiary. This assumption creates an asymmetry between the parent and the subsidiary, once the agent is compensated only through funds from the parent (by
Ac ce p
means of an equity share in the original firm)8. Therefore, once the agent can still distribute himself dividends if the subsidiary has been liquidated, but not if the parent did, this asymmetry gives the agent a direct incentive to transfer funds from the subsidiary to the parent. On the other hand, there is also an indirect incentive for the agent to transfer funds to the subsidiary, in case the latter is going through financial distress, in order to preserve the option of drawing funds from it later, on more 4 Throughout this paper, masculine pronouns refer to the agent, whereas feminine pronouns apply to the principal. 5 The assumption of the existence of cash reserves is made without loss of generality. Indeed, the optimal contract could establish no accumulation of them. 6 The assumption of separate cash reserves between the parent and the subsidiary is also made without loss of generality. To see this, note that the principal could determine a payment to herself of one monetary unit by one of the projects, and, at the same time, invest this unit in the other project, establishing, therefore, a simple transfer of cash between them. That being said, one should note that it is a common practice among firms to maintain separate budget accounts and controls for different projects. 7 This condition is, again, without loss of generality, once that the contract can always establish an investment, to be made by the principal in the project, equivalent to the payment to the principal. 8 This would be the case, for example, in which, for legal or regulatory reasons, the subsidiary was subject to tighter auditing procedures than the parent.
4 Page 4 of 55
favorable states of nature for the subsidiary’s activities. 9 This asymmetry between the projects precludes the first-best outcome from being attained10. By dynamic programming, the agent’s problem is reduced to a system of partial differential equations that allows for the identification of the strategy to be adopted by
ip t
him. This strategy imposes the constraints that the principal must consider in her maximization problem, which is then solved using the same procedure applied in the agent’s one.
cr
Thereafter, the paper demonstrates that the optimal contract can be implemented
by means of limited liability securities and, under some circumstances, the creation of
us
two firms. In this way, a pyramidal ownership structure may arise endogenously, when the unobservable cash transfers between the projects would involve a deadweight cost11.
an
On the other hand, when there is no deadweight cost incurred by these cash transfers, the only feasible alternative is the organization of the two projects as two divisions of a single firm.
M
To understand the intuition behind this condition, suppose that the subsidiary is a supplier of the parent. In this case, the agent can, for instance, transfer cash from the subsidiary to the parent through purchases, by the latter, at a price below the production
d
cost for the former. As the subsidiary is a supplier of the parent, this transference may
te
involve no deadweight cost. On the other hand, if the parent’s and subsidiary’s economic activities are very different from each other, this kind of trade between them
Ac ce p
may then lead to deadweight costs.
Therefore, as an implication of the present model, when the projects have similar
economic activities or are on the same value chain, they tend to be organized as two divisions of a single firm. Conversely, when the projects’ economic activities are very different from each other, they may be organized as two distinct firms in a business group.
The issuance of common stocks by each firm is performed allocating a given
fraction to the other one, and granting the agent full discretionary power over the 9 As a result of his maximization problem, the agent will be interested, under some states of nature, to divert cash to the subsidiary in order to smooth the effects of shocks to the parent’s activities. Indeed, as a consequence of Lemma 2 (b), even though the agent is risk-neutral, he will behave risk-averse whenever the parent firm holds low levels of cash reserves – or, in other words, whenever the probability of its liquidation is high enough. That risk aversion then induces the agent to tend to smooth the effects of shocks and, to that end, make use of the subsidiary's cash reserve. 10 The first-best outcome would be the management of the cash reserves as if the projects in question constituted a single one, turning the present problem into the one analyzed, for example, in Radner and Shepp (1996). 11 Using the notation to be introduced in subsection 2.1, when .
5 Page 5 of 55
dividend policies of both firms. Thus, by means of the dividends to be distributed, he will be able to determine the resource allocation across them – and in such a way that the cash transfers between the firms do not result in deadweight costs. In turn, the common stocks issued by the parent firm to be retained by the
ip t
subsidiary must be from a distinct class of those shares held by the agent (also issued by the parent). This distinction is made so as to allow the dividend policies of each of these classes to differ between them. So, as the agent has discretionary power over the
cr
dividend policies of both stock classes, he can both allocate additional funds to the
subsidiary and determine his own remuneration; however, since those stocks belong to
us
different classes, these two flows are determined independently.
The dividends of the stocks issued by the parent and held by the agent will be
an
distributed only after both firms succeed in reducing their respective bankruptcy risks to satisfactory levels. This result somehow leads to a similar conclusion to that drawn by Biais, Mariotti, Plantin and Rochet (2007) for a firm with a diffuse investor basis.
M
Nevertheless, the result shown herein differ in that the dividend policy will take into account the bankruptcy risk not only of the firm in question (in this case, the parent), but also of the other firm in the group (in this case, the subsidiary).
d
On the other hand, the subsidiary’s dividend policy assumes distinct
te
characteristics. In this case, the dividends can be distributed at any time, depending on the state of nature: when this is relatively more favorable to the business activities of the
Ac ce p
subsidiary than to those of the parent, dividends are distributed in order to make up the parent’s cash reserves. The dividend policy regarding the class of parent’s stocks held by the subsidiary features an analogous behavior. Therefore, risk sharing among the group firms occurs through two mechanisms:
cross-holding of equity between them – i.e., the holding of the parent’s equity by the subsidiary, and vice versa –, and distinct dividend policies across the firms. In fact, cross-holding allows the agent, by means of the distribution of dividends, to allocate resources across the firms of the group. In turn, distinct dividend policies allow these distributions to smooth the effects of shocks and the consequent risk sharing among the firms. In this way, this paper proposes an alternative explanation for the practice of cross-holding of equity between firms. This phenomenon is often seen by the literature as a mechanism of separation of ownership and control, differing from the strict pyramidal structure as the voting rights for control purposes are distributed across 6 Page 6 of 55
several firms, instead of being concentrated in a single holding company.12 The present paper, however, highlights the role of this cross-holding in the risk sharing among firms, enabling the resource allocation between them and the consequent smoothing of shocks.
ip t
In addition, ever since La Porta, Lopez-de-Silanes, Shleifer and Vishny (2000) and Faccio, Lang and Young (2001), the dividend policy of business groups has been
regarded as a tool for the reduction of funds available to the controllers, thereby
cr
restricting their capacity to expropriate the wealth of investors. The present paper
introduces an additional function to the dividend policy, which consists in the allocation
us
of losses and gains across firms within the group. As a result, the distribution of dividends by a certain firm would be subjected not exactly to its own performance, but
an
more precisely to the difference between its performance and those of the other firms in the group.
This paper also shows that the market value of securities issued by a given firm
M
are sensitive to shocks on the activity of another firm in the group, what helps explain the contagion between asset prices on different markets. According to the findings of this paper, this contagion would occur due to the risk sharing among the group firms, would directly affect the business
. In this regard, Johnson and Soenen (2002), investigating some
te
activities of firm
d
and not necessarily because the shock to firm
countries with close economic ties to Japan, show a positive correlation between foreign
Ac ce p
direct investment flows received from that country, and comovement between the stock markets of these economies and that of Japan. Finally, it is demonstrated that the tax on intercorporate dividends may render
the pyramidal structure infeasible, and makes the organization of the two projects as two divisions of a single firm, creating a conglomerate, the only feasible alternative. The adoption of this kind of tax by the United States in the 1930s helps explain the low frequency of these groups in that country, although they were quite common there until then. This idea is presented by Morck (2005) and is corroborated by the findings of this paper. This work is related to the literature on financial contracts in a continuous-time setting. In this literature, Holmstrom and Milgrom (1987) is a seminal work, while
DeMarzo and Sannikov (2006), Biais, Mariotti, Plantin and Rochet (2007), Cadenillas, 12
See, for instance, Bebchuk, Kraakman and Triantis (2000).
7 Page 7 of 55
Cvitanic and Zapatero (2007), Sannikov (2008), Biais, Mariotti, Rochet and Villeneuve (2010) and Williams (2011), among others, may be cited as recent contributions. A line of research in this literature investigates the capital structure and security design as a result of the optimal contract between an agent and a principal. DeMarzo and Sannikov
ip t
(2006) and Biais, Mariotti, Plantin and Rochet (2007) are relevant works in this line. The present paper uses this perspective as a starting point, but considering a group of projects and investigating how the capital structure and the financial securities issued
cr
and purchased by the group are used for risk sharing.
Moreover, this paper is part of an effort to understand the ownership structure of
us
business groups. This is usually perceived as a mechanism of separation of control from cash flow rights – see Morck, Wolfenzon and Yeung (2005) and Khanna and Yafeh
an
(2007) for relevant reviews of this literature. Almeida and Wolfenzon (2006) move away from this perspective, showing that a pyramidal structure may arise for other reasons than the separation of ownership and control – in their case, due to financing
M
advantages of pyramidal structures that arise as a result of certain frictions in the capital market. In the present paper, conversely, the ownership structure of the business group arises as a risk sharing mechanism between firms, providing the proper incentives to the
d
manager to allocate the gains and losses in a predictable way.
te
This paper is organized into four sections. Following this introduction, Section 2 introduces the principal-agent model in order to investigate the incentives related to the
Ac ce p
management of the business group, the constraints necessary to the alignment of interests between the parties, and the resulting optimal contract. Section 3 analyzes the consequent capital structure and the price dynamics of the issued securities. Finally, Section 4 concludes.
2
The Principal-Agent Problem and the Optimal Contract This paper assumes a firm managed by an agent with limited liability under a
contract with a principal with unlimited liability, responsible for the necessary financing. Admittedly, both are risk-neutral, and the contract between them provides the agent with a share of the firm’s equity, and requires the firm to make certain payments to the principal at each instant, in what comprises the remuneration of its debt. Thus, the agent’s compensation consists of the percentage of the dividends distributed by the firm he is entitled to. 8 Page 8 of 55
As this firm is operating normally, the agent receives another cash flow generating project. To implement it requires an irreversible investment equal to
.
This investment can, at least partially, be made by the said firm. If that happens, it represents a deviation from its activities and, consequently, the previous contract must
ip t
be renegotiated with the principal. However, even if the implementation of the project does not involve any financial supply by the firm, the activities will imply a change in
the incentives to which the agent will be subjected. Therefore, anyway, the previous
cr
contract must be renegotiated between the parties before implementing the project. This renegotiation can include both changes in the terms of the original contract and
invest in each of the projects at any time she desires.
us
additional financial supply by the principal. Also, using her wealth, the principal may
an
The agent then proceeds to manage two projects, the parent and the subsidiary. The parent’s realized profit can initially have four allocations: the distribution of dividends, the principal’s remuneration13, cash transfers to the subsidiary, or the
M
accumulation of the project’s cash reserves.
In turn, the subsidiary’s realized profit may be allocated as follows: cash
project’s cash reserves.
d
transfers to the parent, the principal’s remuneration14, and the accumulation of the
te
The contract between the parties specifies the principal’s remuneration as a function of the cash flows history. In turn, the agent has discretionary power to establish
Ac ce p
his own remuneration and cash transfers between projects. It is taken as given that each project is liquidated once it is unable to make the payments to the principal established by the contract.
Supposedly, the principal knows the probability distribution of profits and
observes the cash reserves of both projects. However, she does not observe the realized profit of any of them, allowing the agent to divert some of the profit from one of the projects to the cash reserves of the other project, without the principal being able to identify such practice. Thus, there are two types of transfers of funds between projects: those that are observed by the principal (the parent’s both remuneration and additional investments made in the subsidiary) and those that are not observed by her. Furthermore, given the 13
This remuneration can be either positive or negative. If it is negative, it represents additional investments in the project, to be made by the principal. 14 Which can also be either positive or negative.
9 Page 9 of 55
difficulty faced by the principal in monitoring the funds of both projects, the agent is assumed to have discretionary power to determine the distribution of dividends of the parent (and, therefore, his own remuneration) and the observable payments across projects.
ip t
Note that, since the agent is remunerated by the parent (through a percentage of the dividends distributed to shareholders) and he has discretionary power to allocate resources between projects, he will always prevent the parent from being liquidated
cr
before the subsidiary. Hence, it is assumed, during the entire term of the contract between the parties, that either the subsidiary is liquidated before or that both are
us
liquidated simultaneously.
an
Remark 1: The assumption that the agent is not paid by the subsidiary is an implicit consequence of the supposition that he is not able to divert funds from it. Alternatively, it could be admitted that the agent is able to make such diversions. However, this would
M
simply reduce the problem to a case in which the agent manages a single project15, and is able to divert cash from it – an agency conflict analyzed, for example, in DeMarzo and Sannikov (2006) and Biais, Mariotti, Plantin and Rochet (2007).
d
Differently, the assumption that the agent is not able to divert resources to
te
himself from the subsidiary introduces a distinct agency problem, as a consequence of the resulting asymmetry between the projects. This would be the case, for example, in
Ac ce p
which, for legal reasons, the subsidiary was subject to tighter auditing procedures than the parent.
Considering any instant , suppose the cash reserves of both projects are given.
The scenario described above can be summarized in the following timeline:
(T1) The agent observes the realized profits on the interval
.
(T2) Given the cash reserves at , the contract determines the remuneration the principal will receive from each one of the projects on the interval
.
15
Using the notation that will be introduced in the next subsection, the operating profit of this project, on the interval , would be given by .
10 Page 10 of 55
(T3) Given the information at (T1) and (T2), the agent decides on his own remuneration, the payments between the projects and the amount to be diverted across them on the interval
.
(T4) The variations in cash reserves on the interval
are determined
ip t
residually by the difference between the profits observed at (T1) and the cash flows .
cr
stipulated at (T2) and (T3). These variations will determine the cash reserves at
As a result of the above succession of events, the principal, when faced with , is not able to know whether
us
certain levels of cash reserves of both projects at
these levels result from the allocation of resources by the agent at (T3) or from the profits realized at (T1). Her problem, therefore, consists in providing the agent with the
an
appropriate incentives so that he can determine a dividend policy for the parent and
2.1
M
transfers of funds between the projects according to her interests.
Formalization of the Problem In what follows, let
for the terms referring to the parent, and
d
subsidiary. The operating profit of each one of them on the interval
Ac ce p
te
by
in which
and
are positive constants, and
probability space
for the is given
(1)
is a Brownian motion on a filtered
satisfying the usual conditions.16 Nevertheless, it
is assumed that only the agent observes
(and, consequently, the realization of
).
On each interval
profit,
, the parent has two cash flow sources: its operating
, and the payments received from the subsidiary,
.17 This cash flow
16
By usual conditions, it is meant a right-continuous filtration, and containing all sets with zero probability. 17 In what follows, any flow , represented with the right-to-left arrow, refers to resources of the , refer to flows from the parent to subsidiary allocated to the parent; those with the left-to-right arrow, the subsidiary. Thus, one should not mistake the notation introduced here for that of the vector calculus, where represents a vector. In the present paper, and will always be scalars, and the arrow serves only to help visualize the direction of cash flows within the group.
11 Page 11 of 55
;18 the principal’s
can be allocated as follows: the payment of dividends, remuneration,
;19 the payments made to the subsidiary,
; or the
. Thus, one has the identity
accumulation of cash reserves,
In turn, the subsidiary has the following cash flow sources: its operating profit,
,
cr
ip t
(2)
; or
and the investments made by the parent,
; the principal’s remuneration,
us
allocations: the parent’s remuneration,
. This cash flow can initially have three
the accumulation of cash reserves,
an
. Hence, one obtains the identity
M
(3)
can take on any real values. When
By definition, the flows
will be remunerated by project ; when
, she will provide additional financial
d
supply to the said project.
, the principal
te
However, since the principal does not observe the realization of operating
be the amount of
unobservable payments, on the interval
, from the parent, to make up the cash
profits, the agent is able to divert some part of these profits to the cash reserves of the
Ac ce p
other project without the principal being aware of it. Let
reserves of the subsidiary, and
as the unobservable amount diverted in the
opposite direction. It is admitted that, of these unobservable payments, the project of each monetary unit. Thus, the share of the amount
receives a fraction
diverted from the subsidiary actually received by the parent is equal to
, while the
share of the amount diverted from the parent and received by the subsidiary is given by . In turn,
and
represent the deadweight costs incurred by
the respective diversions.
18
Let be the parent’s equity share held by the agent. Then, from the total of dividends paid a part equivalent to will go to the agent, and the other part , to the principal. 19 Besides the part of the dividends she is entitled to (given by ).
,
12 Page 12 of 55
Therefore, the parent obtains two payment flows from the subsidiary,
and
, on each interval, the first of which is observed by the principal, while the second one is unobservable by her. Likewise, the payments made by the parent to the and
, the first of which is observed by
the principal, and the second one is not.
2.1.1 The contract between the parties
(4)
(5)
an
us
cr
In this way, the identities in (2) and (3) can be rewritten as
ip t
subsidiary may be split into two parts,
One assumes that the agent has discretionary power to establish the parent’s
M
dividend policy – and, therefore, his own remuneration –, the parent’s remuneration process and the additional financial supplies made by the parent to the subsidiary. Thus, ,
, is only suggested by the contract, and it is up to the
d
the process
agent to accept or not this suggestion.
te
It is also admitted that, to match the agent’s actions with the principal’s interests, – this assumption is made without
Ac ce p
the principal may use a pair of amplifiers
. The pair determines that: (i)
loss of generality, as it is possible to have
for each monetary unit the parent supplies to the subsidiary (by way of principal will receive
the parent (by way of
), the
units; (ii) for each monetary unit received as remuneration by ), the principal will receive
units. The purpose of these
parameters is to change the impact of cash flows between projects on their respective cash reserves, altering the incentives to which the agent will be subjected. and
Given the processes one process
, for any
that satisfies, for . Hence,
and
there exists only ,
and
are payments received by the principal
directly from the parent and from the subsidiary, respectively. Then, letting the parent’s equity share held by the principal,
be
the contract establishes 13 Page 13 of 55
as
the
principal’s instantaneous remuneration. The subsidiary project is established by a contract contract) stipulating the pair ,
and
.20 Regarding the
, the agent can divert them from the path suggested by
and
ip t
processes
and the process
(replacing the original
choose a strategy that maximizes his expected utility throughout the contract period. The principal’s remuneration
cr
is determined exclusively by the
contract, and these payments must be made by the respective projects, regardless of
us
their cash reserves. When these are insufficient for that, the said project is liquidated. As argued earlier, as the agent is remunerated by the parent and he can allocate resources
an
between the projects, the parent shall not be liquidated before the subsidiary. Hence, the contract is assumed to be terminated when the parent’s cash reserves are insufficient to meet its obligations. The term of the contract
, in which
M
.21
is then given by interval
The principal’s problem consists in determining the incentive-compatible
d
contract that maximizes her expected utility. By incentive-compatible contract, it is meant any contract under which the strategy adopted by the agent is actually the one
te
suggested by the contract.
To find this optimal contract, it is necessary to obtain the constraints that must
Ac ce p
be included so that the agent’s actions correspond to those required by the principal. With that purpose, the following section will analyze the agent’s maximization problem under a given contract
2.2
.
The agent’s problem Let
and
be the discount rates, respectively, of the agent and of the principal.
Assuming that the agent is more impatient than the principal, one has
. The
following analysis admits, without any damage to the conclusions reached, that any income flow received by the agent is immediately consumed by him. 22 20
Note that, given the terms related to , the processes and are determined residually, according to identities (4) and (5). 21 Equivalently, the subsidiary is liquidated at the instant , in which . 22 Alternatively, suppose that the agent can consume some of this income flow and save up the rest. However, supposing that these savings grow at a rate , any contract that implies positive savings by the agent at some will be weakly dominated by another one in which the principal saves in his place and
14 Page 14 of 55
Just for this subsection, make the following assumption about the process :
Assumption 1: For
(c) for
;
; such that
.
cr
,
ip t
23
(b)
: (a)
Remark 2: The assumption above assumes a Markovian structure for
, which
us
significantly simplifies the agent’s problem. Anyway, in the context of the principal’s problem, Remark 3 further ahead will discuss the appropriateness of this assumption, to .
an
the detriment of a history-dependent process
In this way, besides the Markovian structure mentioned in the remark above, depends on time solely by means of the
M
Assumption 1(a) assumes that the process
levels of cash reserves at each instant, but not explicitly on time itself. This assumption is made in order to guarantee the stationarity of the present problem, given its infinite
d
time horizon.
te
Assumption 1(b) is intended to guarantee that the stochastic differential equations in (6) and (7) have a unique solution. Finally, Assumption 1(c) will be used in
Ac ce p
the proof of Lemma 2. Intuitively, it guarantees, by means of Dynkin’s formula, that for
Let
.
. Inserting (1) into (4) and (5), one sees that the cash reserves
of the parent and of the subsidiary will follow the respective processes
(6)
(7)
makes payments to the agent, through the parent, at the instants he would consume his savings. This result is formally established in DeMarzo and Sannikov (2006) and in DeMarzo and Fishman (2007). 23 represents the set of all functions times continually differentiable in .
15 Page 15 of 55
in which were used the definitions
and .
Define
as the set of feasible strategies available to the agent, i.e., the set of all 24
non-decreasing process
adapted to
, with values in
, and
25
Then, by letting
(8)
us
cr
ip t
with bounded variation. The agent’s criterion functional can be defined as
, the value function
of the present problem
an
is such that:
M
(9)
Since the contract is terminated when the parent runs out of cash reserves, the function for
.
d
must satisfy
Lemma 1 below implies that the subsequent analysis can be restricted to
te
26
contracts in which
. Thus, when subjected to these contracts, the agent’s
Ac ce p
optimal strategy will not include any unobservable (by the principal) allocation of resources between the projects. Its proof, as well as those of the other lemmas and propositions, is shown in the Appendix.
Lemma 1: (a) Let
be the set of optimal contracts, i.e., those incentive-compatible that
maximize the principal’s expected utility. Suppose that this set is non-empty. Then, there is a contract
24
under which
For any process ,
. Since, for
.
,
,
,
,
and
take on
nonnegative values, the processes , , , and shall be nondecreasing. 25 According to the notation used in the present paper, whenever a certain variable is denoted by an uppercase letter, , it will refer to the process in question. When the variable is denoted by a lower-case letter, , it represents a value assumed by the process at a specific, and clear in the context, instant (in (8), the integration parameters indicate that , i.e., ), or whenever the specific instant is not relevant. 26 Note that, being and , implies for . Obviously, the same applies to
.
16 Page 16 of 55
(b) Under an incentive-compatible contract, in order for necessary that
. Likewise, in order for
and and
, it is
, it is necessary
.
that
ip t
Item (b) of the lemma above establishes the incentive-compatibility constraints, which impose the necessary conditions so that the agent will prefer to allocate resources
cr
between the projects using processes that can be observed by the principal, to the detriment of those unobservable by her. To understand the intuition behind this result, 27
paid by the subsidiary; on the other hand, the parent
monetary unit of remuneration
for each monetary unit of
. Therefore, in order for the agent to
an
receives the fraction
for each
us
note that, on the one hand, the parent actually receives a fraction
choose to transfer resources using an observable process
, it is necessary that .
M
. The same intuition applies to the condition
As a result of Lemma 1 above, the following analysis can be restricted, without
necessary conditions for that,
. Thus, implicitly, it is assumed that the
d
any loss, to contracts that result in
and
. That is,
Ac ce p
,
te
In what follows, for any function
relative to the parent’s cash reserves ( ),
, are satisfied.
, let
,
and
represents the partial derivative of function , in relation to the subsidiary’s cash reserves
( ), and so on and so forth.
After that, define the differential operator , where
as, for any function
is a countable set of points,
(10)
in which
27
From the amount
represents the covariance between the Brownian motions
paid by the subsidiary, the parent receives the equivalent to , whereas the principal receives
and
.
.
17 Page 17 of 55
Lemma 2 below characterizes the agent’s value function. Item (a) defines the Hamilton-Jacobi-Bellman (HJB) equation to be satisfied by that function. However, since, in the present problem, the control exerts a linear influence on both the dynamics of the state trajectories and the criterion functional, one has a singular stochastic control
ip t
problem. In this case, the HJB equation involves not only a partial differential equation (represented by the first term of (11)), but actually a system of partial differential
cr
inequalities, as pointed out by (11).
Lemma 2: (a) The value function defined in (9) is a viscosity solution to the equation
.
(c)
an
;
M
is a concave function on domain
(11)
d
(b)
us
below:
To understand the intuition behind Lemma 2, first observe that the agent for each monetary unit of dividend paid by the parent. On deciding
te
receives a fraction
Ac ce p
for an additional monetary unit of cash reserve in this project, the agent incurs an opportunity cost, since this unit could alternatively be partially allocated to his own consumption.
Therefore, on the one hand,
represents the opportunity cost incurred by the
agent when he postpones his consumption (deciding on an additional monetary unit of cash reserve in the parent). On the other hand,
denotes his expected gain in terms of
future utility, obtained from that postponement. Then, by postponing his consumption, the agent’s expected gain cannot be lower than his opportunity cost. Naturally, if that were the case, the postponement would not occur and the agent would consume the parent’s cash reserves until
is satisfied, leading to the term
in (11).
Likewise, recall that, for each monetary unit of remuneration established for the parent (and, indirectly, for the principal) – in the form of actually receives a fraction
–, this project
. So, when deciding on an additional monetary 18 Page 18 of 55
unit of cash reserve in the subsidiary, the agent incurs an opportunity cost, since this unit could alternatively be allocated to the parent’s cash reserves. This opportunity cost is given by
. represents the opportunity cost
In this way, on the one hand,
ip t
incurred by the agent by postponing the parent’s remuneration (deciding on an additional monetary unit of cash reserve in the subsidiary). On the other hand,
cr
denotes his expected gain from this postponement. Hence, by postponing the parent’s remuneration, the agent’s expected gain cannot not be lower than his opportunity cost.
us
Again, if that were the case, this postponement would not occur and the agent would is satisfied, leading to the third term of
remunerate the parent until
an
(11).
Analogously, for each monetary unit of investment by the parent to the subsidiary – by way of
–, the latter receives a fraction
. When
M
deciding on an additional monetary unit of cash reserve for the parent, the agent incurs an opportunity cost, since this unit could alternatively be allocated to the subsidiary’s . Likewise, the agent’s
d
cash reserve. This opportunity cost is given by
te
expected gain from postponing this investment,
, cannot not be lower than his
, as expressed by the fourth term of (11).
opportunity cost, leading to
to represent, for a given level
Ac ce p
Therefore, define function
of
cash reserves of the subsidiary, the lowest level of the parent’s cash reserves from which the agent opts for the distribution of dividends:28
(12)
From the function above, one may obtain the set
defined below and illustrated by
Figure 1.
(13)
28
In what follows, adopt the convention that
.
19 Page 19 of 55
ip t
2
cr
1
0
us
Figure 1: The set
1
. The agent would not establish any remuneration to himself
an
whenever the cash reserves are located in this set.
Thereafter, define function
of the
M
to represent, for a given level
parent’s cash reserves, the lowest level of the subsidiary’s cash reserves from which the
Ac ce p
te
d
agent decides to remunerate the parent, i.e.,
From this function, one gets the set
defined below and illustrated by Figure 2.
(14)
20 Page 20 of 55
ip t
2
cr
2
0
us
Figure 2: The set
1
. The agent would not establish any remuneration to the parent
an
project whenever the levels of cash reserves are located in this set.
Finally, define
M
to represent, for a given level
of the subsidiary’s
cash reserves, the lowest level of the parent’s cash reserves from which the agent opts to :
Ac ce p
te
d
allocate resources to the subsidiary, under the constraint that
Let then the set
be defined as below and illustrated by Figure 3.
(15)
21 Page 21 of 55
ip t
2
cr
3
0
us
1
Figure 3: The set
. The agent would not establish any supply of funds from the
,
and
, it is possible to summarize a possible strategy to be
M
From
an
parent to the subsidiary whenever the levels of cash reserves are located in this set.
followed by the agent. First, the agent would not establish any remuneration to himself whenever the cash reserves are located in
.
, making
te
–, the agent will determine a distribution of dividends equal to these levels return to
, the
– i.e., the levels of cash reserves lie outside
d
parent’s cash reserves reach
. However, if, for a given level
Ac ce p
Likewise, the agent would not establish any remuneration to the parent
whenever the levels of cash reserves are located in
. Nonetheless, if they leave
, the
agent would establish a remuneration to the parent, making the levels of cash reserves return to
.
Finally, the agent would not establish any supply of funds from the parent to the
subsidiary whenever the levels of cash reserves are located in leave
. However, if they
, the agent will determine new financial supplies to the subsidiary, making these
levels return to
.
As a result of this policy, the cash reserves of both projects would remain in the region
29
illustrated in Figure 4. Proposition 1 below shows that this
policy actually dominates any other feasible strategy to be adopted by the agent.
29
Note that, by construction,
and, therefore,
.
22 Page 22 of 55
ip t
2
cr
4
1
. The cash reserves of both projects would remain in the region
an
Figure 4: The set
us
0
M
.
For the existence of a solution to the problem described in Proposition 1, it is still necessary to establish certain conditions necessary so that, when the process
. These necessary conditions are analogous to the ones pointed out by
te
returns to
and the agent adopts the proposed strategy, the process
d
reaches the boundary of
Remark 3.1 in Dupuis and Ishii (1993), and are described by Assumption 2 in Appendix
Ac ce p
B.
These conditions, intuitively, require that, in the first place, if the boundaries of
and
have decreasing regions, they must have, in absolute terms, a maximum slope
(i.e., they should not be excessively vertical). In the second place, in the corners formed by30
and
(points
and
in Figure 4), the set , and
extremely long horns. More formally, denote equidistant points in
from
(but with
in
, and
in
and
as
). Assumption 2 then
implies a minimum necessary distance (dependent upon ) between
30
should not have
and
.
represents the boundary of the set .
23 Page 23 of 55
Proposition 1: Let the sets
,
and
as defined by (13), (14) and (15), and the set , it is known31 that there exists a
. Given an initial state process
that results in a trajectory
(i)
, almost surely with respect to ; ,
and
are nondecreasing;
ip t
(ii)
, such that:
cr
(iii) one has
us
(16)
an
(17)
M
(18)
which
denotes
of
the
set
in
question.
Then,
.
The principal’s problem
Ac ce p
2.3
interior
te
,
the
d
in
For this problem, define the agent’s continuation value as the present value of
the promised flows of utility by the contract, from to the end of its term, i.e.,
in which
Lemma 3: For
. Lemma 3 below establishes the dynamics of the process
(19)
.
,
31
Items (i), (ii) and (iii) describe a Skorokhod problem in a domain with corners. The existence of a solution to this problem was shown by Dupuis and Ishii (1993). The problem of the statement corresponds to Case 2 of the referenced paper.
24 Page 24 of 55
(20)
,
Let
, satisfies
.
be the set of all nondecreasing stochastic process , with values in
adapted to
ip t
in which
and with bounded variation. In her maximization
cr
problem, the principal will be subject to three state variables, namely, the cash reserves of both projects and the agent’s continuation value. Then, the principal’s criterion
us
functional is given by
M
an
(21)
So, it is possible to define the value function
of the principal’s problem:
te
d
(22)
Ac ce p
Define the differential operator where
as, for any function
,
is a countable set of points,
(23)
Lemma 4 below, analogously to Lemma 2 of the agent’s problem, characterizes the principal’s value function. Again, respect to, respectively,
and
and
, and
represent the partial derivatives of
, the partial derivative relative to
with
.
Lemma 4: (a) The value function defined in (22) is a viscosity solution to the system of equations below:
25 Page 25 of 55
(24)
(c)
.
;
ip t
is a concave function on domain
cr
(b)
Figure 5 shows the implications of Lemma 4, assuming, for the sake of .
us
and
M
an
illustration only, fixed values for
te 0
Ac ce p
0
d
> −1
= −1
Figure 5: The value function , assuming fixed values for larger than
Then, define
whenever
and
, and equal to that value when
. Its slope is .
as the lowest level of the agent’s continuation value in which the
principal’s expected gain from making a direct transfer to the agent is equal to its cost, i.e.:
(25)
The following strategy summarizes the possible terms of the optimal contract. By the definition of
in (25), for every
, the reduction of one unit in the agent’s
26 Page 26 of 55
continuation value will cause
to increase by less than one. Therefore, the principal
will not establish any remuneration to the agent whenever his continuation value is . However, at
value, the increase in
, for the decrease by one unit in the agent’s continuation
will have the same magnitude. Thus, when the agent’s , the principal will determine an instantaneous
continuation value reaches
ip t
smaller than
, making that value returns immediately to
remuneration to the agent equal to
.
cr
Proposition 2 below shows that the strategy above dominates any other that
Proposition 2: Assume process
and parameters
as given. Let the set
is taken as defined in (25). Given
an
, in which an initial state
, there is a process
that leads to a trajectory
M
such that:
, almost surely with respect to ;
(i)
is nondecreasing;
Ac ce p
te
(iii) one has
d
(ii)
us
could be adopted by the principal.
Then,
Remark 3: Given
.
, the principal will be indifferent to the possible
specifications that the processes
and
may assume. This result has two
implications. Firstly, this indifference allows assuming that these processes are Markovian, as proceeded in the previous subsection (from Assumption 1). In fact, by assuming a cost associated with the verification and enforceability related to the establishment of
from a history of realizations of
, and being the principal
indifferent to that process, it is natural to imagine that she accepts to minimize that cost through a Markovian structure (in which she needs to take account of only the current
27 Page 27 of 55
value of
). Secondly, the optimal contract will not be necessarily unique, i.e., the one
derived in this paper, as resulting from that assumption, can be just one among others possible from different specifications (which not necessarily Markovian) for the
2.4
and
.
ip t
processes
The optimal contract
In order for the contract considered here to be incentive-compatible, the
cr
promised flows of utility by the contract to the agent must match those that result from
the contract value from the cash reserves (11),
,
it
is
an
and
, i.e.,
must be equal to
us
his maximization. In other words, the agent’s continuation value at
. By Ito’s lemma
known
that
. Comparing this result
with (20), it may be concluded that, given an incentive-compatible contract, one must and
.
M
have
By Proposition 2, the principal will remunerate the agent only from
.
Therefore, in an incentive-compatible contract, the cash reserves from which the .
In other words, the
curve in Figure 4 must belong to the level curve
.
Then, the terms
of the contract must be such that they satisfy the conditions:
Ac ce p
te
d
principal would be willing to remunerate the agent are those that satisfy
(C1)
;
(C2)
;
(C3)
, in which
is the function defined in (12),
in addition to Assumption 1. Conditions (C1) and (C2) result from Lemma 1. There may be several contracts that satisfy the indicated conditions. Anyway, the proposition below demonstrates the existence of at least one optimal contract for each feasible value assumed by
Proposition 3: Suppose that (C2). Given
.
takes on values that satisfy conditions (C1) and
, there exists a nonnegative function
, such that 28 Page 28 of 55
, which causes Assumption 1 to be satisfied, and implies (C3).
For the full characterization of the optimal contract, it remains to determine the and
, which will depend on the bargaining power of the
ip t
values of
parties. To match this choice with the analysis in the subsequent section, suppose that
represents the irreversible
cr
the agent has full bargaining power. As defined earlier,
investment needed for the implementation of the new project. Since
is the
us
subsidiary’s cash reserves in the initial period, the total amount necessary for the . This amount can be, at least partially,
establishment of this project is given by
, whereas the remaining
amount,
an
covered by an initial financial supply by the principal,
, is provided by the parent’s internal resources (in the present case, by its
cash reserves).32 Let
be the parent’s cash reserves at the instant immediately before the value of the original contract for
M
the establishment of the subsidiary, and
, the agent
te
solves the problem below:
and
d
the principal. Then, to determine the values of
Ac ce p
(26)
subject to:
32
Anyway,
can be negative. This will be the case when is allocated to the parent’s cash reserves.
(27)
, and the amount
29 Page 29 of 55
in which the last constraint is given by Lemma 1 and by the definition that . In first place, since in the initial instant the agent is able to allocate resources between the projects without deadweight cost, the cash reserves of each project shall be such that their respective marginal values are equivalent, i.e.,
and
, with at least one strict inequality. Then, for .
us
,
cr
Lemma 5: Let
ip t
. Consider then the lemma below:
According to the lemma above, the smaller the value of
, for the same level of
larger value implies a higher influence
an
cash reserve, the larger the value of the contract for the agent. Notwithstanding, this (by the fourth constraint in (27)) that, in turn, will
and
(second constraint). Looking at Figure 5, it can be , that larger
will yield a smaller value of
and,
consequently, a smaller
. This will have to be compensated for by smaller
M
perceived that, if
. Hence, the levels of
d
which will lead to a reduction in
and
or
,
to be chosen shall be
te
those in which the gain for the agent from a smaller value for them exactly offsets the
Ac ce p
loss caused by a smaller resulting and
. In other words, suppose that
are continuous functions. Then, the values of
shall be such that noted that
, since, if
and increase
3
or
33
and
. Also, by means of Figure 5, it can be , the agent could concomitantly reduce
.
Capital market
The optimal contract derived in the previous section can be implemented through the issuance of financial securities to be negotiated in a competitive capital market. For this purpose, the design of these securities reproduces the ownership
33
Except, evidently, in cases of corner solutions, in which the optimal values for .
or
will be zero or
30 Page 30 of 55
structures of firms resulting from the optimal contract, as well as the remuneration of the parties.
3.1
Implementing the optimal contract
ip t
Define the financial securities below:
Common stocks: securities whose instantaneous payoff is not determined beforehand,
cr
but recurrently at each instant. It is assumed that (i) the stocks issued to the public are held by a diffuse investor base, what makes the company decisions be controlled by the
us
restricted stockholders – in the present model, only the agent; and (ii) there is one or more classes of stock, and these classes may differ from one another with respect to the
an
dividends to be received;
Fixed-coupon debt securities: perpetuities that pay their holders at an instantaneous
M
fixed coupon rate.
Variable-coupon debt securities: perpetuities that pay their holders at an instantaneous
te
d
variable coupon rate.
of
For simplification, suppose that the parent is a public firm, with a fraction
Ac ce p
its common stocks held by the agent, and the remaining fraction, diffuse investor base.34 Define
, held by a
as the total payments to be made by the parent for
remuneration of its outstanding debt. Then, the optimal contract derived in the previous section can be implemented as follows:
Optimal Financing Strategy. If the group issues financial securities in the capital market in order to finance the establishment of the subsidiary firm, a financing strategy may be adopted such that:
Parent firm:
34
This assumption is made only to refrain from any possibility of control over the firm by some investor.
31 Page 31 of 55
Common Stocks: issuance of a second class of stocks, of which a fraction
is
, is held by the subsidiary.
sold to investors, and the remaining,
Fixed-coupon debt securities: issuance of perpetuities that pay an instantaneous coupon
perpetuities, each of them being
cr
Variable-coupon debt securities: purchase of
).
ip t
(or their purchase, if
equal to
remunerated by an instantaneous coupon equivalent to
.
us
,
Subsidiary firm:
, held by the controlling firm.
M
sold to investors, and the remaining fraction,
an
Common stocks: issuance of a single class of stocks, with a fraction equal to
Fixed-coupon debt securities: issuance of perpetuities that pay an instantaneous coupon .
d
equal to
te
The financing strategy above refers to the implementation of the contract identified by Proposition 3 and by the maximization in (26). Although this contract is
Ac ce p
not necessarily the unique optimal one, the financing strategy above allows drawing some conclusions that can be generalized. In this regard, the cross-ownership of firms – i.e., the holding of the parent’s equity by the subsidiary, and vice versa – is targeted at reproducing the incentives provided for by the optimal contract. Each of the firms will be remunerated by the dividends distributed by the other firm to its stockholders. Thus, through the respective dividend policies, the agent can determine the resource allocation between them. Nevertheless, as each firm holds a fraction stocks, each monetary unit received by the firm will represent
of the other firm’s units of the total
dividends to be distributed. Likewise, through the distribution of dividends by the parent, the agent can determine his own remuneration. However, if the parent’s stocks owned by him belong to a different class from those held by the subsidiary, both payment flows made by the parent (to the agent and to the subsidiary) are independent of each other.
32 Page 32 of 55
Finally, the debt securities of both firms are targeted at reproducing the flows resulting from functions payments
and
introduced in Proposition 3, after considering
to be made by the parent, due to its pre-existing debt.
Therefore, two remarks can be made about the group’s capital structure. The
ip t
first one is concerned with the different functions attributed to each security. In fact, the cross-holding of equity between the firms allows the agent to transfer resources between them. Debt issues, however, aim to provide appropriate incentives so that the agent
cr
postpones his remuneration (and, therefore, cash outflows) until both firms succeed in accumulating certain levels of cash reserve and, consequently, reducing their respective
us
bankruptcy risks to satisfactory levels.
Secondly, it shall be noted that the dividend policies differ among the firms of
an
the business group. On the one hand, the dividends of the parent’s stocks owned by the agent (and by the public) will be distributed only when both firms succeed in reducing their respective bankruptcy risks to satisfactory levels. On the other hand, the
M
subsidiary’s dividend policy behaves differently. In this case, the dividends can be distributed at any time, depending on the state of nature: when the state of nature is
d
relatively more favorable to the subsidiary’s activities than to those of the parent, the distribution of dividends is determined. The class of stocks issued by the parent and
Market price dynamics
Ac ce p
3.2
te
held by the subsidiary has a similar dividend policy.
Based on the financing strategy pointed out above, the present problem can have
three types of common stocks:
(S1) common stocks issued by the parent and held by the agent; (S2) common stocks issued by the parent and held by the subsidiary; (S3) common stocks issued by the subsidiary and held by the parent.
Let
be the market values of all securities, such that ,
and . Likewise, let
be the market
values of the parent’s and subsidiary’s net debts, respectively, such that
33 Page 33 of 55
and
. The
proposition below determines the behavior of these functions over time.35
and
, one has
ip t
Proposition 4: For
in
which
us
cr
(28)
,
an
.
and
sources of random shocks (
and
M
According to the proposition above, each market value is influenced by two ). In this way, the prices of securities of each
firm are subject not only to uncertainty over the activity of the firm in question, but also
d
to that related to the other firm in the business group. In other words, regardless of the
te
geographical or sectoral proximity between the markets of the parent and of the subsidiary, shocks to the activity of either of them shall have an impact on the prices of
Ac ce p
securities of the other firm, leading to a contagion effect between the markets.
3.3
Tax on intercorporate dividends Suppose that, in the economy in question, a tax at a rate
is levied on the flows
of dividends between the firms. Therefore, for each monetary unit of dividend distributed by the stocks (S2), the subsidiary actually receives a fraction . Likewise, for each monetary unit of dividend distributed by the . The last
stocks (S3), the parent actually receives the fraction constraint in (27) must be rewritten as follows:
35
Again, applies to
and and
represent the partial derivatives of .
relative to
and
, respectively. The same
34 Page 34 of 55
By the constraint above, note that, if
, the only feasible value for . Hence, if
the parameters in question is
, the new project
must necessarily be financed only by debt issues, without new issuances of common
ip t
stocks for investors. In this case, the parent and the subsidiary will constitute two divisions of a single firm, forming a conglomerate.
cr
It may then be concluded that, if the tax rate is high enough, it renders the
organization of the business group unviable, making the organization of the projects
us
under a conglomerate the only feasible option. This result goes in line with the exceptional situation of the US economy, which has a low frequency of those groups. In fact, in the US, intercorporate dividends have been taxed since 1935. For Morck (2005),
an
the levy of this tax helps explain the fact that, while business groups were quite common in the US economy until the 1930s, after then, the importance of such groups
4
M
significantly decreased.36
Conclusions
d
This paper sought to investigate the risk sharing mechanisms between firms of a
te
business group and the resulting capital structure. In fact, it was shown that, even with the agency conflicts involved, the optimal contract can be implemented by the issuance
Ac ce p
of financial securities for a diffuse investor base, situation in which stockholders and creditors of each firm might be different. The issuance of these securities is possible due to the fact that the resulting capital structure generates incentives so that the agent allocates losses and gains between the firms in a predictable way, allowing investors to adequately assess their securities. The results of this paper confirm Morck’s (2005) idea that the taxes levied on
intercorporate dividends can help explain the fact that business groups are less frequent in the US economy. In this sense, Morck (2005) shows that, considering 33 countries for which data are available, the US, in 1997, was the only country that taxed
36
Other measures implemented in the same time period which, according to Morck (2005), also proved to be relevant for the breakup of business groups were: elimination of consolidated group corporate income tax filings, changes to the capital gains tax rules regarding the elimination of controlled subsidiaries, and restrictions on pyramidal groups in utilities industries.
35 Page 35 of 55
intercorporate dividends.37 At the same time, the US and the UK were the only ones where none of their top ten firms were controlled by a business group.38 In the model of this article, when the agent diverts cash between the firms, it may result in a deadweight cost. Hence, the group's capital structure must be such that
ip t
makes the agent prefer to transfer resources between firms through the distribution of dividends than through unobservable payments. For that purpose, the percentage of the subsidiary dividends to be received by the parent (and vice versa) shall be greater than
cr
the receivable percentage of the unobservable payments (that is, net of deadweight costs). In this way, if the tax rate on intercorporate dividends is high enough, there will
us
be no capital structure with two separate firms satisfying this condition. In this case, the organization of the two projects as two divisions of a single firm would be the only
an
capital structure able to induce the agent to not divert cash between the firms. Exclusively under the model proposed in this paper, the conglomerate which is formed in consequence of this tax is a suboptimal result. One must bear in mind,
M
however, that this suboptimality is due only to the controller’s preferences. In fact, the level of frequency of business groups in a certain economy gives rise to different distributions of corporate control in society. These distributions, in turn, can exert
d
significant influence on the public policies to be implemented and on the investments in
te
innovation. Therefore, the final outcome for social welfare from the presence of business groups in the economy might even be negative. The insertion of the framework
Ac ce p
proposed in this paper in a general equilibrium model is a possible way to investigate these issues.
Appendix A: Proofs of Lemmas 1-2 Proof of Lemma 1: (a) Consider initially process
contract
may have
that establishes ,
and any process or
. Suppose that there is a . Under such contract, one . Below, it will be
37
See Table 2 in Morck (2005). Regarding the UK, as Morck (2005) reports, Franks, Mayer and Rossi (2005) highlight the London Stock Exchange Takeover Rule of 1968 as the main determinant for the low frequency of business groups in that country. According to that rule, any acquisition of 30% or more of a listed company must be an acquisition of 100%. In this way, the parent firm is required to own either 100% or less than 30% of the subsidiary, creating a barrier to the development of pyramidal ownership. 38
36 Page 36 of 55
shown that, in any of these three cases, there is a contract
in which
. In
any of the cases, the proof consists in demonstrating that every optimal contract in which there are unobservable (by the principal) transfers between projects is weakly dominated by another one in which such transfers do not occur.
ip t
By convenience, equations (6) and (7) are rewritten below, but only highlighting
(A.1)
us
cr
the terms that will be relevant in the following demonstration.
refers to the terms of the contract
an
In what follows, the superscript
. Let
be the contract with the same terms as
differences: (i)
; and, (ii)
. Suppose
. Considering (ii) in (A.2), one realizes that
d
such that
.
, but with two
M
Case 1:
Also,
multiplying
te
.
(A.2)
(ii)
by
, leading, by way of (A.1), to
In this way, one concludes by induction that
Ac ce p
,
for
. , yielding
.
Case 2:
. Let
; (ii)
(i)
be the contract with the same terms as
, except for:
; (iii)
; and (iv) .
Suppose
such
. Considering (iii) in (A.2), one notes that multiplying (iii) by
. Also,
,
, yielding
(when considering (iv)) and,
by
(A.1),
that
, .
Again,
by
induction,
it
is
concluded
37 Page 37 of 55
for
, yielding
. The inequality is for
due to the fact that, according to (iv),
Case 3:
. Let ;
(i)
.
be the contract with the same terms as
(ii)
; .
Suppose
,
(ii)
(iii)
such
that
. Also,
one
has
us
multiplying
and
cr
. Considering (ii) in (A.2), one realizes that
, except for:
ip t
that
, yielding (when considering (i) and , and, by (A.1),
an
(iii)) . Again, by induction, it is concluded that
M
.
yielding
, assuming there is a contract
Consider now process
,
that establishes
. Rewriting again (6) and (7) and highlighting only the terms
d
and any process
for
te
that will be relevant for the demonstration:
Ac ce p
(A.3)
Case 1´: and
into
Case 2´: (i)
(A.4)
. The proof is similar to that of Case 1, only substituting
,
,
and , respectively.
. Let
be the contract with the same terms as ;
(ii)
; ;
.
, except for:
Suppose
. Considering (iii) in (A.4), one notes that
(iii) and such
(iv) that . Also, 38 Page 38 of 55
multiplying (iii) by
(=
),
yielding , and, by (A.3),
(when considering (iv))
yielding
.
Case 3´:
. Let
for
be the contract with the same terms as
, except for:
cr
; (ii)
(i)
,
ip t
. Again, by induction, one concludes that
.39 Suppose
such that
us
(iii)
. Considering (ii) in (A.4), one realizes that
an
,
multiplying (ii) by (when considering (iii))
. Also, yielding , and, by (A.3), for
,
M
. Again, by induction, it is concluded that .
yielding
; and
and
, but
(contradicting the first assertion of the
te
such that
d
(b) Step 1: Suppose that an incentive-compatible contract determines a control
Ac ce p
statement). Let then the alternative control
in which, for
(ii)
; . Denote
obtained under control
.
Suppose
such that
.
and
Multiplying
By
(ii)
by
induction,
and,
for
for
by ,
when
In this case, it shall be noted that, being, by (ii), . Thus, by (iii),
,
, yielding (when considering (i) and
. However, since
one has
(iii)
. Considering (ii) in (A.2),
, .
;
as the state trajectory
(iii))
39
: (i)
(A.1), yielding , this ,
.
39 Page 39 of 55
inequality is strict. So, strategy
is dominated by
, implying that the
contract in question cannot be incentive-compatible.
Step 2: Suppose that an incentive-compatible contract determines a control and
, but
(contradicting the second assertion of
ip t
such that
the statement). Let then the alternative control
in which, for
cr
; (ii) . Denote .
Suppose
such that
.
as the state trajectory
. Considering (ii) in (A.4),
Multiplying
(ii)
an
obtained under control
;
us
and (iii)
: (i)
by
,
one
obtains
M
, yielding (when considering
(i) and (iii)) induction,
for
yielding
when
is dominated by
, implying that the contract in
te
. Again, since is strict. So, strategy
,
.
d
By
, and, by (A.3),
, this inequality
Ac ce p
question cannot be incentive-compatible.
Proof of Lemma 2: (a) Consider the auxiliary lemma below:
Lemma A.1: Suppose that constants
is a polynomial growth function, i.e., that there are
such that
(A.5)
in which
represents the Euclidean norm. Admit also that
programming principle such that, for any stopping time
satisfies the dynamic
,
40 Page 40 of 55
(A.6)
is a viscosity solution to (11).
Proof: See Theorem 5.1, chapter VIII, on Fleming and Soner (2006).
ip t
Then,
cr
To use the lemma above, it is necessary to prove (A.5) and (A.6). The auxiliary
Lemma A.2: The function
is continuous in
and satisfies the dynamic
an
programming principle.
us
lemma below proves the second condition.
It remains then to show that ,
d
such that, for
has polynomial growth. With this goal, define . From (6) and (7),
Ac ce p
te
process
M
Proof: See Lemma 2.1, chapter V, on Fleming and Soner (2006).
Applying the generalized Ito’s formula to
in which
,
and
,
represent the continuous parts of the respective processes.
Taking the conditional expectation to the above equation, 41 Page 41 of 55
By the definitions of
and
, the last summation of the above equation takes on and
. Note that Assumption 1(c) implies that
us
nonnegative values, just as such
cr
ip t
(A.7)
that
.
since
M
an
,
Thus,
the
above
equation
and
Ac ce p
,
letting
. Since
, , and given
, one has, after doing
the arbitrariness of the control As
and
te
d
Rearranging
,
.
, one concludes that
(A.8)
satisfying condition (A.5).
(b) Let
and
, such that . Define also
from initial conditions
,
and, for ,
and
are optimal controls under trajectories
and
as the state trajectories
. Suppose then that and
, let
and
, respectively.
42 Page 42 of 55
In turn, let
. is given by
us
cr
ip t
From this control, the state trajectory
one
notes
and of the initial state
an
By the definitions of the control
for
that (with
and
in
which
defined analogously). Also by the definition of
d
M
, one has40
the control
,
,
Ac ce p
te
(A.9)
in which the third row used the definition of controls
and
. However, by . Substituting (A.9)
the definition of the value function,
, which proves
into this inequality, it is concluded that the concavity of .
(c)
By
the
concavity
of
,
for
,
one
must
have
. 40
To obtain the second equality, define respectively. By the definition of , conditional expectation, definition of
, one also has
and
as the filtrations generated by trajectories and , . Since process is -measurable, by the properties of . Naturally, from an analogous .
43 Page 43 of 55
Suppose
for a given
One notes that
is a local maximum of
,
, with
and
.
, by the viscosity subsolution
cr
Since
, consider the smooth test function
ip t
. For
, and let
For
a
sufficiently
small
an
us
property of , one obtains
,
there
is
a
M
. Identically, one proves that
contradiction.
Therefore,
is continuous.
te
d
Appendix B: Assumption 2 One notes that, in ,
and
and
Ac ce p
vectors
,
, the process
is reflected according to
, respectively. Let
,
and
be
the unit vectors with the same direction of such vectors, i.e.,
44 Page 44 of 55
In addition, for and
, define
as the inward normal in relation to
defined analogously. Finally, for
in , with
, let
, in
represents the interior of the given set. Consider then Assumption 2 below:
, there are scalars
,
, such that
in which
us
cr
Assumption 2: For each
ip t
which
represents the scalar product.
an
Appendix C: Proofs of Lemmas 3-5 and Propositions 1-4
M
Proof of Proposition 1: Step 1: Suppose stopping time formula to
and define the
. Applying the generalized Ito’s
Ac ce p
te
d
, one gets
in which notes
,
and
(C.1)
represent the continuous parts of the respective processes. One that
45 Page 45 of 55
, and an analogous reasoning applies to
. Substituting these
ip t
results in (C.1),
us
cr
(C.2)
, and analogously for
such that
. Subtracting
concavity of ,
, and, by (11),
. By (A.8),
on both sides of the
, one obtains
. Then, by the
M
and letting
inequality, dividing it by
and
an
in which
. Thus, taking the conditional expectation
Ac ce p
te
d
to the equation above and rearranging the resulting expression,
(C.3)
By definition of the control
, one knows that
Suppose that the optimal control
implies
, with strict
inequality in at least one dimension. By (11) and by the construction of that,
for
, . Then, letting
,
.
,
, one knows and
in (C.3),
46 Page 46 of 55
which violates the dynamic programming principle in (A.6). Hence, optimal control in
Suppose
ip t
2:
.
,
and
define
cr
Step
is the
. Following the same procedures used in the
us
previous step, one obtains the same equation (C.3).
, one knows that
By definition of the control
implies
, with strict
an
Suppose that the optimal control
.
inequality in at least one dimension. In the present case, by construction, one has , letting again
,
and
te
d
M
in (C.3),
. Then,
Ac ce p
once more violating the dynamic programming principle in (A.6). Therefore, the . Moreover, by the concavity of
optimal contract implies knows that, for
which implies the optimality of optimal control in
and
, one
,
,
. It may then be concluded that
is the
.
Step 3: Performing an analogous procedure to that used in Step 2, one concludes that is the optimal control also in
41
and in
41
.
As done in Step 2, one shall note that, by the concavity of , for
, which implies the optimality of
in
. In turn, for
and
and
,
,
47 Page 47 of 55
Since
and in
one obtains the optimality of the control
Proof of Lemma 3: Define
,
.
as the contract value for the agent
us
cr
ip t
throughout the term of the contract, such that
is a martingale. Then, by the martingale representation
M
, which allows concluding that
and
such that
te
d
theorem, one knows that there are two predictable processes
. Calculating the differential
Ac ce p
in which
,
an
For
(C.4)
(C.5)
from (C.4) and (C.5), equating the
resulting expressions and considering the homogeneity with respect to time of the present problem, one gets (20).
Proof of Lemma 4: (a) This proof is quite similar to that of Lemma 2. Likewise,
one uses Lemmas A.1 and A.2. It is then necessary to show that
has a polynomial
growth.
To do that, consider the same process
used in the proof of Lemma 2, and
rewrite (A.7) taking into account that, in the present problem, the process
, leading to the optimality of
in
is a control:
.
48 Page 48 of 55
ip t cr
Since
,
.
given
that
the
above
equation
Ac ce p
Rearranging
te
d
M
an
us
,
Also,
. Once
has, after letting
and
letting
,
, and given the arbitrariness of the control
, one
. Since
, one
,
and
concludes that
(C.8)
proving that
has a polynomial growth.
49 Page 49 of 55
The proofs for items (b) and (c) are similar to those of the respective items of Lemma 2.
Proof of Proposition 2: Suppose
and define the stopping time
ip t
. Applying the generalized Ito’s formula to
(C.9)
M
an
us
cr
, one gets
that
and
an
analogous reasoning
applies
to
. In turn,
Ac ce p
,
te
d
Note
. Substituting these results in (C.9),
By (C.8),
such that
. Since . Dividing this inequality by
, and letting
50 Page 50 of 55
, one obtains
. Then, by the concavity of ,
. Therefore,
taking the conditional expectation to the above equation and rearranging the resulting
ip t
expression,
By the definition of the control
. Consider then a control
. By (24) and by the construction of
, one knows that, for
an
that implies
, one knows that
us
cr
(C.10)
and
. Then, by (C.10),
d
M
,
te
which violates the dynamic programming principle. Thus,
Ac ce p
.
In addition, by the concavity of is the optimal control in
Hence,
is the optimal control in
, one knows that
.
Proof of Proposition 3: Consider a nonnegative function ; and (ii) for
that: (i)
functions
and
.
,
such that
such . From the
proposed by the statement, one notes that Assumption 1 is satisfied.
Admit also that, for
By (11), one knows that, for
,
,
51 Page 51 of 55
and
. Since, according
ip t
By construction, , one has
to Lemma 2(c),
. One then concludes that
Proof of Lemma 5: Suppose and define
and
and
for
, and
, yielding
By the definition of
.
, and, by (8),
M
,
and
be defined analogously. By
. Since
. Given the arbitrariness of
,
d
,
be the optimal
as the state trajectories under
, respectively. Let (6) and (7),
. Let
an
control under
us
cr
.
.
Ac ce p
te
Still by (8), one knows that
If
, then
(C.11)
, yielding, by the result shown earlier,
(C.12)
If
, by (6) and (7),
and ,
and,
, one concludes that, if
. Since, according to (9), by ,
the
previous
result, . Inserting 52 Page 52 of 55
this result and (C.12) into (C.11) and considering that .
for
Letting
one
. The proof for the case in which
, obtains
and
is
ip t
analogous.
Proof of Proposition 4: Consider initially the function
cr
lemma below:
that satisfies the ordinary
us
Lemma C.1: (a) Consider a function
an
differential equation
; (ii) for
M
subject to conditions: (i)
,
.
te
for
; and (iv)
(C.13)
,
and
; (iii) for bounded on
. Then,
d
,
and the auxiliary
,
Ac ce p
Proof: Applying the generalized Ito’s formula to
Considering (C.13) and the conditions (ii), (iii) and (iv) in the above equation, one obtains, after taking the conditional expectation,
53 Page 53 of 55
in the above equation, after considering condition (i) and rearranging the .
resulting equation, one obtains
,
in (28) is immediate. The proofs for ,
however,
the
equations
are similar. In the case of in
(C.13)
are
us
and
and
and
for
given
by
, respectively, .
an
and conditions (ii) and (iii) are replaced by
the result
cr
From the lemma above, by applying the generalized Ito’s formula to
ip t
Letting
M
References
d
ALMEIDA, H. V., and D. A. WOLFENZON (2006): “A Theory of Pyramidal Ownership and Family Business Groups”. Journal of Finance, 61, 2637-2680.
Ac ce p
te
BEBCHUK, L., R. KRAAKMAN, and G. TRIANTIS (2000): “Stock pyramids, crossownership, and dual class equity”, in Concentrated Corporate Ownership, ed. by R. K. MORCK. University of Chicago Press, Chicago, IL. BERKMAN, H., R. COLE, and L. FU (2009): “Expropriation through loan guarantees to related parties: Evidence from China”. Journal of Banking and Finance, 33, 141-156. BIAIS, B., T. MARIOTTI, G. PLANTIN, and J. ROCHET (2007): “Dynamic Security Design: Convergence to Continuous Time and Asset Pricing Implications”. Review of Economic Studies, 74, 345-390. BIAIS, B., T. MARIOTTI, J. ROCHET, and S. VILLENEUVE (2010): “Large Risks, Limited Liability, and Dynamic Moral Hazard”. Econometrica, 78, 73-118. CADENILLAS, A., J. CVITANIC, and F. ZAPATERO (2007): “Optimal risk-sharing with effort and project choice”. Journal of Economic Theory, 133, 403-440. CHEUNG, Y. L., Y. QI, P. R. RAU, and A. STOURAITIS (2009): “Buy high, sell low: How listed firms price asset transfers in related party transactions”. Journal of Banking and Finance, 33, 914-924.
54 Page 54 of 55
DeMARZO, P., and Y. SANNIKOV (2006): “Optimal security design and dynamic capital structure in a continuous-time agency model”. Journal of Finance, 61, 2681-2724. DUPUIS, P., and H. ISHII (1993): “SDEs with oblique reflection on nonsmooth domains”. The Annals of Probability, 21, 554-580.
ip t
FACCIO, M., L.H.P. LANG, and L. YOUNG (2001): “Dividends and expropriation”. American Economic Review, 91, 54-78.
cr
FLEMING, W. H., and H. M. SONER (2006): Controlled Markov Processes and Viscosity Solutions, Springer, NY.
us
HOLMSTROM, B., and P. MILGROM (1987): “Aggregation and Linearity in the Provision of Intertemporal Incentives”. Econometrica, 55, 303-328.
an
JOHNSON, S., R. LA PORTA, F. LOPEZ-DE-SILANES, and A. SHLEIFER (2000): “Tunneling”. American Economic Review Papers and Proceedings, 90, 22-27.
M
JOHNSON, R. and L. SOENEN (2002): “Asian Economic Integration and Stock Market Comovement”. The Journal of Financial Research, 25, 141-157. KHANNA, T. and Y. YAFEH (2007): “Business groups in emerging markets: paragons or parasites?” Journal of Economic Literature, 45, 331-372.
te
d
LA PORTA, R., F. LOPEZ-DE-SILANES, A. SHLEIFER, and R. VISHNY (2000): “Investor protection and corporate governance”. Journal of Financial Economics, 58, 3-27.
Ac ce p
MORCK, R. (2005): "How to Eliminate Pyramidal Business Groups: The Double Taxation of Intercorporate Dividends and other Incisive Uses of Tax Policy", in Tax Policy and the Economy, 19, ed. by J. POTERBA. Cambridge and London: MIT Press. MORCK, R., D. WOLFENZON, and B. YEUNG (2005): "Corporate Governance, Economic Entrenchment, and Growth". Journal of Economic Literature, 43, 655-720. RADNER, R., SHEPP, L. (1996): “Risk vs. profit potential: a model for corporate strategy”. Journal of Economic Dynamics and Control, 20, 1373-1393. SANNIKOV, Y. (2008): “A Continuous-Time Version of the Principal-Agent Problem”. Review of Economic Studies, 75, 957-984. WEINDENBAUM, M. L. (1996): “Greater China: the next economic superpower?”, in The Dynamic American Firm, ed. by R. A. BATTERSON, K. W. CHILTON, and M. L. WEINDENBAUM. Springer US. WILLIAMS, N. (2011): “Persistent Private Information”. Econometrica, 79, 1233‒ 1275. 55 Page 55 of 55