Robust optimization of the 0–1 knapsack problem: Balancing risk and return in assortment optimization

Accepted Manuscript Robust Optimization of the 0-1 Knapsack Problem: Balancing Risk and Return in Assortment Optimization Robert P. Rooderkerk , Hara...

Download PDF

1MB Sizes 1 Downloads 91 Views

Report

PDF Reader
Full Text

Accepted Manuscript

Robust Optimization of the 0-1 Knapsack Problem: Balancing Risk and Return in Assortment Optimization Robert P. Rooderkerk , Harald J. van Heerde PII: DOI: Reference:

S0377-2217(15)00921-2 10.1016/j.ejor.2015.10.014 EOR 13298

To appear in:

European Journal of Operational Research

Received date: Revised date: Accepted date:

4 September 2014 24 April 2015 8 October 2015

Please cite this article as: Robert P. Rooderkerk , Harald J. van Heerde , Robust Optimization of the 0-1 Knapsack Problem: Balancing Risk and Return in Assortment Optimization, European Journal of Operational Research (2015), doi: 10.1016/j.ejor.2015.10.014

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

Highlights Demand for products in retail assortments is stochastic. Our robust retail assortment optimization problem balances risk and return. We propose a novel and efficient heuristic to solve the robust assortment problem. The heuristic provides solutions in a matter of seconds. The solutions offer substantial risk reduction, yet only small reductions in return.

AC

CE

PT

ED

M

AN US

CR IP T

    

ACCEPTED MANUSCRIPT

Robust Optimization of the 0-1 Knapsack Problem:

Robert P. Rooderkerk

AN US

Harald J. van Heerde

CR IP T

Balancing Risk and Return in Assortment Optimization

AC

CE

PT

ED

M

October 2015



Robert P Rooderkerk is Assistant Professor of Empirical Research Methods, Rotterdam School of Management, Erasmus University Rotterdam, The Netherlands; [email protected]. Harald J. van Heerde is Research Professor of Marketing at Massey University, New Zealand and Extramural Fellow at CentER, Tilburg University, the Netherlands; [email protected]. Robert P. Rooderkerk is the corresponding author. The authors thank Tammo Bijmolt, Marnik Dekimpe, Els Gijsbrechts, Dick den Hertog, Aurélie Lemmens, Anja de Waegenaere and seminar participants at the Rotterdam School of Management for helpful comments and suggestions. The authors are grateful to IRI France for sharing the data. The authors gratefully acknowledge the Netherlands Organization for Scientific Research (NWO) and the New Zealand Royal Society Marsden Fund (MAU1012) for research support.

1

ACCEPTED MANUSCRIPT

Robust Optimization of the 0-1 Knapsack Problem: Balancing Risk and Return in Assortment Optimization ABSTRACT Retailers face the important but challenging task of optimizing their product assortments.

CR IP T

The challenge is to find, for every category in every store, the assortment that maximizes (expected) category profit. Adding to the complexity of this 0-1 knapsack problem, retailers should also consider the risk associated with every assortment. While every product in the assortment offers an expected return, there is also uncertainty around its expected demand and profit contribution. Therefore, retailers face the difficult task of designing a portfolio of products

AN US

that balances risk and return. In this paper, we develop a robust approach to optimize retail assortments that offers this balance. Since the dimensionality of this robust 0-1 knapsack problem in practice often precludes full enumeration, we propose a novel, efficient and real-time heuristic that solves this problem. The heuristic constructs an approximation of the risk-return Efficient Frontier of assortments. We find that the robust solutions offer the retailer a

M

considerable reduction in risk (variance), yet only imply a small reduction in expected return. The constructed approximations contain assortments that are optimal solutions to the robust assortment optimization problem. Moreover, they represent insightful visualizations of the

PT

time (matter of seconds).

ED

solution space, allowing for interactivity (“what risk premium should the retailer pay?”) in real-

Keywords: Retailing, Assortments, Risk-Return, Efficient Frontier, Robust Optimization,

AC

CE

Knapsack Problem

2

ACCEPTED MANUSCRIPT

1. Introduction One of the most challenging decisions for retailers is which assortment to carry (Rooderkerk, Van Heerde, & Bijmolt, 2013). The Retail Assortment Optimization problem is the problem of selecting an assortment of products such that category profit is maximized and the total amount of shelf space that is used does not exceed the available amount (see e.g., Chong,

CR IP T

Ho, & Tang, 2001, Kök & Fisher, 2007) 1. While this is already a complicated optimization problem, it is compounded by the fact that consumer demand cannot be assumed to be fixed. Rather, demand of individual products is stochastic (Cachon, Randall, & Schmidt, 2007; Fischer, Shin, & Hanssens, 2013; Lee, Padmanabhan, & Whang, 1997), and it may show positive or negative correlations with the demand of other products in the assortment, depending on whether

AN US

products are complements or substitutes. Consequently, the total risk associated with an assortment is not just related to the variances of the individual items in the assortment, but also to the nature and extent of covariance between every pair of items in the assortment. Fisher et al. (2013) note that increased demand volatility at the retail level has led to many undesirable consequences such as the bullwhip effect (Cachon et al., 2007; Lee et al.,

M

1997). For the retailer the unpredictability of consumer demand results in a high variance of the associated cash flows. Such volatility increases a retailer’s need for working capital (Rhao &

ED

Bharadwaj, 2008)2. Hence, retailers would benefit from hedging the risks arising from demand uncertainty of product assortments. Consequently, the domain of assortment optimization could benefit greatly from adopting a risk-return approach. Despite the importance of this challenge,

PT

research has been restricted to theoretical approaches (Cardozo & Smith Jr., 1985; Devinney & Stewart, 1988) and numerical experiments (e.g., Rusmevichientong & Topaloglu, 2012).

CE

However, an empirical approach is lacking (Kouvelis, Chambers, & Wang, 2006). This study aims to fill this gap by developing an empirical risk-return approach to assortment optimization.

AC

This is a special case of the robust 0-1 knapsack problem that aims to maximize the expected total benefit of all items in a knapsack while accounting for the uncertainty in each item’s benefit. 1

This can be seen as a 0-1 knapsack problem where the decision per item is whether to include it in the assortment (knapsack) or not, representing a binary decision. Every item has a certain profit (benefit), while taking up a given amount of shelf space (weight). The knapsack’s capacity constraint is the consequence of the limited amount of available shelf space for the category as a whole. 2 Walmart uses its cash flows to fund its operations and global expansion plans (Walmart 2014). To deal with the volatility in cash flows Walmart resorts to short-term borrowings to meet capital requirements. The most recent annual report (Walmart 2014) reveals that these short-term borrowings are quite sizeable ($7.67 billion).

3

ACCEPTED MANUSCRIPT

An additional complicating factor of this retail challenge is the high dimensionality of the knapsack problem: retailers carry more products and categories than before (Rooderkerk et al., 2013), and increasingly customize their assortment at the store level (Rigby & Vishwanath, 2006). Consequently, the potential of a risk-return approach for assortment optimization should be assessed with the dimensionality challenges in mind. In other words, an empirical method for

CR IP T

risk-return assortment optimization should work efficiently on large data sets. Therefore, the goal of this paper is to twofold: (1) to develop an empirical risk-return approach to assortment optimization, and (2) to develop an efficient heuristic that is able to deal with the large data dimensions and provides real-time (near-)optimal solutions that optimally balance risk and return.

AN US

We derive a robust counterpart of the Retail Assortment Problem that balances risk (profit uncertainty) and return (expected profit) of assortments. The objective function of the robust multi-objective counterpart favors returns while simultaneously penalizing risk. The premise is that the retailer may prefer an assortment that is associated with slightly less return (expected profit) yet with a lot less risk (uncertainty). Using full enumeration, we test the robust

M

approach on both synthetic and empirical data concerning retail assortment optimization problems. We find that many of the robust solutions provide retailers with a considerable

ED

reduction in uncertainty (variance), with only a small reduction in expected profit. In practice, the sizeable dimensions of assortment planning problems preclude the use of full enumeration. Therefore, we propose an Efficient Frontier heuristic that provides assortments

PT

that (near-) optimally balance return and risk in real-time. The heuristic does so by quickly constructing a subset of the risk-return Efficient Frontier. For an assortment on this frontier, a so-

CE

called efficient (or Pareto-optimal) assortment, no other assortment exists with a higher expected profit without increasing variance, and conversely no other assortment exists that has a reduced

AC

variance without decreasing expected profit.3 These efficient assortments form a very small subset of the total solution space, effectively reducing the complexity of the large problem. An additional benefit of this approach is that the retailer obtains a visualization of a set of efficient assortments that trade-off risk (profit variance) and return (expected profit). The visualized Efficient Frontier aids the retailer in deciding on how risk-averse s/he wants to be (or conversely what risk-premium is required). This makes the heuristic an example of an a posteriori method 3

The term Efficient Frontier originates in the domain of modern portfolio theory (Markowitz 1952) and was originally used to indicate stock portfolios that are efficient with respect to the associated risk and return.

4

ACCEPTED MANUSCRIPT

of solving a multi-objective function (Hwang & Masud, 1979), in which a subset of the Pareto optimal solutions is generated, allowing the decision maker to determine his/her weight between the objective functions after seeing the trade-off between them. That is useful as it is unlikely that the retailer exactly knows how risk-averse s/he is before seeing the trade-off between risk and return. Finally, we derive an upper bound on the optimal robust profit.

CR IP T

Our study contributes to the marketing, operations management, and operations research literature in a number of ways. First, we introduce an empirical risk-return approach to the important domain of assortment optimization, a central problem in the fields of marketing and operations management. Current practice in the assortment optimization literature assumes profits are deterministic, and there is no risk aversion. We allow for situations in which profit is

AN US

stochastic and the decision maker is risk averse. In doing so, we broaden the scope by taking risk into account during the optimization of the assortment composition as opposed to just afterwards through sensitivity analysis.

Second, we introduce a state-of-the art heuristic for solving a robust 0-1 knapsack problem. The heuristic is able to solve the multi-objective function balancing expected return and risk,

M

where the latter is highly non-linear in the binary decision variables in a real-time manner. The heuristic finds the optimal or near-optimal solution, alongside a subset of the efficient frontier of

ED

solutions. The latter allows the retailer to set the desired level of risk aversion a posteriori; after seeing the risk-return trade-off in the set of generated efficient assortments. This allows for much more flexibility than a scalarized approach in which the level of risk aversion is fixed beforehand

PT

and only one corresponding best solution is generated. Finally, the empirical application provides substantive insights to retailers and the literature

CE

on robust optimization. We find that on aggregate, a large gain (risk reduction) only requires a small sacrifice (decrease in expected return). However, the findings also show that the benefits

AC

(risk reduction) and costs (sacrificing expected profit) differ substantially across stores. For a substantial number of stores, the optimal nominal (risk-neutral) assortment is also the optimal robust one. Among the stores for which the two optimal assortments deviate, there are those for which the risk premium is quite small (a small decrease in expected profit achieves a strong risk shelter), while for others the price to pay for more certainty is quite steep. This suggests that retailers should take a store-level, a posteriori approach to risk-return assortment optimization;

5

ACCEPTED MANUSCRIPT

based on the store-specific efficient assortments retailers should decide how risk averse they want to be when constructing the focal store’s assortment. The rest of the paper is organized as follows. In §2, we formally introduce the Retail Assortment Optimization problem, and illustrate that assortments that solely maximize return (so-called nominal assortments) are vulnerable to the uncertainty in the estimated profit

CR IP T

contributions of the products. We next formulate the Robust Retail Assortment Optimization problem in §3. Section 4 describes the new heuristic for constructing a subset of the Efficient Frontier assortments and a (near-) optimal solution to the Robust Retail Assortment Optimization problem. In §5, we determine the usefulness of the heuristic by contrasting the optimal nominal and optimal robust assortments in an empirical application. We end with conclusions and

AN US

recommendations for future research in §6.

2. Optimizing Retail Assortments to Maximize Returns 2.1. Retail Assortment Optimization Problem

Suppose a retailer is considering n products that can be included in the assortment with the available amount of shelf space equal to c. Every product k (k = 1,...,n) occupies an amount

M

of shelf space wk and contributes pk to profit when it is included in the assortment. We define binary decision variables {xk}k=1,...,n, which are equal to 1 if product k is included in the

ED

assortment and 0 otherwise. The Retail Assortment Optimization problem can now be defined as the following 0-1 knapsack problem:

PT

(RAO)

maximize

CE

x

subject to

n

p x k 1 n

k k

w x k 1

k k



xk {0,1}

c

(1) k  1,..., n.

(2)

AC

Restriction (1) is the capacity constraint whereas (2) forces every product to be in or out.

The occupied shelf space per item (the wk parameters) and the available shelf space for the category (the c parameter) are fixed quantities. The binary integer linear program defined by (RAO) is an example of a 0-1 knapsack problem. The knapsack problem is NP-hard, that is, it is unlikely that the problem can be solved in polynomial time (Papadimitriou & Steiglitz, 1982). Several algorithms and heuristics exist to provide (near-) optimal solutions to the ordinary knapsack problem (e.g., Kellerer, Pferschy, & Pisinger, 2004; Martello & Toth, 1990). 6

ACCEPTED MANUSCRIPT

We focus on the core of the retail assortment optimization problem, abstracting from decisions on product pricing, number of facings, and inventory levels. The product selection decision typically precedes pricing and other more tactical decisions. Future research on robust retail assortment optimization could seek to integrate these extensions. 2.2. Risk Sensitivity of Optimal Retail Assortments T p   p1  pn  in the assortment

CR IP T

If demand is uncertain, the profit parameters

optimization problem may be subject to uncertainty. A naive approach to deal with profit parameter uncertainty is to use expected profit values in the optimization. The problem of this approach is that the optimal assortment becomes sub-optimal under small parameter

AN US

perturbations, as illustrated by the problem instance in Table 1.

Table 1: Example Assortment Optimization Problem Shelf space needed (wk) 1 1 1

Available shelf space (c ) 2

M

Item (k) 1 2 3

Profit Covariance matrix () Expected profit Item 1 Item 2 Item 3 ( pk ) 8 1 1.5 1 9 4 5 10 9

ED

Suppose three products are available that require the same amount of shelf space when included (=1), and the available shelf space for the category allows for at most two items. Next, suppose that the profit contributions follow a multivariate normal distribution with mean equal to

PT

the expected profit contributions p   p1  pn  T and covariance matrix equal to . Items 2 and 3 have a large positive covariance, which may be driven by similar sensitivity to demand shocks.

CE

For instance, if these are outdoor items, items 2 and 3 may be for good weather (e.g., a barbecue and a surf board), which may be simultaneously in high demand (when the weather is good) or in

AC

low demand (when the weather is bad). Item 1 has a negative covariance with items 2 and 3, which indicates an opposite sensitivity to demand shocks. Following the example, item 1 may be an item for bad weather (e.g., an umbrella or a board game). Table 2 shows that the solution that maximizes expected profit, i.e., the optimal nominal assortment, {2, 3}, has a large variance (23) relative to the other solutions. This implies that, even though the nominal profit is maximal, there is a lot of uncertainty regarding the actual profit realization. The assortments {1, 2} and {1, 3} are associated with a substantially lower risk

7

ACCEPTED MANUSCRIPT

(variance) while only entailing a small decrease in return (expected profit) compared to assortment {2, 3}. This illustrates the need to construct assortments that balance risk and return. To this end, we formulate the Robust Assortment Optimization problem in the next section. Table 2: Feasible Solutions to the Retail Assortment Optimization described by Table 1. Nominal Profit

Variance (= x T  x )

{1} {2} {3} {1, 2} {1, 3} {2, 3}

8.00 9.00 10.00 17.00 18.00 19.00

1.00 4.00 9.00 2.00 8.00 23.00

CR IP T

Solution

AN US

3. Optimizing Retail Assortments to Balance Risk and Return 3.1. Robust Retail Assortment Optimization Problem

Many optimization problems experience uncertainty in the parameter estimates. Researchers have defined robust versions of such optimization problems in which the parameters lie within some “uncertainty set”, a convex set of possible parameter values. The resulting

M

optimization problems are referred to as the robust counterparts (Ben-Tal & Nemirovski, 1998, 1999). Such a robust counterpart can be formulated for a knapsack problem as well, i.c., the

ED

Retail Assortment Optimization problem. Defining U as the uncertainty region in which the true profit parameters in (RAO) lie, the robust counterpart of the Retail Assortment Optimization

(RAORobust)

x

AC

subject to

n

p

minimize

CE

maximize

PT

problem (RAORobust) is:

pU

k

xk

k 1

n

w

k



xk

k 1

x k {0,1}

c

(3) k  1,..., n.

(4)

The goal of (RAORobust) is to maximize, using decision variables x, the worst possible

profit outcome among parameters p in the uncertainty set U. We adopt a commonly applied type in the robust optimization literature, the ellipsoidal uncertainty region. Ellipsoidal uncertainty sets are flexible approximations to various uncertainty sets and typically result in computational

8

ACCEPTED MANUSCRIPT

tractable robust optimization problems (Ben-Tal & Nemirowski, 1999). This type of uncertainty region is defined as: U  { p| ( p  p) T Θ 1 ( p  p)  r 2 },

(5)

where p is the vector with expected profit values. It is assumed that  is a symmetric positive

CR IP T

definite matrix. We use  kk ' to denote element (k, k’) of . Note that r relates to the amount of confidence we have that the true profit parameter values are included in the uncertainty set. A larger r will increase the confidence.

Based on Ben-Tal and Nemirovski (1998, 1999), El-Ghaoui et al. (1998), we can rewrite the (RAORobust) objective function with U given by (5), as the following Robust Retail

(RRAO) n

maximize x

p x k 1

k k

r

n

AN US

Assortment Optimization (RRAO) problem:

n

 k 1 k ' 1

n

w x

subject to

k 1

xx

kk ' k k '



k k

M

xk  {0,1}

c

(6) k  1,..., n.

(7 )

Parameter r can be interpreted as the risk penalty coefficient in the objective function. If

ED

r>0, the objective function of RRAO penalizes the standard deviation in profit (i.e., risk), which is consistent with the notion of risk aversion (see Rabin & Thaler, 2001) and aversion to post

PT

decision disappointment (Bell 1982, 1985). In line with portfolio theory (Markowitz 19524) an assortment becomes more risky (has a higher standard deviation 5) when items are included that

CE

have highly uncertain profit contributions (large  kk values) and/or when pairs of items are included to have a (high) positive covariance (positive  kk ' if k ≠ k’). If the decision maker requires

AC

more confidence (i.e., a larger r), s/he penalizes risk more heavily.

4

We refer to the recent special edition of the European Journal of Operational Research commemorating Markowitz’s seminal work on portfolio optimization based on a risk-return (mean-variance) analysis (e.g., the editorial by Zopounidis, Doumpos, & Fabozzi, 2014; reflections and clarifications on the applicability of modern portfolio theory by Markowitz himself 2014). 5 Our robust formulation implies the standard deviation of profit as proxy for risk rather than the variance, which was suggested in portfolio theory (Markowitz, 1952). However, Markowitz (p. 89) notes that an investor may just as well be concerned with the standard deviation as proxy for risk. Moreover, the Efficient Frontier of assortments, a notion key to the heuristic solution approach (to be presented later), is invariant to the actual choice out of these two risk proxies. The optimal robust assortment would always lie on the Efficient Frontier, irrespective of the choice for standard error vs. variance as proxy for risk. In addition, the heuristic can easily be adjusted to consider variance as risk proxy instead of the standard error.

9

ACCEPTED MANUSCRIPT

We construct an ellipsoidal uncertainty set by assuming a multivariate normal distribution for the profit parameters, i.e., p ~ N ( p, ). Consequently, the total profit associated with a given assortment x follows a normal distribution, i.e., p T x ~ N ( p T x, x T x). We use this expression to set a value for r as follows. We set the probability that the true (unobserved) profit is lower than the robust profit equal to a specified confidence level  (01). That is, we set

density function of the standard normal distribution.

CR IP T

P( p T x  p T x  r xT x )   . This implies that r   1 (1   ), where () is the cumulative

We now define the nominal versus robust profit of an assortment x: Nominalprofit   NOM ( x) 

n

p x . k k

k 1

n

n

n

AN US

Robust profit   ROB ( x) 



p k xk  r

k 1

(8)



kk ' xk xk '

.

(9)

k 1 k '1

The nominal profit is equivalent to the expected return of the selected assortment. The robust profit, on the other hand, provides a balance between return and risk. We define the

M

 “optimal nominal assortment” (= xNOM ) as the assortment that optimizes the nominal profit  function in (RAO). The “optimal robust assortment” (= xROB ) is the assortment that optimizes the

ED

robust profit function in (RRAO). The next section illustrates the differences between the Nominal and Robust Retail Assortment Optimization problem using the example presented

PT

earlier.

3.2. Robust Retail Assortment Optimization Example Table 3 extends the overview in Table 2 with the robust profit for each feasible solution

CE

for three values of r, corresponding with an increasingly risk-averse decision maker: r = 1.28 ( = 10%), r = 1.64 ( = 5%), and r = 2.33 ( = 1%). We find that the optimal nominal solution {2,

AC

3} is not the optimal solution to the Robust Retail Assortment Optimization problem for any of these three (commonly used) values of r. This is due to the high variance associated with this solution. The optimal solution to each of the three robust optimization problems is the same: {1, 2}. Since items 1 and 2 have a strong negative covariance (see Table 1), including both items in the assortment provides a hedging mechanism for the retailer.

10

ACCEPTED MANUSCRIPT

Table 3: Nominal vs. Robust Profit of the Feasible Assortments in Table 1. Nominal Profit

Variance ( x  x)

r = 1.28 ( = 10%)

1.00 4.00 9.00 2.00 8.00 23.00

6.72 6.44 6.16 15.19 14.38 12.86

T

{1} {2} {3} {1, 2} {1, 3} {2, 3}

8.00 9.00 10.00 17.00 18.00 19.00

Robust Profit r = 1.64 r = 2.33 ( = 5%) ( = 1%) 6.36 5.72 5.08 14.68 13.36 11.13

5.67 4.34 3.01 13.70 11.41 7.83

% of times highest nominal profit based on simulation# 0.00 0.00 0.00 15.00 25.05 59.95

CR IP T

Solution

Note: We have simulated 10,000 draws from p ~ N ( p, ). Per draw we have computed the profit p T x of every assortment x. The percentages in the last column indicate how often each assortment had the highest nominal profit.

Comparing the optimal robust solution {1, 2} to the optimal nominal solution {1, 3}, we see that the robust set of items provides a much lower variance (2) for profit than the nominal set

AN US

of items (23). This is illustrated by Figure 1, which compares the probability distribution of the total profit generated by the optimal nominal and robust solution.

Figure 1 shows that the uncertainty in the profit realization of an assortment can be reduced substantially by choosing an assortment that is only slightly worse in terms of expected profit. In fact, the expected profit (17) of the optimal robust assortment is only 10.5% less than

M

that of the optimal nominal solution (19), but its robust profit (13.70, using r = 2.33) is almost twice the robust profit of the optimal nominal solution (7.83). This example shows the

ED

fundamentals of robust optimization to provide solutions to the Retail Assortment Optimization problem: trade off a relatively small loss in return (expected profit) against a large reduction in

PT

risk (uncertainty) surrounding the true realization.

CE

Figure 1: Probability Distribution of the Profit of the Optimal Nominal and Robust Solution.

AC

Robust

Nominal

0

5

10

15

20 Total Profit

11

25

30

35

ACCEPTED MANUSCRIPT

At this point, we would like to caution against another approach to deal with the uncertainty in profit parameters. This method is based on simulating profit parameters from its estimated distribution, e.g., using the posterior draws from a Bayesian estimation scheme. A researcher may be inclined to solve the optimization problem for multiple realizations of the profit parameter vector, leading to multiple optimal assortments. Next, we can compute the

CR IP T

fraction of times each assortment outperforms the others in terms of profit, which is what we did for the example in the last column of Table 3. We note that solution {2, 3} wins 59.95% of the times, which is more than any other solution, and hence {2, 3} is seemingly “robust” against profit parameter permutations. However, one has to be aware that this approach is risk neutral in the sense it does not provide a probabilistically guaranteed lower bound on the profit that will be

AN US

achieved, which is something the (RAORobust) approach does.

We also note that some studies perform sensitivity analysis with respect to parameter perturbations after assortments have been optimized (e.g., Miller, Smith, McIntyre, & Achabal 2010; Rooderkerk et al. 2013). These studies still optimize expected returns without the consideration of risk. Our study is fundamentally different from this approach in that we account

M

for risk, in addition to return, during the optimization of the assortment. Hence, in our study risk co-determines the composition of the optimal assortment.

ED

In the next section, we describe a new heuristic that can be used when complete enumeration is infeasible. The heuristic constructs a set of assortments that trade off expected profit versus profit risk, and it also results in a (near-optimal) solution to the Robust Retail

PT

Assortment Optimization problem.

CE

4. Efficient Solution Method for the Robust Assortment Optimization Problem 4.1 Motivation for Solution Method The Robust Retail Assortment Optimization (RRAO) problem is essentially a multi-

AC

objective integer program6, in which the decision maker aims to maximize expected profit while minimizing risk at the same time. Fixing the value of balancing parameter r results in a singleobjective optimization problem. Optimal solutions to this single-objective problem are part of the set of Pareto-optimal solutions7 to the multi-objective function. For a given value of r the

6

See Rasmussen (1986) for an overview of this class of problems. A Pareto-optimal (efficient) assortment is an assortment for which neither of the objective functions (return and risk) can be improved in value by another feasible assortment without degrading the other in value. In other words, it is not dominated. 7

12

ACCEPTED MANUSCRIPT

optimization problem (RRAO) is an example of a non-linear binary program, a class of optimization problems that is hard to solve. For small instances (such as the example discussed in section 2 and 3), we can solve this problem through complete enumeration. However, in practice most realistic assortment optimization problems are too large to solve in this way. Consequently, we need to resort to heuristics to solve this optimization problem. Heuristics that

CR IP T

aim at finding the best solution to (RRAO) for a given value of r are referred to as a priori methods (Hwang & Masud, 1979). Hence, these methods assume that the decision maker sets his relative preference beforehand (before seeing any data).

However, retailers likely find it very hard to elicit their amount of risk aversion before seeing the trade-off between risk and return in some actual assortment solutions. That is why we

AN US

opt for a so-called a posteriori heuristic (Hwang & Masud, 1979), which aims to construct a representative subset of all Pareto-optimal assortments. This approximation of the Pareto curve or Efficient Frontier represents the trade-off between risk and return, allowing the decision maker to determine/adjust his risk aversion a posteriori. Consequently, the next section presents an efficient solution method that rapidly constructs an approximation of the Efficient Frontier of

M

assortments. It will simultaneously determine the solution on this approximated Efficient Frontier that bests fits the risk preference elicited a priori (i.e., the value of r), while providing

ED

the retailers with the (visual) means to adapt this preference a posteriori. The heuristic is specifically designed to deal with the fact that one of the objective functions, risk, is a non-linear

PT

function of the binary decision function.

4.2 Efficient Frontier of Assortments for the Empirical Example Figure 2 shows the set of efficient assortments for the example from §2. The horizontal

CE

axis plots each assortment’s profit variance and the vertical axis its expected or nominal profit. Note that the assortments {2} and {3} are not part of the Efficient Frontier as these two

AC

assortments do not satisfy efficiency: they lead to less expected profit than assortment {1, 3} while at the same time they have larger variances.

13

ACCEPTED MANUSCRIPT

Figure 2: Set of Efficient Frontier Assortments for the Example Problem in Table 1. 20

optimal nominal profit

18

{2, 3} optimal nominal assortment

{1, 3} {1, 2} optimal robust assortment

16

12 10 8

{1}

6 4 2

ø 0 0

10

Variance

15

20

25

AN US

5

CR IP T

Nominal Profit

14

Note: the numbers between the brackets refer to the items included in the corresponding efficient assortment.

If the retailer disapproves the loss in expected (nominal) profit of the optimal robust assortment relative to the optimal nominal assortment (dashed line in Figure 2), s/he could use the Efficient Frontier to find an assortment that provides a better balance between expected profit

M

and uncertainty, such as {1, 3}. This means that the decision maker implicitly adjusts the value

ED

of r, the amount of uncertainty s/he is willing to take into consideration. 4.3. Efficient Frontier Heuristic

In Figure 2, the Efficient Frontier consists of five assortments only. In empirical

PT

applications, this number is much higher. In the empirical application in §5, the number of efficient assortments per store ranges between 104 and 408, with a median of 196. To deal with

CE

realistic problems, we develop the “Efficient Frontier heuristic”. The idea behind this heuristic is to approximate the Efficient Frontier set by rapidly constructing a subset of efficient assortments.

AC

The heuristic also provides a (near-) optimal solution to the Robust Retail Assortment Optimization problem. Intuition behind the heuristic. The heuristic capitalizes on the property of the Efficient Frontier that there is no efficient

max 8 . assortment with a variance that exceeds the variance of the optimal nominal assortment qNOM

8

max If there are multiple optimal solutions to the nominal assortment optimization problem q NOM refers to the minimum variance among these solutions. In this way we secure that the corresponding optimal nominal assortment is also efficient.

14

ACCEPTED MANUSCRIPT

This property can be seen as follows. Suppose there is an assortment with variance exceeding max qNOM . In order for the assortment to satisfy efficiency (being on the Efficient Frontier), its

nominal profit should exceed the nominal profit of the optimal nominal assortment  NOM , which is not possible by definition. Exploiting this property, the heuristic starts by solving the nominal assortment optimization problem. After this, it solves a reformulated version of the nominal

CR IP T

assortment optimization problem by limiting the profit variance from above (to an upper bound q), starting from the variance of the optimal nominal assortment.9 By systematically lowering this upper bound, the heuristic constructs a set of assortments that approximates the Efficient Frontier. Maximizing returns while limiting variance

AN US

Limiting the profit variance introduces a non-linear restriction to the nominal assortment optimization problem (RAO) formulated in Section 2.1. That is because the expression for the n

profit variance,

n

 k 1 k '1

kk'

xk xk  , is non-linear (quadratic) in the decision variables. However, we

can replace the non-linear expression for the profit variance by a linear combination of binary

this,

we

define

 kk

as

the

M

variables through the separation of the on-diagonal and off-diagonal elements of . To achieve kth

diagonal

element

of ,

and

 kk '   kk '   k 'k ,

ED

k  1,.., n  1, k '  k  1,.., n. Since  is symmetric, we can restate this as  kk '  2 kk ' . Furthermore,

PT

we define the following auxiliary binary variables vkk '  xk xk ' , i.e., vkk '  1 if xk  1 and xk '  1, 0 otherwise. This leads to the following reformulation of the profit variance:

CE

n n1 n  n  n n var  pk xk    kk' xk xk '   kk xk    kk 'vkk ' . (10) k 1 k 1 k 'k 1  k 1  k 1 k '1 The benefit of this reformulation is that it is linear in the decision variables. However, we

AC

now have auxiliary decision variables (vkk’) that are non-linear functions of the original decision variables (xk). However, we will show below how we can relate the auxiliary variables to the original decision variables through a set of linear constraints. We now define the reformulated version (RAOLimited) of the nominal assortment

optimization problem that restricts the variance by an upper bound q: 9

As stated in footnote 5, the Efficient Frontier is invariant to the use of variance or standard deviation as proxy for risk (Markowitz, 1952). Limiting the variance from above leads to an optimization problem that is equivalent to one in which we limit the standard deviation from above. However, limiting the variance leads to an optimization problem that is easier to solve.

15

ACCEPTED MANUSCRIPT

(RAOLimited) n

xk    

x , v ,h

k 1

mh 

k

contribution due to variance slack

nominal profit n

w x

subject to



c



q

v kk '



xk

v kk '



xk'

v kk '



xk  xk' 1

k 1 n 1

n

 k xk   k 1

k

k

n



k 1 k ' k 1

 kk ' kk '

v

h

AN US

x k  {0, 1}

[capacity constraint ]

CR IP T

p

maximize

v kk '  [0, 1] h  R with

(11)

[variance constraint ]

(12)

( k , k ' )  V

(13)

( k , k ' )  V

(14)

( k , k ' )  V

(15)

k  1,..., n

(16)

(k , k ' )  V

(17) (18)

V  {k  1,.., n  1, k '  k  1,.., n |  kk '  0}

M

(19) (20)

V  V  V

(21)

ED

V  {k  1,.., n  1, k '  k  1,.., n |  kk '  0}

The objective function of (RAOLimited) consists of the nominal profit (return) and the 

n



contribution of a newly introduced slack variable h times m. Slack variable h  q  var  p k x k  is

PT

 k 1



the difference between the imposed variance upper bound, q, and the actual profit variance given

CE

by (10). By adding the slack variable in the objective function with a small positive profit contribution m, we impose that the optimal assortment is efficient while obeying the variance

AC

upper bound. That is, first expected profit is maximized, and if multiple assortments maximize expected profit, the assortment with minimum variance will be selected. A theoretically optimal choice for m is m = /q, with  = the minimum profit difference between any two feasible assortments that display a trade-off between expected profit and variance. In this way, 0  mh < (/q)q = . Consequently, every possible increase in profit will always be preferred to a decrease in variance, since the former will always contribute more to profit than the latter. Only when no profit difference is possible, i.e., maximum profit is attained, the term mh forces to choose the

16

ACCEPTED MANUSCRIPT

assortment with minimum variance among them. To derive  we would have to solve a separate discrete optimization problem, which resembles a multiple knapsack problem and is very hard to solve. For practical reasons we use a value of  = .0001 in the empirical application. We verified that this value was sufficiently small, in the sense that smaller values did not affect the solutions.10

CR IP T

Equation (11) again represents the capacity constraint. Equations (12) – (21) restrict the profit variance to be equal to or below the variance upper bound. Although the vkk’ variables are binary by definition, we can relax them to assume values within the [0,1] domain because of the structure of the formulation above. This is attractive since that the use of continuous decision variables typically leads to a simpler optimization procedure relative to having binary decision

objective

function

and

restrictions

AN US

variables.11 To explain why the decision variables are effectively binary, we note that the (12)

–

(21)

together

require

that

vkk '  xk xk '  1 if xk  1 and xk '  1 , 0 otherwise. We enforce this by distinguishing the situation in which  kk '  0 from  kk '  0 for every pair (k, k’| k < k’) of Stock-Keeping Units (SKUs, i.e.,

M

individual products that are considered to be part of the assortment). When  kk '  0 holds, the objective function and Equation (12) will result in variable vkk ' being set to its maximum value

ED

(1). However, when xk and/or xk’ are equal to 0, Equations (13) and (14) force vkk ' to be equal to 0. In case  kk '  0 holds, the objective function and Equation (12) will result in the variable vkk '

PT

being set to its minimum value (0). However, when both xk and xk’ are equal to 1, Equation (15) forces vkk ' to be equal to 1. Thus, while we allow vkk ' to move freely in the domain [0,1],

CE

effectively they will be 0 or 1. We now describe the Efficient Frontier heuristic in more detail: Efficient Frontier Heuristic

AC

Initialization 1. Solve the nominal Retail Assortment Optimization problem (RAO). This provides the optimal nominal profit  NOM .

2. 10

Find the assortment that results in the optimal nominal profit  NOM with minimum risk.

We refer to Online Appendix A for more details on the derivation of m and the chosen value for .

11

Formally, the use of the auxiliary variables reduced a quadratic binary program to a linear binary program. Next, through the relaxation of the auxiliary variables (continuous versus integer) the optimization problem became a mixed binary program rather than a pure binary program. In the empirical application the mixed binary formulation proved much easier to solve than the original pure binary formulation.

17

ACCEPTED MANUSCRIPT

If there are multiple optimal nominal assortments, we want to obtain the optimal nominal assortment with minimum variance. Hence we solve an optimization problem in which the objective is to minimize variance while enforcing the assortment to be feasible and result in the optimal nominal profit.

3.

Initialize the following variables: max  Set the maximum variance qNOM equal to the variance of the assortment found in step 2.

 Set the best robust profit found so far,  best ROB , equal to the robust profit associated with

CR IP T

the assortment found in step 2. max  Set the variance of the efficient assortment most recently found, qrecent, equal to q NOM .  Set the step size, s (in %), for the variance grid; the implied variance grid is now max described by the following set: {(1 – (n*s) /100) q NOM , n  1,..., 100 s }.

AN US

Repeat until termination 4. Set the variance upper bound q in optimization problem (RAOLimited) equal to the nearest point on the variance grid below qrecent and solve (RAOLimited).

In every iteration of the heuristic, the upper bound on the variance is set to the nearest point on the grid that has a variance below the actual variance of the efficient assortment most recently found. This ensures that we find a different assortment in every step of the heuristic.

Set qrecent equal to the variance of the assortment found in the most recent step 4.

6.

If the robust profit of the assortment found in the most recent step 4 is higher than the best

M

5.

ED

robust profit found so far, replace  best ROB by this value. The assortment with maximum robust profit among those constructed by the heuristic is the heuristic’s solution to the Robust Assortment Optimization problem.

Terminate when the end of the grid is reached or when it is certain that no better robust

PT

7.

solution can be found.

CE

The Efficient Frontier heuristic will continue until the end of the grid is reached. However, if we are only interested in finding a (near-) optimal solution to the Robust Assortment Optimization problem, we let the heuristic terminate when the nominal profit of the efficient assortment most recently obtained is

AC

less than or equal to the best robust profit found so far. When this is the case it is certain that no better robust assortment will be found, reason for the heuristic to terminate.

Figure 3 illustrates the heuristic for the example in §2 with r equal to 2.33, consistent with

a confidence level of 99%, and step size s equal to 5%, while using the underlined termination criterion. A given assortment x has nominal profit  NOM ( x) based on the profit (objective) function of (RAO); similarly, assortment x has robust profit  ROB ( x) corresponding to the robust

18

ACCEPTED MANUSCRIPT

profit function in (RRAO). After three iterations in which the upper bound on the variance, q, is systematically lowered and (RAOLimited) is solved, the Efficient Frontier heuristic terminates. The top panel of Figure 3 shows the assortments found by the heuristic (red dots), which in this case corresponds to the full efficient set (blue squares). The bottom panel depicts the robust profit corresponding to these assortments. (a) Nominal Profit 20

q = 1.15

q = 6.90

q = 21.85

18

{2, 3}

{1, 3}

16

{1, 2}

14  best ROB 12

Initialization:  NOM = 19, qmax = 23; NOM

10

Iteration 1: q = 21.85   NOM(xRAO-Limited(21.85)) = 18; Iteration 2: q = 6.90   NOM(xRAO-Limited(6.90)) = 17;

8

Iteration 3: q = 1.15   NOM(xRAO-Limited(1.15)) = 8;

{1}

6

  NOM(xRAO-Limited(1.15)) <  best = 13.70; ROB

4

 Heuristic TERMINATES

2

ø 5

10

Variance

15

20

Efficient set Heuristic

25

M

0 0

AN US

Nominal Profit

CR IP T

Figure 3: Graphical illustration of the Efficient Frontier heuristic (r = 2.33, step size s = 5)

(b) Robust Profit

18 16 14  best ROB

{1, 2}

PT

10 8

CE

Robust Profit

12

ED

20

6

{1, 3}

Initialization:  best = 7.83; ROB Iteration 1:  ROB(xRAO-Limited(21.85)) = 11.41   best = 11.41; ROB

4

Iteration 2:  ROB(xRAO-Limited(6.90)) = 13.70   best = 13.70; ROB

2

Iteration 3:  ROB(xRAO-Limited(1.15)) = 5.67   best = 13.70; ROB

{1}

ø

AC 0 0

{2, 3}

5

10

15

20

25

Variance

Note: the numbers between the brackets refer to the items included in the corresponding efficient assortment.

Note that a benefit of the heuristic is that, upon termination, the robust profit plot (panel b in Figure 3) can be constructed for any given degree of risk aversion (i.e., for any value of r); even when termination depended on a different value for r (using the underlined termination

19

ACCEPTED MANUSCRIPT

criterion in step 7 of the heuristic). This allows finding a robust solution for any given value of r after the Efficient Frontier heuristic is terminated.12 This enables the retailer to decide how risk averse s/he wants to be based on the risk-return tradeoff displayed by the problem at hand. Hence, the heuristic falls in the class of a posteriori methods to solving a multi-objective optimization problem (Hwang & Masud, 1979).

CR IP T

5. Empirical Application: Store-Level Category Sales Maximization

The empirical application considers the product category of paper towel rolls. The objective is to maximize category profit per store by optimizing the store-level assortment composition. Information Resources, Inc. (France) has provided weekly store-level scanner data for 21 SKUs from 54 stores from a large national French retail chain for the 156-week period

AN US

between September 2002 and September 2005. The dimensionality of the problem is not excessively large, which allows us to contrast the approach to full enumeration. In the absence of margin data, we assume an equal profit margin (of 1) per unit of volume (i.e., paper towel roll). 5.1. SKU Sales Model

M

We need an estimate for an SKU’s mean and variance in sales, as well as the covariance in sales with other SKUs. To represent typical, average conditions, these estimates need to be

ED

corrected for marketing activities such as in-store promotions. To achieve this, we use an SKU sales response model, regressing SKU sales on a set of SKU dummies, a relative price index variable (relative to competition to capture own and competitive effects parsimoniously),

PT

dummies representing feature and display activity, and a relative shelf space index variable (again relative to competition to allow for cross effects). We use a linear model to ensure that the

CE

model predictions can be directly interpreted as sales levels (rather than, e.g., log sales levels). To allow for heterogeneity in the marketing-mix effects, we include brand-specific parameters

AC

for the marketing-mix variables. Finally, all parameters are allowed to be store specific. The resulting sales model is given by the following expression:

12

Note that, when the underlined criterion is used, termination depends on the level of risk aversion (r) as it affects robust profit. Because higher values of r (higher risk aversion) imply lower robust profits it will be harder for the termination criterion (nominal profit of most recent solution < best robust profit found so far) to be met. Consequently, the heuristic will start to terminate later for higher values of r. More specifically, suppose we have two levels of risk aversion r1 and r2, such that r1< r2, then the set of efficient assortments generated under r2 will contain all those generated under r1, and potentially more. As the generated set of efficient assortments for a given value for r is always contained in the set generated under a larger value for r, it makes most sense to use a large value for r in the termination criterion and (also) evaluate the resulting robust profit for lower values of r rather than the other way around.

20

ACCEPTED MANUSCRIPT

(Pricekti  Pricek i )   2b ( k )i  Feature onlykti   3b ( k )i  Display onlykti   1  K   Pricek 'ti  zk 'ti  N ti  k ' 1  (22) (Shelf spacekti  Shelf spacek i )   4b ( k )  Feature & Displaykti   5b ( k )i    kti ,  1  K   Shelf spacek 'ti  zk 'ti  N ti  k ' 1 

S kti   0 ki  1b ( k )i 





CR IP T

where k is an index for SKU, k  {1,..,K}, K denotes the number of SKUs; t is a week index, t  {1,...,T}, T denotes the number of weeks, and i is a store index, i  {1,...,I}, and I denotes the number of stores. The other symbols are defined in Table 4.

Table 4: Definitions of the symbols in the sales response model (Eq. 22) volume sales of SKU k in week t at store i in number or rolls (x 100)

 0 ki

intercept for SKU k in store i

b(k )

brand index of the brand SKU k belongs to, b(k)  1,…,B (= the number of brands)

 gb( k )i

coefficient belonging b(k) in store i

Pricekti

unit price per paper towel roll for SKU k in week t at store i

Priceki

average unit price per paper towel roll for SKU k at store i (price is mean-centered to remove between-store differences)

z k 'ti

1 when SKU k’ is present in week t at store i, 0 otherwise

N ti

the number of SKUs present in week t in store i

Feature onlykti

1 when SKU k was on feature in week t at store i while not being on display, 0 otherwise

Display onlykti

1 when SKU k was on display in week t at store i while not being on feature, 0 otherwise

Feature&Displaykti

1 when SKU k was both on feature and display in week t at store i, 0 otherwise

Shelf spacekti

amount of shelf space in meters for SKU k in week t at store i average amount of shelf space in meters for SKU k at store i (shelf space is meancentered to remove between-store differences).

AN US

S kti

marketing

mix

instrument

g

(1,...,5)

for

brand

PT

CE

Shelf spaceki

ED

M

to

We assume the store-specific parameters to be independently, identically distributed

AC

according to a normal population distribution:

 i   01i   0 Ki 11i  1Bi   51i   5 Bi T ~ N ( , V ).

(23)

Further, we assume store-specific contemporaneous correlations between the error terms: εti  1ti  Kti  iid ~ N(0, i ), i a full error variance-covariance matrix for store i.

(24)

Of course, we could use a different specification for the sales response model. The point is that any model will entail uncertainty in the sales predictions and hence in the profit predictions. Online Appendix B describes the Bayesian estimation of the model. Visual 21

ACCEPTED MANUSCRIPT

inspection and formal tests confirm the convergence of the Gibbs chain. Online Appendix C shows that the median hyper parameters have the expected signs and the posterior intervals mostly exclude zero. 5.2. Nominal versus Robust Assortment Optimization: Full Enumeration Constructing the Optimization Input

CR IP T

Next, we optimize the assortments for each store separately. We compute the expected profit parameters and the covariance of the profit parameters based on the posterior parameter distribution. We optimize for a regular situation with regular price and shelf space, and no marketing support such as feature or display. Consequently, the only relevant part of the model is formed by the store-specific SKU intercepts and error covariance matrix. For a given posterior

following posterior distribution:



AN US

draw j=1,…,J, the SKU sales vector in a given week t in store i can be sampled from the

( j) Sti( j )  S1(tij )  S Kti







T

~ N ( 0(ij ) , i( j ) ),

(25)





( j) ( j) ( j) where  0i   01i   0 Ki denotes posterior draw j of vector  0i   01i   0 Ki , and  i( j ) T

T

M

posterior draw j of the error covariance matrix i . For every posterior draw we sample T weeks of posterior data, allowing us to compute the expected sales vector and corresponding covariance

ED

matrix of the sales vector. Next we take the average across all posterior draws of the expected sales vector and covariance matrix, and use these as inputs for the nominal and robust

Full Enumeration

PT

optimization.

CE

To assess the full potential of the robust approach, we first solve the nominal and robust store-level Retail Assortment Optimization problem using full enumeration in Matlab 2015a. An

AC

average store has space for 15 SKUs, in which case full enumeration entails

   21 k  15

= 2,069,255

k 1

possible assortments. We perform the robust store-level optimizations for three different values for r: 1.28 (i.e., 90% certainty), 1.64 (i.e., 95% certainty), and 2.33 (i.e., 99% certainty). The average computation time per store was respectively 28.1, 28.0, and 28.0 seconds on a laptop PC equipped with a quad-core Intel i7 2.30 GHz processor and 16 GB memory capacity. Consequently, more than 25 minutes were required to solve the robust optimizations across all

22

ACCEPTED MANUSCRIPT

54 stores for a given value of r. While this amount of time is not excessive, the size of the problem grows very fast when the assortment dimensions increase. For instance, if we double the size of the assortment (15 to 30) and the number of available SKUs (42 vs. 21), the number of potential assortments increases by a factor 2.1 million. We now compare the optimized assortments based on the nominal and robust profit.

CR IP T

When moving from the nominal optimal assortment to the robust optimal assortment, we naturally experience a decrease in nominal profit and increase in robust profit. But how large are these changes? Table 5 illustrates the comparison for one of the stores.

Table 5: Robust vs. Nominal Optimal Assortment for One Store (r = 1.64, 95% certainty). Robust profit 11,927 (=c)

AN US

Optimal nominal assortment

Nominal profit 16,861 (=a) 16,829 (=b)

Optimal robust assortment

12,446 (=d)

 The relative change in nominal profit =  .19% ( 100%  ba-a )  The relative change in robust profit =

4.35% ( 100%  dc-c )

Table 5 shows that, when comparing the robust optimal to the nominal optimal

M

assortments, the relative decrease in nominal profit is quite small (.19%). At the same time the relative increase in robust profit is quite substantial (4.35%). Hence, a large gain (risk reduction)

ED

only requires a small sacrifice (decrease in expected return). Figure 4 summarizes the comparison made in Table 5 across all stores for different values of r. For each store, Figure 4 contrasts the relative difference in robust profit between the robust

PT

and nominal assortment to the relative difference in nominal profit. The stores in each plot are sorted in ascending order of relative gain in robust profit. In Figure 4a (r = 1.28), 29 out of 54

CE

stores have the same nominal and robust optimal assortments, which is represented by the absence of bars at the left-hand side of the figure. For a considerable number of the other 25

AC

stores the optimal robust assortment results in a large increase in robust profit (large white bar) yet a small decrease in nominal profit (small black bar). This holds in particular for the stores at the right-hand side of the plots. In these stores, the robust assortments trade off a small decrease in expected profit for a large reduction in uncertainty. However, it is interesting to note that the opposite situation also occurs. For some stores a small increase in robust profit is accompanied by a large decrease in nominal profit. Thus, for these stores the reduction in uncertainty of the outcome comes at a large cost, i.e., a significant decrease in expected profit.

23

ACCEPTED MANUSCRIPT

Figure 4: Relative Profit Comparison of Robust vs. Nominal Optimal Store Assortments

Robust Nominal

10

White bars:

100%(ROB(xROB) - ROB(xNOM))/ROB(xNOM)

Black bars:

100%(NOM(xROB) - NOM(xNOM))/NOM(xNOM)

Change in Profit (in %)

8

6

Avg. change in robust profit

=

.72%

4

Avg. change in nominal profit =

.42%

0

-2

-4

-6 0

5

10

15

20

25

30

35

40

45

50

Store

10

AN US

(a) r = 1.28

Robust Nominal

Example of Table 5

8

6

Change in Profit (in %)

CR IP T

2

Avg. change in robust profit

4

=

1.23%

Avg. change in nominal profit = .81%

2

M

0

-2

-4

-6 5

10

15

20

25

30

35

40

ED

0

45

50

Store

10

PT

(b) r = 1.64

Robust Nominal

4

2

0

Avg. change in robust profit

-2

-4

-6

0

5

10

15

20

=

2.86%

Avg. change in nominal profit = 1.40%

AC

Change in Profit (in %)

6

CE

8

25

30

35

40

45

50

Store

(c) r = 2.33

24

ACCEPTED MANUSCRIPT

When we move from Figure 4a to 4b and 4c, r increases and hence, the uncertainty set increases. Consequently, for an increasing number of stores (25, 30, and 40 out of 54) the nominal and robust assortments deviate. As expected, the increases in robust profit become larger when the value of r increases (r = 1.28, avg. change = .72%; r = 1.64, avg. change = 1.23%; r = 2.33, avg. change = 2.86%). Interestingly, the corresponding decrease in nominal

CR IP T

profit does not grow as progressively (r = 1.28, avg. change = .42%; r = 1.64, avg. change = .81%; r = 2.33, avg. change = 1.40%). 5.3. Application of the Efficient Frontier Heuristic

The results in the preceding section were obtained with full enumeration. We now compare them to the Efficient Frontier heuristic, which we implemented in the optimization

AN US

package AIMMS 3.13. The software code is available upon request. All optimization problems in the heuristic were solved with the state-of-the-art solver CPLEX 15.0. Sequentially, in an automated fashion, for each store we obtained the Efficient Frontier heuristic for different values of r (i.e., 1.28, 1.64, 2.33). To investigate the role of the step size, the heuristic was run for different step size values (i.e., s = 1%, 2.5%, and 5%).

M

Table 6: Performance of the Efficient Frontier heuristic. Step size (s) 2.5 %

5.0 %

54 54 54

54 53 53

52 52 49

100.00 (100.00) 100.00 (100.00) 100.00 (100.00) 100.00 ( 99.95) 100.00 (100.00) 100.00 ( 99.98)

AC

CE

PT

ED

Number of stores with optimal assortment (of 54) r = 1.28 r = 1.64 r = 2.33 Average percentage of the optimal robust profit r = 1.28 r = 1.64 r = 2.33 Average number of steps in the heuristic r = 1.28 r = 1.64 r = 2.33 Average running time (in seconds) r = 1.28 r = 1.64 r = 2.33 Average reduction in running time compared to full enumeration (in %) r = 1.28 r = 1.64 r = 2.33

1.0 %

99.99 (99.70) 99.99 (99.53) 99.98 (99.48)

22.04 (41.00) 26.98 (47.00) 34.80 (59.00)

14.17 (24.00) 17.09 (26.00) 21.13 (30.00)

9.26 (15.00) 10.89 (16.00) 13.04 (18.00)

.88 (1.29) 1.02 (1.49) 1.28 (1.88)

.56 ( .95) .63 ( .97) .77 (1.06)

.35 (.51) .42 (.61) .47 (.68)

96.80 (95.15) 96.31 (94.50) 95.33 (92.90)

97.96 (96.63) 97.70 (96.20) 97.21 (95.24)

98.71 (97.76) 98.47 (97.21) 98.26 (96.89)

Notes: Between brackets the worst-case numbers, i.e., respectively the minimum fraction of optimal robust profit, the maximum number of steps, the maximum running time, and the minimum reduction in running time. \

25

ACCEPTED MANUSCRIPT

Table 6 shows that the heuristic performs very well compared to full enumeration. For almost all stores (between 49 and 54 out of 54) the heuristic finds the optimal robust assortment, even for the largest step size. In fact, for the smallest step size of 1% the heuristic finds the optimal robust assortment for every store, irrespective of the value of r. Moreover, even in the worst-case scenario (which happens for r = 2.33 with step size 5%), the resulting assortment is

CR IP T

very near to the optimum (99.48% of the optimum). Another observation is that the running time for the heuristic is very short, in the worst-case only 1.88 seconds. Obviously, a smaller step size results in a more detailed heuristic and more stores for which the heuristic finds the optimal robust assortment. However, this also results in an increase in average running time from .49 seconds (step size s = 5%) to1.33 seconds (step size s = 1%) for a value of 2.33 for r. Finally, we

AN US

note that the reduction in average running time compared to full enumeration is 92.9% or better across all cases, which is very substantial. This indicates the potential of the Efficient Frontier heuristic for larger problem instances when full enumeration is not feasible. 5.4. Upper Bound on the Optimal Robust Profit Using the Efficient Frontier Heuristic In the empirical application, we could use complete enumeration to assess the best robust

M

assortments found with the Efficient Frontier heuristic. However, typically we would apply the heuristic in cases where the optimal solution is not known or hard to find. In those cases, we

ED

would also like to have some idea of the quality of the heuristic solution. To this end, we present an upper bound on the actual optimal robust profit, which we can compare with the best

PT

assortment found by the heuristic. As Online Appendix D shows, we can derive the following

CE

upper bound (UB) on the optimal robust profit after running the Efficient Frontier Heuristic:  ROB  max   NOM ( x f )  r q f  , f 0,..,F  1   

(26)

UB

AC

where xf indicates the assortment found in step f (=0,..,F) of the heuristic, and q f the next variance upper bound used in (RAOLimited) after finding xf. The assortment in step 0, x0, corresponds to the optimal nominal assortment that is efficient with corresponding variance, q0, max equal to q NOM .

Online Appendix shows that, ceteris paribus, the upper bound given by (26) decreases when step size s decreases, thus leading to a tighter upper bound. So, not only does decreasing the step size in the heuristic lead to an assortment with a higher robust profit (see Table 6), it is

26

ACCEPTED MANUSCRIPT

also expected to tighten the upper bound. Figure 5 summarizes the average and worst-case relative gap between the best robust profit found by the heuristic and the upper bound. The results are displayed for different values of r and step size s. Figure 5: Relative Gap between the Heuristic and the Upper Bound across Stores. r = 1.28

3.0%

r = 2.33

r = 1.64

5

2.6%

CR IP T

2.8%

Maximum Average

2.4%

2.4

2.2%

2.2

2.0%

2.5

2.0

1.8%

1.8

1.6%

1.6

1.4%

1.4

1.2%

1.2

1.0%

1

1.0

0.8%

0.8

0.6%

0.6

0.4%

0.4

0.2% 0.0%

AN US

Relative gap between heuristic and upper bound

3.2%

0.2

1

2.5

5

1

2.5 Step size

5

0

1

2.5

5

M

Note: The relative gap is defined as 100%(UB – Best robust profit found by the heuristic)/UB.

We observe that the upper bounds are quite tight: the average gap across different values

ED

of r and step size s does not exceed 1.17%; the maximum (i.e., worst-case) gap is 2.97% at most across different values of r and step size s. For a given value of r both the average and maximum

PT

gap increase when the step size s increases, as expected. In addition, we see that the relative gaps increase when r becomes larger. This is consistent with the upper bound in Equation (26), which

CE

shows that the robust profit of the assortment xf found in step f of the heuristic can possibly be

AC

improved by at most r  ( q f  q f ) . Thus, this possible improvement increases in r.

6. Conclusions and Directions for Future Research In the era of accountability of business activities, it is of key importance that managers do

not only consider expected profit outcomes but also risk (Srinivasan & Hanssens, 2009). The business literature has started to recognize the importance of managing risk and return in domains such as advertising and R&D (McAlister, Srinivasan, & Kim, 2007), customer satisfaction (Tuli & Bharadwaj, 2009), and customer portfolio management (Tarasi et al. 2011).

27

ACCEPTED MANUSCRIPT

This paper is the first to empirically study the risk-return balance for the Retail Assortment Optimization problem, a central yet challenging knapsack problem for retailers. While finding the set of SKUs that maximize profit is a hard-to-solve problem (e.g., Rooderkerk et al., 2013), it is further compounded by the fact that its solutions are not necessary robust to the uncertainty in the profit contributions. This means that possible deviations from the expected profit

CR IP T

contributions can render the optimal nominal assortment sub-optimal in terms of the actual profit realization. Therefore, this paper proposes to solve the robust 0-1 knapsack problem, i.c., the Robust Retail Assortment Optimization problem, where return is balanced with a penalty term for risk (i.e., uncertainty in profit).

To deal with the real-life dimensionality of this problem, it is key that we can solve the

AN US

robust knapsack problem in a timely manner. In a typical supermarket, a retailer may have space to stock a few hundred SKUs in a category, but has to choose among thousands of potential SKUs available in the market. The corresponding selection problem will have many potential assortments, for which full enumeration is computationally unpractical. Therefore, we have developed a heuristic to construct (near-) optimal solutions to the robust retail assortment

M

optimization problem. The heuristic constructs a subset of the Efficient Frontier assortments. The decision maker can decide how comprehensive the subset needs to be by tuning the grid

ED

construction process. When the grid construction is more detailed, more assortments that are part of the Efficient Frontier will be constructed; the resulting set will approximate the Efficient Frontier closer and therefore increase the odds of finding the real optimal robust assortment

PT

(which is part of the Efficient Frontier).

In the empirical application, the heuristic proved to be much faster (i.e., a 92.9% or

CE

stronger reduction in running time) than complete enumeration. The heuristic leads to solutions that are always optimal for the smallest step size (1%) and very close to the optima for the largest

AC

step size of 5% (.52% difference or less). We also provide an upper bound for the gap between the heuristic and the true optimum, which is essential in case the problem dimensionality prohibits full enumeration. A key feature of the Efficient Frontier heuristic is that it does not just result in a single

solution but in an accurate subset of the assortments that make up the Efficient Frontier of the robust knapsack problem. This approximation visualizes the trade-off between risk and return, allowing for a posteriori decision making (Hwang & Masud, 1979). That is, the retailer can

28

ACCEPTED MANUSCRIPT

decide how risk-averse s/he wants to be based on the amount of return that needs to be sacrificed to reduce risk to acceptable levels, or conversely if the additional return of an assortment warrants the higher risk (if the risk-premium is high enough). As an alternative to the heuristic we could have also opted for population-based metaheuristics (see Talbi, 2009 for a comprehensive overview of this class of heuristics). Most of the

CR IP T

heuristics in this class are nature or biology-based, allowing for some form of coordination among solutions. For an overview of recent advances in this domain, notably the emergence of a new technique called the Cohort Intelligence (CI) technique, we refer to Kulkarni and Shabir (2014). The CI technique allows candidate solutions to adapt/improve themselves in the direction of other, better and more feasible, candidate solutions. These types of heuristics should be able to

AN US

deal with the multi- and non-linear objective optimization nature of the problem. However, an important goal in designing our heuristic was to allow for a posteriori decision making with respect to the desirable level of risk aversion by the retailer. Consequently, we require a heuristic that quickly approximates the Efficient Frontier of assortments. It is not clear how this can be done using the CI heuristic. In CI, there is coordination among the candidate solutions to ensure

M

that candidate solutions are stochastically adapted in the direction of the best solutions (best objective solution, closest to a binding constraint). However, there is no straightforward way to

ED

ensure that the solutions in each step of a nature- and biology-inspired meta-heuristic, such as CI, are Pareto-optimal (Jaskiewicz, 2004). In addition, our method has the benefit of providing an upper bound on the optimal robust profit for a given level of risk aversion, something that is not

PT

standard in meta-heuristics.

Our results show that solutions to the Robust Retail Assortment Optimization problem

CE

lead to assortments that are more robust to the uncertainty in the profit contributions. In the empirical application for a consumer packaged goods category we find that for many retail

AC

stores, optimal robust assortments trade off a small decrease in return (expected profit) for a large reduction in risk (uncertainty). At the same time, we find that the robust approach also leads to large sacrifices for some of the stores. For these stores, a marginal increase in robust profit requires a large decrease in expected profit. This potential disadvantage of robust optimization, i.e., sometimes being conservative at a large cost, is not stressed in the robust optimization literature. Future research should investigate if we can distinguish conditions or characteristics of stores that will likely lead to a large contribution of the robust approach.

29

ACCEPTED MANUSCRIPT

In sum, when constructing optimal assortments, retailers should not only focus on the contribution of each SKU in terms of expected profit but also in terms of its contribution to the overall risk of the assortment. The latter not only results from the variability in the item’s own sales but also from the way its sales covary with those of the remaining items. Items whose sales substantially and positively covary with those of other items will greatly enhance the total risk of

CR IP T

the assortment’s profit. Hence, oftentimes a retailer may be better off, in terms of the risk-return trade-off, by sacrificing an item for one with a lower expected profit but that (more) negatively covaries with most of the assortment. For example, it may be wise for a fashion retailer to substitute one SKU with a color that is expected to do very well for one that is expected to do slightly less but whose sales may move in opposite directions (a color that will do well when the

AN US

others will not). That is, it is advised not to put all eggs in the same (colored) basket.

Our results provide a number of takeaways for retailers and (local) shop managers. First, risk aversion is important in retail operations but it has its price. However, this sacrifice can differ substantially across stores. For some it may be minimal, for others quite substantial. Consequently, in line with the increasing trend of store-level customisation of assortments

M

(Rigby & Vishwanath, 2006), risk-aversion should also be dealt with at the individual store level. Instead of choosing how risk averse to be in general it would be better for a retailer to investigate

ED

how much different levels of risk aversion “cost” per individual store. Just like insurances, different levels of risk aversion (assurance) are associated with different costs (risk premiums in the shape of sacrificing some expected profit). Ultimately, the cost structure of these insurances

PT

will determine how risk averse the retailer wants to be at an individual store. Future research should also verify how the heuristic performs on assortment optimization

CE

problems involving categories with larger number of products. The estimation of the optimization input will be quite a challenge in these large settings. We could alleviate the need

AC

for SKU-specific parameters by adopting a more parsimonious attribute-based modelling approach (Rooderkerk, Van Heerde and Bijmolt 2013). Finally, it would also be interesting to apply the heuristic to the robust counterparts of

other examples of knapsack problems. One example is the challenge of allocating an advertising budget across different media to maximize the predicted sales response. Since the sales response of each medium is stochastic, robust optimization is called for. Similarly, channel decisions also

30

ACCEPTED MANUSCRIPT

involve returns with stochastic and correlated profitability outcomes. We hope that this study

AC

CE

PT

ED

M

AN US

CR IP T

will stimulate more research on the robust optimization of knapsack problems.

31

ACCEPTED MANUSCRIPT

References Bell, D. E. (1982). Regret in decision-making under uncertainty. Operations Research, 30(5), 961-981. Bell, D. E. (1985). Disappointment in decision-making under uncertainty. Operations Research, 33(1), 1-27.

CR IP T

Ben-Tal, A., & Nemirovski, A. (1998). Robust convex optimization. Mathematics of Operations Research, 23(4), 769-805.

Ben-Tal, A., & Nemirovski, A. (1999). Robust solutions of uncertain linear programs. Operations Research Letters, 25(1), 1-13.

Cachon, G. P., Randall, T., & Schmidt, G. M. (2007). In search of the bullwhip effect.

AN US

Manufacturing & Service Operations Management, 9(4), 457-479.

Cardozo, R. N., & Smith, D. K. (1983). Applying financial portfolio theory to product portfolio decisions - an empirical-study. Journal of Marketing, 47(2), 110-119. Chong, J.-K., Ho T.-H, & Tang, C.S. (2001). A modeling framework for category assortment planning. Manufacturing and Service Operation Management, 3(3), 191-210.

M

Devinney, T. M., & Stewart, D. W. (1988). Rethinking the product portfolio: A generalized investment model. Management Science, 34(9), 1080-1095.

ED

Florios, K., G. Mavrotas, & D. Diakoulaki (2013). Solving multiobjective, multiconstraint knapsack problems using mathematical programming and evolutionary algorithms. European Journal of Operational Research, 203, 14-21.

PT

Fischer, M., Shin, H., & Hanssens, D. M. (2013). Brand performance volatility from marketing spending. Working paper, University of Cologne, 1-36.

CE

Hwang, C.-L. & Masud, A. S. M. (1979). Multiple objective decision making – Methods and applications: A state-of-the-art survey, Lecture Notes in Economics and Mathematical

AC

Systems, 164 (1), Springer-Verlag Berlin Heidelberg. Jaszkiewicz, A. (2004). On the computational efficiency of multiple objective metaheuristics. The knapsack problem case study. European Journal of Operational Research, 158, 418-433.

Kellerer, H., Pferschy, U., & Pisinger, D. (2004). Knapsack problems, Berlin; Springer. Kok, A. G., & Fisher, M. L. (2007). Demand estimation and assortment optimization under substitution: Methodology and application. Operations Research, 55(6), 1001-1021. Kouvelis, P., Chambers, C., & Wang, H. (2006). Supply chain management research and

32

ACCEPTED MANUSCRIPT

Production and Operations Management: Review, Trends, and Opportunities. Production and Operations Management, 15(3), 449-469. Kulkarni, A. J. & Shabir, H. (2014). Solving 0-1 knapsack problem using cohort intelligence algorithm. International Journal of Machine Learning and Cybernetics. Accessible at http://link.springer.com/article/10.1007/s13042-014-0272-y#.

The bullwhip effect. Management Science, 43(4), 546-558.

CR IP T

Lee, H. L., Padmanabhan, V., & Whang, S. J. (1997). Information distortion in a supply chain:

Markowitz, H. (1952). Portfolio selection. The Journal of Finance, 7(1), 77-91.

Markowitz, H. (2014). Mean-variance approximations to expected utility. European Journal of Operational Research, 234, 346-355.

Chichester, John Wiley & Sons Ltd.

AN US

Martello, S., & Toth, P. (1990). Knapsack problems: Algorithms and computer implementations,

McAlister, L., Srinivasan, R., & Kim, M. (2007). Advertising, research and development, and systematic risk of the firm. Journal of Marketing, 71(1), 35-48.

Papadimitriou, C. H., & Steiglitz, K. (1982). Combinatorial optimization: Algorithms and

M

complexity, Englewood Cliffs, N.J.: Prentice-Hall Inc.

Rabin, M., & Thaler, R. H. (2001). Anomalies - risk aversion. Journal of Economic Perspectives,

ED

15(1), 219-232.

Rao, R. K. S., & Bharadwaj, N. (2008). Marketing initiatives, expected cash flows, and shareholders’ wealth. Journal of Marketing, 72(1), 16-26.

PT

Rasmussen, L. M. (1986). Zero-one programming with multiple criteria. European Journal of Operational Research, 26, 83-95.

CE

Rigby, D. K., & Vishwanath, V. (2006). Localization: The revolution in consumer markets. Harvard Business Review, 84(4), 82-92.

AC

Rooderkerk, R. P., van Heerde, H. J., & Bijmolt, T. H. A. (2013). Optimizing retail assortments. Marketing Science, 32(5), 699-715.

Rusmevichientong, P., & Topaloglu, H. (2012). Robust assortment optimization in revenue management under the multinomial logit choice model. Operations Research, 60 (4), 865-882.

Srinivasan, S., & Hanssens, D. M. (2009). Marketing and firm value: Metrics, methods, findings, and future directions. Journal of Marketing Research, 46(3), 293-312. Talbi, E.-G. (2009). Metaheuristics: From design to implementation, Wiley.

33

ACCEPTED MANUSCRIPT

Tarasi, C. O., Bolton, R. N., Hutt, M. D., & Walker, B. A. (2011). Balancing risk and return in a customer portfolio. Journal of Marketing, 75(3), 1-17. Tuli, K. R., & Bharadwaj, S. G. (2009). Customer satisfaction and stock returns risk. Journal of Marketing, 73(6), 184-197. Zopounidis, C., Doumpos, M., & Fabozzi, F. J. (2014). Editorial – Preface to the special issue:

CR IP T

60 years following Harry Markowitz’s contributions in portfolio theory and operations research. European Journal of Operational Research, 234, 343-345.

Walmart (2014). Annual Report. Accessed on April 4, 2015 at http://stock.walmart.com/annualreports.

Zhang, C. W., & H. L. Ong (2004). Solving the biobjective zero-one knapsack problem by an

AC

CE

PT

ED

M

AN US

efficient LP-based heuristic. European Journal of Operational Research, 159, 545-557.

34

ACCEPTED MANUSCRIPT

Online Appendices Appendix A: Derivation of slack contribution m In this Appendix we show how the value of parameter m in the objective function of optimization problem (RAOLimited) should be set to ensure that first expected profit is maximized, subject to constraints (11) – (21), and second (in case of multiple candidate assortments that

CR IP T

maximize expected profit) that the amount of corresponding profit variance is minimized (i.e., the profit-maximizing candidate with minimum variance will be picked). From now on we refer to this as the sequential optimization principle (first expected profit maximization than variance minimization). We start with the observation that when the sequential optimization principle holds for any pair of feasible assortments (i.e., first, of the two feasible assortments the one with

AN US

maximum expected profit is preferred and, only, in case of equal profit variance, the feasible assortment with minimum profit variance is preferred) by mere transitivity the sequential optimization principle will be ensured across all feasible assortments in optimization problem (RAOLimited).

Suppose we have a pair of feasible assortments i and j for a given upper bound on the

M

variance q, which are subscribed by their corresponding solution vectors xi and xj, with associated expected profit NOM(xi) and NOM(xj), abbreviated as i and j, and corresponding profit

ED

variance slack for the given value of q of hi and hj. Using this notation Table A.1 lists all possible combinations of feasible assortment pairs that could occur in terms of their expected profit and profit variance.

CE

Values of the expected profit

 =

j

AC

i

 >

j

 <

j

i

i

PT

Table A.1: Possible scenarios for every pair (i, j) of feasible assortments under (RAOLimited) Values of the profit variance slack variable hi = hj

hi > hj i

hi < hj

preferred assortment

indifferent

x

xj

required value of m

m+

m+

m+

preferred assortment

xi

xi

xi

required value of m

m+

m+

0 < m < (i - j)/(hj -hi)

preferred assortment

xj

xj

xj

required value of m

m+

0 < m < (j - i)/(hi –hj)

m+

Note: The top entry for every possible scenario indicates which assortment is preferred under the sequential optimization principle. The corresponding bottom entry indicates what values of m in combination with the objective function in (RAOLimited) will ensure that the preferred assortment is selected as optimal one.

W1

ACCEPTED MANUSCRIPT

The top of every cell (every scenario) lists what assortment should be preferred when following the sequential optimization principle, the bottom part of the same cell will list the value of parameter m that ensures the result in the top part of the cell when applying the objective function of (RAOLimited). Note, that since we assume risk aversion, profit slack, h, benefits the objective function (c.p. lower variance is better) and hence m+. Consequently, the

holds.

CR IP T

bottom of every cell will indicate the subset of + for which the result in the top part of the cell

Note that we can summarize the nine scenarios in Table A.1 by the following Equation: when  i   j  hi  h j  sign(  i   j )  sign( hi  h j ) otherwise

,

(A.1)

AN US

 ( i   j )   0,  (h j  hi )  m      

where sign(x) is 1 if x>0, -1 if x<0, 0 otherwise. The conditions for which m is restricted to a subdomain of + (top row of Equation (A.1)) boil down to the scenarios in which assortment i has a higher (lower) expected profit than assortment j but a lower (higher) profit variance. In

M

other words when there is a tradeoff between expected profit and variance. To allow for all possible scenarios across all possible pairs of feasible assortments the

ED

value of m needs to lie in the intersection of all regions defined by (A.1) for an individual pair of feasible assortments. That is m should satisfy the following constraint: ( i   j ) , ( i , j ) ( h  h ) j i

PT

0  m  min

(A.2)

where  is the set of assortment pairs (k, k’) where both assortments k and k’ are feasible

CE

solutions to (RAOLimited) with k ≠ k’, hk ≠ hk’, and sign(k - k’) ≠ sign(hk - hk’,). In other words where  is the set of feasible assortment pairs (k, k’) for which there is a tradeoff between

AC

expected profit and profit variance. Next, we note that the following inequality holds ( i   j ) min ( i , j ) ( h  h ) j i



min ( i   j )

( i , j )

max (h j  hi ) ( i , j )    



min ( i   j )

( i , j )

(A.3)

q

hi ,h j [0, q )

Following the main text we define   min (i   j ) as the minimum profit difference ( i , j )

between any pair of feasible assortments that display a tradeoff between expected profit and

W2

ACCEPTED MANUSCRIPT

profit variance. Combining Equations (A.2) and (A.3), the value m should lie in the following region to ensure that the sequential optimization principle holds: m





(A.4)

q

The value of q is known, whereas the value of  is not. We would have to carry out a

CR IP T

complicated optimization problem to find it. However, we can set  to a very small value that is likely to be lower than the minimum difference in expected profit between any pair of feasible assortments that displays an expected profit-profit variance trade-off. In the study we have set  equal to .0001, which is very small compared to the average expected profit across items and stores, which is 830.

AN US

We have conducted two checks to verify that this number is small enough. First, through sensitivity analysis we have lowered the number even further and noticed that the optimal solutions found did not change. Second, but this is something that normally cannot be done, we have looked at the results of full enumeration to see that the true value of delta well exceeds the chosen value for all stores. This was indeed the case.

M

An alternative approach would be to truly carry out a sequential (two-stage) optimization. In the first step the expected profit would be maximized subject to constraints (11) – (21). In

ED

other words, the term mh would be dropped from the objective function. In the second stage an optimization would be run in which the profit variance would be minimized subject to the

AC

CE

PT

expected profit exceeding that of the optimal solution found in the first stage.

W3

ACCEPTED MANUSCRIPT

Appendix B: Estimation Algorithm We estimate the sales model with Hierarchical Bayes using the Gibbs sampler (Boatwright et al. 1999; Rossi and Allenby 1993). The Gibbs sampler for the Hierarchical Bayesian model is based on the Gibbs samplers proposed by Rossi and Allenby (1993) and Boatwright et al. (1999). First, we summarize the model and the notation. S kti  Z kti  i   kti , εti  1ti  Kti  iid ~ N(0, i )

CR IP T

Model:

 i   01i   0 Ki 11i  1Bi   51i   5 Bi T ~ N ( , V ). Priors:

 i ~ N ( ,V ) , with

(B.1) (B.2)

 i ~ Inverse Wishart (w,W ).

Gibbs Sampler: 1.

Generate  i :

3.

V,i  ( X it  i1 X it  V1 ) 1 , T

T

ED

t 1

bi  V,i ( X it i1Zit  V1 ). T

CE

PT

t 1

Generate  :

AC

2.

M

 i | X i , Z i ,  ,V ~ N m (bi ,V,i ) , where T

AN US

 ~ N C (  ,V ) , and V  diag ( 2 ,1 ,...,  2 ,m ) , where  2 , j ~ Inv Gamma( v / 2, s 2 / 2),

 

Generate  2 , j

j 1,..,m.



s 2 , j  , 2   2  

 2 , j | {ij },  j ~ InverseGamma v 

,

where v  v  I , I

s2 , j  s2   ( ij   j ) 2 i 1

, where:  ij = the jth element of column vector  i .

j

= the jth element of column vector  .

4. Generate  i i 1,..,I .

 | { βi },V ~ N m (b, V ) , where

 i | Z i ,  i ,  ~ Inverse Wishart ( w ,W  ) ,

V  ( I V1  V1 ) 1 ,

where

b  V (V1



H

 h 1

h

 V1   ).

w  w  T , T

W   W   ( S ti  Z ti  i )(S ti  Z ti  i ) T . t 1

W4

ACCEPTED MANUSCRIPT

All prior distributions are chosen to be uninformative. For all model parameters other than variances we assume independently, identically distributed normal distributions N(0, 100). Furthermore, we model V as a diagonal matrix with the prior of the jth diagonal element 1 {V } jj ~ InverseGamma(  / 2, s2 / 2) , where    3 and s2  100 . Finally, for every store-specific

CR IP T

error variance-covariance matrix i we assume an Inverse Wishart distribution with m + 2 1 degrees of freedom and covariance matrix equal to 1000 ·Im, where Im is an m by m identity matrix,

and m is the number of independent variables in the model.

We estimate the model with Hierarchical Bayes that we implemented in the software package Matlab. We run the Gibbs sampler for 50,000 draws and retained each 50th draw of the

AN US

last 25,000 draws. Visual inspection and formal tests confirmed convergence of the Gibbs chain.

AC

CE

PT

ED

M

The procedure results in 500 draws used for inference of the posterior distribution.

W5

ACCEPTED MANUSCRIPT

Appendix C: Posterior Parameter Estimates of the Sales Model* Parameter

2.5%

50%

97.5%

SKU 1 SKU 2

1 2

18.88 19.72

22.13 23.04

25.17 26.08

SKU 3 SKU 4 SKU 5 SKU 6 SKU 7 SKU 8 SKU 9

3 4 5 6 7 8 9

1.71 11.22 19.36 11.62 7.68 6.67 5.19

2.07 12.83 23.14 13.60 8.65 7.61 6.20

2.45 14.58 26.93 15.62 9.59 8.76 7.21

SKU 10

10

17.77

20.52

23.32

SKU 11 SKU 12 SKU 13 SKU 14 SKU 15 SKU 16 SKU 17

11 12 13 14 15 16 17

2.03 1.84 5.86 3.41 2.26 2.96 3.13

2.37 2.10 7.77 3.88 2.54 3.38 3.56

2.75 2.48 9.56 4.36 2.81 3.76 4.05

18 19 20 21

2.58 1.28 0.82 .43

3.20 1.47 1.08 .54

3.89 1.70 1.34 .66

11 12 13 14 15

51.45 .45 2.31 3.00 8.05

37.65 .42 1.47 1.79 4.26

24.24 1.27 .47 .40 .50

21 22 23 24 25

2.12 4.81 1.26 2.01 .58

2.64 6.32 4.25 2.73 .78

3.25 7.77 8.04 3.59 .97

31 32 33 34 35

9.39 11.38 9.00 1.70 2.10

11.69 13.89 13.39 3.47 3.01

14.29 16.38 18.23 5.52 3.97

41 42 43 44 45

13.62 22.39 8.71 8.25 4.42

16.82 27.10 18.19 10.52 5.27

20.16 31.54 24.24 12.53 6.13

51 52 53 54 55

2.79 .18 4.71 .09 .25

.99 .03 .44 .71 1.00

.83 .25 4.34 1.49 1.90

Private Label 2

Private Label 3

AN US

National Brand 1

National Brand 2 SKU 18 SKU 19 SKU 20 SKU 21

AC

CE

PT

ED

M

Price Private Label 1 Private Label 2 Private Label 3 National Brand 1 National Brand 2 Feature only Private Label 1 Private Label 2 Private Label 3 National Brand 1 National Brand 2 Display only Private Label 1 Private Label 2 Private Label 3 National Brand 1 National Brand 2 Feature & Display Private Label 1 Private Label 2 Private Label 3 National Brand 1 National Brand 2 Shelf space Private Label 1 Private Label 2 Private Label 3 National Brand 1 National Brand 2 *

CR IP T

Variable Private Label 1

In bold the 95% posterior intervals that exclude 0. Since the model is linear, the coefficients are not elasticities.

W6

ACCEPTED MANUSCRIPT

Appendix D: Upper Bound on the Optimal Robust Profit The efficient assortments not constructed by the heuristic could result in a higher robust profit than that of the best assortment found with the Efficient Frontier heuristic. These overlooked efficient assortments can only be located on the intervals between the variance q f corresponding to any assortment xf found in step f (=0,...,F) of the heuristic and the upper bound

CR IP T

q f constructed by the heuristic after finding xf. Suppose there would be such an efficient xq with variance q on the interval (q f , q f ) . The robust profit associated with this assortment is bounded by the expression in (D.1), which is illustrated in Figure D.1.

 ROB ( xq )   NOM ( xq )  r q   NOM ( x f )  r q , q  (q f , q f ).

(D.1)

 ROB(xf ) ~ ~

~ ~

Variance (q)

qf

PT

qf

ED

M

Robust Profit

AN US

Figure D.1: Upper Bound Robust Profit on the Interval q  (q f , q f ).

Figure D1 shows that the maximum value of the upper bound is obtained at the minimum

CE

variance on the interval (q f , q f ) . Consequently, the maximum value of the upper bound is limited

AC

from above by:

max  ROB ( xq )   NOM ( x f )  r q f .

q( q f , q f )

(D.2)

Finally, the upper bound on the optimal robust profit is equal to the maximum of the

interval-specific bounds.1 ROB  max ( max  ROB ( xq ))  max   NOM ( x f )  r q f  . f  0,..,F  1 q  ( q f , q f ) f  0,..,F  1  1

(D.3)

We do not consider the assortment constructed in the last step since its nominal profit is below the best robust profit found by the heuristic.

W7

ACCEPTED MANUSCRIPT

References Boatwright, P., McCulloch, R., & Rossi, P. (1999). Account-level modeling for trade promotion: An application of a constrained parameter hierarchical model. Journal of the American Statistical Association, 94(448), 1063-1073.

AC

CE

PT

ED

M

AN US

Journal of Marketing Research, 30(2), 171-182.

CR IP T

Rossi, P. E., & Allenby, G. M. (1993). A Bayesian-approach to estimating household parameters.

W8

Robust optimization of the 0–1 knapsack problem: Balancing risk and return in assortment optimization

Robust optimization of the 0–1 knapsack problem: Balancing risk and return in assortment optimization

Recommend Documents