Spatially heterogeneous effects of a public works program

Spatially heterogeneous effects of a public works program

Accepted Manuscript Spatially heterogeneous effects of a public works program Joshua D. Merfeld PII: S0304-3878(18)30478-4 DOI: https://doi.org/10...

4MB Sizes 0 Downloads 27 Views

Accepted Manuscript Spatially heterogeneous effects of a public works program Joshua D. Merfeld PII:

S0304-3878(18)30478-4

DOI:

https://doi.org/10.1016/j.jdeveco.2018.10.007

Reference:

DEVEC 2299

To appear in:

Journal of Development Economics

Received Date: 3 August 2017 Revised Date:

8 September 2018

Accepted Date: 25 October 2018

Please cite this article as: Merfeld, J.D., Spatially heterogeneous effects of a public works program, Journal of Development Economics (2018), doi: https://doi.org/10.1016/j.jdeveco.2018.10.007. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

M AN U

October 26, 2018

SC

Joshua D. Merfeld*

RI PT

Spatially Heterogeneous Effects of a Public Works Program

Abstract

EP

TE D

Most research on labor market effects of the Mahatma Gandhi National Rural Employment Guarantee Scheme focuses on outcomes at the district level. This paper shows that such a focus masks substantial spatial heterogeneity: treated villages located near untreated areas see smaller increases in casual wages than treated villages located farther from untreated areas. Spatial differences in implementation or program leakages do not appear to drive this spatial heterogeneity. The effects of the program on private-sector employment display similar intra-district heterogeneity and these effects on employment are highly correlated with the effect on wages. Overall, these results suggest that worker mobility leads a district-level focus to underestimate the true effect of the program on wages. Quantifying this underestimate using two separate methods produces very similar results; the overall effect on wages appears to be approximately twice as large as district-level estimates suggest.

AC C

Keywords: India, Public Works, Labor, Wages, Spillovers JEL Codes: D50, H53, I38, J38, J46

* Robert F. Wagner Graduate School of Public Service, New York University; [email protected]. Conversations with and comments from Leigh Anderson, Justin Downs, James Long, Mark Long, Karthik Muralidharan, Katya Roshchina, and Jagori Saha were extremely helpful. I also thank participants at various seminars and conferences for helpful comments and suggestions. I especially thank two anonymous referees for greatly improving this paper. Any remaining errors are my own.

1

ACCEPTED MANUSCRIPT Merfeld

1

Spatially Heterogeneous Effects of a Public Works Program

Introduction

India’s Mahatma Gandhi National Rural Employment Guarantee Scheme (NREGS)1 , which

RI PT

was launched in 2006 and is now rolled out throughout the country, is the largest public works program in the world. In fiscal year 2015/16, the program generated more than two billion persondays of employment and amounted to more than two percent of the fed-

SC

eral government budget.2 Given the program’s size, it is perhaps unsurprising that it has spawned a vast body of literature.

M AN U

The program was phased in over three rounds, starting in 2006. However, this rollout was not randomized; the poorest districts were generally the first to receive the program. As such, while the phased rollout makes differences-in-differences attractive, the systematic differences in district characteristics make the identification assumption of parallel trends somewhat tenuous (Zimmermann, 2012; Sukhtankar, 2016). In addition, it is not clear what

TE D

treatment effect estimates averaged over the entire country measure, as implementation of the program varies substantially from state to state and even from district to district (Liu and Barrett, 2012; Ravallion et al., 2013; Dutta et al., 2014; Banerjee et al., 2015; Imbert and Papp, 2015). In reality, these estimates are the net effect of the program, inclusive of

EP

implementation efficacy (Sukhtankar, 2016).

Despite these difficulties, a growing body of literature has analyzed the performance

AC C

and far-reaching effects of the program. These include human capital investments (Afridi et al., 2012; Li and Sekhri, 2013; Shah and Steinberg, 2015; Merfeld and Saha, 2018), nutrition and consumption/income (Liu and Deininger, 2010; Jha et al., 2011; Ravi and Engler, 2015), and corruption (Imbert and Papp, 2011; Niehaus and Sukhtankar, 2013; 1

The program was originally passed as the National Rural Employment Guarantee Act in 2005 and implemented as the scheme starting in 2006. The program was renamed in 2009, but I use the old acronym throughout this paper. 2 Figures are from http://nrega.nic.in/, the official program website.

2

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

Banerjee et al., 2015; Muralidharan et al., 2016). However, given the aims and scope of the program, much of the literature has focused on its labor market effects. While findings

RI PT

vary, there is broad agreement with one basic fact: the program led to a modest increase in prevailing casual wages in rural India (Azam, 2012; Berg et al., 2014; Imbert and Papp, 2015; Muralidharan et al., 2018; Sukhtankar, 2016).3

In addition to a common finding, much of the literature examining the effects of the

SC

program’s rollout on labor market outcomes shares another characteristic: it analyzes the effects of the program at the district level. This level of analysis is attractive for two rea-

M AN U

sons. First, the program was rolled out at the district level. Second, given low levels of rural-to-rural migration in India (Rosenzweig, 1988; Behrman, 1999; Anant et al., 2006; Munshi and Rosenzweig, 2009), districts are often considered to be relatively distinct labor markets. Empirical work in India has long focused on districts, especially for labor-market outcomes; for non-NREGS examples, see, for example, Rosenzweig (1978, 1984), Jay-

TE D

achandran (2006), Topalova (2010), Kaur (2018), and Shah and Steinberg (2017). In this paper, I show that a district-level focus obscures substantial heterogeneity, with important policy implications. I use the Additional Rural Incomes Survey/Rural Economic

EP

Demographic Survey (ARIS/REDS), collected by the National Council of Applied Economics Research, to show that wage changes due to the program are spatially heteroge-

AC C

neous. The first wave of the survey was collected prior to the implementation of NREGS and the second wave was collected between the second and third phases of the rollout. Importantly, unlike other datasets, I am able to identify the location of each village in the survey using GPS data.4 For each district, I construct two geospatial variables: whether phase one or phase two districts (“treated” districts) share a border with phase three dis3

In this context, “casual wages” refer to wages in daily markets for labor. The duration of this type of contract is generally only a day and the contract carries no expectation or promise of future work. 4 Throughout this paper, I refer to the location of a household and the location of a village interchangeably, though the GPS measurement is technically at the village level.

3

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

tricts (“untreated” districts), and vice versa; and how long that border is. For districts that border a district of the opposite treatment status (“border districts”), I construct a variable

RI PT

for surveyed villages equal to their distance to the nearest district of the opposite treatment status.

In line with previous NREGS studies, I first show that phase one and two districts had much lower wage rates prior to implementation of the program than phase three districts

SC

(Zimmermann, 2012; Imbert and Papp, 2015). I then estimate the effects of NREGS by slowly restricting estimation to villages closer and closer to the border. While there are

M AN U

large effects on casual wages when all households are included in the estimation, the estimated effect starts to attenuate around 30-40 kilometers from the border between a treated and untreated district and the estimated effect completely disappears when looking only at households located within 15 kilometers of the border. I show that this general pattern holds for three separate specifications, including simple specifications as well as specifi-

TE D

cations with a number of controls, including variables that determined the roll-out of the program. These patterns also hold when an alternative measure of labor market exposure – percentage of the area around an observation belonging to districts of the opposite treatment

EP

status – is used.

A preponderance of evidence suggests that treated areas closer to untreated areas have

AC C

lower estimated wage increases than treated areas farther from untreated areas. In theory, if “wage arbitrage” is responsible, then these results could be indicative of bias in previous estimates for two reasons. First, district-level estimates may be underestimating the wage increase in treated districts. Second, district-level estimates may be overestimating the counterfactual wage increase in untreated districts. Interestingly, I find evidence of only the first possibility. I attempt to quantify the bias in previous estimates in two separate ways. First, I estimate the effect for households located more than 20 kilometers from a

4

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

district of the opposite treatment status. Second, I estimate the effect of NREGS on wage increases on districts that do not border districts of the opposite treatment status at all.

RI PT

These two different methods produce almost identical estimates. Both suggest previous estimates of the wage increase are about half as large as the true effect of the program on prevailing rural wages.

A number of additional results suggest that this effect is not an artifact of pre-existing

SC

trends. First, I find insignificant overall effects of the program on private-sector employment. However, this finding again masks substantial heterogeneity: the effects on private

M AN U

employment appear to mirror the effects on wages, with interior areas seeing decreases in private employment relative to border areas. Moreover, the spatial patterns themselves are almost identical for both wages and employment, as the effects have a correlation of −0.792. I show that there were no differential pre-program trends in employment across treated and untreated districts, suggesting the spatial heterogeneity is driven by the program

TE D

itself. Further evidence suggests better program implementation is not responsible for spatial heterogeneity. Finally, changes in individual characteristics are indicative of decreased migration of men, more educated individuals, and older individuals. These changes are

2018).

EP

consistent with the estimated wage heterogeneity caused by the program (Imbert and Papp,

AC C

This paper contributes to three separate bodies of literature. First, I provide further evidence of the broad labor-market impacts of NREGS (Azam, 2012; Zimmermann, 2012; Berg et al., 2014; Imbert and Papp, 2015; Muralidharan et al., 2018) as well as public works programs, more generally (see, e.g., Subbarao et al. (2003) and Beegle et al. (2017)). However, unlike most previous studies, I am able to explore the spatially heterogeneous impacts of the rollout of the program. Consistent with the literature, I find overall increases in the wage rate. However, I also find that the estimated increase disappears at the border with

5

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

untreated districts. In addition, some previous research finds that NREGS had a larger impact on female wages than male wages (Azam, 2012). This study presents evidence that

RI PT

men are more likely to move towards wage increase than women. If true, district-level difference-in-differences estimates of the effect on male wages may be underestimating the true impact of the program more than the estimates of the effect on female wages, explaining part of the gender effects. Unfortunately, I do not have the power to test this explicitly,

SC

as I have relatively few female wage observations.

Second, this study sheds some light on the functioning of labor markets in India (Jay-

M AN U

achandran, 2006; Kaur, 2018). While spot markets for casual labor in India are generally assumed to work relatively well (Rosenzweig, 1980), it is unlikely that markets are complete (Rosenzweig, 1988; Behrman, 1999). These results suggest that labor is somewhat mobile. In addition, while the minimum wage is unlikely to bind or be enforced in many developing country contexts (Behrman, 1999), NREGS appears to operate as a de facto

TE D

minimum wage, increasing the bargaining power of casual laborers (Basu et al., 2009). Moreover, recent research finds that NREGS decreases rural-to-urban migration in three states and that certain individuals are particularly likely to reduce migration in response to

EP

the program (Imbert and Papp, 2018). The results in this paper support that argument. Finally, I contribute to the literature on treatment effects and spillovers of public policy

AC C

interventions in developing countries (Duflo, 2001; Miguel and Kremer, 2004; Angelucci and De Giorgi, 2009; Cunha et al., 2011). In particular, this study reinforces the importance of capturing general equilibrium effects when evaluating large public programs and that failure to do so may result in biased estimates of the program’s impact (Muralidharan et al., 2018). In addition, the focus on the district-level effects of NREGS theoretically underestimates the true effect of the program in two ways: the wage in untreated districts increases more in areas closer to treated districts than in areas farther from treated districts,

6

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

while the wage in treated districts increases less in areas closer to untreated districts than in areas farther from untreated districts. The treatment effect we are interested in is the effect

RI PT

of the program on wages absent spillovers since the program will be expanded throughout the country. Yet, a district-level focus overestimates the counterfactual wage in untreated districts and underestimates the wage in treated districts, although the latter effect apparently dominates in the present study. This leads to an underestimate of the true wage effect

SC

of the program. With this in mind and insofar as increasing rural wages is an explicit goal of policymakers, NREGS may have been more effective than previously estimated.

M AN U

The closest paper to this one is Muralidharan et al. (2018), who study the randomized rollout of an improvement in the implementation of NREGS. They find similar spatial effects. Their paper is different in two important respects. First, their randomized intervention provides a cleaner identification strategy and cleaner estimates of the intervention’s impacts. Indeed, the biggest disadvantage of my approach is the inability to test for pre-program

TE D

trends using a specification identical to that in this paper. However, they randomize improvements only in Andhra Pradesh. Since Andhra Pradesh consistently performs better than other states in implementation of NREGS (Banerjee et al., 2015; Imbert and Papp,

EP

2015), it is not clear whether their findings would translate to other Indian states. Second, they do not study the implementation of the program itself. Rather, they estimate the effects

AC C

of a technological reform that improved implementation of the program. This study, on the other hand, examines the effects of the program when it is first rolled out. Given these differences, one might expect findings to differ. However, the patterns overall findings from both studies are largely similar, suggesting that the effects found in both studies are driven by mechanisms common to all of India and not specific to Andhra Pradesh.

7

ACCEPTED MANUSCRIPT Merfeld

2

Spatially Heterogeneous Effects of a Public Works Program

The Program

The Mahatma Gandhi National Rural Employment Guarantee Act was passed in 2005 after

RI PT

“more than a decade of sustained high growth in GDP... was perceived not to have made a sufficient dent in poverty in the rural India” (Azam, 2012, p. 1). The program guarantees up to 100 days of employment to any rural household that requests it. Employment is varied,

SC

but is generally focused on improving productivity in rural areas by building infrastructure, including irrigation, roads, and land improvements (Liu and Deininger, 2010). Unlike previous workfare programs, NREGS is explicitly designed as an employment guarantee;

M AN U

a household has the right to demand – and receive – employment. However, in practice, supply-side constraints are often binding, resulting in substantial unmet demand (Ravallion et al., 2013; Dutta et al., 2014; Mukhopadhyay et al., 2015).

In order to demand employment, households must first apply for a job card. The house-

TE D

hold applies for the card at the local Gram Panchayat (GP)5 and the card includes a picture of every household member eligible for the program. With this card in hand, household members can request employment. According to the design of the program, individuals request work from their local GP, and the legislation stipulates that work be assigned within

EP

five kilometers of the individual’s place of residence, otherwise a travel stipend is to be provided (Azam, 2012). If individuals do not receive employment within 15 days of re-

AC C

questing it, they are eligible for unemployment allowance. However, underreporting of unemployment is common and unemployment compensation is rare (Sharma, 2009). The program’s design begets a relatively complicated funding structure. The federal government is responsible for all unskilled wages and 75% of skilled wages and materials, while states are responsible for the remaining 25% (Azam, 2012). As such, funding flows 5

The Gram Panchayat is the lowest level of administration in the Indian federal structure and generally consists of several villages.

8

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

from the federal government to the state before being dispersed through lower administrative levels and eventually to the GP’s account, while funding requests from GPs to the state

RI PT

also flow through two separate administrative levels (Banerjee et al., 2015).6 Several specific features of the program are worth noting. First, the minimum wages set by the program7 are self-targeting and are relatively low (and the labor relatively difficult) so that only needy households will apply for employment. Nonetheless, program wages

SC

vary from state to state, and some wages were set above prevailing wage rates at the start of the program. In some cases, program wages were more than double the prevailing wage rate

M AN U

at the time, especially for women.8 Second, the program guarantees equal wages to both men and women and also mandates that a certain number of beneficiaries be women.9 In addition, childcare at worksites is required so that mothers with young children can participate in the program. However, many worksites do not adhere to this childcare requirement (Khawlneikim and Mital, 2016; Khera and Nayak, 2009; Reddy and Upendranadh, 2010;

TE D

Sharma, 2009).

Given the demand-driven nature of the program, implementation is relatively decentralized, with employment and worksites started and administered by the GP. While the GP

EP

is the basis for employment generation, the program was rolled out at the district level, in three phases. Districts in the first phase received the program in 2006, districts in the

AC C

second phase received the program in 2007, and districts in the third phase received the program in 2008. Given the phased rollout, the majority of research on the labor-market effects of NREGS utilize differences-in-differences. However, the rollout was not random6

GPs request funding from the block, which is one administrative level above the GP. Subsequently, the block forwards requests to the district, which then requests funding from the state pool. Banerjee et al. (2015) provide an excellent overview of this process. 7 The original legislation allows the states to define the program wage, as long as it is above the legislated minimum. 8 http://www.levyinstitute.org/pubs/EFFE/Mehrotra_Rio_May9_08.pdf 9 The legislation state that “priority shall be given to women in such a way that at least one-third of the beneficiaries shall be women who have registered and requested for work under this Act” (Ministry of Law and Justice, 2005, p. 14).

9

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

ized. Rather, the Indian government assigned districts to phases based on a governmentconstructed rating, which resulted in the poorest districts generally receiving the program

RI PT

first (Zimmermann, 2012). While the rating used to assign districts to phases is not publicly available, underlying data used in the construction of this rating are available in the 2003 report from the Planning Commission (Planning Commission, 2003; cited in Zimmerman, 2012). These ratings show that earlier phase districts tended to be poorer and have more

SC

scheduled castes/tribes, lower agricultural productivity, and lower wages. As such, there are concerns regarding the identification assumption of differences-in-differences, that treated

of the program.

3

Data and Methods

M AN U

districts and untreated districts had identical labor-market trends prior to implementation

TE D

To analyze the effects of NREGS, I mainly use the Additional Rural Incomes Survey/Rural Economic Demographic Survey (ARIS/REDS), collected by the National Council of Applied Economic Research. The ARIS/REDS is a panel survey that has been conducted periodically since 1969. When first collected, the data was nationally representative, although

EP

it oversampled high-income households and areas suitable to green-revolution crops. However, with recent changes in demographics and administration, the data are no longer rep-

AC C

resentative of the entire country (Foster and Rosenzweig, 2010). As such, I interpret the results as representative of overall spatial patterns – including ratios of magnitudes – but not as representative of the actual magnitude of effects. I use the fifth and sixth rounds of the survey – collected in 1999 and 2008, respectively – for my main analyses. Additionally, I use the Indian National Sample Survey to analyze pre-program trends. I make use of the 1999/2000 and 2004/05 waves of the data. Since the second wave of ARIS/REDS data (2008) was collected after implementation of 10

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

the first two NREGS phases but before implementation of the third phase, I refer to phase one and phase two districts as “treated” districts and phase three districts as “untreated”

RI PT

districts. Figure 1 provides an overview of the phased rollout of NREGS along with the location of ARIS/REDS villages. There is a lot of geographic variation in the location of surveyed villages, with all of the major Indian states covered except Jammu and Kashmir. Since the main strategy of this paper is to compare villages near borders of treated

SC

and untreated districts, it is instructive to look at the geographic variation in this specific characteristic. In particular, there appears to be a concentration of villages located near

M AN U

the border between treated and untreated districts in the southern and central regions. On the other hand, treated villages far from untreated districts tend to be located in the eastern half of the country, while untreated villages far from treated districts tend to be in the northwestern region of the country.

Methods

TE D

3.1

This paper employs differences-in-differences estimates to examine the effects of NREGS

EP

on wages. I estimate regressions of the form

yidt = αh + α1 N REGSd + α2 P ostt + α3 N REGSd × P ostt (1)

AC C

+ Xidt + Zvdt + Dd × P ostt + States × P ostt + εidt ,

where yidt is the outcome of interest for individual i in district d in wave t; αh is household fixed effects; N REGSd is an indicator variable equal to one if the district is a treated (phase-one or phase-two) district; P ostt is an indicator variable for observations in the second wave, 2008; Xidt is a vector of time-variant individual controls, which in practice include just age, age squared, and gender; Zvdt is village characteristics, which include distance to district headquarters, distance to nearest road, distance to nearest town, and 11

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

population density; Dd × P ostt is a vector of district controls – which include the preprogram wage, total border, total area, population, percent rural, percent scheduled caste,

RI PT

percent scheduled tribe, percent literate, labor force percentage, employment characteristics, and agricultural productivity – interacted with the post indicator to allow these characteristics to differentially affect trends in y; States × P ostt is state by wave fixed effects; and εidt is a conditional mean-zero error term. The inclusion of state by wave fixed effects

SC

means the post dummy is generally not interpretable (without significant context) and the dummy is thus omitted from tables. If households never moved, the household fixed ef-

M AN U

fects would absorb the NREGS dummy. However, there are some movers in the data. As such, the NREGS dummy is technically identified. However, there are very few households that move and these are clearly a very select subset of households. As such, the NREGS dummy is also difficult to interpret and is thus also omitted from tables for simplicity. I only present the main, district-level effects using the full specification. In the rest of the

TE D

analyses, I include only age, age squared, gender, and state by wave fixed effects (along with household fixed effects).

The main goal of this paper is to estimate the effects of NREGS at different locations

EP

in a district. To isolate these difference, I restrict estimation to individuals living within a certain distance of the border. For example, after presenting results for all individuals,

AC C

I then restrict estimation to only individuals within 50 kilometers of a border between a treated and untreated district. I then restrict estimation to only individuals within 40 kilometers, and so on. In general, I am not able to estimate effects at any distance less than 15 kilometers from the border; I discuss this more in the results section. An alternative strategy would be to estimate the effects of the program using different windows of distances from the border – for example, comparing households between 40 and 50 kilometers of the border, then households between 35 and 45 kilometers of the border, etc., all the way down to within six kilometers of the border. However, this strategy suffers from severe power 12

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

issues and the sample size is simply not large enough to provide meaningful estimates. The coefficient of interest is α3 , which gives the difference-in-differences estimate of the

RI PT

effect of NREGS on outcomes. This coefficient captures the difference, across treatment status, in changes of each outcome at the district level. This implies a very specific assumption required for unbiased estimates of the effect of NREGS: the trends, across time, of outcomes in treated and untreated districts are identical in the absence of the program.

SC

In other words, if treated districts had not received the program, then treated and untreated districts would have had the same change, from 1999 to 2008, in each of the outcomes. I

M AN U

discuss possible failure of this assumption of difference-in-differences in the results section.

I employ two separate distance variables and present results using a triple interaction:10 yidt = αh + α1 N REGSd + α2 P ostt + α3 Distancev + α4 N REGSd × P ostt (2)

TE D

+ α5 N REGSd × Distancev + α6 Distancev × P ostt + α7 Distancev × P ostt × N REGSd + εidt ,

where Distancev is the village-specific distance variable. These regressions allow an

EP

empirical estimate of how the effect of NREGS on the wage differs by distance.11 One of the distance variables I use is the distance of the border shared with districts of the

AC C

opposite treatment status. In other words, for treated districts, the length of border shared with untreated districts is the key variable, while for untreated districts it is the length of border shared with treated districts. The assumption here is that treated districts surrounded by untreated districts should see a smaller wage effect than treated districts surrounded by other treated districts. This specification is used in two ways. First, it allows me to quantify 10 11

The non-interaction variables are omitted for simplicity of presentation. The triple interaction and all interactions with Post are included in tables for the same reasons listed above.

13

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

the bias in previous estimates. Second, I am not able to empirically estimate pre-program trends with the ARIS/REDS data. In order to explore pre-program trends, I turn instead to

RI PT

data from the National Sample Survey (NSS), which is a nationally representative survey conducted by the Ministry of Statistics and Programme Implementation.12 Since these data are only aggregated to the district level, I am unable to measure distances to border for each village. Instead, I use this border length variable to explore pre-program trends.

SC

I construct several geovariables for use in this analysis. For each district in the country, I first assign it a value of treated (if in the first or second phase) or untreated (if in the third

M AN U

phase). I then construct a variable, which I call border district, that equals one if a treated district borders an untreated district or if an untreated district borders a treated district and zero otherwise. In other words, border district identifies borders across which there is a difference in treatment status in 2008. I then calculate the total distance of such borders for each district in the country.

TE D

In order to estimate treatment heterogeneity, I estimate a series of regressions, restricting estimation based on distance to the nearest border. I begin by estimating regressions using all individuals. Then, I restrict estimation to only individuals living within a certain distance

EP

of the border (e.g. 50, 40, 30, etc.). I define this distance variable as follow: For all villages in treated border districts, I construct another variable equal to the distance from each

AC C

village to the closest border with an untreated district. Similarly, for villages in untreated border districts, I construct this variable as the distance from each village to the closest border with a treated district. To analyze spatial heterogeneity in effects of the program, I will essentially be comparing differences-in-differences estimates of all villages compared to villages close to borders. Figure 2 presents a histogram of this variable across treated (NREGS) and untreated (non-NREGS) districts. 12

See http://mospi.nic.in/national-sample-survey-office-nsso.

14

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

I define labor market exposure in a second way, as well. For each village in the data, I construct a circle with a radius of 20 kilometers whose center is the village. I then calculate

RI PT

the percentage of the area of the resulting circle that belongs to treated districts and the percentage of the area that belongs to untreated districts. For each village/individual, the resulting variable is the percent of the total area belonging to a district of the opposite treatment status. While we expect the wage increase to be smaller as the distance variable

be larger as the percentage of area decreases.

SC

gets smaller with the previous variable, with this variable, we expect the wage increase to

M AN U

I use two main variables to measure labor-market outcomes. The first is the wage. I construct this variable by including both casual agricultural and casual non-agricultural wages at the individual level. For individuals that worked more than one type of job, I sum hours worked and total remuneration (cash and in-kind) to create the wage variable. The other main labor-market outcome I use is private-sector employment. I construct this variable

TE D

by including all days worked in casual labor, own-account agricultural labor, own-account non-farm labor, own-account livestock labor, construction/maintenance labor, household (non-leisure) labor, and other economic activities.

EP

Finally, I cluster standard errors at the district level in all regressions, despite the inclusion of household fixed effects. The program itself was implemented at the district level, so

AC C

district-clustered standard errors provide a conservative estimate for the standard errors (Donald and Lang, 2007).

3.2

Summary Statistics

Summary statistics from 1999, prior to the program, are presented in Table 1. In Panel A, statistics are at the district level. There are 39 treated (phase one or two) districts and 33 untreated (phase three) districts in the ARIS/REDS data. Treated districts tend to be larger – 15

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

geographically – and more rural than untreated districts. They also have a higher percentage of scheduled castes and scheduled tribes living in the district, suggesting treated districts

RI PT

may be worse off relative to untreated districts. This is confirmed by the literacy rate and wage rate, both of which are lower in treated districts. This is, of course, unsurprising, given that rollout was designed to give the poorest districts access to the program first.

Panel B presents summary statistics at the individual level for any individual with a valid

SC

wage observation in 1999. The first four variables are defined at the village level, but are presented at the individual level. Consistent with the rural nature of treated districts,

M AN U

individuals living in treated districts tend to be farther from the district headquarters, farther from the nearest road, and farther from the nearest town than individuals living in untreated districts. Treated individuals with a wage observation are slightly younger but are less likely to be male. Finally, again consistent with the district-level statistics, individual wages are

4

Results

TE D

lower in treated districts than in untreated districts.

EP

I begin by estimating a simple differences-in-differences of the district-level effect of the program on wages in Table 2. This specification is similar to previous estimates of the

AC C

impact of NREGS on wages. The first three columns present estimates using household fixed effects, while the last three columns present estimates using district fixed effects. In the first and fourth columns, only age (and age squared) and gender are included in the regression. In the second and fifth columns, state by wave fixed effects are also included. Finally, in the third and sixth columns, an extended set of controls is added. These controls are detailed in the methods section above. The estimated effect of NREGS on wages is relatively similar, regardless of specifica-

16

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

tion. Across all six columns, the estimated effect is between approximately 7.6 and 12.1 percent.13 This is slightly larger than previous estimates using the NSS (Imbert and Papp,

RI PT

2015) but slightly smaller than previous estimates using the same dataset as this paper (Liu and Deininger, 2010). Despite the fact that the estimated effect is smallest using the simplest specification, previous research has found that this specification may be biased due to violations of the parallel trends assumption. However, the inclusion of household

SC

fixed effects – instead of district fixed effects – should help alleviate some of these concerns. Moreover, the third column includes a number of controls that have been found

M AN U

to attenuate the pre-program trend (Imbert and Papp, 2015); since the estimated effect of NREGS does not change following the inclusion of these controls (in the household fixed effects specifications), this is suggestive evidence that pre-program trends may not be driving household-fixed-effect estimates of the effect of NREGS on wages.14 To explore whether the district-level estimate of the effect obscures spatial heterogeneity,

TE D

I first present a graphical representation of wage changes from 1999 to 2008 in Figure 3. Only individuals located in border districts are included in the figure. The x-axis denotes the distance to the border between treated and untreated districts, with negative values indi-

EP

cating movements towards the interior of untreated districts and positive values indicating movements towards the interior of treated districts. A value of zero indicates the location

AC C

of the border. The wage in 1999 is monotonically decreasing as we move from the interior of untreated districts – presumably the location of better-off households – towards the border and then towards the interior of of treated districts. It is well documented that wages 13

The Post dummies have been omitted for simplicity. In columns one and three, the post dummy is identified. However, in columns two and five, the post dummy takes on the value of one of the state/wave fixed effects, complicating its interpretation. Despite the inclusion of household fixed effects, the NREGS dummy is technically identified in all columns. However, this identification comes from just a handful of households that appear to have moved districts. Thus, the coefficient is imprecisely estimated and its interpretation is not clear. 14 If the estimate in the first column is indeed biased upwards and if we are able to control for the variables responsible for this bias, then the estimates in column two and three should actually be smaller, not larger.

17

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

in treated districts were markedly higher before implementation of NREGS than wages in untreated districts. If the means are different, on average, across the treatment phases, then

RI PT

there is likely to be some type of gradient across districts, which is what we see in Figure 3. In 2008, we see a similar pattern in untreated districts; the wage is decreasing as we move towards the border. However, there is a clear difference in the pattern as we move towards the interior of treated districts; the wage seems to be increasing relative to 1999.

SC

Table 3 presents empirical estimates of this effect, using the same specification as column two of Table 2. The first column estimates the effect of NREGS when using all individuals

M AN U

with a wage observation. The estimate indicates that NREGS increased the wage in treated districts by around 9.5 percent relative to untreated districts. Each subsequent column restricts the sample to individuals located closer and closer to the border. The second column restricts estimation to only individuals located within 55 kilometers of the border, the third column to only individuals located with 45 kilometers of the board, and so on. The

TE D

effect appears to attenuate as we restrict attention to households located closer and closer to the border. However, due to imprecision in the estimates and the fact that estimates jump around slightly as we restrict our sample, the actual empirical estimates are very sensitive

EP

to the exact cut-off used. While round numbers may be preferred for presentation, there is no empirical reason that 50 is a more appropriate choice than 49, for example.

AC C

To help make the pattern clearer, Figure 4 presents the results graphically. The regressions in Table 3 were re-estimated at every cut-off between 15 and 50 kilometers. The estimated effect size at each cut-off is plotted against the distance. The first panel presents results using the simple specification (column one of Table 2); the second panel presents results using the state/wave fixed effects specification (column two of Table 2); and the third panel presents results using the extended controls specification (column three of Table 2). In all three cases, the spatial heterogeneity is clear: the estimated effect of the program at-

18

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

tenuates as we restrict attention to individuals located closer to the border between treated and untreated districts.15 Moreover, consistent with the results in Table 2, the overall ef-

RI PT

fect appears to be smallest with the simple specification and the most imprecise (and also the largest) – as evidenced by substantial jumps in the estimated coefficients – with the extended controls specification. This motivates the use of the state by wave fixed effects specification for results in the remainder of this paper.

SC

The appendix presents three additional graphs to help contextualize these results. Figure A1 presents the effect when not restricting attention to only individuals living in border

M AN U

districts; the results are unchanged. Figure A2 shows the estimated effects of the program by phase, comparing phases one and three (first panel) and phases two and three (second panel). This substantially reduces power, but we see the same general spatial trend in both graphs. Finally, Figure A3 presents results within 15 kilometers to help clarify why the three graphs in Figure 4 stop at 15 kilometers. We start to see large jumps in the estimated

TE D

effect based on sample restrictions. For example, restricting the sample from 13 to 12 kilometers causes the estimated effect to increase from just 0.03 to 0.2. Similarly, seven to six leads to a drop of more than 0.2 and going from six to five leads to an even larger drop. It

EP

appears that the ARIS/REDS does not have enough observations to reliably estimate effects within approximately 15 kilometers of the border.

AC C

The previous results use the distance to border as a proxy for labor market exposure. Another possible proxy is the area around a village that is made up of a different district. In other words, if we construct a circle with a radius of 20 kilometers around each village, what percent of that circle belongs to a district of the opposite treatment status? In this case, a value of zero would indicate a village that is completely surrounded by districts of the same treatment status, while a value of one would indicate a village that is completely 15

In the first two panels, the effect is approximately zero within 15 kilometers. In the third panel, the coefficient is slightly larger than zero, but the standard error is larger than the coefficient, suggesting any differences from zero may be driven by imprecision.

19

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

surrounded by districts of the opposite treatment status.16 Table 4 presents results based on restrictions using this radius variable. The first column again uses all observations. Each

RI PT

additional column restricts the sample to individuals with less and less of the surrounding area belonging to a district of the opposite treatment status. While we expected the effect to attenuate with the sample restriction in Table 3, we now expect the effect to increase as we restrict the sample. Indeed, this is exactly what we see. Similar to the previous

SC

results, however, the exact cut-off is important. Thus, Figure 5 again presents the results graphically. The first graph uses a radius of 20 kilometers while the second graph uses

M AN U

a radius of 30 kilometers. The pattern is again clear: more isolated villages (with values closer to zero) see larger estimated effects of NREGS on prevailing casual wages. To help quantify the effect of distance on the estimated impact of NREGS on wages, Table 5 presents results using two separate triple interactions. The first three columns include a triple interaction of Post×NREGS×Distance to other. The estimates in column

TE D

one suggest that an increase of one kilometer is correlated with an increase in the estimated effect of NREGS on wages of approximately 0.3 percent. The mean distance is 26.2, which would suggest a wage increase approximately 7.9 percent higher at the mean distance than

EP

at the border. Importantly, the Post×NREGS dummy is the estimated effect of NREGS on individuals located at a distance of zero kilometers from the border. As we would expect,

AC C

this coefficient is not significantly different from zero. The last three columns use a triple interaction of the Post×NREGS dummy with a new variable, which is the length of the district border shared with districts of the opposite treatment status. The intuition behind this variable is that treated districts surrounded by untreated districts should, ceteris paribus, see smaller wage increases than treated districts surrounded by other treated districts. This is meant to act as a third proxy of labor market exposure. Column four suggests that as the length of the border shared with dis16

The latter is of course not a realistic possibility, though the former is.

20

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

tricts of the opposite treatment status increases – conditional on length of total border – the estimated effect of NREGS on casual wages decreases. The coefficient is marginally

RI PT

significant (p=0.113) with the conservative district-clustered standard errors. The mean length of the length variable is 1.86, which corresponds to 186 kilometers. For the mean treated district, the effect of NREGS on wages would be approximately 9.7 percent smaller (−0.052 × 1.86) than for a treated district without any exposure to untreated districts. This

SC

estimate is surprisingly close to the estimate using a completely different specification in column one. The Post×NREGS dummy is again the estimated effect of NREGS on wages

M AN U

for districts with a value of zero on the border shared variable. In contrast to column one, however, this is now the estimate of the effect of NREGS on wages for the most isolated districts. This is arguably the best estimate we have of the true effect of NREGS on wages, a point to which I return below.

Interestingly, columns two, three, five, and six suggest that the heterogeneity is driven

TE D

completely by dynamics within treated districts. Column two shows the estimated wage change by distance to border for treated districts while column three shows the estimated change by distance to border for untreated districts only. The coefficient in column two

EP

is positive and significant, as expected, but the coefficient in column three is a preciselyestimated zero. The same dynamic is seen in columns five and six, though the coefficient in

AC C

column six is at least in the predicted direction. It is not clear why this heterogeneity is only seen within treated districts and this finding deserves further analysis in future research. The results in columns two and three of Table 5 suggest that previous estimates might be underestimating the wage change due to the implementation of NREGS in treated districts. If true, then previous estimates of the impact of NREGS on wages are biased downwards. To quantify this bias, we need to isolate the effect of NREGS on wages in more “isolated” areas. It is not clear the best measure of isolation, but Table 6 presents two possibilities. The

21

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

first column presents the overall, district-level effect of NREGS. As before, the estimated effect is around 9.5 percent. Column two estimates the effect only for individuals that

RI PT

are not located within 20 kilometers of a district of the opposite treatment status; in other words, a circle with a radius of 20 kilometers around that individuals does not contain a district of the opposite treatment status. The estimated effect is approximately twice as large in these areas as the district-level average suggests. The third-column estimates are

SC

identical to the triple interaction in column four of Table 5. The assumption here is that treated districts that do not share a border with untreated districts feel the full effect of

M AN U

NREGS on wages. As previously indicated, this effect is the Post×NREGS coefficient. Reassuringly, the estimated “true” effect is almost identical to the estimate in column two. To put bounds on the underestimate, the bottom row shows the difference in estimated effects along with a percentile 95-percent confidence interval constructed by bootstrapping the estimates 1,000 times. The bootstrap draws households within districts to be consis-

TE D

tent with the inclusion of household fixed effects and the clustering of standard errors at the district level. In both columns two and three, the estimated “true” effect is larger – approximately twice as large – than the average effect in column one. In column two, the

EP

95-percent confidence interval suggests the underestimate could be as small as 0.018 but as large as 0.161. Given that the overall effects are slightly larger than previous estimates, the

AC C

upper end of the confidence interval seems unlikely. Though the point estimate in column three is almost identical, the confidence interval is slightly larger due to larger standard errors (and almost includes zero at the 95-percent level).

4.1

Is This Finding Driven by Pre-Program Trends?

The fundamental issue in this paper is the inability to test for pre-program trends. Though the ARIS/REDS has another wave before 1999, it was collected in the early 80s and, as 22

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

such, is unlikely to provide a valid estimate of any wage trends just before NREGS implementation. Therefore, in this section I present several pieces of circumstantial evidence

RI PT

that suggest pre-program trends are not responsible for the results. First, unlike previous studies, this paper uses household fixed effects in all specifications, which should help with identification. Second, Figure 4 shows similar heterogeneity, regardless of specification. In particular, the third column (“Extended Controls”) shows the

SC

results when a number of district- and household-level controls are included, including the variables that determined roll-out. Imbert and Papp (2015) show that these variables help

M AN U

mitigate differential pre-program trends. Using the National Sample Survey, I confirm this in Table A1. The first column shows the simple difference-in-differences estimator from the period 1999/2000 to 2004/05. As in Imbert and Papp (2015), the wage appears to be trending upwards in treated districts relative to untreated districts prior to the program. However, the inclusion of the district-level “Extended Controls” mitigates this problem, as

TE D

show in column two. As such, the fact that the heterogeneity remains even conditional on these variables is suggestive evidence that trends are not responsible. Third, though there were differential pre-program trends in wages, that was not the case

EP

for days worked in casual employment. This is confirmed in columns three and four of Table A1 in the Appendix. Thus, if we see spatial heterogeneity in private-sector employ-

AC C

ment, as well, we might attribute this to true effects instead of trends. Table 7 shows the effects of NREGS on private-sector employment at different distances to the border, similar to the estimated effects on wages in Table 2. Again, however, the exact choice of cut-off is important. Therefore, Figure 6 presents the effects graphically, at every possible cut-off. To facilitate comparison, Figure 6 also includes the effects on wages shown in Figure 4. The heterogeneity in private-sector employment is exactly as we would expect, with a more negative effect towards the interior of districts, where the wage increase was the largest.

23

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

Moreover, just like wages, the estimated effect near the border is approximately zero. The effects on wages and days are also strongly correlated, with a correlation of −0.792. This

RI PT

is further evidence that the spatial heterogeneity is due to the program and not trends. Fourth, Table 6 presents two ways to quantify the bias in the district-level estimates of wage impacts. In particular, the third column reports estimates using the ARIS/REDS and the distance of shared borders between districts. Though I cannot test pre-program

SC

trends using distance to border with the NSS (since the dataset does not have geographic information available below the district level), I am able to estimate pre-program trends

M AN U

using the distance of shared border. I present these results in Appendix Table A2. Column one shows the triple interaction from 1999/2000 to 2004/05 using all districts available in the NSS. The second column restricts attention to just the districts also in the ARIS/REDS, while the third column restricts attention to just border districts in the ARIS/REDS. In all three cases, the triple interaction is positive and insignificant, the opposite of what we

TE D

would expect if a violation of the parallel trends assumption were driving the results. This is again inconsistent with pre-program trends being responsible for the current results. Finally, another possibility is that the wage increase is largest on the interior of treated

EP

districts because NREGS may be better implemented there. In this case, the differences in wage effects would be true program effects, but they would not be due to spillovers of any

AC C

kind. To test this possibility, Table 8 presents regressions of days in public works in 2008 on different distance variables. These regressions include district fixed effects instead of household fixed effects, since the variable is available only for 2008.17 The first column confirms that public works employment in 2008 was significantly higher in treated districts than in non-treated districts, as would be expected since NREGS was not yet implemented in the non-treated districts. 17

Recall that in a single wave, there is variation within the district in the distance variable but not within the household.

24

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

The second and third columns present results using only NREGS districts. Interestingly, being located farther from an untreated district is negatively correlated with days in public

RI PT

works. If the spatial heterogeneity in wage changes were driven by differences in program implementation, we would likely see more days in public works, not less.18 Rather, this is more consistent with the wage increase itself. If NREGS wages are constant within a district, then, all else equal, we should see fewer days in public works where the prevailing

SC

private-sector wage is higher than where the prevailing wage is lower. The third column adds distance to the district headquarters. Though the coefficient is negative, it is small and

M AN U

insignificant. The mean distance to headquarters in treated districts is 53 kilometers, which is associated with a decrease in days of public works of just 2.2 percent. On the whole, these results are not consistent with differences in implementation being responsible for the observed spatial heterogeneity.

As expected, when we restrict estimation to untreated districts, the coefficients them-

TE D

selves are very small in magnitude, though the coefficient on distance to headquarters is marginally significant in column three. Though there are other public works programs in India, none is as large as NREGS. As such, we would not expect to see any significant

Do We Observe Labor Movements?

AC C

4.2

EP

correlations between distance variables and days in public works in untreated districts.

It appears that this spatial heterogeneity is driven at least partially by movements of labor. Figure 7 presents some individual characteristics (in 2008) and how they vary with distance to border in an attempt to identify some of these movements. The first three columns are personal characteristics that should be pre-determined – relative to the program, that is – for most adults. First, we see that the number of males (as a proportion of all adults) is 18

Note that workers do not need to work in the program for it to have a significant impact on wages (Basu et al., 2009). Rather, the program need only provide workers with a credible outside option.

25

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

increasing as we move towards the interior of treated districts. One can construct a story in which this is consistent with a wage increase, but that story is not so clear for either

RI PT

age or primary education, neither of which shows a consistent spatial pattern. Age is first decreasing then increasing then decreasing, while primary education is increasing then decreasing. The last figure presents travel time to employment, which is available only for non-farm employment. Interestingly, travel time increases exactly where we see the

SC

gradient in the wage effect, around 30-40 kilometers from the border. Moreover, it then decreases towards the very interior of treated districts, which we might expect if the wage

M AN U

increase is largest there and residents do not need to travel to take advantage of the largest wage increases.

However, it is not necessarily the levels in 2008 that are of interest to the estimated wage effects, but rather the changes in these characteristics from 1999 to 2008. While this is not possible for non-farm travel time – which is only available in 2008 – Figure 8 presents

TE D

difference-in-differences estimates of changes for the other three individual characteristics. There are clearer trends here than in the previous figure. In particular, it appears that the changes in male and primary education follow very similar patterns as the wage increase.

EP

Age also follows a similar pattern, though it is not as evident as for male or primary education. This is consistent with men and more educated individuals being more likely to

AC C

remain behind to take advantage of the larger wage increase. It should be noted, however, that the patterns presented here are not consistent with the movement of labor causing the wage increase. Rather, we might expect an influx of men and/or educated workers, all else equal, to suppress prevailing wages rather than increase them. Though travel time to non-farm employment may be indicative of daily travel, demographic changes are not. Rather, demographic changes are presumably due to either changes in migration or changes in the types of people who are working for wages. The present

26

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

results mirror recent research that finds NREGS reduces seasonal migration in three Indian states (Imbert and Papp, 2018).19 The authors, using a survey designed specifically

RI PT

to answer questions related to seasonal migration, find non-monetary costs of migration are especially important for individuals with more education and for older individuals. In other words, the authors argue NREGS may be especially likely to reduce migration of these individuals, which is exactly what I find. Despite historical low labor mobility in

SC

India (Munshi and Rosenzweig, 2009), these results nonetheless suggest wage increases may decrease out-migration. In addition, they further supports the contention that the wage

5

M AN U

heterogeneity documented in this paper is indeed caused by the program.

Conclusion

In this paper, I document the effects of the Mahatma Gandhi National Rural Employment

TE D

Guarantee Scheme (NREGS) in India. In particular, I show that the effects of the program are spatially heterogeneous: treated areas located near untreated areas see smaller wages increases than treated areas located farther from untreated areas. In addition, this hetero-

EP

geneity does not appear to be driven by pre-program trends or program leakage. However, while the pattern is clear across all of the results presented, a small sample size and demanding empirical strategy suggest caution in interpreting the magnitude of the estimated

AC C

effects.

These results suggest that previous studies of the effect of the program on wages underestimate the true effect of the program. This bias theoretically operates through two separate channels: an overestimate of the counterfactual wage in untreated districts and an underestimate of the counterfactual wage in treated districts, although the present results suggest the latter is more important. Quantifying the bias suggests that the true effect of NREGS 19

Those three states are Rajasthan, Gujarat, and Madhya Pradesh.

27

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

on wages is approximately twice as large as previous estimates indicate. It appears that the

AC C

EP

TE D

M AN U

SC

RI PT

effects of NREGS on wages may have been greater than previously believed.

28

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

References

RI PT

Afridi, F., Mukhopadhyay, A., and Sahoo, S. (2012). Female labour force participation and child education in India: The effect of the National Rural Employment Guarantee Scheme. Working Paper. Anant, T., Hasan, R., Mohapatra, P., Nagaraj, R., and Sasikumar, S. (2006). Labor markets in India: issues and perspectives. In Felipe, J. and Hasan, R., editors, Labor Markets in Asia, pages 205–300. Palgrave Macmillan.

SC

Angelucci, M. and De Giorgi, G. (2009). Indirect effects of an aid program: How do cash transfers affect ineligibles’ consumption? The American Economic Review, 99(1):486– 508.

M AN U

Azam, M. (2012). The impact of Indian job guarantee scheme on labor market outcomes: Evidence from a natural experiment. IZA Discussion Paper. Banerjee, A., Duflo, E., Imbert, C., Mathew, S., and Pande, R. (2015). Can e-governance reduce capture of public programs? Experimental evidence from a financial reform of India’s employment guarantee. New Delhi: International Initiative for Impact Evaluation (3ie).

TE D

Basu, A. K., Chau, N. H., and Kanbur, R. (2009). A theory of employment guarantees: Contestability, credibility and distributional concerns. Journal of Public Economics, 93(3):482–497. Beegle, K., Galasso, E., and Goldberg, J. (2017). Direct and indirect effects of Malawi’s public works program on food security. Journal of Development Economics.

EP

Behrman, J. R. (1999). Labor markets in developing countries. Handbook of Labor Economics, 3:2859–2939.

AC C

Berg, E., Bhattacharyya, S., Rajasekhar, D., and Manjula, R. (2014). Can Public Employment Schemes Increase Equilibrium Wages? Evidence from a natural experiment in India. The Centre for Market and Public Organisation. Cunha, J. M., De Giorgi, G., and Jayachandran, S. (2011). The price effects of cash versus in-kind transfers. National Bureau of Economic Research Working Paper. Donald, S. G. and Lang, K. (2007). Inference with difference-in-differences and other panel data. The Review of Economics and Statistics, 89(2):221–233. Duflo, E. (2001). Schooling and labor market consequences of school construction in Indonesia: Evidence from an unusual policy experiment. American Economic Review, 91(4):795–813.

29

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

Dutta, P., Murgai, R., Ravallion, M., and Van de Walle, D. (2014). Does India’s employment guarantee scheme guarantee employment? World Bank Policy Research Working Paper.

RI PT

Foster, A. D. and Rosenzweig, M. R. (2010). Is there surplus labor in rural India? Working Paper. Imbert, C. and Papp, J. (2011). Estimating leakages in India’s employment guarantee. In Khera, R., editor, Battle for Employment Guarantee, pages 269–78. Oxford University Press.

SC

Imbert, C. and Papp, J. (2015). Labor market effects of social programs: Evidence from India’s employment guarantee. American Economic Journal: Applied Economics, 7(2):233–263.

M AN U

Imbert, C. and Papp, J. (2018). Costs and Benefits of Rural-Urban Migration: Evidence from India. Working Paper. Jayachandran, S. (2006). Selling labor low: Wage responses to productivity shocks in developing countries. Journal of Political Economy, 114(3):538–575.

TE D

Jha, R., Bhattacharyya, S., and Gaiha, R. (2011). Social safety nets and nutrient deprivation: An analysis of the National Rural Employment Guarantee Program and the Public Distribution System in India. Journal of Asian Economics, 22(2):189–201. Kaur, S. (2018). Nominal wage rigidity in village labor markets. American Economic Review, Forthcoming.

EP

Khawlneikim, S. and Mital, M. (2016). Socio-economic impact of Rural Employment Guarantee Scheme in Churachandpur District, Manipur: A beneficiary perspective. International Journal of Scientific Research, 4(12).

AC C

Khera, R. and Nayak, N. (2009). Women workers and perceptions of the national rural employment guarantee act. Economic and Political Weekly, pages 49–57. Li, T. and Sekhri, S. (2013). The unintended consequences of employment based safety net programs. Working Paper. Liu, Y. and Barrett, C. B. (2012). Heterogeneous pro-poor targeting in India’s Mahatma Gandhi National Rural Employment Guarantee Scheme. Working Paper. Liu, Y. and Deininger, K. (2010). Poverty impacts of India’s National Rural Employment Guarantee Scheme: Evidence from Andhra Pradesh. Selected paper prepared for presentation at the Agricultural & Applied Economics Association. Merfeld, J. and Saha, J. (2018). Insuring girls’ lives against drought. Working Paper. 30

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

Miguel, E. and Kremer, M. (2004). Worms: Identifying impacts on education and health in the presence of treatment externalities. Econometrica, 72(1):159–217.

RI PT

Ministry of Law and Justice (2005). The National Rural Employment Guarantee Act. http://nrega.nic.in/rajaswa.pdf. Mukhopadhyay, A., Sharan, M., et al. (2015). The National Rural Employment Guarantee Scheme in Rajasthan: Rationed funds and their allocation across villages. Working Paper.

SC

Munshi, K. and Rosenzweig, M. (2009). Why is mobility in India so low? Social insurance, inequality, and growth. National Bureau of Economic Research Working Paper.

M AN U

Muralidharan, K., Niehaus, P., and Sukhtankar, S. (2016). Building state capacity: Evidence from biometric smartcards in India. The American Economic Review, 106(10):2895–2929. Muralidharan, K., Niehaus, P., and Sukhtankar, S. (2018). General Equilibrium Effects of (Improving) Public Employment Programs: Experimental Evidence from India. National Bureau of Economic Research Working Paper 23838. Niehaus, P. and Sukhtankar, S. (2013). The marginal rate of corruption in public programs: Evidence from India. Journal of Public Economics, 104:52–64.

TE D

Planning Commission (2003). Report of the Task Force: Identification of Districts for Wage and Self Employment Programmes. Ravallion, M., Van de Walle, D. P., Dutta, P., and Murgai, R. (2013). Testing information constraints on India’s largest antipoverty program. Working Paper.

EP

Ravi, S. and Engler, M. (2015). Workfare as an effective way to fight poverty: The case of India’s NREGS. World Development, 67:57–71.

AC C

Reddy, D. N. and Upendranadh, C. (2010). National rural employment guarantee: Issues, concerns and prospects. Oxfam India Working Papers Series. Rosenzweig, M. R. (1978). Rural wages, labor supply, and land reform: A theoretical and empirical analysis. American Economic Review, 68(5):847–861. Rosenzweig, M. R. (1980). Neoclassical theory and the optimizing peasant: An econometric analysis of market family labor supply in a developing country. The Quarterly Journal of Economics, 94(1):31–55. Rosenzweig, M. R. (1984). Determinants of wage rates and labor supply behavior in the rural sector of a developing country. In Binswanger, H. and Rosenzweig, M., editors, Contractual Arrangements, Employment and Wages in Rural Labour Markets in Asia, pages 1–41. Yale University Press, New Haven. 31

ACCEPTED MANUSCRIPT Merfeld

Spatially Heterogeneous Effects of a Public Works Program

Rosenzweig, M. R. (1988). Labor markets in low-income countries. Handbook of Development Economics, 1:713–762.

RI PT

Shah, M. and Steinberg, B. M. (2015). Workfare and human capital investment: Evidence from India. National Bureau of Economic Research Working Paper. Shah, M. and Steinberg, B. M. (2017). Drought of opportunities: Contemporaneous and long-term impacts of rainfall shocks on human capital. Journal of Political Economy, 125(2):000–000.

SC

Sharma, A. (2009). NCAER-PIF Study on Evaluating performance of National Rural Employment Guarantee Act. National Council of Applied Economic Research.

M AN U

Subbarao, K. et al. (2003). Systemic shocks and social protection: Role and effectiveness of public works programs. Working Paper. Sukhtankar, S. (2016). India’s National Rural Employment Guarantee Scheme: What Do We Really Know about the World’s Largest Workfare Program? India Policy Forum, July, 13:2009–10. Topalova, P. (2010). Factor immobility and regional impacts of trade liberalization: Evidence on poverty from India. American Economic Journal: Applied Economics, 2(4):1– 41.

AC C

EP

TE D

Zimmermann, L. (2012). Labor market impacts of a large-scale public works program: Evidence from the Indian Employment Guarantee Scheme. IZA Discussion Paper.

32

ACCEPTED MANUSCRIPT

Tables

M AN U

SC

RI PT

Figure 1: NREGS Districts and ARIS/REDS Villages

Legend

ARIS/REDS Villages

NREGA Districts Phase

EP

TE D

N/A

AC C

6

Phase 1 Phase 2 Phase 3

ACCEPTED MANUSCRIPT 34

Table 1: Summary Statistics (1999)

Panel A: District Level Total border Area

SC

Population (log)

Percent SC Percent ST Literacy rate Labor force participation rate Median wage (log rupees)

TE D

Observations

Panel B: Individual Level Distance to district HQ (log km)

Distance to nearest road (log km)

EP

Distance to nearest town (log km) Village population density

AC C

Male (yes = 1)

Wage (log rupees) Observations

M AN U

Percent rural

Age

5.42 (2.30) 8.88 (0.54) 14.67 (0.49) 0.80 (0.11) 0.18 (0.07) 0.10 (0.13) 0.54 (0.11) 0.41 (0.07) 4.17 (0.30) 39

(2) Non-NREGS Districts mean/sd

RI PT

(1) NREGS Districts mean/sd

3.72 (0.96) 0.52 (0.87) 2.44 (0.97) 0.94 (1.43) 33.72 (13.26) 0.61 (0.49) 4.09 (0.43) 2031

5.12 (2.06) 8.72 (0.72) 14.81 (0.48) 0.65 (0.17) 0.15 (0.06) 0.03 (0.06) 0.60 (0.10) 0.39 (0.06) 4.40 (0.33) 33

3.57 (0.93) 0.23 (0.62) 2.08 (1.06) 0.97 (1.28) 34.48 (13.15) 0.68 (0.47) 4.37 (0.40) 2043

All statistics are from prior to implementation of NREGS. Column one includes phase one and two districts while column two includes phase three districts. Panel A includes district-level variables: total border and area are constructed from GIS files, population through labor force participation are from the 2000 census, and median wage is constructed using the ARIS/REDS 1999 wave. Panel B includes individual-level variables in the 1999 ARIS/REDS.

ACCEPTED MANUSCRIPT 35

TE D

M AN U

SC

RI PT

Figure 2: Histogram of Distance to Border

Distance to border is defined as the distance from the enumerated village to the closest border between a treated and untreated district.

EP

Table 2: Effect of NREGS on Wages

AC C

Post×NREGS

(1) Model 1 0.076 (0.059) yes no no 9649

Age and Gender State/Wave FE Extended Controls Observations

Household FE (2) (3) Model 2 Model 3 0.095** 0.095* (0.045) (0.057) yes yes yes no no yes 9649 9649

(4) Model 4 0.075 (0.061) yes no no 9649

District FE (5) (6) Model 5 Model 6 0.121** 0.090 (0.052) (0.059) yes yes yes no no yes 9649 9649

Standard errors are in parentheses and are clustered at the district level in all columns. Post is defined as a dummy equal to one for the 2008 wave of the ARIS/REDS, between phases two and three of NREGS implementation. NREGS is defined as an individual that lives in a phase one or two district. Extended controls include both district-level variables and village-level variables. District-level variables are defined using either the 2000 census or the 1999 ARIS/REDS. These variables are constant across both waves and are thus interacted with the Post dummy to allow them to differentially affect trends. Village-level variables use the values from their respective waves and thus are not interacted with the Post dummy. * p<0.1 ** p<0.05 *** p<0.01

ACCEPTED MANUSCRIPT 36

TE D

M AN U

SC

RI PT

Figure 3: Distance to Border and Wages - 1999 and 2008

EP

Distance to border is defined as the distance from the enumerated village to the closest border between a treated and untreated district. Negative values indicate untreated (phase three) districts and positive values indicate treated districts. In other words, more negative values indicate the interior of untreated districts, values close to zero indicate areas located near the border between a treated and untreated district, and more positive values indicate the interior of treated districts.

AC C

Table 3: Effect of NREGS on Wages by Distance to Border

Post×NREGS

Age and Gender State/Wave FE Observations

(1) (2) All < 55 km 0.095** 0.058 (0.045) (0.042) yes yes yes yes 9649 7296

(3) < 45 km 0.056 (0.043) yes yes 6929

(4) < 35 km 0.037 (0.039) yes yes 6452

(5) < 25 km 0.017 (0.042) yes yes 5217

(6) < 15 km 0.001 (0.045) yes yes 3584

Standard errors are in parentheses and are clustered at the district level in all columns. Post is defined as a dummy equal to one for the 2008 wave of the ARIS/REDS, between phases two and three of NREGS implementation. NREGS is defined as an individual that lives in a phase one or two district. Distance to border is defined as the distance from the enumerated village to the nearest border between a treated and untreated district for border districts. * p<0.1 ** p<0.05 *** p<0.01

37

EP TE D M AN U

SC

RI PT

Each point is the a difference-in-differences estimate of the effect of NREGS on wages for all households no farther than x kilometers from the border between a treated and untreated district. As an example, a point at 30 means that the regression was estimated for all households less than or equal to 30 kilometers from the border. The first figure reports results from column one in , the second figure reports results from column two in , and the third figure reports results from column three in .

AC C

Figure 4: Effect of NREGS on Wages by Distance to Border

ACCEPTED MANUSCRIPT

ACCEPTED MANUSCRIPT 38

Table 4: Effect of NREGS on Wages - Circles of Radius 20km

Post×NREGS Age and Gender State/Wave FE Observations

(3) < 0.2 pct 0.113* (0.061) yes yes 7336

(4) < 0.15 pct 0.143** (0.063) yes yes 6901

(5) < 0.1 pct 0.122* (0.061) yes yes 6604

(6) < 0.05 pct 0.132* (0.074) yes yes 6005

RI PT

(1) (2) All < 0.25 pct 0.095** 0.122** (0.045) (0.053) yes yes yes yes 9649 7633

M AN U

SC

Standard errors are in parentheses and are clustered at the district level in all columns. Post is defined as a dummy equal to one for the 2008 wave of the ARIS/REDS, between phases two and three of NREGS implementation. NREGS is defined as an individual that lives in a phase one or two district. For each household, a circle of radius 20 kilometers was constructed with the household as the center and the percent of the area of the circle belonging to a district of the opposite treatment status was calculated. In each column, the sample is restricted using this variable. * p<0.1 ** p<0.05 *** p<0.01

AC C

EP

TE D

Figure 5: Effect of NREGS on Wages by Distance to Border - Circles

Each point is the a difference-in-differences estimate of the effect of NREGS on wages. The x variable is the percent of a circle of radius 20 kilometers (first figure) or 30 kilometers (second figure) – whose center is each household – that belongs to a district of the opposite treatment status. As an example, a point at 0.2 in the first figure means that the regression was estimated for all households with less than 20 percent of the area within 20 kilometers of the household belonging to a district of the opposite treatment status.

ACCEPTED MANUSCRIPT 39

Table 5: Effect of NREGS on Wages - Triple Interactions

Post×NREGS×Border with other

-0.052 (0.033)

0.001 (0.001)

0.005*** (0.001)

Post×Border with other Post×NREGS

0.024 (0.053) yes yes 8693

yes yes 4606

yes yes 4087

M AN U

Age and Gender State/Wave FE Observations

-0.000 (0.001)

0.006 (0.019) 0.185** (0.085) yes yes 9649

SC

Post×Distance to other

Border Shared w/ Other (km) (4) (5) (6) All NREGS Non-NREGS

RI PT

Post×NREGS×Distance to other

Distance to Other (km) (1) (2) (3) All NREGS Non-NREGS 0.003* (0.002)

-0.067* (0.037)

0.008 (0.016)

yes yes 5562

yes yes 4087

TE D

Standard errors are in parentheses and are clustered at the district level in all columns. Post is defined as a dummy equal to one for the 2008 wave of the ARIS/REDS, between phases two and three of NREGS implementation. NREGS is defined as an individual that lives in a phase one or two district. Distance to Other (columns one through three) is defined as the distance from the household to the nearest border between a treated and untreated district, for border districts. Border Shared with Other (columns four through six) is a variable that gives the percent of each district’s total border distance that is shared with a border of the opposite treatment status. As an example, a treated district that shares 200 kilometers of 50 border with treated districts and 50 kilometers of border with untreated districts would have a value of 0.2 ( 200+50 ). * p<0.1 ** p<0.05 *** p<0.01

Post×NREGS

EP

Table 6: Quantifying Bias in District-Level Wage Estimates (1) All 0.095** (0.045)

(2) Radius Area = 0 0.188** (0.082)

Post×NREGS×Border with other

AC C

Post×Border with other Observations

9649 Bias: Difference: 95-percent Bootstrap CI:

4605 Col(2) - Col(1) 0.093 (0.018, 0.161)

(3) Shared Border 0.185** (0.085) −0.052 (0.033) 0.006 (0.019) 9649 Col(3) - Col(1) 0.090 (0.001, 0.187)

Standard errors are in parentheses and are clustered at the district level in the top panel. In the bottom panel (Bias), confidence intervals are calculated by bootstrapping the estimator, with the bootstrap set up to draw households from within districts, consistent with household fixed effects and district-clustered standard errors. Column one is the simple difference-in-differences estimate (column two of Table 2), column two is the difference-in-differences estimate when restricting the sample to only households more than 20 kilometers from any border with a district of the opposite treatment status, and column three is column four of Table 5. * p<0.1 ** p<0.05 *** p<0.01

ACCEPTED MANUSCRIPT 40

Table 7: Effect of NREGS on Employment

Post×NREGS Age and Gender State/Wave FE Observations

(3) < 45 km −0.124 (0.092) yes yes 29279

(4) < 35 km −0.155 (0.095) yes yes 26828

(5) < 25 km −0.098 (0.115) yes yes 22010

(6) < 15 km 0.027 (0.122) yes yes 14941

RI PT

(1) (2) All < 55 km −0.066 −0.116 (0.096) (0.092) yes yes yes yes 38160 31026

M AN U

SC

Standard errors are in parentheses and are clustered at the district level in all columns. The dependent variable is total days in private-sector employment. The specification is otherwise identical to column two of Table 2. * p<0.1 ** p<0.05 *** p<0.01

AC C

EP

TE D

Figure 6: Effect of NREGS on Wages and Employment

The figure presents the wage results from the middle figure in Figure 4 (dotted line) as well as results from identical specifications using (log of) days in private-sector employment as the dependent variable. The relevant sample includes all adults over 18 years of age. The restrictions in each column are based on the distance to border variable used in previous specifications.

ACCEPTED MANUSCRIPT 41

Table 8: Days in Public Works in 2008 and Distance to Border and HQ

Distance to other district (hundreds of km) Distance to district HQ (hundreds of km) Observations

Non-NREGS Districts (4) (5) Model 4 Model 5

-0.539** (0.249)

-0.021 (0.013)

17100

-0.513** (0.229) 0.137* (0.074) 17100

18212

-0.019 (0.012) -0.007* (0.004) 18212

SC

38160

NREGS Districts (2) (3) Model 2 Model 3

RI PT

NREGS district

All (1) Model 1 0.071*** (0.015)

M AN U

Standard errors are in parentheses and are clustered at the district level. All regressions include district fixed effects and only the 2008 wave of the ARIS/REDS. The dependent variable is (log of) days in public works and includes the sample of adults over 18 years of age. * p<0.1 ** p<0.05 *** p<0.01

AC C

EP

TE D

Figure 7: Distance to Border and Individual Characteristics

Distance to border is defined as the distance from the enumerated village to the closest border between a treated and untreated district. Negative values indicate untreated (phase three) districts and positive values indicate treated districts. In other words, more negative values indicate the interior of untreated districts, values close to zero indicate areas located near the border between a treated and untreated district, and more positive values indicate the interior of treated districts.

ACCEPTED MANUSCRIPT

RI PT

42

AC C

EP

TE D

M AN U

SC

Figure 8: Distance to Border and Changes in Individual Characteristics

Each figure presents difference-in-differences estimates using the same methods as Figure 4 and Figure 6. The dependent variable (DV) in the top-left graph is whether the wage earner is male; the DV in the top-right graph is the age of the wage earner; and the DV in the bottom graph is whether the wage earner has a primary education or above.

ACCEPTED MANUSCRIPT

A

Appendix

State/Wave FE Extended Controls Observations

Days (log) (3) (4) Model 3 Model 4 0.011 -0.006 (0.016) (0.019) yes yes no yes 396604 336495

SC

Post×NREGS

Wage (log) (1) (2) Model 1 Model 2 0.032* -0.010 (0.019) (0.019) yes yes no yes 62269 57133

RI PT

Table A1: Simple Pre-Program Wage and Private Employment Trends (NSS)

M AN U

Standard errors are in parentheses and are clustered at the district level in all columns. All regressions include district fixed effects and use the National Sample Survey (NSS). The NSS waves used in this analysis come from 1999/2000 and 2004/05, prior to the implementation of the program. Post times NREGS is thus the placebo test for pre-program trends in treated districts relative to untreated districts. The dependent variable (DV) in columns one and two is (log of) wage and the DV in columns three and four is (log of) days in private-sector employment in the previous seven days. NSS sample weights are used in all regressions. * p<0.1 ** p<0.05 *** p<0.01

Table A2: Border Distance and Pre-Program Trends

Post×NREGS Post×Border with other

EP

Observations

TE D

Post×NREGS×Border with other

(1) All Districts

(2) ARIS Districts

0.011 (0.009) 0.007 (0.029) −0.009 (0.007) 62269

0.026 (0.018) −0.028 (0.053) −0.015 (0.014) 16155

(3) ARIS Border Districts 0.020 (0.021) −0.008 (0.067) −0.008 (0.018) 13682

AC C

Standard errors are in parentheses and are clustered at the district level in all columns. All regressions include district fixed effects and use the National Sample Survey (NSS). The NSS waves used in this analysis come from 1999/2000 and 2004/05, prior to the implementation of the program. Border with Other is defined as the percentage of the district border shared with a district of the opposite treatment status, similar to the specification in column four of Table 5. Column one includes all NSS districts while column two restricts the sample to only districts also available in the ARIS/REDS and column three further restricts the sample to border districts. NSS sample weights are used in all regressions. * p<0.1 ** p<0.05 *** p<0.01

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

Figure A1: Effect of NREGS on Wages by Distance to Border - All Districts

The figure is identical to the middle figure in Figure 4 but does not restrict the sample to only border districts.

AC C

EP

TE D

M AN U

SC

Figure A2: Heteogeneity by Phase

RI PT

ACCEPTED MANUSCRIPT

The specifications are identical to the middle figure in Figure 4. However, the left figure restricts the sample to phase one (treated) and phase three (untreated) districts, while the right figure restricts the sample to phase two (treated) and phase three (untreated) districts.

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

Figure A3: Effects inside 15 Kilometers

The figure is identical to the middle figure in Figure 4 presents results within 15 kilometers.

ACCEPTED MANUSCRIPT

Highlights

EP

TE D

M AN U

SC

RI PT

Wage effects of Indian public works program are spatially heterogeneous Treated areas near untreated areas see smaller wage increases Treated areas far from untreated areas see larger wage increases Suggests previous estimates are underestimates of true effect Results indicate true effect may be twice as large as previously estimated

AC C

• • • • •