Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP

Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP

Journal Pre-proofs Research papers Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP Adnan Rajib, ...

3MB Sizes 0 Downloads 17 Views

Journal Pre-proofs Research papers Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP Adnan Rajib, Zhu Liu, Venkatesh Merwade, Ahmad A. Tavakoly, Michael L. Follum PII: DOI: Reference:

S0022-1694(19)31141-2 https://doi.org/10.1016/j.jhydrol.2019.124406 HYDROL 124406

To appear in:

Journal of Hydrology

Received Date: Revised Date: Accepted Date:

16 October 2019 23 November 2019 25 November 2019

Please cite this article as: Rajib, A., Liu, Z., Merwade, V., Tavakoly, A.A., Follum, M.L., Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP, Journal of Hydrology (2019), doi: https://doi.org/10.1016/j.jhydrol.2019.124406

This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

© 2019 Published by Elsevier B.V.

Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP Adnan Rajib1*#, Zhu Liu2*, Venkatesh Merwade3, Ahmad A. Tavakoly4,5, and Michael L. Follum5 1Department 2Department 3Lyles 4Earth 5Coastal

of Environmental Engineering, Texas A&M University, Kingsville, Texas, USA

of Land, Air and Water Resource, University of California, Davis, California, USA

School of Civil Engineering, Purdue University, West Lafayette, Indiana, USA

System Science Interdisciplinary Center, University of Maryland, College Park, Maryland, USA

and Hydraulics Laboratory, US Army Engineer Research and Development Center, Vicksburg, Mississippi, USA

*Shared lead authorship #Corresponding

author: Frank H. Dotterweich College of Engineering, 700 University Blvd, MSC 213, Kingsville, TX 78363, USA; email: [email protected]

Highlights 

Degree of local relevance is defined by the spatial resolution of stream network.



Simulation of real-life flood events across 26,000 streams in the Ohio River Basin.



Satellite images and another similar framework verify flood mapping accuracy.



Incorporating streamflow uncertainty in flood maps minimizes prediction bias.



Streamflow input in lower order streams is essential for accurate flood mapping.

ABSTRACT

1

Lack of geospecificity or local relevance is a major limitation in contemporary large-scale flood modeling frameworks. There is a little practical value for configuring a large-scale model if the model produces streamflow and/or inundation maps only along the large rivers while numerous lower order streams remain overlooked. This study fills the gap through a new flood prediction framework based on the loose coupling of a hydrologic model Soil and Water Assessment Tool (SWAT) and a 1D/2D hydrodynamic model LISFLOOD-FP (hence, SWAT-LISFP). The prototype SWAT-LISFP framework was configured with ~26,000 stream reaches across the ~500,000 km2 Ohio River Basin, United States. After being calibrated against 50 gauge stations across the basin, SWAT simulated streamflow outputs were fed as upstream boundary conditions in LISFLOOD-FP. The resultant flood inundation extents consistently captured 70-80% of the remotely sensed inundation, irrespective of the flood events or locations within the basin. This was also confirmed via cross-validation with an existing flood modeling framework AutoRAPID (Follum et al., 2017). Additional modeling experiments were conducted to facilitate two critical discussions – how simulated inundation extent is affected by the uncertainty in streamflow prediction and the density of streamflow boundary conditions. Taking into account the uncertainties in SWAT streamflow, LISFLOOD-FP showed a remarkable improvement with more than 95% of remotely sensed inundation captured within the simulated extent. While this approach produces a variable-area flood map (i.e., a range of areas likely to be inundated at a particular point of time), inundation in the lower order streams can still remain undetected. A solution to this problem was demonstrated by setting up streamflow boundary conditions across further lower order streams, which subsequently justified the need for high-resolution stream network, and hence, the essence of locally relevant flood inundation modeling. The new contributions of his study, particularly through introducing SWAT as a functional hydrologic alternative to supplement a

2

hydrodynamic model such as LISFLOOD-FP and the series of experiments to draw insights on addressing lack of accuracy and local relevance, will enhance the global flood modeling initiatives.

Keywords: Flood mapping; floodplain; global flood hazards, hydrologic modeling; hydrodynamic modeling; Ohio River Basin

1. INTRODUCTION With frequently occurring high magnitude floods and associated socio-economic damages in many regions across the world, researchers and decision makers are actively seeking suitable approaches for flood inundation mapping. Traditionally, flood inundation maps are created by driving a hydrodynamic model with available stage or streamflow data (e.g., Cook and Merwade, 2009; Liu et al., 2019a; Liu and Merwade, 2019; Musser and Dyar, 2007). However, this approach cannot represent the propagation, retention, and attenuation of flooding across the entire stream network. This is because gauges that record flow and stage data are mostly installed along higher order streams, thereby leaving many lower order streams unmonitored. Even the data available at higher order streams are of too short length or too poor quality to accurately capture floodplain hydrodynamics (Di Baldassarre and Montanari, 2009). An effective way to address this information gap, at both gauged and ungauged streams, is through an integrated hydrologichydrodynamic modeling framework. In such a framework, one or more hydrologic models can simulate streamflow, which can provide upstream boundary conditions (or forcing data) to one or more hydrodynamic models for producing flood inundation maps. The coupling of hydrologic and hydrodynamic models is a well-recognized approach, but common only to small watersheds (e.g., Huang and Hattermann, 2018; Kim et al., 2012; Komi et

3

al., 2017; Nguyen et al., 2016). Development of “large-scale” hydrologic-hydrodynamic modeling frameworks covering the world’s major river basin(s) (e.g., Biancamaria et al., 2009; Hoch et al., 2017; Paiva et al., 2013; Paiva et al., 2011; Schuman et al., 2013) are not yet full-fledged. We identified three persistent limitations in the current state-of-the-art large-scale flood modeling initiatives: (i) lack of geospecificity or “local relevance” in predicted flood information, (ii) applications restricted only to identify a zone of potential food hazards (so-called floodplain maps), not the actual flooding, and (iii) acute inaccuracy problem. To justify the rationale of our study, these limitations are elaborately discussed below. Large-scale predictions of streamflow and inundation maps are immensely helpful for operational disaster information systems. What seriously impedes such potential is the lack of “local relevance”. Spatial resolution of topography data and computational mesh configuration in a hydrologic and/or hydrodynamic model are frequently used to define the local relevance of its outputs (e.g., Chen et al., 2017; Gallegos et al., 2009; Hartnett and Nash, 2017; Wood et al., 2011). We suggest that this definition is a misnomer for (fluvial) flood prediction and mitigation planning. For instance, there is little practical value for configuring a model with meter-scale computational mesh if the so-called high- or hyper-resolution model produces streamflow and/or inundation maps only along the main river channels (see, e.g., Biancamaria et al., 2009; Chen et al., 2017; Komi et al., 2017; Logah et al., 2017; Schuman et al., 2013; Wilson et al., 2007). On the contrary, it is physically more meaningful and thus practically informative to construct flood models so that the degree of local relevance is dictated by the resolution (spatial density) of stream network. Considering as many lower order streams as possible via a high-resolution stream network would ensure the maximum local relevance of streamflow and inundation maps across large spatial scales

4

– a fundamental aspect of flood prediction previously ignored in research and operational management. In line with efforts to build “large-scale locally relevant” flood prediction frameworks, global flood hazard models have received considerable attention in recent times (e.g., Alfieri et al., 2014; Dottori et al., 2016; Pappenberger et al., 2012; Sampson et al., 2015; Wing et al., 2017). However, these initiatives have only produced static maps with a specific probability of flood occurrence (e.g., 100-year return period). While these maps stimulate new-generation of spatially explicit floodplain ecohydrologic and socio-economic management (Kiedrzyńska et al., 2015; Smith et al., 2019), they cannot be considered as actual flood maps. Near real-time mapping of flood inundation extents (with corresponding streamflow data) is the main requirement for a decision-support system that can inform real-life flood management. To facilitate near-real time predictions, National Oceanic and Atmospheric Administration in the United States developed the National Water Model (NWM) framework (Maidment, 2017). NWM predicts streamflow and potential flood inundation extents across ~2.7 million streams in the continental US – an unprecedented combination considering both spatial scale and resolution of stream network (Maidment et al., 2016). Follum et al. (2017) introduced the AutoRAPID framework, demonstrating its capability for fast, parsimonious flood map simulation across millions of streams in the Midwest US and Mississippi Delta. There is a concomitant trend of similar initiatives for other parts of the world as well (e.g., GLOFRIM by Hoch et al., 2017). While these emergent frameworks offer increased flood information capacity in public domain, their accuracy during real-life flood events and means to efficiently address their prediction uncertainties have not been thoroughly investigated (see, e.g., Afshari et al., 2018; Salas et al., 2018; Zheng et al., 2018a). What is also being overlooked in the previous studies is the possibility 5

that two such frameworks, independently developed for a particular basin, may produce largely inconsistent flood information. Accordingly, there remains the urgency to continue developing robust hydrologic-hydrodynamic model combinations and ensure reliable flood information via real-life accuracy assessments and inter-framework comparisons. This study aims to fill these gaps with a newly developed large-scale locally relevant flood prediction framework. At its core, the framework loosely couples hydrologic model SWAT (Soil and Water Assessment Tool; Arnold et al., 2012; Neitsch et al., 2011) with 1D/2D hydrodynamic model LISFLOOD-FP (Bates and De Roo, 2000; Horritt and Bates, 2001; Bates et al., 2013; Neal et al., 2012). With a prototype daily-simulation setup involving ~26,000 stream reaches across the Ohio River Basin, a ~500,000 km2 region within the Mississippi River system (Figure 1), we addressed the following questions to infer a comprehensive understanding on the framework’s predictability and representativeness: 1. How accurate is the new framework during real-life flood events? Does the existing

framework(s) corroborate the new framework’s predicted flood inundation extents? 2. Does knowing the level of hydrologic uncertainty help increase the predictability of inundation

extents? 3. How does local relevance (i.e., spatial resolution of stream network) influence models’ ability to capture inundation extents over large spatial scales?
2. MODEL SELECTION Many 1D/2D hydrodynamic models have been developed as stand-alone software packages to support reach- or watershed-scale simulations (Nelson et al., 2003; Musser and Dyar,

6

2007; Vanderkimpen et al., 2009; Ballesteros et al., 2011; Nguyen et al. 2016; USACE, 2016). For a large-scale flood modeling framework involving numerous streams, choice of a hydrodynamic model depends primarily on its easy interoperability with a driver hydrologic model, opportunities for cyber-enabled high performance computation, and flexibility for data assimilation (Kauffeldt et al., 2016; Merwade et al., 2018). This is why “low-complexity” parsimonious inundation mapping tools relying on topography, roughness parameters, and simple hydraulic equations, such as those used in the NWM or AutoRAPID frameworks (Follum et al., 2017; Zheng et al., 2016, 2018a,b), have gained great community attention. However, because these tools lack physical process descriptions on floodplain mechanisms, their application involves a trade-off between fast computation power and accuracy. As such, they may underperform compared to 1D/2D hydrodynamic models (Afshari et al., 2018). To this end, LISFLOOD-FP has emerged as a suitable hydrodynamic model. Although nearly all of the large-scale frameworks using LISFLOOD-FP are designed for mapping a static areal extent with a specific probability of inundation (e.g., the “floodplain maps” by Alfieri et al., 2014; Dottori et al., 2016; Sampson et al., 2015; Wing et al., 2017), it is possible to configure LISFLOOD-FP for near real-time prediction of potential flood events (Wing et al., 2019). Hoch et al. (2017) and Schuman et al. (2013) demonstrated this by coupling LISFLOOD-FP respectively with Variable Infiltration Capacity (VIC; Liang et al., 2003) and PCRaster GLOBal Water Balance (PCR-GLOBWB; van Beek and Bierkens, 2009) models, however, using coarse-resolution stream networks. Therefore, how LISFLOOD-FP would perform in a much complex setup (e.g., 26,000 stream reaches across the 500,000 km2 Ohio River Basin; Figure 1) is still an outstanding question. For large-scale simulation of streamflow across high-resolution stream networks, coupled atmospheric-land surface-river routing models are being increasingly used in recent times (e.g.,

7

Lin et al., 2019, 2018a,b; Tavakoly et al., 2017). For instance, the NOAH-MP model, under the auspices of WRF-Hydro modeling system, is adopted in the NWM flood prediction framework (Salas et al., 2018). There have been several large-scale applications of VIC and PCR-GLOBWB as well, but these applications often involve coarse-resolution stream network (e.g., Hoch et al., 2017; Schuman et al., 2013; as noted above) or monthly time-scale simulation (e.g., Oubeidillah et al., 2014; Safeeq et al., 2014; van Beek et al., 2011). SWAT is another process-based model frequently applied on the world’s major river basins (e.g., Abbaspour et al., 2015; Daggupati et al., 2015; Du et al., 2018; Pervez and Henebry, 2015; Rajib and Merwade, 2017; Schuol et al., 2008). In addition to the well-documented evaluations that approve SWAT’s effectiveness (Arnold et al., 2012), there are web-based platforms such as SWATShare that can be used to publish and share SWAT models with a broader community, and perform extensive calibration using high performance computational resource (Rajib et al., 2016a). The companion tools of SWATShare such as HydroGlobe can assimilate multi-source earth observations (e.g., Rajib et al., 2018a,b) and SWATFlow can perform dynamic visualization of hydrographs and flood inundation extents. These are the emergent opportunities which led to SWAT’s selection for our flood prediction framework. To the best of our knowledge, this is the first study to show the application of SWAT model as the hydrologic component of a large-scale flood prediction framework. Our framework also involves the most complex assembly of LISFLOOD-FP till date, specifically considering the combined overburden of large basin area, high-resolution stream network and topography, and continuous daily simulation. While this study demonstrates a prototype framework using recent flood events in the Ohio River Basin as test cases, efforts are underway to extend the models over a much larger domain and perform near real-time predictions.

8

3. METHODOLOGY 3.1 Ohio River Basin as the flood modeling testbed The ~500,000 km2 Ohio River Basin (ORB) (Figure 1) is the largest sub-basin of the Mississippi River system in terms of water volume, thus carrying a great socio-economic and ecological significance for the North American continent. An intriguing factor to consider ORB as a flood modeling testbed is the increasing frequency of flood events over the past decades (Holmes et al., 2010; Holmes and Dinicola, 2012; Mallakpour and Villarini, 2015). A historical record by US National Weather Service (NWS, 2018) suggests that the Ohio River at Cincinnati (located in the most downstream portion of the basin) came near or above the designated flood stage 108 times within a span of 160 years (1858-2018, i.e., nearly 1 potential extreme event in every 1.5-year period). Previous studies analyzing land management and/or climate change effects also confirmed the increasing flood frequency in ORB (e.g., Du et al., 2018; Kunkel et al., 2013; Tayyebi et al., 2015). Despite this rising threats of flood hazard, flood modeling studies specifically focusing on ORB do not exist in literature. 3.2 Hydrologic modeling 3.2.1 SWAT model setup Setting up a SWAT model in the GIS interface (ArcSWAT) requires three geospatial datasets: digital elevation model (DEM), land use, and soil texture. These datasets were obtained from the following sources: 30-m DEM from the US Geological Survey (USGS) National Elevation Dataset (NED) (USGS, 2013a), 30-m land use data from the 2011-National Land Cover Database (NLCD) (USGS, 2013b), and 1:250,000 scale State Soil Geographic Data (STATSGO) built-in with SWAT geodatabase. To reduce the computational burden, all topographic analyses

9

(e.g., sub-basin delineation) were performed by resampling the DEM to a 90-m spatial resolution. Following several other large-scale SWAT applications (e.g., Schuol et al., 2008; Faramarzi et al., 2017; Du et al., 2018), a sub-basin was treated as the smallest spatial unit of hydrologic simulations. Elevation difference across the basin was represented via three slope classes (0-4%, 4-10%, and >10%). To enable comparison with similar flood modeling initiatives in the US (prevously noted, e.g., Follum et al., 2017; Maidment, 2017; Tavakoly et al., 2017), the NHDPlus stream network (National Hydrography Dataset Plus; McKay et al., 2012) was superimposed (burned-in) on the 90-m DEM, along with a 100 km2 threshold area for headwater stream identification. This 100 km2 threshold was selected via multiple iterations to spatially match the DEM-generated stream network with NHDPlus dataset. The model could also use NHDPlus stream network directly; however, such an option would require a significant amount of GIS pre-processing and quality assessment tasks to ensure continuous connectivity of streams (see, e.g., David et al., 2013). Nonetheless, our DEM-based network delineation resulted into ~26,000 individual stream reaches (and sub-basins) (Figure 1). For every stream reach, hydraulic geometry, i.e., bank-full width (wc) and depth (dc) were defined by relating upstream drainage area (DA) with semi-empirical power law relationships (wc = 1.29DA0.6 and dc = 0.13DA0.4 where DA is in km2 and w is in m; Her et al., 2017). This approach, similar to other large-scale hydrologic/river routing models (Hoch et al., 2017; Li et al., 2013; Paiva et al., 2013), allowed w values in lower order streams to be smaller than the DEM’s spatial resolution. For instance, the minimum wc in the ORB model was 2-m, whereas the underlying DEM cell-size was 90-m. The above-bank water was simply routed through a hypothetical floodplain with a constant 5wc width (Neitsch et al., 2011). 10

Weather forcing data were obtained from the ~12 km North American Data Assimilation System (NLDAS-phase II; NASA, 2019) and National Centers for Environmental Prediction's (NCEP) ~38 km Climate Forecast System Reanalysis (CFSR; Saha et al., 2010). Specifically, total daily precipitation were obtained from NLDAS, whereas other energy-related inputs including minimum/maximum daily temperature, solar radiation, relative humidity and wind speed were obtained from CFSR. Potential evapotranspiration (PET) was estimated using SWAT’s built-in Penman-Monteith approach. Nearly 30% of ORB has artificial subsurface drainage (tile-drainage) (Du et al., 2018). Therefore, tile-drainage routine was activated for all the sub-basins having poorly-drained soils (e.g., Du et al., 2018; Green et al., 2006). Reservoir storage-discharge could not be explicitly considered in the current ORB setup because of limited access to dam operation data, although this limitation was implicitly addressed via rigorous streamflow calibration across a large number of gauge stations. Processes related to infiltration/ surface runoff generation and channel routing were simulated using the Curve Number and Variable Storage methods, respectively (Neitsch et al., 2011). 3.2.2 Calibration and validation of SWAT The ORB SWAT model was first initialized for 1-year (2009) and then calibrated for a 2year period (2010-2011) using daily streamflow observations at 50 USGS gauge stations (Figure 2a and supplementary information S1). The simultaneous multi-site calibration involved 18 parameters related to surface, subsurface, snow accumulation, and in-stream hydrologic responses (supplementary information S2). Du et al. (2018) previously calibrated a coarse-resolution ORB SWAT model using 80 years (1935-2014) of streamflow data at 9 gauge stations. For calibrating the current ORB SWAT model, initial parameter ranges were defined based on the optimization 11

results of Du et al. (2018). The parameter optimization was conducted with Sequential Uncertainty Fitting algorithm-version 2 (SUFI-2; Abbaspour, 2015) using a weighted Kling-Gupta Efficiency (KGE) (Gupta et al., 2009; Rajib et al., 2016b, 2018b) as the objective function. KGE was maximized to obtain the most optimal parameter set which produces the best possible agreement between simulated and observed streamflow hydrographs at target locations (KGE =1 means 100% agreement). Relying on a single streamflow output time-series (the so-called best hydrograph) may not be reasonable because of the uncertainties in extreme flow predictions – a problem common to any large-scale hydrologic model (Abbaspour et al., 2015; Faramarzi et al., 2017). Therefore, a “band” hydrograph representing the 95% prediction uncertainty (Abbaspour, 2015) was produced for every reach in addition to the “best” hydrograph representing the most optimal streamflow (corresponding the most optimal parameter set). To allow an independent posterior evaluation, the model was validated over a separate 2-year period (2012-2013), and at 5 separate gauge stations across the basin targeting different upstream drainage areas, topography, land use, and climatic conditions (Figure 2a). < Figure 2> 3.3 Hydrodynamic modeling 3.3.1 Coupling SWAT and LISFLOOD-FP Simulated streamflow outputs from SWAT were fed as the upstream boundary conditions in LISFLOOD-FP. This was a “loose coupling” prototype architecture where both SWAT and LISFLOO-FP were executed in stand-alone mode, and an intermediate relational data model (e.g., Hoch et al., 2017; Peckham et al., 2013) allowed the one-way transfer of information. We 12

performed three separate sets of simulations (M1–M3) by varying the boundary conditions of LISFLOOD-FP (Table 1). Irrespective of their differences, the basic setup and parameterization procedure in LISFLOOD-FP were kept identical across all the configurations. Throughout the following discussion, we name the framework SWAT-LISFP. 3.3.2 LISFLOOD-FP setup and calibration The geospatial datasets (topography, land use, and stream network) required to set up LISFLOOD-FP were the same as in the ORB SWAT model. In all the configurations (M1–M3), a sub-grid scale hydrodynamic scheme (Neal et al., 2012) was applied to solve the momentum and continuity equations for channel flow and floodplain flow, respectively (Eq.1-3).

𝑄𝑡𝑐 + ∆𝑡 =

𝑄𝑡𝑐 ― 𝑔𝐴𝑡𝑐,𝑓𝑙𝑜𝑤∆𝑡𝑆𝑡𝑐

{

| | (𝑅𝑡 )4/3𝐴𝑡 [ 𝑐,𝑓𝑙𝑜𝑤 𝑐,𝑓𝑙𝑜𝑤]

𝑡 1 + 𝑔∆𝑡𝑛2 𝑐 𝑄𝑐

𝑞𝑡

𝑄𝑡𝑓,𝑖++∆𝑡1/2 =

𝑓,𝑖 +

1 2

[

― 𝑔ℎ𝑡𝑓,𝑓𝑙𝑜𝑤∆𝑡𝑆𝑡

𝑓,𝑖 +

1+

| |

𝑡 𝑔∆𝑡𝑛2 𝑓 𝑞𝑓

]

}

(1)

1 2

(2)

(∆𝑥 ― 𝑤𝑐,𝑓𝑙𝑜𝑤)

7

(ℎ𝑡𝑓,𝑓𝑙𝑜𝑤)3

∆𝑡 = 𝑄𝑡𝑐,𝑖++∆𝑡1/2 + 𝑄𝑡𝑓,𝑖++∆𝑡1/2 𝑄𝑡𝑖 ++ 1/2

(3)

Here, indices c, f , and t indicate channel, floodplain, and instance of time respectively; Q = volumetric flow rate; Ac = cross-sectional area of the channel; S = slope; n = Manning’s roughness coefficient; i and j are grid-cell spatial indices in two spatial dimensions, ∆𝑥 = grid-cell width (defined by the spatial resolution of the input DEM); qf = unit width flow rate in floodplain; wc = channel bank-full width; ℎ𝑓 = depth of floodplain flow; g = gravitational acceleration. 13

Simulation of flood inundation in LISFLOOD-FP requires each stream reach to be described using four parameters: bank-full width (wc), bank elevation (Z), bed elevation (Zb) and Manning’s n (Bates et al., 2013). Given the loose nature of integration between SWAT and LISFLOOD-FP, it is challenging to minimize the propagation of uncertainty resulting from the obvious differences in models’ respective floodplain representation and corresponding numerical schemes. One way to address this issue is to maintain the “best possible” consistency in channel/floodplain representation by parameterizing both models with identical values of wc, Z, Zb and/or n. In the current setup of the framework, reach-average values for each of these input parameters were adopted as described below. (i)

Bank-full width (wc): Reach-average wc values were directly exported from the ORB SWAT model. These can also be derived from existing global databases (Andreadis et al., 2013) or using at-station hydraulic geometry (Leopold and Maddock, 1953). Most of the previous applications of LISFLOOD-FP adopted either of these two approaches (e.g., Neal et al., 2012; Schumann et al., 2013). However, global databases are mostly derived from available coarse-resolution DEMs so that they provide wc values only for higher order streams. Calculation of at-station hydraulic geometry does not produce reasonable results in relatively ungauged areas, not to mention the heavy computational overhead required to perform at-a-station analysis across thousands of stream reaches. As such, exporting SWAT wc, a drainage area-based estimate similar to other large-scale models (Hoch et al., 2017; Li et al., 2013; Paiva et al., 2013; noted in section 3.2.1), deemed as an efficient way for parameterizing LISFLOOD-FP.

14

(ii)

Bank elevation (Z): For any grid-cell that has w associated with it, LISFLOOD-FP picked the elevation data from the input DEM and assigned that as the corresponding Z value (Bates et al., 2013).

(iii)

Bed elevation (Zb): This was defined as (Z - dc), where dc is the bank-full depth. Similar to wc, dc was exported from SWAT as well. Alternatively, LISFLOOD-FP can approximate dc as a(wc)b following Leopold and Maddock (1953); here, a and b are user-defined coefficients which can be optimized through calibration. Using SWAT-estimated dc values was more practical than seeking so-called “calibrated” values of a and b for 26,000 reaches. While maintaining consistency between the two model components and reducing computational overhead, this approach eliminated potential uncertainties induced from inappropriate values of a and b.

(iv)

Roughness coefficient for floodplain (Manning’s coefficient nf ): A set of nf values were adopted from the supporting literature (Kalyanapu et al., 2010) and assigned to each of the land use classes in LISFLOOD-FP. In this manner, nf was varied across the basin allowing the model to effectively represent floodplain surface heterogeneity.

(v)

Roughness coefficient for channel (Manning’s coefficient nc): A unique nc value (=0.03) was assigned throughout the stream network. This unique value was derived from a calibration experiment based on a coarse-resolution LISFLOOD-FP setup (Merwade et al., 2018; Rajib et al., 2016c). In this preliminary experiment, nc values were varied between 0.01 and 0.05 (in 0.005 increments, one-at-a-time, every iteration assigning a unique nc throughout the stream network). After each iteration, the relative change in spatial agreement of simulated inundation extents was measured against the US Federal Emergency

Management

Agency

100-year

flood

hazard

maps

(FEMA;

15

https://msc.fema.gov/portal). Accordingly, 0.03 was found to be a representative estimate of channel roughness for ORB considering the maximum agreement between FEMA maps and LISFLOOD-FP simulations. Although FEMA maps are mostly model simulated outcomes (Xian et al., 2015), and not the “observed” extents of actual flood events, using these maps to compare the results of other hydrodynamic models is an approach frequently followed in flood modeling studies (e.g., Afshari et al., 2018). Although floodplain roughness (nf) in the ORB model was varied spatially, representing channel roughness with a unique nc value for the entire stream network might have posed some inaccuracies in simulated flood maps. However, it is not uncommon to use a unique nc value in flood modeling especially over large spatial scales (see, e.g., Afshari et al., 2018; Hoch et al., 2017; Li et al., 2013; Rudorff et al., 2014). In fact, the nc value (=0.03) used for ORB is the same as the one used for the Mississippi and Amazon River Basins by Follum et al. (2017; AutoRAPID framework) and Hoch et al. (2017; GLOFRIM framework), respectively. 3.4 Validation of simulated inundation extents 3.4.1 Comparison with remotely sensed observations Real-life “observed” flood inundation maps derived from open-access remotely sensed imagery were used as the primary reference data for model validation. Specifically, Landsat satellite images for May 4, 2011 and April 22, 2013 were obtained from USGS earth explorer (http://earthexplorer.usgs.gov; Figure 3) so that they represent different flood magnitudes across different areas within the basin with different topography and land use conditions. Yet, model validation based on satellite imagery is not always ideal due to cloud cover contamination (Wang et al., 2019) as well as the potential temporal mismatch between satellite (e.g., repeat cycle, exact

16

time of image acquisition) and model outputs (e.g., daily average data). We strategically designed our approach so that these issues impart minimal influence on validation results. Specifically, despite the 16-day repeat cycle of Landsat satellite, availability of relatively cloud-free image was one of our main criteria of selecting flood events for model validation. Further, each of the days selected for model validation was within a multi-day span of flood inundation, unlike flash floods that dissipate within a few hours (stakeholder inputs; not discussed here). In such cases, a Landsat image could represent the average flood condition throughout the day irrespective of the satellite’s exact image acquisition time. This assumption, despite imposing some limitations on validation results, allows consistent comparison of satellite data with daily-average outputs derived from daily time-step of model simulations. “Raw” Landsat images were processed through a supervised image classification technique in ArcGIS to distinguish flood inundation extents. This classification was conducted by (i) creating training polygons for known land use classes such as waterbodies, forest, agricultural and urban areas, and then (ii) applying a maximum likelihood method to separate flooded areas. All the flooded areas in an image were then converted (and aggregared) into GIS polygons and thus a reference map for evaluating simulated inundation extents. This so-called remotely sensed reference inundation map is subject to uncertainty due to satelittte’s image acquisition, processing, and the subsequent image classification techniques. Moreover, the Landsat images available through USGS platform are in 30-m spatial resolution which may be too coarse to capture flooding along the lower order streams. Effects of these uncertainties on model evaluation results were not considered in this study. < Figure 3> 3.4.2 Comparison with an existing flood modeling framework 17

It is important to acknowledge how a newly developed flood prediction framework might compare with the frameworks that are already available for a particular region of interest. While this cross-validation step has been commonly ignored in the previous studies, we sought evidence on SWAT-LISFP’s predictability and reliability using a secondary set of reference flood maps generated from AutoRAPID. In the basic setup of AutoRAPID (Follum et al., 2017), first the Routing Application for Parallel computatIon of Discharge (RAPID; David et al., 2011) model computes streamflow using gridded runoff data from land surface models (LSMs) (e.g., Tavakoly et al., 2017). A parsimonious hydraulic model AutoRoute (Follum et al., 2013) uses this streamflow and Manning’s equation to calculate the normal flow-depth across a large number of channel cross-sections, where the cross-section geometry is derived directly from topography data. Finally, AutoRoute simulates inundation extents using a volume-filling numerical method at each cross-section. To facilitate a comparison with the current SWAT-LISFP setup, we executed AutoRAPID across ORB only for the two specific flood events mentioned above (Figure 3) using NHDPlus stream network and 10-m NED DEM. Here, we limit this comparison only to flood inundation extents. Potential inconsistencies between these two frameworks, related to different model inputs, process-conceptualizations and parameter calibrations, are not addressed in this study. 3.4.3 Model performance metrics F (fit) and C (correctness) indices (Alfieri et al., 2014; Liu and Merwade, 2018) were used to measure prediction accuracy of simulated flood inundation extents (Equations 4-5). 𝐹=𝐴 𝐶=

𝐴𝑟𝑚 𝑟 + 𝐴𝑚 - 𝐴𝑟𝑚

𝐴𝑟𝑚 𝐴𝑟

(4)

(5) 18

Here, Ar and Am refer to the reference (remote sensing/AutoRAPID) and simulated inundation extents, respectively, and Arm refers to the common inundated area (intersection) between the two flood maps. By definition, both indices range from 0 to 1. F index indicates the degree to which reference and simulated extents match (F=1 means 100% overlap). As F decreases, the simulated inundation extent starts to deviate from the reference (either over-estimation or under-estimation) until no overlap can be found (F=0). On the contrary, C index is a relatively less conservative metric of model accuracy signifying the percentage of the reference extent that is correctly predicted by simulation. When C equals 1, reference flood extent falls entirely within the simulated extent even though the model might actually be over-estimating the inundated area. Therefore, C is always greater or equal to F for a given set of simulated and reference inundation maps. 4. RESULTS AND DISCUSSION 4.1 Overall predictability and representativeness of the framework 4.1.1 Streamflow Streamflow simulation performance, in terms of KGE across 55 gauge stations, indicated a robust model (Figure 4a). KGE between USGS data and the most optimal SWAT simulations ranged between 0.20 and 0.90, with a value 0.4-0.5 or higher in 64% of the target locations (35 out of 55 gauge stations; Figure 4b). The equally reasonable performance regardless of calibration or validation run, especially considering non-overlapping time-periods and a separate set of gauge stations, approved consistent predictability of the model. While streamflow output based on the most optimal parameter set may underperform at some locations (hence, relatively low KGE), a well-calibrated model should have a narrow uncertainty band (r-factor<1) with most of the reference data points captured therein (p-factor~1) (Abbaspour, 2015). We exemplify this in Figure 4c for one of the validation locations, where the model produced relatively low KGE but acceptable 19

p-factor and r-factor estimates. The robustness of streamflow simulation as noted above might have been due to our methodological design that involved substantially rigorous calibration compared to many other large-scale flood modeling frameworks. For instance, our 50-site SWAT calibration largely contrasts 3-site PCR-GLOBWB calibration in a similar-sized Amazonian sub-basin (GLOFRIM framework by Hoch et al., 2017; also compare with 8-site MGB-IPH calibration across the entire Amazon River Basin by Paiva et al., 2013). Although the short run-time (5 years) of our prototype framework is similar to that used in GLOFRIM (6 years; Hoch et al., 2017), we selected parameter initial ranges based on a prior 80-year streamflow calibration (Du et al., 2018; noted in section 3.2.2). These comparisons, considered alongside the overall streamflow simulation performance (Figure 4), suggest that our current ORB SWAT model is capable of reproducing (and forecasting) hydrologic variabilities with reasonable accuracy.
4.1.2 Flood inundation extents The simulated flood inundation maps matched considerably well with remotely sensed observations and AutoRAPID outputs (Figure 5). Specifically, when the upstream boundary conditions in LISFLOOD-FP were set with the most optimal SWAT streamflow (M1 configuration; Table 1), simulated flood maps showed F=0.6-0.65 and C=0.75-0.8 with respect to remotely sensed observations. These performance measures suggest a fairly good accuracy notwithstanding the deterministic streamflow forcing, complete lack of in-situ channel geometry data, and challenges they imposed on models’ representation of 26,000 stream reaches across the 500,000 km2 basin. The predictability and representativeness of our SWAT-LISFP framework was further

20

justified when we found resemblance between SWAT-LISFP and AutoRAPID flood maps (F=0.70.75 and C=0.75-0.85; Figure 6). Clearly, SWAT-LISFP flood maps were consistently accurate across different flood events and regions within the basin, and regardless of the reference dataset used in evaluating model performance, which confirms the reliability of the framework for reallife applications.
4.2 Effect of streamflow prediction uncertainty on simulated flood inundation Inaccuracies of deterministic inundation maps and their limited implications on efficient flood management planning have been long-standing discussions in flood modeling practices (Merwade et al., 2008). Most of the previous studies on probabilistic inundation were conducted at reach or watershed scales (e.g., Di Baldassarre et al., 2010). To comply with the physical meaning of a probabilistic flood map, the most ideal way is to perform delicate numerical analyses and distinguish both epistemic and aleatory uncertainties in model simulations (Alfonso et al., 2016), which may not be feasible for a very complex model setup such as the one presented in this study. Given the above, one simple alternative would be to at least “inform” the hydrodynamic model about the uncertainties in hydrologic (streamflow) simulation. Our M2 configuration ran such a scenario by setting up the boundary conditions in LISFLOOD-FP with a band of streamflow (Table 1; Figure 4c). The outcome was a flood map showing the minimum and maximum extents of flood inundation (Figure 7). Here, the ‘maximum’ and ‘minimum’ inundation extents resulted respectively from the upper and lower bounds of the 95% prediction uncertainty in streamflow simulation.

21

It is evident that simulated flood inundation based on band hydrographs captured nearly 100% of the remotely sensed inundation, whereas the same model setup using the most optimal streamflow data as boundary conditions captured less than 80% (compare C index in Figure 5a and Figure 7). In areas where the maximum simulated extent overestimated the reference map, the minimum extent produced a better match. Therefore, the maximum simulated extent does not mean that it has a greater likelihood of occurrence than the minimum extent; together they delineate an area that is likely to be inundated during a flood event. These findings clearly suggest that, instead of predicting a single deterministic flood inundation extent based on the most optimal streamflow output (M1), predicting a potential areal range of inundation via integrating streamflow uncertainty into the channel-floodplain hydrodynamic simulation (M2) greatly minimizes bias. This, with different representations of channel geometry and roughness in LISFLOOD-FP, would produce an “end-to-end” probabilistic flood map (e.g., Liu and Merwade, 2018). Our study demonstrates that SWAT-LISFP can efficiently account for these uncertainties, implying meaningful practical implications of the framework for near real-time disaster management.
4.3 Effect of densified streamflow boundary conditions on simulated flood inundation In the first two configurations (M1 and M2), LISFLOOD-FP ingested SWAT simulated streamflow at a limited number of locations where the channel becomes wider than 90 m (Table 1; Figure 2b). In order to evaluate the role of upstream boundary conditions on large-scale flood inundation, the M3 configuration ingested streamflow at lower order streams where the channel becomes wider than 30 m (Figure 2c). Results obtained from M3 configuration revealed prominent improvement in simulated inundation (compare Figure 8 (M3): F=0.70 and C=0.91 versus Figure 22

5b (M1): F=0.60 and C=0.80). This improvement in model predictability was more noticeable in lower order streams. Like any other hydrodynamic model, it is theoretically impossible for LISFLOOD-FP to produce fluvial/floodplain inundation if lower order streams (hence, the associated streamflow inputs) do not exist in the model. This was why the ability to set up abundant streamflow boundary conditions via a high-resolution stream network enabled better prediction of flood inundation extents in M3. This, in other words, justified our definition of “locally relevant” flood modeling where spatial resolution is defined in terms of the number of streams being modeled, not the grid size of the DEM or the model simulation.
5. POTENTIAL OPPORTUNITIES FOR IMPROVED PREDICTABILITY Logic suggests that the possible increase in streamflow accuracy should correspondingly lead to more accurate flood inundation maps. But attempting high accuracy in streamflow simulation simply by parameter calibration is a challenging task irrespective of the model used in a flood prediction framework (e.g., an early prototype version of NWM by Lin et al., 2018b). For instance, despite involving 50 gauge stations in the calibration process, the relatively underperforming streamflow simulation at some locations in the current ORB SWAT model may be due to the lack of spatially explicit reservoir storage-discharge functions. This problem could be minimized by incorporating reservoir geometry data (i.e., maximum water surface area and storage volume) in hydrologic models (Neitsch et al., 2011). While these information are accessible from global databases for some of the major reservoirs (e.g., Lehner & Döll, 2004), Salas et al. (2018) and Tavakoly et al. (2017) showed how streamflow simulation accuracy in a large-scale flood prediction framework can be improved by directly assimilating available gauge data at reservoir 23

outlets or reservoir release flow without requiring explicit inputs on reservoir geometry. This also underscores the potential for assimilating remotely sensed streamflow data at reservoir outlet locations (e.g., Surface Water and Ocean Topography (SWOT) satellite mission; Yoon et al., 2016). Yet, assimilating reservoir outflow data and/or considering more gauge stations in parameter calibration may not improve overall fidelity (Kannan et al., 2019) because hydrologic models often give right answers (acceptable streamflow at the watershed outlet) for wrong reasons (biased water balance across the watershed). Therefore, in addition to targeting improved streamflow predictability at discrete locations, the overall physical state of the hydrologic model needs to be improved by assimilating remotely sensed soil moisture and/or evapotranspiration data (e.g., Han et al., 2012; Rajib et al., 2018a,b) so that the model remains well-informed of the antecedent watershed conditions at the onset of extreme events. To enhance hydrodynamic modeling accuracy, improving the channel geometry representation is perhaps the most widely-acknowledged measure (Cook and Merwade, 2009). We suggest that channel geometry is important for both hydrologic and hydrodynamic models. This is because a realistic representation of channel width/depth would enable improved simulation of floodplain storage and hence the timing and magnitude of peak flow in the hydrologic model (Her et al., 2017), which would subsequently allow the hydrodynamic model to precisely capture inundation pattern (Cook and Merwade, 2009). Global databases on channel width/depth exist (e.g., Andreadis et al., 2013); however, these databases do not provide information for lower order streams. Calibrating the power-law equations that conceptualize channel width/depth (e.g., sections 3.2.1 and 3.3.2) may be a pursuable approach only where relevant reference data (stage height) are available. To fill this information gap, reconstructing channel geometry from highresolution topography data (e.g., Dey et al., 2019) appears to be an efficient solution. These 24

algorithms may be tightly coupled with both hydrologic and hydrodynamic model architectures, enabling seamless assimilation of channel geometry data in respective simulations. Besides this, use of a unique channel roughness (nc) throughout the stream network is another persistent limitation in large-scale hydrodynamic model setups (e.g., Follum et al., 2017; Hoch et al., 2017; also noted in section 3.3.2). It would be ideal to select nc values based on a regional calibration of Manning’s equation across clusters of land use and slope. While these are important considerations for improved predictability of large-scale locally relevant flood models, associated data-model interoperability and computational challenges cannot be overstated. 6. SUMMARY AND FUTURE DIRECTION Contrary to the long-standing practice of reach- or watershed-scale flood modeling, there is a need to build large-scale flood prediction capabilities. Additionally, it is necessary that a socalled “large scale” model is “locally relevant”. Evolved from this critical yet often overlooked perspective, our study provides the first assessment of a new large-scale locally relevant flood prediction framework that loosely couples hydrologic model SWAT with hydrodynamic model LISFLOOD-FP (hence SWAT-LISFP). In the prototype framework, SWAT simulates streamflows for nearly 26,000 stream reaches across the ~500,000 km2 Ohio River Basin, which feed LISFLOOD-FP as the upstream boundary conditions to simulate flood inundation extents potentially at near real-time. Based on three contrasting modeling configurations, the following conclusions are drawn. (i) Streamflow outputs are reasonably accurate in terms of KGE performance metric. Based on the calibration-validation results across 55 gauge stations, KGE values range between 0.20 to 0.90. Especially, KGE ≥ 0.5 in more than 60% of the target locations indicates a robust model. Using the so-called “most optimal or best-fit” SWAT streamflow as the upstream boundary 25

conditions (M1 configuration), LISFLOOD-FP produces flood inundation extents that are consistently 70-80% accurate (F and C indices) with respect to the remotely sensed observations of real-life flood events. The simulated flood inundation extents also show strong resemblance with the outputs produced by an existing flood prediction framework AutoRAPID (Follum et al., 2017). This two-step model validation using two independent reference datasets (remote sensing and AutoRAPID) approves the reliability of SWAT-LISFP framework. (ii) While the above findings are based on a “single” deterministic flood map generated by a bestfit streamflow forcing (boundary condition), SWAT-LISFP can also delineate a “range” of inundated areas by considering the uncertainty in streamflow simulation. Specifically, the simulated extents considering the 95% streamflow uncertainty band can potentially capture the entirety of actual (remotely sensed) flooding. This is a key finding indicating the ability of our prototype framework to efficiently minimize prediction bias. Emergency management decisions looking onto this range of inundation, as opposed to a single deterministic map, is more useful for first responders. (iii) Our results clearly show that setting up the streamflow boundary conditions only at the large rivers (e.g., locations beyond which a channel is wider than 90 m) leaves a considerable portion of flood inundation undetected. In comparison, model predictability noticeably improves when the boundary conditions are set at the lower order streams. A high-resolution stream network implies availability of densified streamflow estimates from a hydrologic model. Densified streamflow would allow more rigorous constraining of the hydrodynamic model, boosting its capacity to detect flooded areas across large spatial scales. These findings rationalize the use of a high-resolution stream network and hence the new theme of locally relevant flood modeling – an important aspect disregarded in many of the existing large-scale frameworks.

26

While these results are promising, an added benefit of our framework is the parameter interoperability between SWAT and LISFLOOD-FP. With the current inter-model data transfer mechanism, LISFLOOD-FP can export channel geometry (bank-full width and depth) for each of the stream reaches (N=26,000) directly from SWAT’s algorithm. While greatly minimizing dependency on in-situ data acquisition and/or hydrodynamic model calibrations, this interoperability also eliminates propagation error that may stem from the driver hydrologic model and subsequently affect the simulated inundation extents. Accordingly, it is possible that improvements in SWAT’s floodplain/river corridor representation through process modifications, integrating spatially resolved global datasets and/or remote sensing techniques would automatically increase the overall accuracy of LISFLOOD-FP. Our perspective is not limited to coupling models and evaluating their performances. Currently, efforts are underway to equip SWAT with a reproducible workflow of multi-source multi-variate remotely sensed data assimilation (e.g., Rajib et al., 2018a,b). To enable near realtime predictions, input management especially to handle sub-daily weather forecasts and associated data transfer mechanism from SWAT to LISFLOOD-FP will be restructured as well. Eventually, the framework will be prompted from a cyber-infrastructure so that it can be Findable, Accessible, Interoperable, and Reproducible (FAIR). SWATShare, a recently developed cyberinfrastructure for sharing, data assimilation, high performance simulation, and visualization of environmental models (Rajib et al., 2016a), is an example attesting our motive towards that direction. With these emergent cohesive efforts, it is possible to transform the current prototype framework into a sustainable, globally applicable flood modeling and information system.

27

ACKNOWLEDGEMENTS This work was conducted with partial funding support from the US National Science Foundation (NSF # ACI-1261727) and the US Army Engineer Research and Development Center (ERDC) Military Hydrology Program. All opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF and ERDC. Authors would also like to thank editors and two anonymous reviewers for providing constructive suggestions on the earlier version of the manuscript. Authors also acknowledge Ms. Liuying Du, former graduate research assistant at Purdue University, for her contribution during the initial stage of the project. AUTHOR CONTRIBUTION VM provided scientific guidance and organized necessary resources, while AR assembled the team. AAT and MLF produced the AutoRAPID results. AR and ZL equally contributed to writing the paper.

References Abbaspour, K.C., Rouholahnejad, E., Vaghefi, S., Srinivasan, R., Yang, H. and Kløve, B., 2015. A continental-scale hydrology and water quality model for Europe: Calibration and uncertainty of a high-resolution large-scale SWAT model. Journal of Hydrology, 524, 733752. Abbaspour, K.C., 2015. SWAT-CUP 2012: SWAT calibration and uncertainty programs - a user manual. http://swat.tamu.edu/media/114860/usermanual_swatcup.pdf. Afshari, S., Tavakoly, A.A., Rajib, A., Zheng, X., Follum, M.L., Omranian, E. and Fekete, B.M., 2018. Comparison of new generation low-complexity flood inundation mapping tools with a hydrodynamic model. Journal of Hydrology, 556, 539-556. Alfieri, L., Salamon, P., Bianchi, A., Neal, J., Bates, P. and Feyen, L., 2014. Advances in pan‐European flood hazard mapping. Hydrological Processes, 28(13), 4067-4077. Alfonso, L., Mukolwe, M., Di Baldassarre, G., 2016. Probabilistic Flood Maps to support decision‐making: Mapping the Value of Information. Water Resources Research, 52(2), 1026-1043. 28

Andreadis, K.M., Schumann, G.J.P., Pavelsky, T., 2013. A simple global river bankfull width and depth database. Water Resources Research, 49(10): 7164-7168. Arnold, J.G., Moriasi, D.N., Gassman, P.W., Abbaspour, K.C., White, M.J., Srinivasan, R., Santhi, C., Harmel, R.D., Van Griensven, A., Van Liew, M.W. and Kannan, N., 2012. SWAT: Model use, calibration, and validation. Transactions of the ASABE, 55(4): 1491-1508. Baldassarre, G.D. and Montanari, A., 2009. Uncertainty in river discharge observations: a quantitative analysis. Hydrology and Earth System Sciences, 13(6), pp.913-921. Ballesteros, J., Bodoque, J., Díez-Herrero, A., Sanchez-Silva, M., Stoffel, M., 2011. Calibration of floodplain roughness and estimation of flood discharge based on tree-ring evidence and hydraulic modelling. Journal of Hydrology, 403(1-2), 103-115. Bates, P.D., De Roo, A., 2000. A simple raster-based model for flood inundation simulation. Journal of hydrology, 236(1): 54-77. Bates, P., Neal, J., Dabrowa, A., 2013. LISFLOOD-FP User Manual. University of Bristol, Bristol. Biancamaria, S., Bates, P.D., Boone, A. and Mognard, N.M., 2009. Large-scale coupled hydrologic and hydraulic modelling of the Ob river in Siberia. Journal of Hydrology, 379(1-2): 136-150. Cook, A., Merwade, V., 2009. Effect of topographic data, geometric configuration and modeling approach on flood inundation mapping. Journal of Hydrology, 377(1): 131-142. Chen, Y., Li, J., Wang, H., Qin, J., Dong, L., 2017. Large-watershed flood forecasting with highresolution distributed hydrological model. Hydrology And Earth System Science, 21, 735– 749. DOI:10.5194/hess-21-735-2017 Daggupati, P., Yen, H., White, M.J., Srinivasan, R., Arnold, J.G., Keitzer, C.S. and Sowa, S.P., 2015. Impact of model development, calibration and validation decisions on hydrological simulations in West Lake Erie Basin. Hydrological Processes, 29(26), 5307-5320. David, C.H., Maidment, D.R., Niu, G.Y., Yang, Z.L., Habets, F. and Eijkhout, V., 2011. River network routing on the NHDPlus dataset. Journal of Hydrometeorology, 12(5), 913-934. David, C.H., Yang, Z.-L., Hong, S., 2013. Regional-scale river flow modeling using off-the-shelf runoff products, thousands of mapped rivers and hundreds of stream flow gauges. Environmental modelling & software, 42, 116-132. Dey, S., Saksena, S. and Merwade, V., 2019. Assessing the effect of different bathymetric models on hydraulic simulation of rivers in data sparse regions. Journal of Hydrology, 575, 838851. Di Baldassarre, G., Montanari, A., 2009. Uncertainty in river discharge observations: a quantitative analysis. Hydrology and Earth System Sciences, 13, 913-921. Di Baldassarre, G., Schumann, G., Bates, P.D., Freer, J.E., Beven, K.J., 2010. Flood-plain mapping: a critical discussion of deterministic and probabilistic approaches. Hydrological Sciences Journal, 55(3), 364-376. Dottori, F., Salamon, P., Bianchi, A., Alfieri, L., Hirpa, F.A. and Feyen, L., 2016. Development and evaluation of a framework for global flood hazard mapping. Advances in Water Resources, 94, 87-102. Du, L., Rajib, A., Merwade, V., 2018. Large scale spatially explicit modeling of blue and green water dynamics in a temperate mid-latitude basin. Journal of Hydrology, 562, 84-102. Faramarzi, M., Abbaspour, K.C., Lu, W., Fennell, J., Zehnder, A.J. and Goss, G.G., 2017. Uncertainty based assessment of dynamic freshwater scarcity in semi-arid watersheds of Alberta, Canada. Journal of Hydrology: Regional Studies, 9, 48-68.

29

Follum, M.L., 2013. AutoRoute: Rapid Flood Inundation Model, ERDC/CHL-CHETN-IV-88, Coastal and Hydraulics Laboratory, US Army Engineer Research and Development Center, Vicksburg, Mississippi, USA. Available online at: https://apps.dtic.mil/dtic/tr/fulltext/u2/a583052.pdf (last accessed: Auguest 26, 2019). Follum, M.L., Tavakoly, A.A., Niemann, J.D., Snow, A.D., 2017. AutoRAPID: A model for prompt streamflow estimation and flood inundation mapping over regional to continental extents. Journal of the American Water Resources Association, 53(2), 280-299. Gallegos, H.A., Schubert, J.E., Sanders, B.F., 2009. Two-dimensional, high-resolution modeling of urban dam-break flooding: A case study of Baldwin Hills, California. Advances in Water Resources, 32(8): 1323-1335. Green, C., Tomer, M., Di Luzio, M., Arnold, J., 2006. Hydrologic evaluation of the soil and water assessment tool for a large tile-drained watershed in Iowa. Transactions of the ASAE, 49(2): 413-422. Gupta, H.V., Kling, H., Yilmaz, K.K. and Martinez, G.F., 2009. Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. Journal of hydrology, 377(1-2), 80-91. Han, E., Merwade, V., Heathman, G.C., 2012. Implementation of surface soil moisture data assimilation with watershed scale distributed hydrological model. Journal of Hydrology. 416417, 98–117. DOI:10.1016/j.jhydrol.2011.11.039 Hartnett, M., Nash, S., 2017. High-resolution flood modeling of urban areas using MSN_Flood. Water Science and Engineering, 10 (3), 175-183. Her, Y., Jeong, J., Bieger, K., Rathjens, H., Arnold, J. and Srinivasan, R., 2017. Implications of conceptual channel representation on SWAT streamflow and sediment modeling. JAWRA Journal of the American Water Resources Association, 53(4), 725-747. Hoch, J.M., Neal, J., Baart, F., van Beek, R., Winsemius, H., Bates, P., Bierkens, M., 2017. GLOFRIM v1.0 – A globally applicable computational framework for integrated hydrological–hydrodynamic modelling. Geoscientific Model Development, 10, 39133929. Holmes, R.R., Dinicola, K., 2010. 100-year flood—It’s all about chance. US Geological Survey General Information Product 106. Available online at: https://pubs.usgs.gov/gip/106/pdf/100-year-flood-handout-042610.pdf. Last accessed on September 3, 2018. Holmes, R.R., Schwein, N.O., Shadie, C.E., 2012. Flood risk awareness during the 2011 floods in the central United States: Showcasing the importance of hydrologic data and interagency collaboration. Leadership and Management in Engineering, 12(3), 101-110. Horritt, M., Bates, P., 2001. Predicting floodplain inundation: raster‐based modelling versus the finite‐element approach. Hydrological processes, 15(5): 825-842 Huang, S. and Hattermann, F.F., 2017. Coupling a global hydrodynamic algorithm and a regional hydrological model for large-scale flood inundation simulations. Hydrology Research, 49(2), pp.438-449. Kalyanapu, A.J., Burian, S.J., McPherson, T.N., 2010. Effect of land use-based surface roughness on hydrologic model output. Journal of Spatial Hydrology, 9(2), 51-71. Kannan, N., Santhi, C., White, M.J., Mehan, S., Arnold, J.G. and Gassman, P.W., 2019. Some Challenges in Hydrologic Model Calibration for Large-Scale Studies: A Case Study of SWAT Model Application to Mississippi-Atchafalaya River Basin. Hydrology, 6(1).

30

Kauffeldt, A., Wetterhall, F., Pappenberger, F., Salamon, P., Thielen, J., 2016. Technical review of large-scale hydrological models for implementation in operational flood forecasting schemes on continental level. Environmental Modelling & Software, 75, 68-76. Kiedrzyńska, E., Kiedrzyński, M. and Zalewski, M., 2015. Sustainable floodplain management for flood prevention and water quality improvement. Natural Hazards, 76(2), pp.955-977. Kim, J., Warnock, A., Ivanov, V.Y. and Katopodes, N.D., 2012. Coupled modeling of hydrologic and hydrodynamic processes including overland and channel flow. Advances in Water Resources, 37, pp.104-126. Komi, K., Neal, J., Trigg, M.A., Diekkrüger, B., 2017. Modelling of flood hazard extent in data sparse areas: a case study of the Oti River basin, West Africa. Journal of Hydrology: Regional Studies, 10: 122-132. Kunkel, K.E. et al., 2013. Probable maximum precipitation and climate change. Geophysical Research Letters, 40(7), 1402-1408. Lehner, B. and Döll, P., 2004. Development and validation of a global database of lakes, reservoirs and wetlands. Journal of Hydrology, 296(1-4), 1-22. Leopold, L.B., Maddock, T., 1953. The hydraulic geometry of stream channels and some physiographic implications. US Geoogical Survey Professional Paper 252. Available online at https://pubs.usgs.gov/pp/0252/report.pdf. Last cited on September 3, 2018. Li, H., Wigmosta, M.S., Wu, H., Huang, M., Ke, Y., Coleman, A.M., Leung, L.R., 2013. A physically based runoff routing model for land surface and earth system models. Journal of Hydrometeorology, 14(3), 808-828. Liang, X., Xie, Z., Huang, M., 2003. A new parameterization for surface and groundwater interactions and its impact on water budgets with the variable infiltration capacity (VIC) land surface model. Journal of Geophysical Research: Atmospheres, 108(D16). DOI: 10.1029/2002JD003090 Lin, P., Rajib, M.A., Yang, Z.L., Somos‐Valenzuela, M., Merwade, V., Maidment, D. et al., 2018a. Spatiotemporal Evaluation of Simulated Evapotranspiration and Streamflow over Texas Using the WRF‐Hydro‐RAPID Modeling Framework. Journal of the American Water Resources Association, 54(1): 40-54. Lin, P., Yang, Z.-L., Gochis, D., Yu, W., Maidment, D. et al., 2018b. Implementation of a vectorbased river network routing scheme in the community WRF-Hydro modeling framework for flood discharge simulation. Environmental Modelling & Software, 107, 1-11. Lin, P., Pan, M., Beck, H. E., Yang, Y., Yamazaki, D., Frasson, R., et al., 2019. Global reconstruction of naturalized river flows at 2.94 million reaches. Water Resources Research, 55. https://doi.org/10.1029/2019WR025287 Liu, Z., Merwade, V., Jafarzadegan, K., 2019a. Investigating the role of model structure and surface roughness in generating flood inundation extents using 1D and 2D hydraulic models. Journal of Flood Risk Management, 12(1). DOI: 10.1111/jfr3.12347 Liu, Z., Merwade, V., 2019. Separation and prioritization of uncertainty sources in a raster based flood inundation model using hierarchical Bayesian model averaging. Journal of Hydrology, 578, 124100. DOI: 10.1016/j.jhydrol.2019.124100 Liu, Z., Merwade V., 2018. Accounting for model structure, parameter and input forcing uncertainty in flood inundation modeling using Bayesian model averaging. Journal of Hydrology, 565, 138-149.

31

Logah, F., Amisigo, A., Obuobie, E., Kankam-Yeboah, K., 2017. Floodplain hydrodynamic modelling of the Lower Volta River in Ghana. Journal of Hydrology: Regional Studies, 14, 1-9. DOI: 10.1016/j.ejrh.2017.09.002 Maidment, D., 2017. Conceptual framework for the national flood interoperability experiment. Journal of the American Water Resources Association, 53(2): 245-257. Maidment, D., Rajib, A., Lin, P., Clark, E. (Eds.) 2016. National Water Center Innovators Program Summer Institute Report 2016, CUAHSI Technical Report 13, USA, 122p. DOI: 10.4211/technical.20161019. Mallakpour, I., Villarini, G., 2015. The changing nature of flooding across the central United States. Nature Climate Change, 5, 250–254. McKay, L., Bondelid, T., Dewald, T., Johnston, J., Moore, R., Rea, A., 2012. NHDPlus Version 2: user guide. National Operational Hydrologic Remote Sensing Center, Washington, DC. Merwade, V., Olivera, F., Arabi, M., Edleman, S., 2008. Uncertainty in flood inundation mapping: current issues and future directions. Journal of Hydrologic Engineering, 13(7): 608-620. Merwade, V., Rajib, A., Liu, Z. 2018. “An integrated approach for flood inundation modeling on large scales”. In Jung and Wang (Eds.), Bridging Science and Policy Implication for Managing Climate Extremes, pp. 133-155. World Scientific Publication Company, Singapore. DOI: 10.1142/9789813235663_0009. Download link: goo.gl/d2GePV Musser, J.W., Dyar, T.R., 2007, Two-dimensional flood-inundation model of the Flint River at Albany, Georgia: U.S. Geological Survey Scientific Investigations Report 2007–5107, 49 p., https://pubs.usgs.gov/sir/2007/5107. NASA 2019. README Document for North American Land Data Assimilation System Phase 2 (NLDAS-2) Products. National Aeronautics and Space Administration Goddard Earth S cience D ata Information and Services Center. Available online at: https://hydro1.gesdisc.eosdis.nasa.gov/data/NLDAS/README.NLDAS2.pdf. Accessed on October 14, 2019. Neal, J., Schumann, G., Bates, P., 2012. A subgrid channel model for simulating river hydraulics and floodplain inundation over large and data sparse areas. Water Resources Research, 48, W11506. DOI:10.1029/2012WR012514 Neitsch, S.L., Arnold, J.G., Kiniry, J.R., Williams, J.R., 2011. Soil and water assessment tool theoretical documentation version 2009, Texas Water Resources Institute Technical Report 406, Texas A&M University, College Station, Texas, USA. Nelson, J.M., Bennett, J.P., Wiele, S.M., 2003. “Flow and sediment transport modeling”. In Kondolf and Piegay (Eds.), Tools in Fluvial Geomorphology, pp. 539-576. John Wiley & Sons Ltd., Chichester, West Sussex, England. Nguyen, P., Thorstensen, A., Sorooshian, S., Hsu, K., AghaKouchak, A., Sanders, B., Koren, V., Cui, Z. and Smith, M., 2016. A high resolution coupled hydrologic–hydraulic model (HiResFlood-UCI) for flash flood modeling. Journal of Hydrology 541: 401-420. NWS, 2018. Ohio River History - Cincinnati. National Weather Service, Wilmington weather forecast office, Ohio. Available online at https://www.weather.gov/iln/ohioriverhistory. Last cited on August 8, 2018. Oubeidillah, A., Kao, S.-C., Ashfaq, M., Naz, B., Tootle, G., 2014. A large-scale, high-resolution hydrological model parameter data set for climate change impact assessment for the conterminous US. Hydrology and Earth System Science, 18, 67–84.

32

Paiva, R.C., Collischonn, W., Tucci, C.E., 2011. Large scale hydrologic and hydrodynamic modeling using limited data and a GIS based approach. Journal of Hydrology, 406(3-4): 170-181. Paiva, R.C.D., Buarque, D.C., Collischonn, W., Bonnet, M.P., Frappart, F., Calmant, S. and Mendes, C.A.B., 2013. Large‐scale hydrologic and hydrodynamic modeling of the Amazon River basin. Water Resources Research, 49(3), pp.1226-1243. Peckham, S.D., Hutton, E.W., Norris, B., 2013. A component-based approach to integrated modeling in the geosciences: The design of CSDMS. Computers & Geosciences, 53, 3-12. Pappenberger, F., Dutra, E., Wetterhall, F., Cloke, H.L., 2012. Deriving global flood hazard maps of fluvial floods through a physical model cascade. Hydrology and Earth System Sciences, 16(11), 4143-4156. Pervez, M.S., Henebry, G.M., 2015. Assessing the impacts of climate and land use and land cover change on the freshwater availability in the Brahmaputra River basin. Journal of Hydrology: Regional Studies, 3, 285-311. Rajib, A., Evenson, G.R., Golden, H.E. and Lane, C.R., 2018a. Hydrologic model predictability improves with spatially explicit calibration using remotely sensed evapotranspiration and biophysical parameters. Journal of hydrology, 567, pp.668-683. Rajib, M.A., Merwade, V., Yu, Z., 2018b. Rationale and efficacy of assimilating remotely sensed potential evapotranspiration for reduced uncertainty of hydrologic models. Water Resource Resource. 54, 4615-4637. DOI: 10.1029/2017WR021147 Rajib, A., Merwade, V., 2017. Hydrologic response to future land use change in the Upper Mississippi River Basin by the end of 21st century. Hydrological Processes, 31, 3645-3661. Rajib, A., Merwade, V., Kim, L.I, Zhao, L., Song, C., Zhe, S., 2016a. SWATShare – A web platform for collaborative research and education through online sharing, simulation and visualization of SWAT models, Environmental Modelling & Software, 75, 498-512. Rajib, A., Merwade, V., Yu, Z., 2016b. Multi-objective calibration of a hydrologic model using spatially distributed remotely sensed/in-situ soil moisture. Journal of Hydrology, 536: 192207. Rajib, A., Merwade, V., Liu, Z., 2016c. "Large scale high resolution flood inundation mapping in near real-time." 40th Anniversary of the Association of State Flood Plain Managers National Conference, Grand Rapids, Michigan, USA, 19-24 June. Available online at: http://www.asfpmfoundation.org/ace-images/Adnan_Full2016.pdf. Accessed on December 21, 2018. Rudorff, C.M., Melack, J.M., Bates, P.D., 2014. Flooding dynamics on the lower Amazon floodplain: 1. Hydraulic controls on water elevation, inundation extent, and river‐floodplain discharge. Water Resources Research, 50(1), 619-634. Safeeq, M., Mauger, G., Grant, G., Arismendi, I., Hamlet, A., Lee, S.-Y., 2014. Comparing largeScale hydrological model predictions with observed streamflow in the Pacific Northwest: Effects of climate and groundwater. Journal of Hydrometeorology, 15, 2501-2521. Salas, F., Somos-Valenzuela, M., Dugger, A., Maidment, D., Gochis, D. et al., 2018.Towards realtime continental scale streamflow simulation in continuous and discrete space. Journal of the American Water Resources Association, 54(1), 7-27. Sampson, C.C., Smith, A.M., Bates, P.D., Neal, J.C., Alfieri, L. and Freer, J.E., 2015. A high‐resolution global flood hazard model. Water resources research, 51(9): 7358-7381.

ciation7-27 54(1): (JAWRA)

33

Saha, S., Moorthi, S., Pan, H.L., Wu, X., Wang, J., Nadiga, S., Tripp, P., Kistler, R., Woollen, J., Behringer, D. and Liu, H., 2010. The NCEP climate forecast system reanalysis. Bulletin of the American Meteorological Society, 91(8), 1015-1058. Schumann, G.P., Neal, J.C., Voisin, N., Andreadis, K.M., Pappenberger, F., Phanthuwongpakdee, N., Hall, A.C., Bates, P.D., 2013. A first large‐scale flood inundation forecasting model. Water Resources Research, 49(10): 6248-6257. Schuol, J., Abbaspour, K.C., Srinivasan, R., Yang, H., 2008. Estimation of freshwater availability in the West African sub-continent using the SWAT hydrologic model. Journal of Hydrology, 352(1-2), 30-49. Smith, A., Bates, P.D., Wing, O., Sampson, C., Quinn, N. and Neal, J., 2019. New estimates of flood exposure in developing countries using high-resolution population data. Nature communications, 10(1), p.1814. Tavakoly, A.A., Snow, A.D., David, C.H., Follum, M.L., Maidment, D.R. and Yang, Z.L., 2017. Continental-scale river flow modeling of the Mississippi River Basin using high-resolution NHDPlus dataset. JAWRA Journal of the American Water Resources Association, 53(2), 258-279. Tayyebi, A., Pijanowski, B.C., Pekin, B.K., 2015. Land use legacies of the Ohio River Basin: Using a spatially explicit land use change model to assess past and future impacts on aquatic resources. Applied Geography, 57, 100-111 USACE, 2016. HEC-RAS River Analysis System 2D Modeling user’s manual version 5.0. US Army Corps of Engineers Hydrologic Engineering Center, Davis, California. USGS-NED, 2013. National Elevation Dataset: United States Geological Survey National Map Viewer. Available at: http://viewer.nationalmap.gov/viewer/. Accessed 10 March, 2013. USGS-NLCD, 2013. National Elevation Dataset: United States Geological Survey National Map Viewer. Available at: http://viewer.nationalmap.gov/viewer/. Accessed 10 March, 2013. van Beek, L.P.H., Wada, Y. and Bierkens, M.F., 2011. Global monthly water stress: 1. Water balance and water availability. Water Resources Research, 47(7). van Beek, L. P. H. and Bierkens, M. F. P., 2009. The Global Hydrological Model PCR-GLOBWB: Conceptualization, Parameterization and Verification. Available at: http://vanbeek.geo.uu. nl/suppinfo/vanbeekbierkens2009.pdf (last accessed: 20 December 2018). Vanderkimpen, P., Melger, E., Peeters, P., 2009. Flood modeling for risk evaluation – a MIKE FLOOD vs. SOBEK 1D2D benchmark study. In Samuel et al. (eds.), Flood Risk Management: Research and Practice. Taylor & Francis Group, London. Wang, T., Shi, J., Letu, H., Ma, Y., Li, X., Zheng, Y., 2019. Detection and removal of clouds and associated shadows in satellite imagery based on simulated radiance fields. Journal of Geophysical Research: Atmospheres, 124, 7207– 7225. Wilson, M., Bates, P., Alsdorf, D., Forsberg, B., Horritt, M., Melack, J., Frappart, F. and Famiglietti, J., 2007. Modeling large-scale inundation of Amazonian seasonally flooded wetlands. Geophysical Research Letters, 34(15). Wing, O.E., Bates, P.D., Sampson, C.C., Smith, A.M., Johnson, K.A., Erickson, T.A., 2017. Validation of a 30 m resolution flood hazard model of the conterminous United States. Water Resources Research, 53(9): 7968-7986. Wing, O.E., Sampson, C.C., Bates, P.D., Quinn N., Smith, A.M., Neal, J. C., 2019. A flood inundation forecast of Hurricane Harvey using a continental-scale 2D hydrodynamic model, Journal of Hydrology X, 100039, 2589-9155.

34

Wood, E., Roundy, J., Troy, T., van Beek, L., Bierkens, M., Blyth, E. et al., 2011. Hyperresolution global land surface modeling: Meeting a grand challenge for monitoring Earth's terrestrial water. Water Resources Research, 47, W05301, DOI:10.1029/2010WR010090 Xian, S., Lin, N., Hatzikyriakou, A., 2015. Storm surge damage to residential areas: a quantitative analysis for Hurricane Sandy in comparison with FEMA flood map. Natural Hazards, 79(3), 1867-1888. Yoon, Y., Beighley, E., Lee, H., Pavelsky, T. and Allen, G., 2015. Estimating flood discharges in reservoir-regulated river basins by integrating synthetic SWOT satellite observations and hydrologic modeling. Journal of Hydrologic Engineering, 21(4), p.05015030. Zheng, X., Lin, P., Keane, S., Kesler, C., Rajib, M.A. 2016. “NHDPlus-HAND evaluation”. In Maidment et al. (Eds.), National Water Center Innovators Program Summer Institute Report, Technical Report 13, Consortium of Universities for the Advancement of Hydrologic Science Inc. (CUAHSI), USA, pp. 26-36. DOI: 10.4211/technical.20161019 Zheng, X., Tarboton, D., Maidment, D., Liu, Y., Passalacqua, P., 2018a. River channel geometry and rating curve estimation using Height above the Nearest Drainage. Journal of the American Water Resources Association, 54(4), 785-806. Zheng, X., Maidment, D., Tarboton, D., Liu, Y., Passalacqua, P., 2018b. GeoFlood: Large-scale flood inundation mapping based on high-resolution terrain analysis. Water Resources Research, 54(12), 10013-10033.

Table 1. Modeling configurations, their design and intended objectives.

Objective

M1 Overall predictability and representativeness of the framework

Configurations M2 Effect of streamflow prediction uncertainty on simulated inundation extent

M3 Effect of densified streamflow boundary conditions (via highresolution stream network) on simulated flood 35

inundation extent

Streamflow boundary conditions obtained from hydrologic model

Daily average streamflow based on the most optimal parameter set

95% prediction uncertainty band

Same as M1

Locations of streamflow boundary conditions in hydrodynamic model

Locations beyond which a channel becomes wider than 90 m (wc > 90m; Figure 2b)

Same as M1

Densified boundary conditions compared to M1 (wc >30m; Figure 2c)

Flood events for model validation

2011 and 2013 flood events

2011 flood event

2013 flood event

Reference inundation extent for model validation

Satellite imagery and AutoRAPID modeling framework (Follum et al., 2017)

Satellite imagery

Satellite imagery

36

(b) Ohio River Basin

(a) The Mississippi River System in the USA

The Mississippi River system

Ohio River Basin

± (e) NLDAS ~12 km weather grid overlapped on stream network

stream network

(d) Topography

(c) Land use

Elevation (m) High (1745) stream network

Low (30)

Land Use

Water Urban Agriculture Forest

Figure 1. (a-b) The Ohio River Basin (ORB) within the Mississippi River system; (b) high-resolution stream network in the ORB with 26,000 reaches; (c-d) topography and land use data used to set up the models; (e) the ~12 km grid of weather data is superimposed on the stream network to demonstrate the high spatial resolution of stream network. 37

Legend (a) Streamflow gauge stations used in SWAT calibration and validation !

# !.

weatherst

validation locations USGS_val_site calibration locations USGS_cal_site stream Reach network Watershed

(b) Streamflow boundary conditions in LISFLOOD-FP setup

(c) Densified streamflow boundary conditions in LISFLOOD-FP setup

Figure 2. (a) The 55 gauge stations used for SWAT calibration and validation. USGS station ID, location coordinates, and drainage area for each of these locations are provided in supplementary information S1. Figures (b) and (c) indicate the variation of streamflow input boundary conditions for LISFLOOD-FP across different model configurations (see Table 1).

38

Oct 16, 2010

Dry condition

April 25, 2014

Dry condition

May 4, 2011

Flood condition

April 22, 2013

Flood condition

Figure 3. Actual Landsat images of two flood events (May 4, 2011 and April 22, 2013). Images corresponding to dry conditions show the relative change in “wet” areas during flood condition. Reference flood maps were processed from the May 4, 2011 and April 22, 2013 images. These reference maps were used to evaluate the accuracy of LISFLOOD-FP outputs. 39

(a) Goodness of streamflow simulation Kling Gupta Efficiency (KGE)

KGE

Circle size indicates a specific range of values 0.2 – 0.3 0.3 – 0.4 0.4 – 0.5

03057000

0.5 – 0.6 0.6 – 0.7 0.7 – 0.8 0.8 – 0.9

Circle color indicates calibration or validation location

Calibration (50 stations) Validation (5 stations)

(b)

Daily streamflow (m3/s)

03316500

(c)

3000

95ppu

2000

USGS

USGS 03316500

SWAT (most optimal)

KGE = 0.85 p-factor = 0.90 r-factor = 1.15

1000 0 01/2010

06/2010

12/2010

06/2011

12/2011

Daily streamflow (m3/s)

600 95ppu

400

USGS

USGS 03057000

SWAT (most optimal)

KGE = 0.36 p-factor = 0.75 r-factor = 0.82

200 0 01/2012

06/2012

12/2012

06/2013

12/2013

Days

Figure 4. (a) Evaluation of SWAT simulated daily streamflow in terms of KGE at 55 gauge stations. KGE measures the agreement between gauge data and the most optimal simulation; (b) number of stations are categorized with respect to the corresponding range of KGE value; (c) timeseries comparison of gauge data with both the most optimal simulated hydrograph and band hydrograph. The band refers to 95% prediction uncertainty (95ppu). The p-factor and r-factor in (c) indicate percentage of gauge data captured within 95ppu band and the band width, respectively (Abbaspour, 2015). Therefore, p-factor and r-factor values closer to 1 suggest overall good simulation, although a lower KGE may indicate disagreement between simulation and reference (gauge) time-series. 40

(a) May 4, 2011

(b) April 22, 2013

Figure 5. Evaluation of simulated inundation extent for two real-life flood events across two different regions in the Ohio River Basin (see Figure 3). Here, (a) and (b) respectively depict a 5000 km2 and 30,000 km2 region. The “observed” extent was obtained via classification of satellite imagery (section 3.4.1). For each of the events, the observed extent was considered as the reference to measure model accuracy (F and C indices). Simulated flood maps shown here correspond to the model configuration where LISFLOOD-FP was driven by the most optimal SWAT streamflow data (M1 in Table 1).

41

(b) April 22, 2013

(a) May 4, 2011

F = 0.7 C = 0.75

F = 0.75 C = 0.85

Figure 6. Evaluation of simulated inundation extent for the same flood events as shown in Figure 5. Here, the flood inundation extent simulated by AutoRAPID framework (Follum et al., 2017; section 3.4.2) was considered as the reference to measure the accuracy of SWAT-LISFP framework (F and C indices). Similar to Figure 5, the SWAT-LISFP flood maps shown here correspond to the M1 configuration (Table 1).

42

Figure 7. Effect of streamflow prediction uncertainty on simulated flood inundation extent. Model results shown here correspond to the configuration in which LISFLOOD-FP is driven by SWAT’s band streamflow hydrograph (M2 in Table 1). The ‘maximum’ and ‘minimum’ in the simulated flood map refer respectively to the upper and lower bound of the 95% prediction uncertainty band in SWAT streamflow. Forcing LISFLOOD-FP with band streamflow helps capturing almost the entirely of flooding (compared to M1 in Figure 5).

43

Figure 8. Effect of streamflow input boundary conditions on simulated flood inundation. Results from two model configurations (M1 and M3; Table 1) are compared here with remotely sensed inundation extent. LISFLOOD-FP configured with densified streamflow boundary conditions (M3) better captures flooded areas, especially along smaller reaches. These flooded areas could not be detected in the M1 configuration in which LISFLOOD-FP had lesser number of streamflow pour-points/boundary conditions (M1 in Figure 5). AUTHOR CONTRIBUTION 44

VM provided scientific guidance and organized necessary resources, while AR assembled the team. AAT and MLF produced the AutoRAPID results. AR and ZL equally contributed to writing the paper.

Highlights • Degree of local relevance is defined by the spatial resolution of stream network. • Simulation of real-life flood events across 26,000 streams in the Ohio River Basin. • Satellite images and another similar framework verify flood mapping accuracy. • Incorporating streamflow uncertainty in flood maps minimizes prediction bias. • Streamflow input in lower order streams is essential for accurate flood mapping.

45