Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP
Journal Pre-proofs Research papers Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP Adnan Rajib, ...
Journal Pre-proofs Research papers Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP Adnan Rajib, Zhu Liu, Venkatesh Merwade, Ahmad A. Tavakoly, Michael L. Follum PII: DOI: Reference:
Please cite this article as: Rajib, A., Liu, Z., Merwade, V., Tavakoly, A.A., Follum, M.L., Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP, Journal of Hydrology (2019), doi: https://doi.org/10.1016/j.jhydrol.2019.124406
This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Towards a large-scale locally relevant flood inundation modeling framework using SWAT and LISFLOOD-FP Adnan Rajib1*#, Zhu Liu2*, Venkatesh Merwade3, Ahmad A. Tavakoly4,5, and Michael L. Follum5 1Department 2Department 3Lyles 4Earth 5Coastal
of Environmental Engineering, Texas A&M University, Kingsville, Texas, USA
of Land, Air and Water Resource, University of California, Davis, California, USA
School of Civil Engineering, Purdue University, West Lafayette, Indiana, USA
System Science Interdisciplinary Center, University of Maryland, College Park, Maryland, USA
and Hydraulics Laboratory, US Army Engineer Research and Development Center, Vicksburg, Mississippi, USA
*Shared lead authorship #Corresponding
author: Frank H. Dotterweich College of Engineering, 700 University Blvd, MSC 213, Kingsville, TX 78363, USA; email: [email protected]
Highlights
Degree of local relevance is defined by the spatial resolution of stream network.
Simulation of real-life flood events across 26,000 streams in the Ohio River Basin.
Satellite images and another similar framework verify flood mapping accuracy.
Incorporating streamflow uncertainty in flood maps minimizes prediction bias.
Streamflow input in lower order streams is essential for accurate flood mapping.
ABSTRACT
1
Lack of geospecificity or local relevance is a major limitation in contemporary large-scale flood modeling frameworks. There is a little practical value for configuring a large-scale model if the model produces streamflow and/or inundation maps only along the large rivers while numerous lower order streams remain overlooked. This study fills the gap through a new flood prediction framework based on the loose coupling of a hydrologic model Soil and Water Assessment Tool (SWAT) and a 1D/2D hydrodynamic model LISFLOOD-FP (hence, SWAT-LISFP). The prototype SWAT-LISFP framework was configured with ~26,000 stream reaches across the ~500,000 km2 Ohio River Basin, United States. After being calibrated against 50 gauge stations across the basin, SWAT simulated streamflow outputs were fed as upstream boundary conditions in LISFLOOD-FP. The resultant flood inundation extents consistently captured 70-80% of the remotely sensed inundation, irrespective of the flood events or locations within the basin. This was also confirmed via cross-validation with an existing flood modeling framework AutoRAPID (Follum et al., 2017). Additional modeling experiments were conducted to facilitate two critical discussions – how simulated inundation extent is affected by the uncertainty in streamflow prediction and the density of streamflow boundary conditions. Taking into account the uncertainties in SWAT streamflow, LISFLOOD-FP showed a remarkable improvement with more than 95% of remotely sensed inundation captured within the simulated extent. While this approach produces a variable-area flood map (i.e., a range of areas likely to be inundated at a particular point of time), inundation in the lower order streams can still remain undetected. A solution to this problem was demonstrated by setting up streamflow boundary conditions across further lower order streams, which subsequently justified the need for high-resolution stream network, and hence, the essence of locally relevant flood inundation modeling. The new contributions of his study, particularly through introducing SWAT as a functional hydrologic alternative to supplement a
2
hydrodynamic model such as LISFLOOD-FP and the series of experiments to draw insights on addressing lack of accuracy and local relevance, will enhance the global flood modeling initiatives.
Keywords: Flood mapping; floodplain; global flood hazards, hydrologic modeling; hydrodynamic modeling; Ohio River Basin
1. INTRODUCTION With frequently occurring high magnitude floods and associated socio-economic damages in many regions across the world, researchers and decision makers are actively seeking suitable approaches for flood inundation mapping. Traditionally, flood inundation maps are created by driving a hydrodynamic model with available stage or streamflow data (e.g., Cook and Merwade, 2009; Liu et al., 2019a; Liu and Merwade, 2019; Musser and Dyar, 2007). However, this approach cannot represent the propagation, retention, and attenuation of flooding across the entire stream network. This is because gauges that record flow and stage data are mostly installed along higher order streams, thereby leaving many lower order streams unmonitored. Even the data available at higher order streams are of too short length or too poor quality to accurately capture floodplain hydrodynamics (Di Baldassarre and Montanari, 2009). An effective way to address this information gap, at both gauged and ungauged streams, is through an integrated hydrologichydrodynamic modeling framework. In such a framework, one or more hydrologic models can simulate streamflow, which can provide upstream boundary conditions (or forcing data) to one or more hydrodynamic models for producing flood inundation maps. The coupling of hydrologic and hydrodynamic models is a well-recognized approach, but common only to small watersheds (e.g., Huang and Hattermann, 2018; Kim et al., 2012; Komi et
3
al., 2017; Nguyen et al., 2016). Development of “large-scale” hydrologic-hydrodynamic modeling frameworks covering the world’s major river basin(s) (e.g., Biancamaria et al., 2009; Hoch et al., 2017; Paiva et al., 2013; Paiva et al., 2011; Schuman et al., 2013) are not yet full-fledged. We identified three persistent limitations in the current state-of-the-art large-scale flood modeling initiatives: (i) lack of geospecificity or “local relevance” in predicted flood information, (ii) applications restricted only to identify a zone of potential food hazards (so-called floodplain maps), not the actual flooding, and (iii) acute inaccuracy problem. To justify the rationale of our study, these limitations are elaborately discussed below. Large-scale predictions of streamflow and inundation maps are immensely helpful for operational disaster information systems. What seriously impedes such potential is the lack of “local relevance”. Spatial resolution of topography data and computational mesh configuration in a hydrologic and/or hydrodynamic model are frequently used to define the local relevance of its outputs (e.g., Chen et al., 2017; Gallegos et al., 2009; Hartnett and Nash, 2017; Wood et al., 2011). We suggest that this definition is a misnomer for (fluvial) flood prediction and mitigation planning. For instance, there is little practical value for configuring a model with meter-scale computational mesh if the so-called high- or hyper-resolution model produces streamflow and/or inundation maps only along the main river channels (see, e.g., Biancamaria et al., 2009; Chen et al., 2017; Komi et al., 2017; Logah et al., 2017; Schuman et al., 2013; Wilson et al., 2007). On the contrary, it is physically more meaningful and thus practically informative to construct flood models so that the degree of local relevance is dictated by the resolution (spatial density) of stream network. Considering as many lower order streams as possible via a high-resolution stream network would ensure the maximum local relevance of streamflow and inundation maps across large spatial scales
4
– a fundamental aspect of flood prediction previously ignored in research and operational management. In line with efforts to build “large-scale locally relevant” flood prediction frameworks, global flood hazard models have received considerable attention in recent times (e.g., Alfieri et al., 2014; Dottori et al., 2016; Pappenberger et al., 2012; Sampson et al., 2015; Wing et al., 2017). However, these initiatives have only produced static maps with a specific probability of flood occurrence (e.g., 100-year return period). While these maps stimulate new-generation of spatially explicit floodplain ecohydrologic and socio-economic management (Kiedrzyńska et al., 2015; Smith et al., 2019), they cannot be considered as actual flood maps. Near real-time mapping of flood inundation extents (with corresponding streamflow data) is the main requirement for a decision-support system that can inform real-life flood management. To facilitate near-real time predictions, National Oceanic and Atmospheric Administration in the United States developed the National Water Model (NWM) framework (Maidment, 2017). NWM predicts streamflow and potential flood inundation extents across ~2.7 million streams in the continental US – an unprecedented combination considering both spatial scale and resolution of stream network (Maidment et al., 2016). Follum et al. (2017) introduced the AutoRAPID framework, demonstrating its capability for fast, parsimonious flood map simulation across millions of streams in the Midwest US and Mississippi Delta. There is a concomitant trend of similar initiatives for other parts of the world as well (e.g., GLOFRIM by Hoch et al., 2017). While these emergent frameworks offer increased flood information capacity in public domain, their accuracy during real-life flood events and means to efficiently address their prediction uncertainties have not been thoroughly investigated (see, e.g., Afshari et al., 2018; Salas et al., 2018; Zheng et al., 2018a). What is also being overlooked in the previous studies is the possibility 5
that two such frameworks, independently developed for a particular basin, may produce largely inconsistent flood information. Accordingly, there remains the urgency to continue developing robust hydrologic-hydrodynamic model combinations and ensure reliable flood information via real-life accuracy assessments and inter-framework comparisons. This study aims to fill these gaps with a newly developed large-scale locally relevant flood prediction framework. At its core, the framework loosely couples hydrologic model SWAT (Soil and Water Assessment Tool; Arnold et al., 2012; Neitsch et al., 2011) with 1D/2D hydrodynamic model LISFLOOD-FP (Bates and De Roo, 2000; Horritt and Bates, 2001; Bates et al., 2013; Neal et al., 2012). With a prototype daily-simulation setup involving ~26,000 stream reaches across the Ohio River Basin, a ~500,000 km2 region within the Mississippi River system (Figure 1), we addressed the following questions to infer a comprehensive understanding on the framework’s predictability and representativeness: 1. How accurate is the new framework during real-life flood events? Does the existing
framework(s) corroborate the new framework’s predicted flood inundation extents? 2. Does knowing the level of hydrologic uncertainty help increase the predictability of inundation
extents? 3. How does local relevance (i.e., spatial resolution of stream network) influence models’ ability to capture inundation extents over large spatial scales?
2. MODEL SELECTION Many 1D/2D hydrodynamic models have been developed as stand-alone software packages to support reach- or watershed-scale simulations (Nelson et al., 2003; Musser and Dyar,
6
2007; Vanderkimpen et al., 2009; Ballesteros et al., 2011; Nguyen et al. 2016; USACE, 2016). For a large-scale flood modeling framework involving numerous streams, choice of a hydrodynamic model depends primarily on its easy interoperability with a driver hydrologic model, opportunities for cyber-enabled high performance computation, and flexibility for data assimilation (Kauffeldt et al., 2016; Merwade et al., 2018). This is why “low-complexity” parsimonious inundation mapping tools relying on topography, roughness parameters, and simple hydraulic equations, such as those used in the NWM or AutoRAPID frameworks (Follum et al., 2017; Zheng et al., 2016, 2018a,b), have gained great community attention. However, because these tools lack physical process descriptions on floodplain mechanisms, their application involves a trade-off between fast computation power and accuracy. As such, they may underperform compared to 1D/2D hydrodynamic models (Afshari et al., 2018). To this end, LISFLOOD-FP has emerged as a suitable hydrodynamic model. Although nearly all of the large-scale frameworks using LISFLOOD-FP are designed for mapping a static areal extent with a specific probability of inundation (e.g., the “floodplain maps” by Alfieri et al., 2014; Dottori et al., 2016; Sampson et al., 2015; Wing et al., 2017), it is possible to configure LISFLOOD-FP for near real-time prediction of potential flood events (Wing et al., 2019). Hoch et al. (2017) and Schuman et al. (2013) demonstrated this by coupling LISFLOOD-FP respectively with Variable Infiltration Capacity (VIC; Liang et al., 2003) and PCRaster GLOBal Water Balance (PCR-GLOBWB; van Beek and Bierkens, 2009) models, however, using coarse-resolution stream networks. Therefore, how LISFLOOD-FP would perform in a much complex setup (e.g., 26,000 stream reaches across the 500,000 km2 Ohio River Basin; Figure 1) is still an outstanding question. For large-scale simulation of streamflow across high-resolution stream networks, coupled atmospheric-land surface-river routing models are being increasingly used in recent times (e.g.,
7
Lin et al., 2019, 2018a,b; Tavakoly et al., 2017). For instance, the NOAH-MP model, under the auspices of WRF-Hydro modeling system, is adopted in the NWM flood prediction framework (Salas et al., 2018). There have been several large-scale applications of VIC and PCR-GLOBWB as well, but these applications often involve coarse-resolution stream network (e.g., Hoch et al., 2017; Schuman et al., 2013; as noted above) or monthly time-scale simulation (e.g., Oubeidillah et al., 2014; Safeeq et al., 2014; van Beek et al., 2011). SWAT is another process-based model frequently applied on the world’s major river basins (e.g., Abbaspour et al., 2015; Daggupati et al., 2015; Du et al., 2018; Pervez and Henebry, 2015; Rajib and Merwade, 2017; Schuol et al., 2008). In addition to the well-documented evaluations that approve SWAT’s effectiveness (Arnold et al., 2012), there are web-based platforms such as SWATShare that can be used to publish and share SWAT models with a broader community, and perform extensive calibration using high performance computational resource (Rajib et al., 2016a). The companion tools of SWATShare such as HydroGlobe can assimilate multi-source earth observations (e.g., Rajib et al., 2018a,b) and SWATFlow can perform dynamic visualization of hydrographs and flood inundation extents. These are the emergent opportunities which led to SWAT’s selection for our flood prediction framework. To the best of our knowledge, this is the first study to show the application of SWAT model as the hydrologic component of a large-scale flood prediction framework. Our framework also involves the most complex assembly of LISFLOOD-FP till date, specifically considering the combined overburden of large basin area, high-resolution stream network and topography, and continuous daily simulation. While this study demonstrates a prototype framework using recent flood events in the Ohio River Basin as test cases, efforts are underway to extend the models over a much larger domain and perform near real-time predictions.
8
3. METHODOLOGY 3.1 Ohio River Basin as the flood modeling testbed The ~500,000 km2 Ohio River Basin (ORB) (Figure 1) is the largest sub-basin of the Mississippi River system in terms of water volume, thus carrying a great socio-economic and ecological significance for the North American continent. An intriguing factor to consider ORB as a flood modeling testbed is the increasing frequency of flood events over the past decades (Holmes et al., 2010; Holmes and Dinicola, 2012; Mallakpour and Villarini, 2015). A historical record by US National Weather Service (NWS, 2018) suggests that the Ohio River at Cincinnati (located in the most downstream portion of the basin) came near or above the designated flood stage 108 times within a span of 160 years (1858-2018, i.e., nearly 1 potential extreme event in every 1.5-year period). Previous studies analyzing land management and/or climate change effects also confirmed the increasing flood frequency in ORB (e.g., Du et al., 2018; Kunkel et al., 2013; Tayyebi et al., 2015). Despite this rising threats of flood hazard, flood modeling studies specifically focusing on ORB do not exist in literature. 3.2 Hydrologic modeling 3.2.1 SWAT model setup Setting up a SWAT model in the GIS interface (ArcSWAT) requires three geospatial datasets: digital elevation model (DEM), land use, and soil texture. These datasets were obtained from the following sources: 30-m DEM from the US Geological Survey (USGS) National Elevation Dataset (NED) (USGS, 2013a), 30-m land use data from the 2011-National Land Cover Database (NLCD) (USGS, 2013b), and 1:250,000 scale State Soil Geographic Data (STATSGO) built-in with SWAT geodatabase. To reduce the computational burden, all topographic analyses
9
(e.g., sub-basin delineation) were performed by resampling the DEM to a 90-m spatial resolution. Following several other large-scale SWAT applications (e.g., Schuol et al., 2008; Faramarzi et al., 2017; Du et al., 2018), a sub-basin was treated as the smallest spatial unit of hydrologic simulations. Elevation difference across the basin was represented via three slope classes (0-4%, 4-10%, and >10%). To enable comparison with similar flood modeling initiatives in the US (prevously noted, e.g., Follum et al., 2017; Maidment, 2017; Tavakoly et al., 2017), the NHDPlus stream network (National Hydrography Dataset Plus; McKay et al., 2012) was superimposed (burned-in) on the 90-m DEM, along with a 100 km2 threshold area for headwater stream identification. This 100 km2 threshold was selected via multiple iterations to spatially match the DEM-generated stream network with NHDPlus dataset. The model could also use NHDPlus stream network directly; however, such an option would require a significant amount of GIS pre-processing and quality assessment tasks to ensure continuous connectivity of streams (see, e.g., David et al., 2013). Nonetheless, our DEM-based network delineation resulted into ~26,000 individual stream reaches (and sub-basins) (Figure 1). For every stream reach, hydraulic geometry, i.e., bank-full width (wc) and depth (dc) were defined by relating upstream drainage area (DA) with semi-empirical power law relationships (wc = 1.29DA0.6 and dc = 0.13DA0.4 where DA is in km2 and w is in m; Her et al., 2017). This approach, similar to other large-scale hydrologic/river routing models (Hoch et al., 2017; Li et al., 2013; Paiva et al., 2013), allowed w values in lower order streams to be smaller than the DEM’s spatial resolution. For instance, the minimum wc in the ORB model was 2-m, whereas the underlying DEM cell-size was 90-m. The above-bank water was simply routed through a hypothetical floodplain with a constant 5wc width (Neitsch et al., 2011). 10
Weather forcing data were obtained from the ~12 km North American Data Assimilation System (NLDAS-phase II; NASA, 2019) and National Centers for Environmental Prediction's (NCEP) ~38 km Climate Forecast System Reanalysis (CFSR; Saha et al., 2010). Specifically, total daily precipitation were obtained from NLDAS, whereas other energy-related inputs including minimum/maximum daily temperature, solar radiation, relative humidity and wind speed were obtained from CFSR. Potential evapotranspiration (PET) was estimated using SWAT’s built-in Penman-Monteith approach. Nearly 30% of ORB has artificial subsurface drainage (tile-drainage) (Du et al., 2018). Therefore, tile-drainage routine was activated for all the sub-basins having poorly-drained soils (e.g., Du et al., 2018; Green et al., 2006). Reservoir storage-discharge could not be explicitly considered in the current ORB setup because of limited access to dam operation data, although this limitation was implicitly addressed via rigorous streamflow calibration across a large number of gauge stations. Processes related to infiltration/ surface runoff generation and channel routing were simulated using the Curve Number and Variable Storage methods, respectively (Neitsch et al., 2011). 3.2.2 Calibration and validation of SWAT The ORB SWAT model was first initialized for 1-year (2009) and then calibrated for a 2year period (2010-2011) using daily streamflow observations at 50 USGS gauge stations (Figure 2a and supplementary information S1). The simultaneous multi-site calibration involved 18 parameters related to surface, subsurface, snow accumulation, and in-stream hydrologic responses (supplementary information S2). Du et al. (2018) previously calibrated a coarse-resolution ORB SWAT model using 80 years (1935-2014) of streamflow data at 9 gauge stations. For calibrating the current ORB SWAT model, initial parameter ranges were defined based on the optimization 11
results of Du et al. (2018). The parameter optimization was conducted with Sequential Uncertainty Fitting algorithm-version 2 (SUFI-2; Abbaspour, 2015) using a weighted Kling-Gupta Efficiency (KGE) (Gupta et al., 2009; Rajib et al., 2016b, 2018b) as the objective function. KGE was maximized to obtain the most optimal parameter set which produces the best possible agreement between simulated and observed streamflow hydrographs at target locations (KGE =1 means 100% agreement). Relying on a single streamflow output time-series (the so-called best hydrograph) may not be reasonable because of the uncertainties in extreme flow predictions – a problem common to any large-scale hydrologic model (Abbaspour et al., 2015; Faramarzi et al., 2017). Therefore, a “band” hydrograph representing the 95% prediction uncertainty (Abbaspour, 2015) was produced for every reach in addition to the “best” hydrograph representing the most optimal streamflow (corresponding the most optimal parameter set). To allow an independent posterior evaluation, the model was validated over a separate 2-year period (2012-2013), and at 5 separate gauge stations across the basin targeting different upstream drainage areas, topography, land use, and climatic conditions (Figure 2a). < Figure 2> 3.3 Hydrodynamic modeling 3.3.1 Coupling SWAT and LISFLOOD-FP Simulated streamflow outputs from SWAT were fed as the upstream boundary conditions in LISFLOOD-FP. This was a “loose coupling” prototype architecture where both SWAT and LISFLOO-FP were executed in stand-alone mode, and an intermediate relational data model (e.g., Hoch et al., 2017; Peckham et al., 2013) allowed the one-way transfer of information. We 12
performed three separate sets of simulations (M1–M3) by varying the boundary conditions of LISFLOOD-FP (Table 1). Irrespective of their differences, the basic setup and parameterization procedure in LISFLOOD-FP were kept identical across all the configurations. Throughout the following discussion, we name the framework SWAT-LISFP.
3.3.2 LISFLOOD-FP setup and calibration The geospatial datasets (topography, land use, and stream network) required to set up LISFLOOD-FP were the same as in the ORB SWAT model. In all the configurations (M1–M3), a sub-grid scale hydrodynamic scheme (Neal et al., 2012) was applied to solve the momentum and continuity equations for channel flow and floodplain flow, respectively (Eq.1-3).
𝑄𝑡𝑐 + ∆𝑡 =
𝑄𝑡𝑐 ― 𝑔𝐴𝑡𝑐,𝑓𝑙𝑜𝑤∆𝑡𝑆𝑡𝑐
{
| | (𝑅𝑡 )4/3𝐴𝑡 [ 𝑐,𝑓𝑙𝑜𝑤 𝑐,𝑓𝑙𝑜𝑤]
𝑡 1 + 𝑔∆𝑡𝑛2 𝑐 𝑄𝑐
𝑞𝑡
𝑄𝑡𝑓,𝑖++∆𝑡1/2 =
𝑓,𝑖 +
1 2
[
― 𝑔ℎ𝑡𝑓,𝑓𝑙𝑜𝑤∆𝑡𝑆𝑡
𝑓,𝑖 +
1+
| |
𝑡 𝑔∆𝑡𝑛2 𝑓 𝑞𝑓
]
}
(1)
1 2
(2)
(∆𝑥 ― 𝑤𝑐,𝑓𝑙𝑜𝑤)
7
(ℎ𝑡𝑓,𝑓𝑙𝑜𝑤)3
∆𝑡 = 𝑄𝑡𝑐,𝑖++∆𝑡1/2 + 𝑄𝑡𝑓,𝑖++∆𝑡1/2 𝑄𝑡𝑖 ++ 1/2
(3)
Here, indices c, f , and t indicate channel, floodplain, and instance of time respectively; Q = volumetric flow rate; Ac = cross-sectional area of the channel; S = slope; n = Manning’s roughness coefficient; i and j are grid-cell spatial indices in two spatial dimensions, ∆𝑥 = grid-cell width (defined by the spatial resolution of the input DEM); qf = unit width flow rate in floodplain; wc = channel bank-full width; ℎ𝑓 = depth of floodplain flow; g = gravitational acceleration. 13
Simulation of flood inundation in LISFLOOD-FP requires each stream reach to be described using four parameters: bank-full width (wc), bank elevation (Z), bed elevation (Zb) and Manning’s n (Bates et al., 2013). Given the loose nature of integration between SWAT and LISFLOOD-FP, it is challenging to minimize the propagation of uncertainty resulting from the obvious differences in models’ respective floodplain representation and corresponding numerical schemes. One way to address this issue is to maintain the “best possible” consistency in channel/floodplain representation by parameterizing both models with identical values of wc, Z, Zb and/or n. In the current setup of the framework, reach-average values for each of these input parameters were adopted as described below. (i)
Bank-full width (wc): Reach-average wc values were directly exported from the ORB SWAT model. These can also be derived from existing global databases (Andreadis et al., 2013) or using at-station hydraulic geometry (Leopold and Maddock, 1953). Most of the previous applications of LISFLOOD-FP adopted either of these two approaches (e.g., Neal et al., 2012; Schumann et al., 2013). However, global databases are mostly derived from available coarse-resolution DEMs so that they provide wc values only for higher order streams. Calculation of at-station hydraulic geometry does not produce reasonable results in relatively ungauged areas, not to mention the heavy computational overhead required to perform at-a-station analysis across thousands of stream reaches. As such, exporting SWAT wc, a drainage area-based estimate similar to other large-scale models (Hoch et al., 2017; Li et al., 2013; Paiva et al., 2013; noted in section 3.2.1), deemed as an efficient way for parameterizing LISFLOOD-FP.
14
(ii)
Bank elevation (Z): For any grid-cell that has w associated with it, LISFLOOD-FP picked the elevation data from the input DEM and assigned that as the corresponding Z value (Bates et al., 2013).
(iii)
Bed elevation (Zb): This was defined as (Z - dc), where dc is the bank-full depth. Similar to wc, dc was exported from SWAT as well. Alternatively, LISFLOOD-FP can approximate dc as a(wc)b following Leopold and Maddock (1953); here, a and b are user-defined coefficients which can be optimized through calibration. Using SWAT-estimated dc values was more practical than seeking so-called “calibrated” values of a and b for 26,000 reaches. While maintaining consistency between the two model components and reducing computational overhead, this approach eliminated potential uncertainties induced from inappropriate values of a and b.
(iv)
Roughness coefficient for floodplain (Manning’s coefficient nf ): A set of nf values were adopted from the supporting literature (Kalyanapu et al., 2010) and assigned to each of the land use classes in LISFLOOD-FP. In this manner, nf was varied across the basin allowing the model to effectively represent floodplain surface heterogeneity.
(v)
Roughness coefficient for channel (Manning’s coefficient nc): A unique nc value (=0.03) was assigned throughout the stream network. This unique value was derived from a calibration experiment based on a coarse-resolution LISFLOOD-FP setup (Merwade et al., 2018; Rajib et al., 2016c). In this preliminary experiment, nc values were varied between 0.01 and 0.05 (in 0.005 increments, one-at-a-time, every iteration assigning a unique nc throughout the stream network). After each iteration, the relative change in spatial agreement of simulated inundation extents was measured against the US Federal Emergency
Management
Agency
100-year
flood
hazard
maps
(FEMA;
15
https://msc.fema.gov/portal). Accordingly, 0.03 was found to be a representative estimate of channel roughness for ORB considering the maximum agreement between FEMA maps and LISFLOOD-FP simulations. Although FEMA maps are mostly model simulated outcomes (Xian et al., 2015), and not the “observed” extents of actual flood events, using these maps to compare the results of other hydrodynamic models is an approach frequently followed in flood modeling studies (e.g., Afshari et al., 2018). Although floodplain roughness (nf) in the ORB model was varied spatially, representing channel roughness with a unique nc value for the entire stream network might have posed some inaccuracies in simulated flood maps. However, it is not uncommon to use a unique nc value in flood modeling especially over large spatial scales (see, e.g., Afshari et al., 2018; Hoch et al., 2017; Li et al., 2013; Rudorff et al., 2014). In fact, the nc value (=0.03) used for ORB is the same as the one used for the Mississippi and Amazon River Basins by Follum et al. (2017; AutoRAPID framework) and Hoch et al. (2017; GLOFRIM framework), respectively. 3.4 Validation of simulated inundation extents 3.4.1 Comparison with remotely sensed observations Real-life “observed” flood inundation maps derived from open-access remotely sensed imagery were used as the primary reference data for model validation. Specifically, Landsat satellite images for May 4, 2011 and April 22, 2013 were obtained from USGS earth explorer (http://earthexplorer.usgs.gov; Figure 3) so that they represent different flood magnitudes across different areas within the basin with different topography and land use conditions. Yet, model validation based on satellite imagery is not always ideal due to cloud cover contamination (Wang et al., 2019) as well as the potential temporal mismatch between satellite (e.g., repeat cycle, exact
16
time of image acquisition) and model outputs (e.g., daily average data). We strategically designed our approach so that these issues impart minimal influence on validation results. Specifically, despite the 16-day repeat cycle of Landsat satellite, availability of relatively cloud-free image was one of our main criteria of selecting flood events for model validation. Further, each of the days selected for model validation was within a multi-day span of flood inundation, unlike flash floods that dissipate within a few hours (stakeholder inputs; not discussed here). In such cases, a Landsat image could represent the average flood condition throughout the day irrespective of the satellite’s exact image acquisition time. This assumption, despite imposing some limitations on validation results, allows consistent comparison of satellite data with daily-average outputs derived from daily time-step of model simulations. “Raw” Landsat images were processed through a supervised image classification technique in ArcGIS to distinguish flood inundation extents. This classification was conducted by (i) creating training polygons for known land use classes such as waterbodies, forest, agricultural and urban areas, and then (ii) applying a maximum likelihood method to separate flooded areas. All the flooded areas in an image were then converted (and aggregared) into GIS polygons and thus a reference map for evaluating simulated inundation extents. This so-called remotely sensed reference inundation map is subject to uncertainty due to satelittte’s image acquisition, processing, and the subsequent image classification techniques. Moreover, the Landsat images available through USGS platform are in 30-m spatial resolution which may be too coarse to capture flooding along the lower order streams. Effects of these uncertainties on model evaluation results were not considered in this study. < Figure 3> 3.4.2 Comparison with an existing flood modeling framework 17
It is important to acknowledge how a newly developed flood prediction framework might compare with the frameworks that are already available for a particular region of interest. While this cross-validation step has been commonly ignored in the previous studies, we sought evidence on SWAT-LISFP’s predictability and reliability using a secondary set of reference flood maps generated from AutoRAPID. In the basic setup of AutoRAPID (Follum et al., 2017), first the Routing Application for Parallel computatIon of Discharge (RAPID; David et al., 2011) model computes streamflow using gridded runoff data from land surface models (LSMs) (e.g., Tavakoly et al., 2017). A parsimonious hydraulic model AutoRoute (Follum et al., 2013) uses this streamflow and Manning’s equation to calculate the normal flow-depth across a large number of channel cross-sections, where the cross-section geometry is derived directly from topography data. Finally, AutoRoute simulates inundation extents using a volume-filling numerical method at each cross-section. To facilitate a comparison with the current SWAT-LISFP setup, we executed AutoRAPID across ORB only for the two specific flood events mentioned above (Figure 3) using NHDPlus stream network and 10-m NED DEM. Here, we limit this comparison only to flood inundation extents. Potential inconsistencies between these two frameworks, related to different model inputs, process-conceptualizations and parameter calibrations, are not addressed in this study. 3.4.3 Model performance metrics F (fit) and C (correctness) indices (Alfieri et al., 2014; Liu and Merwade, 2018) were used to measure prediction accuracy of simulated flood inundation extents (Equations 4-5). 𝐹=𝐴 𝐶=
𝐴𝑟𝑚 𝑟 + 𝐴𝑚 - 𝐴𝑟𝑚
𝐴𝑟𝑚 𝐴𝑟
(4)
(5) 18
Here, Ar and Am refer to the reference (remote sensing/AutoRAPID) and simulated inundation extents, respectively, and Arm refers to the common inundated area (intersection) between the two flood maps. By definition, both indices range from 0 to 1. F index indicates the degree to which reference and simulated extents match (F=1 means 100% overlap). As F decreases, the simulated inundation extent starts to deviate from the reference (either over-estimation or under-estimation) until no overlap can be found (F=0). On the contrary, C index is a relatively less conservative metric of model accuracy signifying the percentage of the reference extent that is correctly predicted by simulation. When C equals 1, reference flood extent falls entirely within the simulated extent even though the model might actually be over-estimating the inundated area. Therefore, C is always greater or equal to F for a given set of simulated and reference inundation maps. 4. RESULTS AND DISCUSSION 4.1 Overall predictability and representativeness of the framework 4.1.1 Streamflow Streamflow simulation performance, in terms of KGE across 55 gauge stations, indicated a robust model (Figure 4a). KGE between USGS data and the most optimal SWAT simulations ranged between 0.20 and 0.90, with a value 0.4-0.5 or higher in 64% of the target locations (35 out of 55 gauge stations; Figure 4b). The equally reasonable performance regardless of calibration or validation run, especially considering non-overlapping time-periods and a separate set of gauge stations, approved consistent predictability of the model. While streamflow output based on the most optimal parameter set may underperform at some locations (hence, relatively low KGE), a well-calibrated model should have a narrow uncertainty band (r-factor<1) with most of the reference data points captured therein (p-factor~1) (Abbaspour, 2015). We exemplify this in Figure 4c for one of the validation locations, where the model produced relatively low KGE but acceptable 19
p-factor and r-factor estimates. The robustness of streamflow simulation as noted above might have been due to our methodological design that involved substantially rigorous calibration compared to many other large-scale flood modeling frameworks. For instance, our 50-site SWAT calibration largely contrasts 3-site PCR-GLOBWB calibration in a similar-sized Amazonian sub-basin (GLOFRIM framework by Hoch et al., 2017; also compare with 8-site MGB-IPH calibration across the entire Amazon River Basin by Paiva et al., 2013). Although the short run-time (5 years) of our prototype framework is similar to that used in GLOFRIM (6 years; Hoch et al., 2017), we selected parameter initial ranges based on a prior 80-year streamflow calibration (Du et al., 2018; noted in section 3.2.2). These comparisons, considered alongside the overall streamflow simulation performance (Figure 4), suggest that our current ORB SWAT model is capable of reproducing (and forecasting) hydrologic variabilities with reasonable accuracy.