Full article: Leveraging mesh modularization to lower the computational cost of localized updates to regional 2D hydrodynamic model outputs

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Hydrodynamic model outputs are used in urban flood risk modelling, flood alert systems, and Monte Carlo hazard assessment. This study tackles an under-explored challenge wherein regular updates to the spatial characteristics of the watershed – due to factors such as changing land use – alter the watershed’s response to rainfall forcing, thus rendering existing model outputs obsolete. Because state-of-the-art hydrodynamic models are computationally expensive, frequently re-running simulations can be costly. Modularization addresses this problem by requiring re-computation only for a limited domain affected by the land use changes. This article introduces a novel approach by modularizing the 2D domain into independent sub-domains before (‘discrete’) and after (‘abstract’) the numerical computations. Using the Hydrologic Engineering Center River Analysis System (HEC-RAS) 2D model of a large urban watershed in Houston as an illustrative and generalizable testbed, we show that both the discrete and abstract modularization closely approximates the results from re-running the entire model. The computational cost of modularization scales linearly with model size for memory requirements as storing the solution on the interior boundaries (discrete) or throughout the domain (abstract) are necessary. This trade-off of memory for computation may facilitate advances in surrogate modelling or Monte Carlo flood risk assessment.

KEYWORDS:

1. Introduction

Fast, detailed, and accurate, but scalable, hydrodynamic models are usually infeasible without High Performance Computing (HPC), yet the information they provide in resolving a floodplain response both temporally and spatially could influence flood hazard assessment, stormwater management, and emergency planning/response in urban systems. As the scale (both spatial and temporal), level of detail (i.e. resolution), and complexity of processes (e.g. presence of hydraulic structures) represented by the hydrodynamic model increase, so does the computational cost of performing a simulation. If complex large-scale models were viable, building a regional integrated flood information system that provides real-time flood warning (e.g. sub-hourly alerts on inundation hotspots and road network accessibility) and flood risk and flood hazard prediction (e.g. probabilistic floodplain mapping and damage estimates) might become feasible.

Recently, various studies have implemented Machine Learning (ML) methods in hydrodynamic applications to address this scale issue by leveraging the availability of big datasets to approximate a solution with significantly reduced computing time. Examples include using support vector machines (Bermúdez et al., Citation2019; Liu & Pender, Citation2015), random forests (Mobley et al., Citation2021; Wang et al., Citation2015), gradient boosting decision trees (Wu et al., Citation2020), extreme gradient boosting or XGBoost (Sanders et al., Citation2022), as well as deep learning methods such as convolutional neural networks (Hosseiny, Citation2021; Kabir et al., Citation2020) and Long Short-Term Memory (LSTM) (Nevo et al., Citation2022; Yang et al., Citation2022; Zou et al., Citation2023). While the use of ML and deep learning methods in flood risk, flood hazard, and flood prediction have grown increasingly popular owing to their promising results and endless potential applications, they could be dynamically unstable (Mosavi et al., Citation2018) and might require frequent retraining due to changes in boundary conditions, which is an issue common in urban hydrology.

To minimize the computational burden of constant model updates/retraining, we propose a mesh modularization framework that divides the modelling domain/mesh into multiple predefined subsets or modules. Since these subsets act as surrogates, one would only need to update or retrain specific modules that have encountered change instead of re-running the entire modelling domain. This article details an original proof of concept with discrete, pre-determined modules in a two-dimensional (2D) hydraulic model. Although we used specific hydraulic software – the Hydrologic Engineering Center River Analysis System (HEC-RAS) (Brunner, Citation2021) – the framework introduced in this study would be applicable to other hydrodynamic models and solvers. Additionally, the article also presents and discusses a pilot study using abstract modules generated after computation, with boundaries intended for eventual use in a real-time flood alert system for Harris County, Texas.

2. Background

In the last decade, 2D hydrodynamic solvers have gone from academic codes like TELEMAC-2D (World Bank, Citation2022) that solve governing Partial Differential Equations (PDEs) to widespread adoption in Hydrologic and Hydraulic (H/H) applications with many fully graphical interfaced models such as TUFLOW (BMT, Citation2022), InfoWorks-ICM (Innovyze, Citation2022), and HEC-RAS. These programs can compute detailed and accurate spatio-temporal floodplains and can show how these computed water surface elevations affect critical infrastructure. However, these tools are still computationally expensive, which poses a challenge to end-users. While numerous methods have been proposed to address this computational burden, we summarize them into three classes of method: Simplification, Supercomputing, and ML.

One class of method uses models that reduce computational cost by representing a dominant subset of physical processes. An example of this approach is LISFLOOD-FP (Bates, Citation2022), which uses a 1D solver on a 2D structured mesh similar to the diffusive wave approximation of the Shallow Water Equations (SWEs) (vector form shown in Equation 1). This approximation decreases the computational cost and provides good results when those assumptions can be applied (Alonso et al., Citation2008). However, these simplified structured mesh solvers cannot account for control structures like reservoirs or other major retention owing to the irregular geometries of real applications, leading to mesh leakage and other root issues. In addition, structured solvers are often poor at preventing mesh leakage for coarser grids as the model scales up (Shustikova et al., Citation2019). Of particular relevance to the problem of urban flood risk management, representing the effects of control structures and producing estimates at the scale of buildings and infrastructure present challenges to flood risk modelling. As an example, releases from the Addicks and Barker reservoirs during Hurricane Harvey 2017 significantly contributed to the flooding experienced in the Houston region (Garcia et al., Citation2020). Although these simplified tools are useful when applicable, excluding relevant physical processes can limit their suitability for many problems.

The second major class of approach for PDE solvers is the implementation of supercomputing, which is used in hurricane forecasting and other complex fields. This could be a viable option in theory but might not be fiscally or logistically feasible for many municipalities in practice, where most floodplain modelling resides. Supercomputing works well but does not address the fundamental trade-off of computational expenses against model detail and scope.

The third class of approach is to train surrogate models or emulators. Surrogate models use machine learning to generate a computationally cheap approximation of a more complex model by using output from the complex model as training data. However, surrogate methods require the problem being learned to be static (L’Heureux et al., Citation2017). Consequently, every time there is an edit to the modelled domain, the entire training process needs to be redone to learn a new approximation. Since urban landscapes are inherently dynamic systems that change frequently, a static system assumption is ill-suited for certain applications (e.g. urban flood modelling). Adding land use and other geometric edits as another degree of freedom for the the surrgoate to learn adds a multiplier for the data requirements. These requirements already present a nearly infeasible task, which may see improvements as computer science methods and data storage continue to improve but are still far from implementation.

3. Methods

The introduction of modularization starts with an applied definition, and then a more formalized definition of the method. Next will be the sequence of tests performed, starting with the discrete case, or pre-determined modules in the domain setup followed by a full abstraction of the method, or modules created after the modelling has been completed. Both tests comprise three model runs with the same input rainfall data, one with the original terrain (the existing condition) and two with the edited terrain (the edited condition). One of the edited conditions is identical to the 2D domain of the existing condition, and the other is a subset of that 2D domain using model data from the existing condition as external boundary conditions, this setup ending with the two edited condition runs showing the same results as a comparative run being the goal.

3.1. Mesh modularization

Modularization draws from multi-domain methods intended to solve varied mesh refinement detail in large simulations (Tezduyar & Osawa, Citation2001) and domain decomposition methods for SWEs intended to parallelize PDEs (Navon & Cai, Citation1993). Conceptually, modularization involves defining the contiguous portion of the domain containing the drainage outlet where an edit to the model would affect the output water depth (i.e. outside the portion of the domain selected, or module, the change in the solution due to the edit is far less than the error of the base model and in practical use is unchanged). Below in Figure , a watershed bounded by a 2D domain $Ω$ is discretized on a grid shown as squares. Figure (A) shows the existing condition with the location of a future edit being highlighted in red, and the channel being highlighted in blue.

Figure 1. Graphical overview of modularization: (A) the red square is the location of an edit in the domain $Ω$ and the blue represents the main channel; (B) the red edit will change the water surface elevation in the green region and will change the channel volumes along the purple; (C) the boundary of the changed regions in B are denoted by $Ω^{*}$ . Using this boundary as an upstream boundary condition, a new solution in the pink region can be solved in place of $Ω$ for the red edit. The forming of the new domain without prior modelling discretization will be defined later as abstract modularization. If the model had been built having $Ω$ and $Ω^{*}$ components from the start, the modularization would be discrete.

3.1.1. Practical explanation of modularization

As a more concrete example of modularization, if a large new development offset their runoff with a retention pond (red square in Figure (A)), the only region of the floodplain, or Water Surface ELevation (WSEL), that would differ from this change is the immediate region surrounding the terrain edit (green in Figure (B)) and the volumetric changes to the channels downstream of it (purple in Figure (B)). Anything draining into the region affected by the new development region or into the channels downstream of the development region would not be influenced by it, and as a result would not have any effect on the computed floodplain solution assuming the same external forcings. Defining this as an internal boundary where inside the new developed pond will change the floodplain, and outside of it there will be no influence (green and purple in Figure (B)), a contiguous subset is produced of the full domain with a known solution along the boundary except for the already dependent boundary condition of the outlet for the watershed (pink in Figure (C)).

This subset of the full domain is called a module ( $Ω^{*}$ in Figure (C)) as it can be edited independently, replacing the existing portion of the full model that it occupied (the red dashed outlined regions in Figure (C)). The naming was then extended to the entire method of splitting the physics-based domain, called modularizing the domain. In normal engineering applications, the storage of the solution information along these modular boundaries (or all cell faces in the abstract case) would be excessive, but in the same scenario of ML methods trading off computation for stored information with the training dataset, this storage minimizes the necessary computations to update the full model as it only computes the subset of the domain where edits to the output will occur.

3.1.2. Formally defining a module by its boundary

As a more formal definition of this module, we will define it as the boundary, or $Ω^{*}$ , in Figure (C). Given that the governing PDE, in this case the SWE, is well posed and solved under reasonable physical conditions (Duruk Mutlubas et al., Citation2020; Ghader & Nordström, Citation2014), any edit (defined as a localized change to topography or land use) has a continuous relationship to the change in the solution, or water depth, on a discretized 2D domain. Below is a walkthrough of this statement that implies the existence of the previously defined internal boundary line in the domain where there is an insignificant solution difference from the edit to form a module.

Beginning with the vector form of the momentum equation from the HEC-RAS hydraulic reference manual (Brunner, Citation2021), (2–150) with (2–153) and (2–154) are included as Equation (1) below: $\vec{V}$ is the velocity vector, $f_{c}$ is the Coriolis parameter, is the virtual unit vector, $g$ is the gravitational acceleration, $z_{s}$ is the water surface elevation, $h$ is the water depth, $v_{t}$ is the eddy viscosity tensor, $n$ is Manning’s roughness coefficient, $R$ is the hydraulic radius, and $τ_{s}$ is the wind surface stress vector. (1) $\begin{aligned} \frac{\partial \vec{V}}{\partial t} + (\vec{V} \cdot \nabla) \vec{V} + f_{c} k \times \vec{V} \\ = - g \nabla z_{s} + \frac{1}{h} \nabla \cdot (v_{t} h \nabla \vec{V}) - \frac{n^{2} g | | \vec{V} | | \vec{V}}{R^{4 / 3}} + \frac{τ_{s}}{ρ h} \end{aligned}$ (1) The PDE in Equation (1) can be rearranged to isolate the portion influenced by the topographic and roughness changes to the right-hand side (RHS), putting the other terms on the left-hand side (LHS) in Equations (2): (2) $\begin{aligned} \frac{\partial \vec{V}}{\partial t} + (\vec{V} \cdot \nabla) \vec{V} + f_{c} k \times \vec{V} - \frac{1}{h} \nabla \cdot (v_{t} h \nabla \vec{V}) \\ - \frac{τ_{s}}{ρ h} = - g \nabla z_{s} - \frac{n^{2} g | | \vec{V} | | \vec{V}}{R^{4 / 3}} \end{aligned}$ (2) Next, each side is renamed as a function of its dependent variables in Equations (3) and (4): (3) $\begin{aligned} S (\vec{V}, h) & = \frac{\partial \vec{V}}{\partial t} + (\vec{V} \cdot \nabla) \vec{V} + f_{c} k \times \vec{V} \\ - \frac{1}{h} \nabla \cdot (v_{t} h \nabla \vec{V}) - \frac{τ_{s}}{ρ h} \end{aligned}$ (3) (4) $\begin{aligned} b (\vec{V}) & = - g \nabla z_{s} - \frac{n^{2} g | \vec{V} | \vec{V}}{R^{4 / 3}} \end{aligned}$ (4) with the following three points: the RHS and LHS remain equal, the total energy of the system outside of the edit remains constant, and the total mass of the full system is unchanged. It can be implied that a change to $b$ has a continuous relationship to a change in the solution of $h$ for a given velocity. This implication also suggests that the norms of these changes are continuously related, as shown in Equation (5): (5) $\begin{aligned} S (\vec{V}, h) = b (\vec{V}) \Rightarrow | | Δ b {| |}^{2} \mapsto | | Δ S (h) {| |}^{2} \end{aligned}$ (5) This continuity further implies that there exists some maximum distance L in the 2D domain $Ω$ of x, y coordinates, where the local change to the solution $ε$ is less than the error of the model $e$ for each $Δ b_{x, y}$ , or the x, y location of a topographic or roughness edit, as shown in Equations (6a) and (6b): (6a) $\begin{aligned} \Rightarrow \exists L; L_{Δ b_{x, y}} & = max (| | Δ b_{x, y} - (Δ S (h), ε) {| |}_{L^{2} (Ω)}) \end{aligned}$ (6a) (6b) $\begin{aligned} s .t . ε & \leq | e |; e, ε \in R; x, y \in Ω \end{aligned}$ (6b) In total, this implies there exists a line in the 2D domain where the solution along that line is unchanged with respect to the model error. Therefore, if all the cell faces or discrete module boundaries lie beyond this line L away from the domain edit, the solution along this line can be used as an external boundary condition for the portion of the domain lying inside it that contains the edit. The resulting module, or $Ω^{*},$ produces the same model solution for the PDE as the full domain when it is run with the same edit.

3.2. Discrete modularization test

The initial modularization testing was done on a highly urbanized, upstream portion of the White Oak Bayou watershed in Houston (Figure ). This section was chosen owing to the limited amount of natural infiltration and dense quantity of flood control structures. The scale of the modelling domain was minimized to both limit computation time and limit the number of modelling elements that were still significant with 25 individual 2D modelling domains or discrete modules and 97 individual connections between them. The discrete modules can be seen outlined in Figure .

Figure 2. Overview of testing and case study modelling in Texas, USA. The yellow region shows the discrete testing region zoomed in on the upper right with the black lines delineating the sub-domains of 2D models. A detail of the terrain edits for a local pond fill is shown in the lower right for location T1. The blue outline is the location of the abstract testing region for the case study.

Some of these modules were part of a segmented 2D channel section implemented for downstream volume conveyance changes. Each module was interconnected with every adjacent module with HEC-RAS ‘SA/2D Area Connections’ (or on-terrain weirs highlighted purple in Figure ). Each 2D module was discretized using an internal mesh generation program in HEC-RAS with 61 m (200 ft) cell spacing and used mesh refinement along the channels with 30.5 m (100 ft) cell spacing (Figure ). The delineations for the cuts defining the modules were determined from the Harris County 1 m LiDAR Digital Elevation Map (DEM) (HGAC, Citation2018) of the region using an ArcMap™ program called ArcHydro (now an integrated tool in ArcGIS^® Pro) (ESRI, Citation2013) effectively defining sub-basins of the modelling domain.

Figure 3. Stage and flow comparisons of the discrete modularization tests performed. The rows display the stage and flow of the outlet for the tested region. The columns of existing conditions and edited condition differences highlight the module edit run in the context of the change the geometry caused.

ArcHydro determines the direction of drainage across the domain with a slope raster it generates from the DEM and delineates drainage boundaries from the user defined watershed outlet and an eight directional slope raster. A channel shapefile is similarly determined from a threshold value calculation from the same slope raster and accompanied flow accumulation raster made with ArcHydro. The generated channel lines were then buffered 122 m (400 ft) to ensure it was well outside the floodplain and manually cut perpendicular to the flow direction to divide into smaller modules. These manually made channel modules were then placed over the ArcHydro delineations of sub-basins superseding the existing boundaries to produce the complete set of modules after some manual dissolving of insignificantly sized sub-basins resulting in what is shown in the call out of Test in Figure .

Other geometry inputs for the discrete testing model included the land use data, which was the NLCD 2016 data from the Multi-Resolution Land Characteristics Consortium (MRLC, Citation2020), and the associated roughness values, which were derived from Kalyanapu et al. (Citation2010). The roughness values were adjusted to 75% of those reported study values owing to previous calibration work done (Garcia et al., Citation2020) on a model in the neighbouring Addicks watershed. For the single rainfall input for this phase in testing, an approximately 100-year two-hour event of 17.78 cm (7.0 in.) was used as the rain-on-grid input for the model and was derived from the NOAA Atlas 14 data statistics (NOAA, Citation2018) of the location. The rainfall was temporally split into 6.35 cm (2.5 inch) and 11.43 cm (4.5 inch) volumes over the two one-hour periods to minimize the computation time of these tests by simulating a flash flood condition. The only boundary condition aside from the rainfall is the normal depth condition at the watershed outlet using the average slope of the region making the model a rain-on-grid simulation.

To test the modularization methodology, a comparison of the outputs from a contiguous set of modules with full domain outputs was conducted. To compare the edited condition runs effectively, significant changes to the output hydrograph needed to be created. In this study, significant terrain modifications were made through filling selected detention ponds to a constant DEM elevation across each scenario. The edited condition for these few tests consisted of various detention pond modifications and were each named after the pond(s) being edited. The ponds selected for testing were intended to sample variations effectively that could occur from both the size and location of the detention changes made in the domain. The three ponds selected were compared in a total of four evaluation runs, one condition for each pond being filled individually: T1, T2, and T3, and one of all three ponds being filled together, T123 (Figure ). Each edited condition was compared to the existing condition run to ensure that significant changes were made to the outflow hydrographs at the watershed outlet and to validate their use for the comparative analysis.

3.3. Abstract modularization case study

Along with the implementation of a distributed net rainfall solver for the model, HEC-RAS 6.0.0 onward allowed for the abstraction of the module development such that a single domain could be used, and a single module could be generated after the fact from the internal stored data. The inputs to the HEC-RAS model were the same but with updated land use (MRLC, Citation2020) and the addition of soils data for HEC-RAS to use Green-and-Amp to solve for net rainfall in each cell. These input soils were from the USDA Natural Resources Conservation Service Soils database (USDA, Citation2022) and the parameters were based on USDA soils documentation as well (USDA, Citation2019). The rainfall rasters were scrubbed from the MTArchive hosted by Iowa State (Herzmann, Citation2022) and were processed into gridded [.dss] formats do be read easily by HEC-RAS using the Vortex API from the USACE HEC Github (Lehman, Citation2022). The storm chosen was Tax Day 2016 (HCFCD, Citation2022), which caused significant flooding in parts of Harris County including Brays Bayou where the edited condition will focus. As with the discrete case, the rain-on-grid simulation had a normal depth boundary condition at the outlet, but additionally some normal depth conditions on the perimeter as the terrain-based watershed was not computed. Water can freely exit the model on a majority of the boundary with the average terrain slope as constructed.

Since all of the data at each cell face, centre, and more can be pulled from the computation run data, the modules do not need to be defined explicitly in the model development but rather to be constructed after the domain edit using the stored data. This method has some requirements for necessary meta-data from the domain since the pulled internal data will not directly map to a newly constructed module. This includes which cells and faces are in the new domain, which faces are now on the domain boundary, what is the order of these boundary faces, which directions are the computed norms of these faces on the boundary, and what faces are being combined to a single external condition (since a corner no longer has external references). All this meta-data needs to be computed or pulled to allow an arbitrary cut to form a new module that can be updated independently.

Exactly how this was accomplished for HEC-RAS is not elaborated here as it would be very model/solver specific. Overall, the automation of this process will produce a set of boundary conditions each of which corresponds to a single cell face with the exception of corners and which has the flow passing through those faces for each computation timestep (or as near as feasible to each timestep). The dependent boundary conditions, such as the watershed outlet, will remain the same, and no cell centre will move or change volume from the original model to the module.

The defined single cut in the original domain this automation will produce (the blue line in Figure ) is the same as described earlier and defines the boundary where everything inside is affected by a terrain edit, and everything outside is not. The addition of abstracting this cut to any selection of continuous cell faces rather than discretely defining it in the construction of the model allows any arbitrary contiguous subset of cells containing the outlet to be used. This not only refines the improvements modularization can yield, but also truly minimizes the future computations and maximizes the improvements the methodology can bring.

Once the module computer questions are answered in the automation and the abstract module is generated, it needs to be tested in the same manner as in the discrete case. The test setup for this case study adopted a similar approach to the original test, however it was done in a more realistic manner. There are two golf courses in the middle of Brays Bayou that have been discussed with a view to becoming detention ponds for regional mitigation strategies. For this study, artificial ponds were cut into the LiDAR over the area of the existing golf courses to the same depths as the adjacent channel. These new detention ponds were used as the terrain edit on a hindcast of the Tax Day 2016 event meteorology. The existing condition Tax Day hindcast was used as the input dataset for the generated module with the new detention ponds for the edited condition and the full model was run again the with the edited condition. This case study was intended to show that the abstract modularization can work in real-world conditions and be used in a future pilot study for the model.

4. Results

As stated before, the comparison of the two edited conditions shows the viability of modularization for either the discrete case or the abstract case. This section will begin with the initial testing done with the discrete case and multiple scenarios (T1, T2, T3, and T123) with different terrain edits and then show the case study of the abstract case for the Tax Day hindcast.

4.1. Discrete modularization test

To start the analysis, the difference hydrographs (existing minus the full model edited conditions) are highlighted in the top panels of Figure . These graphs show the washed response to the detention pond filling for each tested condition and will highlight that the conditions did significantly edit the hydrographs. The top panels of Figure clearly show that the most downstream pond, T1, shows the most significant individual difference to watershed outlet response, and this is confirmed by the combined condition of T123 showing only a marginal difference.

To visualize the performance of the module run in isolation against the full model being re-run (full edit) for each condition, the bottom panels of Figure show the outlet hydrograph differences of the edited conditions (full edit minus module edit). As these graphs in Figure show, there is no longer a correlation to the terrain changes of the tested conditions and the differences in outlet flow. The consistent temporal shape of the bottom panels in contrast with the top panels shows that the differences are associated with the model setup and domain specific errors for this exact computation. Since the nominal differences do not impact the temporal shape of the graphs, the on-terrain weir approach seems to be a main driver of the errors. Further support for this theory can be produced when calculating the averages of the data displayed in Figure as shown in Table .

Table 1. Tested condition average differences and average percentage differences.

Download CSV Display Table

The percentage difference data shown in Table was computed by dividing the differences by the full edit condition value for each timestep producing the percentage with the convention that the full edit condition is true and then averaging that time series. As the results of Table show, the average nominal difference is quite small, but most importantly the average percentage difference is at or less than 0.77% in all cases and is extremely consistent across the runs showing the deltas to be a modelling setup error rather than stemming from the modularization. In addition, the near zero percentage average difference rate on the stage values further supports the application for inundation mapping, and by extension real-time flood warning. These results show a strong case in favour of this methodology, even in this crude discrete implementation, being able to be quite successful when building a scalable ML training dataset with these potential errors being kept in mind from the start.

4.2. Harris county case study for abstract modules

As stated in the previous sections, the flow averaging of the physically implemented discrete module boundaries was a major driver of errors in the methodology. To provide a case study addressing this, the abstract implementation was performed, and the results are displayed below.

Figure shows the depth hydrographs of the lower, middle, and upstream portions of the tested module labelled points C1, C2, and C3. The flood depth difference map is coloured such that all values between −0.15 and 0.15 m are blue, and the few outliers are either green or red. In these hydrographs the existing condition run for the Tax Day hindcast can be seen in light grey labelled as existing and the two edited conditions share the same naming conventions as the discrete test. In all of these graphs, it is evident that there are significant differences from the existing condition and the full and module edited conditions. This shows that the conversion of the golf courses in mid-stream Brays Bayou to detention ponds decreased peak flows during the Tax Day event, which is consistent with common practice in urban hydrology. An important point that can be observed in these figures is that the module edit and full edit are still not perfectly identical, although they may seem so from the image. This is due to the lack of energy conservation across the cut boundary. Since the external boundary condition can only be set as a flow hydrograph with an Energy Gradient (EG) slope to distribute (Brunner, Citation2021), only the flow, and by extension mass, from the shallow water equations is conserved across the boundary. The velocities of these flow time series are not identical to the original dataset since only the flow rate and a static EG slope were input. This can be observed with the mild amount of time delay and peak drop shown in Figure . This is a problem that can be corrected further through a decomposition of the face velocities and flow rates to calculate the average EG slope for each cell face and adding that into the automated processes. However, the differences observed between the full model and this abstract module are far less than the errors of the original model, so it was decided to show the results and address those final differences later if needed for the full surrogate development.

Figure 4. Spatial maximum and hydrograph outputs of the flood stage.

To show a distributed comparison of the module to the full domain, a table was generated using the WSEL time series at each cell in the edit condition module so as to compare it to the same cell in the edit condition for the full domain, which can be seen in Table .

Table 2. WSEL meta-analysis of individual cell errors.

Download CSV Display Table

After observing some significant outliers for all the commutated cells, some of those outliers can be seen in Figure .

These results show that modularization of the 2D domain can in practice reproduce the results of a solver with only a subset of the domain, meaning that a training dataset produced from a mass conservative solver can be updated for edits with minimal computation outside the locations where the solution will change. As a result, an application of this modularization is to facilitate the deployment of ML-based surrogate methods. Modularization can support this aim by reducing the computational cost of updating hydrodynamic model outputs, which ML models use as training data, as regional land use patterns shift. This can facilitate Monte Carlo risk assessments and advanced real-time flood warning and information systems.

5. Summary

Modularization is a way of leveraging the fact that data storage is far cheaper than computation. Using stored internal solve data on the faces of a finite volume solver can allow us to construct a boundary condition from this data. A set of these boundary conditions can be used to solve a small subset of the domain where edits are occurring (with some conditions), called a module. As the name implies, the module solution can be used in place of re-computing the full domain and will produce results with errors smaller than the original model making it viable for practical applications like flood warning systems or regional flood mapping. Owing to the minimization of the update cost (as only the edited portions are re-computed), dynamic systems like urban landscapes become worth the computational investment of developing large regional models of un-simplified physics-based solvers.

The methods shown in this article are intended to push flood warning technology forward now and enable connected fields to develop uses like ML based H/H surrogate models. Some fields, such as hazard research and risk assessment, already have a need for fast cheap fully distributed hydrodynamic outputs to enable more data intensive statistical modelling like Monte Carlo methods. Modularization could allow research to leap forward with feasible large-scale probabilistic modelling for areas like water resources management, emergency planning, or structural optimization. On top of these benefits, implementing these with ML methods for H/H can foster shorter time horizons for interconnections like real-time traffic alerts (Panakkal et al., Citation2019) and real-time decision support services for critical infrastructure. Pushing forward ML based flood warning systems that are implementable and scalable will have major short- and long-term impacts across many fields, and the modularization of the hydrodynamic model will make it feasible today.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

Alonso, R., Santillana, M., & Dawson, C. (2008). On the diffusive wave approximation of the shallow water equations. European Journal of Applied Mathematics, 19(5), 575–606. https://doi.org/10.1017/S0956792508007675
Google Scholar
Bates, P. (2022). LISFLOOD-FP [WWW Document]. LISFLOOD-FP. http://www.bristol.ac.uk/geography/research/hydrology/models/lisflood/.
Google Scholar
Bermúdez, M., Cea, L., & Puertas, J. (2019). A rapid flood inundation model for hazard mapping based on least squares support vector machine regression. Journal of Flood Risk Management, 12(S1), e12522. https://doi.org/10.1111/jfr3.12522
Google Scholar
BMT Commercial Australia. (2022). TUFLOW Homepage. https://www.tuflow.com/.
Google Scholar
Brunner, G. W. (2021). HEC-RAS, river analysis system hydraulic reference manual. https://www.hec.usace.army.mil/software/hec-ras/documentation/HEC-RAS_6.0_Reference_Manual.pdf.
Google Scholar
Duruk Mutlubas, N., Geyer, A., & Quirchmayr, R. (2020). Well-posedness of a highly nonlinear shallow water equation on the circle. Nonlinear Analysis, 197, 111849. https://doi.org/10.1016/j.na.2020.111849
Web of Science ®Google Scholar
esri. (2013). Arc hydro: GIS for water resources 2. https://www.esri.com/content/dam/esrisites/sitecore-archive/Files/Pdfs/library/fliers/pdfs/archydro.pdf.
Google Scholar
Garcia, M., Juan, A., & Bedient, P. (2020). Integrating reservoir operations and flood modeling with HEC-RAS 2D. Water, 12(8), 2259. https://doi.org/10.3390/w12082259
Web of Science ®Google Scholar
Ghader, S., & Nordström, J. (2014). Revisiting well-posed boundary conditions for the shallow water equations. Dynamics of Atmospheres and Oceans, 66, 1–9. https://doi.org/10.1016/j.dynatmoce.2014.01.002
Web of Science ®Google Scholar
HCFCD, H.C.F.C.D. (2022). Spring floods 2016 [WWW Document]. https://www.hcfcd.org/About/Harris-Countys-Flooding-History/Spring-Floods-2016.
Google Scholar
Herzmann, D. (2022). Iowa Environmental Mesonet [WWW Document]. https://mesonet.agron.iastate.edu/.
Google Scholar
HGAC, H.-G.A.C. (2018). LiDAR Imagery [WWW Document]. https://www.h-gac.com/imagery/lidar.
Google Scholar
Hosseiny, H. (2021). A deep learning model for predicting river flood depth and extent. Environmental Modelling & Software, 145, 105186. https://doi.org/10.1016/j.envsoft.2021.105186
Web of Science ®Google Scholar
Innovyze. (2022). InfoWorks ICM product page. https://www.innovyze.com/en-us/products/infoworks-icm.
Google Scholar
Kabir, S., Patidar, S., Xia, X., Liang, Q., Neal, J., & Pender, G. (2020). A deep convolutional neural network model for rapid prediction of fluvial flood inundation. Journal of Hydrology, 590, 125481. https://doi.org/10.1016/j.jhydrol.2020.125481
Web of Science ®Google Scholar
Kalyanapu, A. J., Burian, S. J., & McPherson, T. N. (2010). Effect of land use-based surface roughness on hydrologic model output. Journal of Spatial Hydrology, 9.
Google Scholar
Lehman, W. (2022). GitHub – hydrologic engineering center [WWW Document]. GitHub. https://github.com/HydrologicEngineeringCenter.
Google Scholar
L’Heureux, A., Grolinger, K., Elyamany, H. F., & Capretz, M. A. M. (2017). Machine learning With Big data: Challenges and approaches. IEEE Access, 5, 7776–7797. https://doi.org/10.1109/ACCESS.2017.2696365
Web of Science ®Google Scholar
Liu, Y., & Pender, G. (2015). A flood inundation modelling using v-support vector machine regression model. Engineering Applications of Artificial Intelligence, 46, 223–231. https://doi.org/10.1016/j.engappai.2015.09.014
Web of Science ®Google Scholar
Mobley, W., Sebastian, A., Blessing, R., Highfield, W. E., Stearns, L., & Brody, S. D. (2021). Quantification of continuous flood hazard using random forest classification and flood insurance claims at large spatial scales: A pilot study in southeast Texas. Natural Hazards and Earth System Sciences, 21(2), 807–822. https://doi.org/10.5194/nhess-21-807-2021
Web of Science ®Google Scholar
Mosavi, A., Ozturk, P., & Chau, K. (2018). Flood prediction using machine learning models: Literature review. Water, 10(11), 1536. https://doi.org/10.3390/w10111536
Web of Science ®Google Scholar
MRLC, M.-R.L.C.C. (2020). Data | multi-resolution land characteristics (MRLC) consortium [WWW Document]. https://www.mrlc.gov/data?f%5B0%5D=category%3Aland%20cover&f%5B1%5D=region%3Aconus.
Google Scholar
Navon, I. M., & Cai, Y. (1993). Domain decomposition and parallel processing of a finite element model of the shallow water equations. [WWW Document]. Computer Methods in Applied Mechanics and Engineering, 106(1–2), 179–212.
Web of Science ®Google Scholar
Nevo, S., Morin, E., Gerzi Rosenthal, A., Metzger, A., Barshai, C., Weitzner, D., Voloshin, D., Kratzert, F., Elidan, G., Dror, G., Begelman, G., Nearing, G., Shalev, G., Noga, H., Shavitt, I., Yuklea, L., Royz, M., Giladi, N., Peled Levi, N., … Matias, Y. (2022). Flood forecasting with machine learning models in an operational framework. Hydrology and Earth System Sciences, 26(15), 4013–4032. https://doi.org/10.5194/hess-26-4013-2022
Web of Science ®Google Scholar
NOAA, N.O. and A.A. (2018). Atlas14_Volume11. https://www.weather.gov/media/owp/oh/hdsc/docs/Atlas14_Volume11.pdf.
Google Scholar
Panakkal, P., Juan, A., Garcia, M., Padgett, J.E., & Bedient, P. 2019. Towards enhanced response: Integration of a flood alert system with road infrastructure performance models. Confrence paper from SEI part of ASCE. https://doi.org/10.1061/9780784482223.029
Google Scholar
Sanders, W., Li, D., Li, W., & Fang, Z. N. (2022). Data-Driven flood alert system (FAS) using extreme gradient boosting (XGBoost) to forecast flood stages. Water, 14(5), 747. https://doi.org/10.3390/w14050747
Web of Science ®Google Scholar
Shustikova, I., Domeneghetti, A., Neal, J. C., Bates, P., & Castellarin, A. (October 26, 2019). Comparing 2D capabilities of HEC-RAS and LISFLOOD-FP on complex topography. Hydrological Sciences Journal, 64(14), 1769–1782. https://doi.org/10.1080/02626667.2019.1671982
Web of Science ®Google Scholar
Tezduyar, T., & Osawa, Y. (2001). The multi-domain method for computation of the aerodynamics of a parachute crossing the far wake of an aircraft. Computer Methods in Applied Mechanics and Engineering, Minisymposium on Methods for Flow Simulation and Modeling, 191, 705–716. https://doi.org/10.1016/S0045-7825(01)00310-3
Web of Science ®Google Scholar
USDA. (2022). Tools | NRCS soils [WWW Document]. https://www.nrcs.usda.gov/wps/portal/nrcs/main/soils/survey/tools/.
Google Scholar
USDA, U.S.D. of A. (2019). Part 630 hydrology national engineering handbook. https://directives.sc.egov.usda.gov/OpenNonWebContent.aspx?content=43924.wba
Google Scholar
Wang, Z., Lai, C., Chen, X., Yang, B., Zhao, S., & Bai, X. (2015). Flood hazard risk assessment model based on random forest. Journal of Hydrology, 527, 1130–1141. https://doi.org/10.1016/j.jhydrol.2015.06.008
Web of Science ®Google Scholar
World Bank. (2022). TELEMAC-2D [WWW Document]. http://www.appsolutelydigital.com/ModelPrimer/chapter4_section7.html.
Google Scholar
Wu, Z., Zhou, Y., Wang, H., & Jiang, Z. (2020). Depth prediction of urban flood under different rainfall return periods based on deep learning and data warehouse. Science of The Total Environment, 716, 137077. https://doi.org/10.1016/j.scitotenv.2020.137077
PubMed Web of Science ®Google Scholar
Yang, S.-Y., Jhong, B.-C., Jhong, Y.-D., Tsai, T.-T., & Chen, C.-S. (2022). Long short-term memory integrating moving average method for flood inundation depth forecasting based on observed data in urban area. Natural Hazards, 116, 2339–2361. https://doi.org/10.1007/s11069-022-05766-1
Google Scholar
Zou, Y., Wang, J., Lei, P., & Li, Y. (2023). A novel multi-step ahead forecasting model for flood based on time residual LSTM. Journal of Hydrology, 620, 129521. https://doi.org/10.1016/j.jhydrol.2023.129521
Web of Science ®Google Scholar

Leveraging mesh modularization to lower the computational cost of localized updates to regional 2D hydrodynamic model outputs

Abstract

1. Introduction

2. Background