An inverse problem methodology to identify flow channels in fractured media using synthetic steady-state head and geometrical data

Advances in Water Resources 33 (2010) 782–800 Contents lists available at ScienceDirect Advances in Water Resources j o u r n a l h o m e p a g e : ...

Download PDF

5MB Sizes 0 Downloads 1 Views

Report

Full Text

Advances in Water Resources 33 (2010) 782–800

Contents lists available at ScienceDirect

Advances in Water Resources j o u r n a l h o m e p a g e : w w w. e l s ev i e r. c o m / l o c a t e / a d v wa t r e s

An inverse problem methodology to identify ﬂow channels in fractured media using synthetic steady-state head and geometrical data R. Le Goc a,b,⁎, J.-R. de Dreuzy a, P. Davy a a b

Géosciences Rennes, UMR 6118 CNRS, Université de Rennes 1, CS 74205, F-35042 Rennes Cedex, France Itasca Consultants SAS, 64 chemin des mouilles, F-69134, Écully Cedex, France

a r t i c l e

i n f o

Article history: Received 17 December 2009 Received in revised form 12 April 2010 Accepted 14 April 2010 Available online 26 April 2010 Keywords: Inverse problem Simulated annealing Fractured media Flow channels

a b s t r a c t We present a methodology for identifying highly-localized ﬂow channels embedded in a signiﬁcantly less permeable medium using steady-state head and geometrical data. This situation is typical of fractured media where ﬂows are often strongly channeled at the scales of interest (10 m–1 km). The objective is to identify both geometrical and hydraulic characteristics of the conducting structures. Channels are identiﬁed in decreasing order of importance by successive optimizations of an objective function. The identiﬁcation strategy takes advantage of the hierarchical ﬂow organization to restrict the dimension of the solution space of each individual optimization step. The characteristics of the secondary channels are strongly determined by the main ﬂow channels. The latter are slightly modiﬁed by the secondary channels through the addition of a regularization term to the main channel characteristics in the objective function. As the objective function is strongly non-convex with numerous local minima, inversion is performed using a stochastic algorithm (simulated annealing). We assess the possibilities of the hierarchical identiﬁcation strategy on simple synthetic steady-state ﬂow conﬁgurations where hydraulic data are made up of 25 regularly spaced heads and of the boundary conditions. Those ﬂow structures that are dominated by at most two simple channels can be identiﬁed with these head data only. Conﬁgurations comprising up to three complex and interconnected channels can still be identiﬁed with additional geometrical information including the distances of piezometers to their closest channel. The capabilities of the hierarchical identiﬁcation strategy are limited to ﬂow structures dominated by at most three equivalent ﬂow channels. We ﬁnally discuss the perspectives of application of the method to transient-state data obtained on a more restricted number of piezometers. © 2010 Elsevier Ltd. All rights reserved.

1. Introduction Flow channeling is widespread in fractured media, mainly in lowpermeability rocks where ﬂows concentrate in highly transmissive ﬂow paths within a restricted number of fractures [39,51]. Highly-channeled media refer here to media where the main channels, i.e. those carrying the largest part of the ﬂow, are sparse at the scale of interest. The hydrological properties of highly-channeled media are mainly controlled by the continuity and connectivity of the channels [21,28,46]. In such media, characterizing the hydraulic parameters comes down to identify these main channels and their hydraulic properties. Existing approaches based on extensive resistivity [22] or hydraulic data [30] yield zones of higher permeability rather than highly-localized channels. The objective of this article is to explore the possibilities of identifying highly-localized ﬂow channels using head data. This is a rather new approach to inverse

⁎ Corresponding author. Itasca Consultants SAS, 64 chemin des mouilles, F-69134, Écully Cedex, France. Tel.: +33 4 72 18 04 20. E-mail address: [email protected] (R. Le Goc). 0309-1708/$ – see front matter © 2010 Elsevier Ltd. All rights reserved. doi:10.1016/j.advwatres.2010.04.011

problems in hydrogeology, where most of the developments have focused on continuous porous media with permeability ﬁelds parameterized by zones or geostatistical functions [8,17,42]. We thus aim at solving the inverse problem with a parameterization relying on discrete ﬂow structures. Parameterization is known to be a key issue for handling inverse problems, since optimization algorithms can effectively identify only a limited number of parameters [57]. Parameterization is also required for adapting the model complexity to the information available in the data to avoid under- and over-parameterization. Applied to fractured media, continuous-like parameterizations have led to the deﬁnition of wide fracture zones within a stochastic less permeable continuous medium [35,54], whereas 3D Discrete Fracture Network (DFN) parameterizations have led to more localized ﬂow structures but failed to identify the major ﬂow channels [6,7]. Because of the complexity of the problem, either simpliﬁcations or introduction of additional constraints have been considered. Gwo [27] simpliﬁed the inverse problem by discretizing the channels on a regular grid, where grid elements were either highly transmissive fractures or an almost impervious matrix. Fracture geometries and transmissivities were identiﬁed by a genetic algorithm but ﬂow path geometries could be correctly estimated for single

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

straight-line ﬂow paths only. Mauldon et al. [40] modeled a fracture network with a regular lattice where each element was either conductive or not. The geometry of conductivity was calibrated with simulated annealing using steady-state heads and the resulting models could identify the most transmissive areas rather than main ﬂow channels. Reductions of the parameter space have also been proposed by replacing the network structure by self-similar structures like Iterated Function Systems (IFSs) conditioned to transient ﬂow tests [19]. The main advantage of IFSs is to yield a large variety of shapes with very few parameters. Optimization performed by a stochastic method (simulated annealing) yielded a broad range of highly different structures. Another simpliﬁcation consisted in identifying in 2D only one or two single structures [44,49] with non-linear least-square regression or co-kriging. Identiﬁcation was successful only in those cases where the observation network surrounds the transmissive structure and covers an area close to the size of the fracture zone. Considering the lack of constraints, Bruel [3] used hydraulic and stress data to calibrate fracture transmissivities in a DFN with a previously identiﬁed geological structure. For the same reasons, Day-Lewis [14] used inter-well connectivity data and hydraulic data to estimate successively the geometry and the transmissivity of four fractured zones. They showed that accounting for discrete highly transmissive structures signiﬁcantly improves the model accuracy. Following the same spirit, the standard parameterizations used in the petroleum industry and in the storage of high-level nuclear wastes now rely ﬁrst on the deﬁnition of a reference geometrical structure of the highly transmissive zones, and second on the identiﬁcation of hydraulic properties of ﬁxed fracture families [4,5,18,26].

783

In the previous approaches, as in continuous models, the ﬂow structure is deduced from geological observations or is limited to a very small number of structures (1 or 2) and only the hydrological properties (i.e. the transmissivities) are calibrated. The purpose of this paper is to set up an identiﬁcation strategy that identiﬁes both the structure and its hydrological properties by focusing on preferential ﬂow pathways. This is a rather new approach since, whereas continuous approaches or equivalent porous media calibrations aim at identifying the mean property of media, we aim at identifying the extreme properties caused by major discontinuities. The strategy relies on an iterative procedure that identiﬁes ﬁrst the major ﬂow and then, if the data contain enough information, the second-order channels. We assess the method on a set of synthetic conﬁgurations with an increasing level of complexity. In addition, we ﬁrst use only synthetic steady-state head data and then introduce, for more complex conﬁgurations, basic synthetic geometrical information. The purpose is not to remain close to natural media and realistic data but rather to use simple synthetic conﬁgurations to assess the feasibility of the approach in terms of structure complexity and data requirement. We thus propose a parameterization designed to identify both the channel geometry and permeability. The parameterization is based on the main ﬂow channels rather than on the fracture zones. This leads to much more localized ﬂows than those obtained in heterogeneous porous parameterizations, while requiring fewer parameters and details than DFN-like parameterizations. The identiﬁcation strategy is based on a hierarchical analysis of the ﬂow paths. Structures of decreasing importance are identiﬁed using successive objective function optimizations performed by a simulated annealing algorithm. A stepby-step procedure is developed where results from ﬁrst-order channels

Fig. 1. (a) Synthetic fracture network. (b) Flows computed with a constant sub-horizontal hydraulic gradient of direction given by the arrow. All fractures have the same transmissivity. The grey color is proportional to the logarithm of ﬂow normalized by the total ﬂow entering the domain. (c) Simpliﬁed fracture structure carrying 70% of the ﬂow. (d) Sketch of the possible main ﬂow channels, with a main channel (black), a secondary channel (dark grey) and third-order structures (light grey).

784

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

are slightly modiﬁed when adding second-order links. The ﬁrst objective of this paper is to deﬁne the parameterization in terms of ﬂow channels and to setup the identiﬁcation methodology on 2D synthetic ﬂow channel structures using synthetic hydraulic and geometrical data. Meanwhile, the second objective is to assess which type of ﬂow conﬁgurations can be identiﬁed with head data only and to which extent the use of geometrical data can help solving the problem. In other words, we aim at deﬁning the information concealed behind hydraulic and geometrical data. We present in Section 2 the parameterization adapted to highly-channeled media underpinning the strategy adapted to solve the inverse problem deﬁned in Section 3. We test the methodology in Section 4 on three increasingly complex fracture ﬂow structures using ﬁrst hydraulic and then geometrical data. Section 5 presents a further evaluation of the methodology on a larger set of 20 examples with both hydraulic and geometrical data. Advantages and limitations of the data used as well as possible extensions of the inversion methodology to more classical transient-state data are discussed in Section 6. 2. Deﬁnition and principle of the hierarchical channel identiﬁcation We only consider highly-channeled ﬂow cases where ﬂows are focused in very few structures at the scale of interest. We use this assumption to reduce the parameter space. When ﬂows are more evenly distributed in the medium, geostatistically-based parameterizations are advised since they are dedicated to this kind of media [8,17,42]. Section 5 will assess quantitatively the domain of applicability of these

two different approaches by using the channeling indicators previously developed in Le Goc et al. [36].

2.1. Hierarchical channel organization In fractured media, ﬂows are highly channeled in a small number of fractures below the homogenization scale (i.e. Representative Elementary Volume, REV), which is roughly the distance between two main channels [37]. Below this scale, ﬂows are focused in preferential ﬂow channels, or “paths of least resistance” [52], and are organized hierarchically in channels of decreasing importance [15,16]. Fig. 1 gives an example of channeled ﬂows obtained on a synthetic 2D fracture network composed of a square system where fracture positions and orientations are drawn from uniform distributions, and lengths are power-law distributed. Among all generated fractures (Fig. 1a), only a few carry a non-negligible amount of ﬂow (Fig. 1b) and even fewer carry 70% of the total ﬂow (Fig. 1c). Channels are visually identiﬁed and organized in decreasing order of importance in Fig. 1d with a dominant channel in black carrying 25% of the ﬂow, a secondary channel in dark grey carrying 15% of the ﬂow and higher-order structures in light grey carrying from 5% to 15% of the ﬂow. The highly simpliﬁed network in Fig. 1d leads to a ﬂow structure very similar to that of the simpliﬁed network in Fig. 1c, providing that the segment transmissivities are increased to keep the same total ﬂow. All the remaining ﬂowing structures of Fig. 1a are integrated in the background matrix.

Fig. 2. Principle of the identiﬁcation strategy applied to the ﬂow structure of Fig. 1d. Channels are identiﬁed in decreasing order of importance. At each step, an objective function with a restricted number of parameters is deﬁned and optimized. Steps 1 to 3 illustrate the identiﬁcation of the ﬁrst channel with a shape of increasing complexity. Steps 4 to 5 illustrate the addition and reﬁnement of a secondary channel. Step 6 shows how channel connectivity can increase with a channel connecting two previously identiﬁed channels. Steps 7 to 9 complete the identiﬁcation strategy. The background color is correlated to the background permeability. It is expected to decrease with the number of structures identiﬁed.

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

2.2. Quantitative deﬁnition of a channel We deﬁne a channel as a succession of highly transmissive ﬂow paths maintaining high ﬂow rates over long distances. Thus, for twodimensional domains, a channel is a polyline, i.e. a series of connected line segments, deﬁned by the location of its nodes and by its transmissivity. The channel extremities belong either to the boundary of the domain or to sinks/sources or to other channels. Fig. 1d displays ﬁve channels of varying complexities along which ﬂows are at least one tenth of the total ﬂow entering the domain. The channel parameterization is directly derived from this deﬁnition. The full fracture network

785

is parameterized by a restricted set of channels superimposed on a homogeneous background matrix of smaller permeability. This parameterization can be organized in three classes of parameters comprising channel geometry and topology (connectivity between channels), channel transmissivities, and matrix permeability. 2.3. Iterative channel identiﬁcation The identiﬁcation strategy relies on the hierarchical organization of the ﬂow structure and consists in an iterative identiﬁcation of the channels in decreasing order of importance. Fig. 2 illustrates this

Fig. 3. Flowchart of the ﬂow channel identiﬁcation strategy consisting of four nested loops underlined by different grey colors. The four loops are the calibration loop, the single channel identiﬁcation loop, the hierarchical channel identiﬁcation loop and the post-processing analysis loop. P is the last parameterization considered as valid, P ′ is the current parameterization, ns is the simulation number, nmax is the number of simulations to be performed, and the stopping criteria are given by (5) and (6).

786

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

algorithm. In this section, we present these four loops together with two additional sections dedicated to the deﬁnition of the objective function and to the method used for solving the direct problem.

strategy applied to the example in Fig. 1. At each step, we solve an inverse problem on a single channel while modifying only marginally the already identiﬁed ones (e.g. Fig. 2, steps 1 to 3). We thus restrict the complexity of each step of the inverse problem by reducing the parameter space. Variations in the previously identiﬁed parameters are strongly damped by the addition of regularization terms in the objective function. Only the newly added parameters are allowed to vary freely within their possible range. Because of the hierarchical channel organization, our strategy consists in reﬁning the shape of the most important channels before introducing less important additional channels. This is illustrated in the simpliﬁed ﬂow structure of Fig. 1d by an idealized succession of identiﬁcation steps (Fig. 2). The channel reﬁnement is performed by increasing the number of nodes in the polyline (Fig. 2, steps 2, 3, 5, 8 and 9). Once a channel has been identiﬁed, we add a new channel in the identiﬁcation process (Fig. 2, steps 1, 4, 6, and 7). Each added channel enhances the connectivity between the system limits (Fig. 2, steps 1, 4, and 6) or between the channels (Fig. 2, step 7). The strategy consists in identifying ﬁrst the main channel (Fig. 2, steps 1 to 3) and then the other channels in decreasing order of importance (Fig. 2, steps 4 to 9). One of the critical point of the algorithm is the criterion used to shift from the reﬁnement of an existing channel to the addition of a new channel. This will be detailed in the following section.

3.1. Hierarchical channel identiﬁcation We ﬁrst describe the “hierarchical channel identiﬁcation” loop in Fig. 3. This loop adds new channels as long as they yield a better agreement between the model and data. Each step of the loop consists in initializing a new channel, optimizing its structure parameters from the single channel identiﬁcation loop and, after a stopping criterion (deﬁned below) has been met, analyzing the beneﬁts of its addition. The new channel is initialized by drawing its characteristics from uniform distributions between their extreme values given in Table 1. The positions of the channel extremities are drawn uniformly from the already existing channels and boundaries. The log10-transmissivity of the channel is chosen at least two orders of magnitude greater than the background permeability in a uniform distribution in the interval [min(log10Tmat)+2, max(log10Tmat)+2], where min(log10Tmat) and max(log10Tmat) are the limit values allowed for the decimal logarithm of the background matrix transmissivity. The improvement of the model by the possible addition of the channel is assessed by two criteria. The ﬁrst criterion is given by a measure of the agreement between the model and data. The measure is the classical quadratic mismatch sum of squares RE between observed data d and modeling results d ′(nc) obtained with the number of channels nc of the current model:

3. Inverse problem methodology and implementation The algorithm is composed of four nested loops (Fig. 3). As will be shown in Section 3.3, the objective function is highly non-convex and possesses numerous local minima. We thus use a stochastic optimization yielding a distribution of sub-optimal solutions rather than a unique optimal solution. The external loop drives a complete identiﬁcation process that ends with the ﬁnal solution being derived from all the suboptimal solutions obtained using the identiﬁcation strategy. This strategy, directly derived from the discussion in Section 2.3, is made up of three loops. The outer loop concerns the hierarchical channel identiﬁcation that adds and identiﬁes a new channel at each iteration. It calls the single channel identiﬁcation loop that reﬁnes the structure of the channel by adding new nodes to the channel discretization. The inner parameter calibration loop identiﬁes the characteristics of the polyline nodes. It is in this loop that the optimization of an objective function takes place. Once the distribution of solutions has been built up, we look for similarities and differences among the solutions with a clustering

Nw

REðnc Þ = ∑

!2 0 di −di ðnc Þ

i=1

σid

;

ð1Þ

where Nw is the number of observations, di and d′i are the ith observed and simulated data, respectively, and σid is the error associated with the ith observed data. The second criterion is given by the comparison of the parameter uncertainty PU with the structure discrimination SD. PU is based on the covariance matrix of the estimated parameters deﬁned by Yeh and Yoon [55]. For a parameterization with channels nc and parameters NP(nc), PU writes [50,55]:

PU ðnc Þ =

∥

∥

−1 Nw −NP ðnc Þ T Jd ⋅Jd ; RE

ð2Þ

Table 1 Parameter list and possible values for the whole identiﬁcation strategy. Parameter

Deﬁnition

Range of possible values

Value chosen for the method

Log10(Tmat) Log10(Tch) B1ch, B2ch

Log-transmissivity of the background matrix Log-transmissivity of channels Borders or channels connected by the current channel

Initialized at 10− 3 Drawn in a uniform distribution Drawn in a uniform distribution

C1ch, C2ch (x,y)ch σih

Position of the channel's extremity Position of points deﬁning the channel shape Weight associated to the ith head value (Eq. (7))

10− 5 to 100 10− 3 to 102 0 to nb + nc where nc is the number of channels and nb the number of system borders 0 to the border/channel length Inside the system 10− 3 to ∞

σid σik α1 α2 λj,k

Weight associated to the ith distance value (Eq. (7)) Weight associated to the ith parameter (Eq. (7)) Stopping criterion associated to RE (Eq. (5)) Stopping criterion associated to PU and SD (Eq. (6)) Weighting term associated to previously identiﬁed parameters

N0 N0 ]0;1] ≥1 N0

ξi

Weighting term associated to channels during the post-processing

N0

Itmax min Fobj Tinit ns dc

Allowed number of iterations of the simulated annealing Aimed value of the objective function Initial temperature of the simulated annealing Number of single solutions used in the post-treatment Threshold of the clustering algorithm

≥1 ≥0 N0 ≥1 N0

Drawn in a uniform distribution Drawn in a uniform distribution 1 or equal to the difference between the channeled and the homogeneous conﬁguration 1 1 1 2 ncur(k) − nk where ncur is the current step and nk the step where the kth parameter were introduced. Fobj/Si with Fobj the objective function value corresponding to the channel and Si its sensitivity. 5.104 10− 4 100, 101 or 102 50 Between 0.5 and 3.5 as a function of the conﬁguration

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

with α1 ≤ 1 and that:

where Jd is the Jacobian matrix of RE:

Jd =

p ∂di σi ⋅ ∂pk σkd

! :

ð3Þ

i;k

In Eq. (3), di refers to the ith observation, σid to its associated error, pk to the kth parameter and σkp to its associated error. SD is used as a criterion to discriminate between two parameterizations and is based on the distance between the two corresponding solutions. It is given by Tsai et al. [50]: vﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ !ﬃ u u 1 Nw d ðn + 1Þ−d ðn Þ 2 i c i c SDðnc + 1Þ = t ∑ : Nw i = 1 σid

ð4Þ

The addition of channel “nc + 1” signiﬁcantly contributes to the improvement of the model providing that: REðnc + 1Þ bα1 ; REðnc Þ

787

ð5Þ

PU ðnc + 1Þ bα2 : SDðnc + 1Þ

ð6Þ

The ﬁrst criterion α1, based on RE, indicates an improved agreement between simulated and observed data. The second criterion α2, based on PU and SD, ensures that the two models with channels nc and nc + 1 can be discriminated according to the parameter uncertainty. Overparameterization, leading to a higher parameter uncertainty [31,50], is avoided. These criteria have been used for the identiﬁcation of zones in heterogeneous continuous media with α1 = 1 and α2 = 2 [10,50]. We take these same simple values in this study. 3.2. Single channel identiﬁcation All channels are identiﬁed iteratively. At ﬁrst, each channel is a single segment initialized in the hierarchical channel identiﬁcation loop. Its parameters, i.e. the position of its extremities and its transmissivity, are optimized by the parameter calibration loop. Once optimized, the segment is the starting element of a polyline made up of a series of segments and nodes. At each further step, a new node is added and both its position and the transmissivity of the polyline are optimized. The stopping criteria of the reﬁnement are those of the hierarchical channel

Fig. 4. Objective function versus model parameters in a simple test case comprising two channels. (a) Conﬁguration used to compute the objective function; red dots indicate observation locations. Objective function values (Eq. (7)) with σhi = 1 and σri = ∞ as a function of (b) the ﬁrst channel parameter, (c) the second channel parameters and (d) the matrix permeability.

788

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

Fig. 5. Sketch of the post-processing clustering algorithm identifying the prevalent characteristics (right) within a set of individual solutions (left). Colors in the individual solutions refer to the channels identiﬁed as prevalent characteristics.

identiﬁcation loop (5 and 6). The full identiﬁcation procedure stops when no improvement results from the last channel or from the addition of a new channel. 3.3. Deﬁnition of the objective function Determination of the polyline parameters is performed by the parameter calibration loop based on the optimization of an objective function made up of two terms. The ﬁrst term is the mismatch between modeled and observed head data RE (1). The second term is a regularization term for current and previously identiﬁed channels [8]. The objective function writes: Nw Fobj nc; ; NP ðnc Þ = ∑

i=1

!2 di −di′ ðnc Þ σid

nc −1 NPðjÞ

+ ∑ ∑ λj;k ⋅ j=1 k=1

!2 pk ðjÞ−pk′ ðjÞ σkp

;

(nc) is the sum of NP(j) for the ﬁrst channels nc and λj,k is the strength of the regularization of the kth parameter of the jth channel. The regularization term (second term in Eq. (7)) restricts the possible variations in the parameters of the previously identiﬁed channels nc − 1. This strategy is critical to the success of the identiﬁcation strategy. Neglecting it precludes the possibility to modify ﬁrst-order channels when adding higher-order channels. On the opposite, searching all parameters simultaneously would exceed the abilities of existing optimization algorithms. The objective is to allow some degree of freedom to the previously identiﬁed parameters. In fact, the previously identiﬁed channels may be impacted by subsequent channel additions and modiﬁcations while the regularization term forces the former parameters to remain close to their previously identiﬁed values. We denote ncur the current step index and nk the step index where parameter k was introduced. The weighting coefﬁcient λj,k is a function that linearly increases with the number of steps between ncur and nk:

ð7Þ λj;k = ncur ðkÞ−nk : where pk is the kth parameter introduced in the model with channels nc, pk′ is its value derived from previous steps, σkp is the associated error, NP(j) is the number of parameters describing the jth channel, NP

ð8Þ

Increasing the value of λj,k restricts the possible modiﬁcations of the already identiﬁed corresponding parameter, which means that the

Fig. 6. Flow rates ϕ of three tested conﬁgurations, ordered in increasing complexity. The simplest conﬁguration (a) is composed of two straight-line channels. The intermediary conﬁguration (b) is composed of two channels that are more complex. The complex conﬁguration (c) has several second-order well-connected structures.

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

parameters added earlier in the model remain closer to the value identiﬁed in the previous steps. 3.4. Parameter calibration At each iteration, the single channel identiﬁcation loop performs the optimization of the objective function (7) with a restricted number of parameters. In the case of the addition of a new channel, the number of parameters to be identiﬁed is 6: the position of the extremities of the channel (the connected structure and the curvilinear coordinate of the connection point), the log-transmissivity of the channel and the transmissivity of the background matrix. In the case of the modiﬁcation of a channel, the number of parameters to be identiﬁed is 4: the position of the nodes of the polyline (2 parameters), the log-transmissivity of the channel and the log-transmissivity of the background matrix. The background “matrix” transmissivity is systematically calibrated as it is expected to decrease through the identiﬁcation process. We note here that log-transmissivity as a model parameter was found to yield much better results than transmissivity. This is mainly due to the range of transmissivities varying over several orders of magnitude while geometrical parameters have a narrower variation range. The optimization algorithm has been constructed based on the properties of the objective function. Fig. 4 shows the objective function as a function of parameter values. Each curve represents the variations of the objective function according to the variations of a single parameter, the others remaining equal to their optimal value. The 1st and 2nd connected structures refer to the borders or channels

789

connected by the channel (between 0 and 4) and the position refers to the curvilinear coordinate of the intersection along the connected structure. All curves show that the objective function is not convex and displays numerous local minima (Fig. 4b and d). We thus choose a simulated annealing algorithm, which is among the simplest and most common stochastic inversion algorithms [48] used in groundwater calibration problems [14,38,43,56]. We give in Appendix A the chosen implementation of the simulated annealing. The algorithm is stopped either when a maximum number of iterations itmax is reached or when min the objective function falls below a minimum value Fobj . 3.5. Numerical method for solving the direct problem The direct problem consists in solving the ﬂow equation in wellconnected channels embedded in a uniform homogeneous medium. We set up a fast ﬂow simulation method by discretizing the matrix so that the discretization nodes of the matrix include the wells where the observed data are made available. Practically, the discretization grid is regular with a characteristic scale equal to the inter-well distance. The discretized matrix superimposed on the channels issues a network of 1D segments in which we solve the classical ﬂow equation: ∇·ðT∇hÞ = 0;

ð9Þ

where T is the segment transmissivity. The boundary conditions are described in Section 4.1. The discretized equation leads to a linear

Fig. 7. (a) Objective function against iteration number during one particular optimization run out of the 50 optimizations performed for the simple conﬁguration of Fig. 6a. Variation in the parameter values of (b) the ﬁrst identiﬁed channel, (c) the second identiﬁed channel and (d) the background matrix against the number of iterations.

790

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

system solved by the multifrontal method implemented in the software UMFPACK [13].

3.6. Post-processing analysis of the solution distribution Both because of the existence of local minima and because of the use of a stochastic inversion method, the result of the identiﬁcation is not necessarily the absolute minimum of the objective function. When running the identiﬁcation strategy several times with different initial conditions, each solution depends on the initial conditions of the algorithm, represented here by the position of the newly added channels and points within the channels, and on the changes in the conﬁgurations made at each step of the simulated annealing algorithm. For a given problem, we obtain a distribution of sub-optimal solutions. We use these solutions to construct the distribution of solutions in the most external loop of the inversion scheme (Fig. 3). We set up a methodology to analyze the similarities and differences of the simulations based on hierarchical clustering algorithms [12,24]. It consists in ﬁnding the most common channels and in quantifying their probability of occurrence (Fig. 5). The method described in Appendix B enables a limited number of most probable channels also called meta-channels to be extracted from the channel characteristics and sensitivity.

3.7. Synthesis of parameters of the identiﬁcation strategy The identiﬁcation strategy deﬁned in the previous paragraph is controlled by several parameters (Table 1). First, the results of the simulated annealing algorithm depend on the initial conditions, i.e. the initial parameterization and the initialization of the pseudo-random number generator. Second, the balance between the optimization duration and accuracy is controlled by the stopping criteria of the single channel identiﬁcation loop, i.e. the maximal number of iterations itmax min and the expected objective function value Fobj The reﬁnement level of the ﬁnal parameterization depends on the stopping criteria of the hierarchical identiﬁcation loop α1 (Eq. (5)) and α2 (Eq. (6)). The optimization algorithm itself is controlled by the initial temperature, the temperature schedule and the size of the searching neighborhood (Appendix A, 16). The shape of the objective function is a critical factor that depends on the weighting coefﬁcients λj,k of the regularization term in (Eq. (7)). The ﬁnal clustering algorithm based on the analysis of single solutions is controlled by the number of solutions ns used in the analysis, by the weighting coefﬁcient ξi (Appendix B, 18) and by the stopping threshold dc. The parameters have been assigned values that are either those most commonly used in other studies or those resulting from tests in simple conﬁgurations containing one or two straight channels.

4. Results After a synthetic presentation of the tested conﬁgurations, we assess the identiﬁcation strategy on synthetic ﬂow structures of increasing complexity ﬁrst by using head data only, and second by adding geometrical data.

4.1. Flow conﬁgurations and data density We test the identiﬁcation strategy on a set of fracture ﬂow patterns dominated by large fractures. The conﬁgurations are obtained from random fracture networks slightly above the percolation threshold from which we have extracted the elastic backbone. Transmissivities are all equal to 1.0. In these conﬁgurations, ﬂow is channeled within major structures and present second-order ﬂow paths, consistently with the deﬁnition of hierarchical ﬂow. Three typical conﬁgurations of increasing complexity are illustrated in Fig. 6. The simple conﬁguration (Fig. 6a) is composed of two channels, while the intermediary conﬁguration further comprises some secondary ﬂow (Fig. 6b) and the complex conﬁguration has several second-order well-connected structures (Fig. 6c). For each tested conﬁguration, we compute the steady-state head data and extract Nw values from Nw wells regularly distributed over the square domain according to a grid pattern (like in Fig. 4a). These Nw values will be used as the “observed” data di in (Eq. (7)) for the inverse problem. We take the inter-well distance dw as the fundamental length unit. The domain size L is expressed as a function of Nw and dw as L = N1/2 w ·dw. We use Nw = 25 and a typical inter-well distance of the order of 100 m, which corresponds to a domain size of the order of 1 km. The boundary conditions are derived from a ﬁxed regional gradient that is not aligned with the edge of the system, the head gradient being ﬁxed at 1. Instead of (Eq. (7)), results are analyzed according to the squared difference between head data and results Δh2 over the whole domain and obtained with:

2

Δh =

2 1 ∫ h−h′ dS; SS

ð10Þ

where S is the system surface. Practically, this integral is computed by using a 100 × 100-discretization ﬁner than the data density. The advantage of Δh2 compared to RE is that a more precise quantitative measure of the agreement between the observed and resulting heads is obtained.

Fig. 8. Distribution of the squared difference between observed and model results on the whole surface Δh2 (10) derived from the 50 individual solutions for (a) the simple conﬁguration of Fig. 6a, (b) the intermediary conﬁguration of Fig. 6b and (c) the complex conﬁguration of Fig. 6c.

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

791

4.2. Hierarchical inversion with steady-state heads At ﬁrst, we analyze the behavior of the inversion. Fig. 7a–d display the variation of both the objective function and the parameter values as functions of the iteration number during one particular optimization run out of the 50 optimizations performed for the simple conﬁguration of Fig. 6a. The objective function (Fig. 7a) follows several staircase-like decreases that are typical of simulated annealing algorithms [33,45]. The addition of new parameters through the single and hierarchical channel identiﬁcation loops triggers successive sharp increases and decreases. As a whole, the channel addition or modiﬁcation can reduce either signiﬁcantly or marginally the objective function as shown by the addition and the modiﬁcation of the second channel (Fig. 7a). The identiﬁcation strategy stops on a two-channel conﬁguration, showing that the procedure correctly avoids over-parameterization. As expected, the channel and background log-transmissivity values ﬂuctuate much more than the geometrical parameters (Fig. 7b-c) because head data are less sensitive to permeability than ﬂow data. Moreover, transmissivities are only relative to each other, since heads are only sensitive to the difference between channel transmissivities, with the absolute ﬂow being unknown. The channel importance hierarchy is respected since the ﬁrst identiﬁed channel has a transmissivity around three orders of magnitude larger than the next. The background matrix plays efﬁciently its role of substitution of less important channels (Fig. 7d). It has indeed a high relative transmissivity value (Tmat = 100) when only one channel is involved, and a small value (Tmat b 10− 5) when the optimal number of channels is reached. For the three conﬁgurations of Figs. 6, 8 gives the distribution of the squared difference between head data and results Δh2 over the whole domain as derived from the 50 individual solutions. The distribution is given at the intermediary steps containing 1, 2 and 3 channels and at the ﬁnal stage of the identiﬁcation (Fig. 8). It shows that, for the simple and intermediary conﬁgurations, the addition of one and two channels signiﬁcantly restricts the occurrence of large Δh2 values. With three channels, most of the models yield head maps that are close to the observed heads (Δh2 b 10− 1) and the distribution of Δh2 is only slightly shifted towards lower values by the introduction of additional channels. For the complex conﬁguration, however, results are less signiﬁcant and show that the optimal Δh2 value is reached with one or two channels and the resulting Δh2 remains higher (Δh2∼ 10− 0.2). Fig. 9 shows the mean and the standard deviation of the position of the identiﬁed channels resulting from the combination of all solutions. The percentage next to the channel indicates the proportion of channels containing this channel in all the conﬁgurations resulting from optimization. Channels are colored according to their relative transmissivity. For the simple case (Fig. 9a), the method provides a globally good agreement between the 50 solutions and the initial structure. Both channels are well located and have a transmissivity ratio equal to 2.8 of the same order as the real one equal to 1.0. For the intermediary conﬁguration (Fig. 9b), the map shows the presence of the two main ﬂow channels representing slightly less than 45% of all the determined channels while the remaining channels (∼55%) are not in the true model (Fig. 9b). Finally, for the complex conﬁguration (Fig. 9c), the most transmissive structures are sub-vertical and located in the rightmost part of the domain (Fig. 9c). They identify only partly the right vertical crossing channel while the other channels are not resolved but replaced by an average diagonal channel. The hierarchical identiﬁcation method yields good results for the simple case in terms of both location and transmissivity. However, the two main channels of the intermediary conﬁguration are resolved but the solution contains two dummy additional channels. The solution obtained for the complex case is far from the true model. To explain these results, we study what we call the head deviation equal to the difference between the observed head and the head computed for a

Fig. 9. Illustrations of the results of the post-processing clustering algorithm for the conﬁgurations of Fig. 6 by using head data only. The objective function is deﬁned with σih equal to 1.

homogeneous case having the same boundary conditions. The idea is to damp some of the inﬂuence of the boundary conditions and, conversely, to provide graphic illustrations of the information

792

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

Fig. 10. Interpolated maps of the head deviation from the 25 data (a) for the simple conﬁguration of Fig. 6a, (b) for the intermediary conﬁguration of Fig. 6b and (c) for the complex conﬁguration of Fig. 6c.

contained in the head. Fig. 10 displays the “head deviation” for the three tested conﬁgurations. In fact, head deviation maps are consistent with the position of most of the correctly identiﬁed channels except for the complex conﬁguration. Furthermore, the two dummy additional channels obtained for the intermediary conﬁguration are mainly located in poorly sensitive areas. Finally, as expected, Fig. 10 denotes that head data are less sensitive to each channel when the number of channel increases. These results show a strong inﬂuence of the boundary conditions on head sensitivities. 4.3. Use of head deviation To damp some of the inﬂuence of the boundary conditions, we introduce the head deviation information in the inversion process. The principle is to use the inverse of head deviations as head uncertainties (σid in (Eq. (7))) to increase the weight of the heads that are more sensitive to the channel structure than to the boundary conditions. We present the same synthetic map as in Fig. 9 by using hydraulic heads and their deviations (σid = 1/“head deviation”) (Fig. 11). For the simple conﬁguration, the position of the channels remains accurate and the ratio between transmissivities is reduced to 1.4, closer to the true value of 1.0 (Fig. 11a). For the intermediary conﬁguration, introducing the head deviation in the objective function moves the dummy channels to the corner of the domain crossing a zone of low sensitivity (Fig. 11b). We also note that the standard deviation of the channel location is inversely correlated to the head deviation. The two channel transmissivities are almost equal with a ratio of 0.9, which is close to the actual value of 1.0. Four of the ﬁfty solutions obtained in this conﬁguration are displayed in Fig. 12. They all contain the most important ﬂow channel and three of the four contain the secondary channel, while embranchments are not resolved. Regarding the complex conﬁguration (Fig. 11c), the result is not improved by the use of head deviation, since the two diagonal structures crossing the domain from the left to the right do not exist in the true model. However, their importance is limited because of their relatively small transmissivity (Fig. 10c). The inversion process yields relevant results for the simple and intermediary conﬁgurations of Fig. 6a and b. Head data are sufﬁcient and the use of head deviation precludes the occurrence of channels in the areas where head data are poorly sensitive to the channels. However, the exclusive use of head data ﬁnds its limitations in the complex channeling case.

improvements with the addition of basic geometrical information. The objective is to check if the inverse problem can be solved with additional data of a different nature. We deliberately choose to add geometrical data in the same spirit as what has been done when adding a regularization term in more classical inverse problems for continuous permeability structures. An alternative not explored in this paper would be to use transient-state head data. 4.4.1. Geometrical data As basic geometrical data, we use the distance between a well and the nearest non-negligible channel. Although this kind of information is not at ﬁrst designed to be realistic, some of it could be derived from well test interpretation [11] or from geophysical measurements like GPR [25,53]. Introducing explicitly distances in the objective function (7) leads to: !2 2 # hi −h′i ðnc Þ ri −r i′ ðnc Þ + σir σih i=1 !2 nc −1 NP ðjÞ p ðjÞ−p′k ð jÞ + ∑ ∑ λj;k ⋅ k ; σkp j=1 k=1

Nw Fobj nc; ; NP ðnc Þ = ∑

"

ð11Þ

where h stands for observed heads, h′ for simulated heads, r for observed distances, r′ for simulated distances, and σih and σir are the uncertainties linked to head and distance data, respectively. The choice of σih and σir balances the relative importance of head and distance data. 4.4.2. Results with distance data alone We ﬁrst check the possibility to identify the channels with the sole distance data excluding temporarily the head data by choosing σir = 1 and σih = ∞. With distance data only (Fig. 13a), the optimal results of the inversion are made up of a correctly identiﬁed vertical structure on the left. The other channels are not correctly identiﬁed. To explain this result, we analyze the distance map interpolated from the Nw = 25 data points (Fig. 13b). The dummy channels connect indeed small-distance zones but do not locate existing channels. The exclusive use of distance data is limited by the impossibility to identify the right connectivity of small-distance areas, the lack of sensitivity of distances to the hierarchical channel organization and the absence of information provided on transmissivities.

4.4. Addition of geometrical data Because of the impossibility to identify the channels in the complex structure of Fig. 6c with only hydraulic data, we test the possible

4.4.3. Results with heads and distances When used simultaneously, hydraulic and geometrical information are balanced by choosing σir = 2 and σid = 1/“head deviation”.

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

793

solution is more accurate (Fig. 13c) and only the location of the bottom channel is inaccurate. It may be due to the short distance between the ﬂow structures and the wells at the bottom of the model domain. The diagonal structure already observed with only distance or head data is persistent when using both head and distance data. Its importance is however limited by its transmissivity that is twice to four times smaller than that of the other channels. Finally, the use of additional information represented here by the distance data lowers the dispersion of the retrieved structures. It is ﬁrst observed in the error bars of the identiﬁed channels. It is also apparent from the increase in the cumulative percentage of the identiﬁed structures. This increase ranges from 70% in Fig. 11c to 80% in Fig. 13c. 4.4.4. Conclusion For the complex structure of Fig. 6c, the identiﬁcation yields acceptable results only when using head and distance data simultaneously. Transmissivity still appears to be a regulating factor of the importance of the different structures. Resulting structures that are not present in the reference conﬁguration have indeed a signiﬁcantly lower transmissivity. 5. Systematic evaluation of the hierarchical identiﬁcation strategy In this paragraph, we investigate the performance of the identiﬁcation strategy on a larger set of 20 conﬁgurations. We use for the inverse problem the complete set of information, i.e. heads, head deviations and distances. 5.1. Tested conﬁgurations We apply the inversion strategy to the 20 conﬁgurations of Fig. 15. They are numbered in increasing order of apparent degree of complexity and, from C1 to C20, networks contain gradually more intricately connected channels. All transmissivities are equal to 1.0. The conﬁgurations analyzed in the previous sections are C2, C10 and C12. Even if this visual rating is subjective, it is closely consistent with the product of the channeling indicators Dcc and Dic introduced in Le Goc et al. [36] (Table 2). Dcc is a characteristic channel continuity scale taken as the distance over which ﬂow rates remain consistently high. Dic is a characteristic inter-channel distance, with Dic/L N 0.5 denoting a hierarchical ﬂow. Therefore, the channeling degree increases with Dcc·Dic. 5.2. Analyses of results

Fig. 11. Illustrations of the results of the post-processing clustering algorithm for the conﬁgurations of Fig. 6 by using head data and head deviation.

Even if distances cannot be used alone, they provide additional information that is different from that provided by hydraulic data. Thus, combining head and distance data yields a much better identiﬁcation (Fig. 13c). Most of the principal channels are resolved and single solutions contain at most three channels including, for more than half of them, the main channels (Fig. 14). The optimal

The results of the identiﬁcation strategy are compared to the reference conﬁgurations in Fig. 15. The agreement between the results and the true models globally tends to decrease with the apparent complexity as measured by the number of the conﬁguration. This is conﬁrmed for most of the tested conﬁgurations by the increase in RE as a function of the conﬁguration complexity as measured by Dcc·Dic/L2 (black squares in Fig. 16). Three more complex conﬁgurations (C14, C15, and C20) depart from this tendency with still low RE values (white squares in Fig. 16). These smaller RE values, as compared to those of the other conﬁgurations having similar Dic.Dcc/L2 values, are characterized by the absence of the largest head deviations ampliﬁed by the squared difference in RE (Eq. (1)). If Dic and Dcc give a global rating of the ﬂow structure complexity and the ﬁnal value of RE gives a global rate of agreement between the ﬁnal solution and the reference structure, we are also interested in the quality of the identiﬁcation of the structural characteristics. Therefore, we use two additional visual indicators for the number of independent channels and for the agreement between the solution and the reference structure. These indicators are clearly subjective. However,

794

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

Fig. 12. Four individual solutions of the inversion method applied to the intermediary complex conﬁguration of Fig. 6b. Fobj refers to the value of the objective function (7). The individual solutions identify the diagonal main ﬂow structure represented in green. Three of them identify the secondary ﬂow structures in red. Embranchments are structures too small to be resolved and are identiﬁed only in the third result (blue). The channeled structure is recalled in the background.

Fig. 13. Results for the complex conﬁguration of Fig. 6c. (a) Results of the post-processing clustering algorithm with the objective function deﬁned with distances alone. (b) Interpolated maps of the distances from the 25 data points. (c) Results with the objective function deﬁned using both heads and distances.

for the simple structures handled here, we assume that the variability in their estimation remains limited. The number of independent channels ranges from 1 for C1 to 6 for C19. The visual quality of the agreement between the reference and resulting structures increases from 1 to 5 (Table 2). As previously noticed, the increase in the channel number, and hence in the ﬂow structure complexity, induces on average a decrease in the solution quality (Fig. 17). We identify two different groups. The ﬁrst group comprises the wellidentiﬁed structures characterized by a visual rating greater or equal to 3.

Apart from C13 and C18, it also corresponds to the networks made up of less than 4 independent channels where ﬂow is hierarchical (Dic/L N 0.5). The other conﬁgurations including C14, C15, C17, C19, and C20 have more than 4 independent channels and lead to a visual rating smaller or equal to 2. These more complex conﬁgurations do not correspond to a hierarchical ﬂow (Dic/L b 0.5), which explains the lack of efﬁciency of the proposed method. We ﬁnally note that the three complex conﬁgurations C14, C15 and C20 having a small RE value are among the worse ones in terms of visual rating, consistently with their small Dic/L values.

Fig. 14. Four individual solutions in red and their associated objective function value obtained for the identiﬁcation strategy with the full objective function applied to the conﬁguration of Fig. 6c. The reference conﬁguration is recalled with gray colors in the background and Fobj refers to the complete objective function (11).

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

795

Fig. 15. Conﬁgurations used to test the identiﬁcation strategy. Left and right columns show the reference conﬁguration and the modeling result given by the post-processing analysis. Green dots stand for well locations.

796

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

Fig. 15 (continued).

Consequently, the mismatch between the resulting and reference heads gives only a very crude estimate of the inversion quality. 6. Discussion We discuss ﬁrst the results obtained in Sections 4 and 5, and second the extension of the method to more classical transient-state data. 6.1. Discussion of the results To be able to solve the inverse problem with increasingly complex channel structures, we have progressively added more information. The simplest conﬁgurations (C1 to C4, Fig. 15) can be identiﬁed with the sole knowledge of hydraulic data. The wells surround the channels and the characteristic inter-well distance is smaller than the channel scale, a case for which identiﬁcation has already been proved possible with only head data and a restricted number of nine wells [49]. For more complex conﬁgurations made up of three and four channels (C5 to C13, Fig. 15), the knowledge of the boundary conditions and distance from the well to

the nearest channels are sufﬁcient to determine most of the channel organization. The most complex among the handled structures (C14 to C20, Fig. 15) cannot be identiﬁed. A possible explanation is that the data obtained from the 25 piezometers do not contain enough information. Consequently, we have applied the hierarchical identiﬁcation strategy to 100 wells instead of 25 and found no signiﬁcant improvement on the identiﬁed channels. The limitation may not come from the quantity of data but from the impossibility for the algorithm to identify conﬁgurations. In fact, a more detailed analysis shows that conﬁgurations C14 to C20 are not extremely channeled since transmissivities are all equal. Because the hierarchical ﬂow organization assumption is not valid, conﬁgurations C14 to C20 cannot be identiﬁed with the proposed algorithm and should be treated with more classical inverse problem strategies. This shows that the nature of the ﬂow structure between highly channeled and more evenly distributed should be determined a priori. This information might be given by other data like the distribution of ﬂowing fractures or of water inﬂows in wells. It might also be deduced from a ﬁner analysis of the hydraulic data, head and ﬂow distribution, a possibility that we will explore in the future with transient-state data.

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

797

6.2. Potential extension to more classical transient-state data Model calibration based on more advanced hydraulic and geometrical data has been the topic of several studies. So far, most inverse problem methods seeking discrete ﬂow structures have been performed on more complex data but with a simpler parameterization. Doughty et al. [19] sought IFS parameters using transient-state drawdown data. Day-Lewis et al. [14] determined inter-well connectivity structures also using transient-state drawdown data. Bruyelle et al. [5] worked on a known fracture network structure and restricted the parameterization to the identiﬁcation of the fracture transmissivities. Gwo [27] determined the occupation probability of links on a discrete structure using transport solute breakthrough data. In this paper, we proceed in two steps. First, we identify complex structures with basic steady-state head and geometrical data, and second we replace these simple data with more advanced geometrical and hydraulic data. Geophysical data will likely not give such precise information as the channel closest to a well but will yield some similar information that should be used to constrain the hydraulic structure close to the wells. Concerning hydraulic data, the objective is to use transient-state data from a more restricted number of wells. The interest in using transient-state data is that they give more information on the ﬂow structure and are less dependent on the boundary conditions than their steady-state counterpart. At not too late times, the hydraulic response is more inﬂuenced by the pumping location than by the far away boundary conditions. Pumping systematically in different wells provides a possible varying coverage of the sensitive zones while potentially modifying the main ﬂow structures (e.g. hydraulic tomography [1,30,41]). Since our identiﬁcation strategy relies on the hierarchical organization of the ﬂow channels, it may lead to different and only partially consistent ﬂow structures for varying pumping locations. Combining them in a consistent discrete transmissivity structure is the ﬁrst critical issue in shifting from steady-state to transient-state ﬂow data [20]. 6.3. Optimization of the hierarchical identiﬁcation strategy A key issue in shifting to transient-state simulation is the increased computational demand. Choosing 10 discrete times in 10 pumping tests performed in 10 of the 25 previous wells leads to the necessity to

Table 2 Residual errors RE deﬁned by Eq. (1), channeling indicators (Dic and Dcc) associated to the tested conﬁgurations (Fig. 15), threshold of the clustering algorithm (dc) and visual indicators of the channel number and solution quality. Dic/L and Dcc/L are in the interval [0;1]. The number of channel and the quality of the solution range between 0 and 5. Conﬁguration

RE

Dic/L

Dcc/L

dc

Visual number of channels

Visual quality of the solution

C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 C11 C12 C13 C14 C15 C16 C17 C18 C19 C20

0.0014 0.0080 0.0033 0.0080 0.0059 0.0083 0.023 0.0059 0.0080 0.0099 0.020 0.020 0.050 0.012 0.0068 0.029 0.037 0.034 0.044 0.013

0.97 0.65 0.94 0.72 0.85 0.62 0.86 0.79 0.65 0.65 0.64 0.75 0.40 0.34 0.46 0.63 0.42 0.30 0.26 0.43

1.0 0.97 0.98 0.92 0.89 0.88 0.96 0.94 0.92 0.97 0.98 0.78 0.98 0.56 0.65 0.77 0.76 0.71 0.62 0.76

3.5 1.0 1.0 1.5 1.0 2.0 1.5 1.5 0.5 0.7 1.0 0.5 1.0 0.5 1.3 1.0 0.4 1.4 1.0 1.5

1 2 2 2 2 2.5 2 2 2.5 2 2.5 3.5 3.5 4 4 3 3.5 5 6 4

5 5 5 5 4.5 4 4 4 3.5 4 3.5 4 3 1 1 4 2 3 1 1.5

Fig. 16. Quadratic mismatch sum RE between reference data and modeling results as a function of the product of channeling indicators Dic and Dcc normalized by the square of the system size L2. Black squares identify conﬁgurations for which RE increases with more complex conﬁgurations corresponding to smaller Dic·Dcc/L2 values. White squares stand for three complex conﬁgurations resulting in small errors RE.

solve 250 ﬂow problems at speciﬁed times. The most appropriate method is the Laplace method with the Stehfest algorithm for the inverse Laplace transform [47]. To achieve a reasonable precision, eight full ﬂow problems need to be solved at each time. In this example, the computational load increases by a factor of 2000 compared to a single steady-state ﬂow simulation. The optimization algorithms should thus be improved to lower the number of ﬂow simulations by one to three orders of magnitude. We argue that this may be possible, ﬁrst because we took a very simple simulated annealing algorithm and did not optimize its performances, and second because transient-state ﬂow data contain more information than steady-state ﬂow data. The simulated annealing algorithm may be replaced by more advanced optimization methods like the Covariance Matrix Adaptation or Monte-Carlo Markov Chain methods [23,29]. The combination of the different terms of the objective functions may also be improved to minimize the required number of simulations [2,9,42]. The other interest in modifying the objective function is to increase its regularity and use gradient-like optimization methods. The optimization method may combine an initial stochastic search and a subsequent gradient-like scheme like what is done for the identiﬁcation of permeability zones [50].

Fig. 17. Visual rating of the ﬁnal post-processing solution versus the visual channel number.

798

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

7. Conclusion In this paper, we present a hierarchical identiﬁcation method designed to identify both hydraulic properties and hydraulic structures using simple steady-state hydraulic and geometrical data. The hierarchical identiﬁcation method is derived from the hierarchical organization of ﬂow. The method identiﬁes ﬁrst the main ﬂow channels before calibrating second-order channels. We test the method on synthetic conﬁgurations typical of low-permeability rocks where ﬂows concentrate in highly transmissive ﬂow paths within a restricted number of fractures. Conﬁgurations range from a couple of straight channels to conﬁgurations comprising up to six interconnected channels. The principal novelty of the proposed approach is to focus on preferential ﬂow paths rather than on equivalent media properties. This is why all other ﬂow settings and optimization algorithms have been taken as simple as possible. The boundary conditions are derived from a uniform head gradient and synthetic data are made up of steady-state hydraulic heads. For the most complex conﬁgurations, additional data including distances from the wells to the nearest channel are used. We use these distance data as highly-idealized data possibly provided by near-well geophysical measurements. Channels are identiﬁed in decreasing order of importance by using successive optimizations of an objective function. The identiﬁcation strategy takes advantage of the hierarchical ﬂow organization to restrict the dimension of the solution space of each individual optimization. Because of the successive optimizations, main ﬂow channels strongly determine the characteristics of secondary channels. Additionally, main ﬂow channels can be slightly modiﬁed by secondary channels through the introduction of a regularization term in the main channel characteristics in the objective function. The regularization term is weighting the rate of variations of the formerly identiﬁed structures compared to the newly added structures. Reﬁnement of main channels is performed ﬁrst before introducing additional less essential channels. The identiﬁcation is stopped when both the improvements of the channel structures and the channel number become marginal. A matrix has been introduced in order to replace the channels that should be identiﬁed later. The matrix permeability sharply decreases as the identiﬁcation proceeds. Modiﬁcations of the objective function have been introduced to allow marginal modiﬁcations of the main ﬂow structures by secondary ﬂow structures. The classical simulated annealing method has been chosen as the optimization algorithm because of the strong non-convex nature of the objective function. For each conﬁguration, the identiﬁcation strategy has been run 50 times yielding 50 solutions. The prevalent channels are extracted from the 50 solutions by a post-processing algorithm. We conclude that the simplest conﬁgurations comprising a couple of straight channels are identiﬁed with only the steady-state head data. The identiﬁcation of similar conﬁgurations having complex rather than straight channels requires additionally the knowledge of the head deviations. For intricate interconnected conﬁgurations comprising up to three channels with complex shapes, head data are insufﬁcient and the identiﬁcation requires additional data. In this paper, we have introduced the distances from the wells to the closest channel as highly-idealized geometrical information in the channeling structure. The tested conﬁgurations comprising more than four complex channels were not identiﬁed even with both hydraulic and geometric data because of the absence of a strong enough hierarchy in the channel structure. Extension of the method to transient-state head data obtained from a more restricted number of wells is the natural outcome of this work. It will require improving the optimization method to lower the number of direct ﬂow simulations by one to two orders of magnitude. Acknowledgements The French National Research Agency ANR is acknowledged for its ﬁnancial funding through the MOHINI project (ANR-07-VULN-008)

and for its contribution to the development of numerical methods through the MICAS project (ANR-07-CIS7-004). Additional funding was provided by the French Association for Research and Technology ANRT (CIFRE-747/2006). The authors thank Olaf Kolditz and three anonymous reviewers for their helpful and insightful comments. Appendix A. Simulated annealing parameterization Simulated annealing performs a random walk in the parameter space directed towards the minimization of the objective function [34]. The acceptance of parameter sets leading to an increase in the objective function is necessary to get out of local minima. Its probability, however, decreases slowly in order to force the algorithm to converge to the global minimum. The simulated annealing algorithm is parameterized by the acceptance mechanisms of worse parameter sets, by the deﬁnition of the random walk in the parameter space and by its condition of termination. The probability of acceptance is handled by an energy criterion characterized by a temperature. The scheduled temperature has been chosen according to Ingber [32]: 1 Tk = Tinit ⋅exp −ci ⋅kD ;

ð12Þ

where Tinit is the initial temperature, D is the dimension of the parameter space, k is the number of accepted states and ci is a userdeﬁned value that can be adapted to improve the algorithm performances, the default value of which is set to: ci = − log 10

−5

1 : ⋅exp − log100 D

ð13Þ

The acceptance test deﬁnes whether a new parameter state is accepted or rejected. It writes: F −F u b exp − i i−1 ; Ti

ð14Þ

where u is a random value drawn from a uniform distribution in [0;1], Fi is the value of the objective function with the new parameters, Fi − 1 is the value of the objective function at the last accepted state and Ti is the current temperature. Consequently, if the new parameters induce a decrease in the objective function, they are accepted for sure. Otherwise, they are accepted with a probability that depends on how they penalize the solution. This probability decreases as temperature decreases. The random walk within the parameter space is characterized by the parameter modiﬁcations from steps i to i + 1. For parameter p j: j j j j pi = pi−1 + yi ⋅ pmax −pmin ;

ð15Þ

j j and pmax are the minimal and maximal possible values for where pmax j p , and yi is the following scaling factor:

yi = sign ðu−0:5Þ⋅Ti ⋅

1 2⋅u−1 1+ −1 ; Ti

ð16Þ

where u is a random value drawn from a uniform distribution in [0;1]. With this formulation, the random walk shifts progressively from global to local thanks to the decrease in temperature. The optimization stops when the objective function becomes smaller than a ﬁxed minimal value Fi ≤ Fmin or when the maximal number of iterations allowed is reached.

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

Appendix B. Clustering algorithm The clustering algorithm takes as input a set of channels and groups them together according to the distance between channels, accounting for their relevancy and their sensitivity. The degree of similarity between two channels i and j is given by the distance between them: 2

2 312 yi ðxÞ−y j ðxÞ dx7 6 dij = 4 ∫ 5 Δ2ij Δ

;

ð17Þ

ij

where i and j are the indices of the two channels, y i(x) and y j(x) characterize the ordinate of the channels corresponding to abscissa x and Δij is the overlapping x-range between the channels. The clustering algorithm gathers the channels in meta-channels by minimizing the distances between their elements. It is controlled by the maximum distance dc between two channels in a meta-channel. Large dc values reduce the number of meta-channels but may lower their internal consistency. On the contrary, small dc values yield larger numbers of meta-channels with higher internal consistency. After investigating the clustering results for several different conﬁgurations, we ﬁx dc between 5% and 35% of the characteristic domain scale L. Each meta-channel is then characterized by its mean position and transmissivity, and the uncertainty of its parameters. The mean location of a meta-channel is derived from the position of its components weighted by the coefﬁcients ξi given by: ξi =

Si ; i Fobj

ð18Þ

i is the value of the objective function for the solution where Fobj containing channel i and Si is the weight of the channel within this solution. The transmissivity of the meta-channel is chosen as the geometric mean weighted by the coefﬁcients ξi of its components. Larger weights ξi are obtained for smaller objective function values and for more sensitive channels. Si is more difﬁcult to obtain than the ﬁnal value of the objective function. First, we rank the channels within a solution by their relative weight. The weight of channel i (Si) is given by the difference in RE values (Eq. (1)) between the whole solution and the whole solution from which channel I has been removed. Second, from a system void of channels, we add progressively the channels within the solution according to their decreasing weight as deﬁned above. We deﬁne Si as the difference in RE values before and after the addition of channel i. With this method, the channel with the largest Si is likely to be the main channel. The uncertainty on the meta-channel position is obtained as the standard deviation of its components weighted by the coefﬁcients ξi.

References [1] Brauchler R, Liedl R, Dietrich P. A travel time based hydraulic tomographic approach. Water Resour Res 2003;39:1370. [2] Bruckner G, Handrock-Meyer S, Langmach H. An inverse problem from the 2D groundwater modelling. Inverse Probl 1998;14:835–51. [3] Bruel D. Using the migration of the induced seismicity as a constraint for fractured Hot Dry Rock reservoir modelling. Int J Rock Mech Min Sci 2007;44:1106–17. [4] Bruyelle J, Lange A. Automated characterization of fracture conductivities from well tests inversion, 71st EAGE Conference & Exhibition, 8-11 June 2009. Amsterdam, The Netherlands: Society of Petroleum Engineers; 2009. [5] Bruyelle J, Lange A. An extended evolution strategy for the characterization of fracture conductivities from well tests. Proceedings of the 11th Annual conference on Genetic and evolutionary computation. Québec, Canada: ACM, Montreal; 2009. [6] Cacas MC, Ledoux E, de Marsily G, Tillie B, Barbeau A, Durand E, et al. Modeling fracture ﬂow with a stochastic discrete fracture network: calibration and validation, 1, the ﬂow model. Water Resour Res 1990;26:479–89. [7] Cacas MC, Ledoux E, de Marsily G, Tillie B, Barbeau A, Durand E, et al. Modeling fracture ﬂow with a stochastic discrete fracture network: calibration and validation, 2, the transport model. Water Resour Res 1990;26:491–500.

799

[8] Carrera J, Alcolea A, Medina A, Hidalgo J, Slooten LJ. Inverse problem in hydrogeology. Hydrogeol J 2005;13:206–22. [9] Carrera J, Neuman SP. Estimation of aquifer parameters under transient and steady state conditions: 1. Maximum likelihood method incorporating prior information. Water Resour Res 1986;22. [10] Chang L-F, Sun NZ, Yeh WW-G. Optimal observation network design for parameter structure identiﬁcation in groundwater modeling. Water Resour Res 2005;41. [11] Chang YC, Yeh HD, Huang YC. Determination of the parameter pattern and values for a one-dimensional multi-zone unconﬁned aquifer. Hydrogeol J 2008;16:205–14. [12] Chelcea S, Bertrand P, Trousse B. A New Agglomerative 2–3 Hierarchical Clustering Algorithm, Innovations in Classiﬁcation, Data Science, and Information Systems; 2005. p. 3–10. [13] Davis TA. Algorithm 832: UMFPACK, an unsymmetric-pattern multifrontal method. ACM Trans Math Softw 2004;30:196–9. [14] Day-Lewis FD, Hsieh PA, Gorelick SM. Identifying fracture-zone geometry using simulated annealing and hydraulic-connection data. Water Resour Res 2000;36: 1707–21. [15] de Dreuzy J-R, Davy P, Bour O. Hydraulic properties of two-dimensional random fracture networks following a power law length distribution 1. Effective connectivity. Water Resour Res 2001;37:2065–78. [16] de Dreuzy J-R, Davy P, Bour O. Hydraulic properties of two-dimensional random fracture networks following a power law length distribution 2. Permeability of networks based on lognormal distribution of apertures. Water Resour Res 2001;37. [17] de Marsily G, Delhomme J-P, Delay F, Buoro A. Four decades of inverse problems in hydrogeology. In: Zhang DW, C.L., editors. Theory, modeling and ﬁeld investigation in hydrogeology : a special volume in honor of Shlomo P. Neuman's 60th birthday. Colorado, USA: Geological society of America, Boulder; 2000. p. 1–17. [18] Donado LD, Sanchez-Vila X, Ruiz E, Elorza FJ, Bajos C, Vela-Guzman A. Calibration of hydraulic and tracer tests in fractured media represented by a DFN model. In: IAHS W, UK, editors. International Conference on Calibration and Reliability in Groundwater Modelling : From Uncertainty to Decision Making, IAHS, Wallingford, UK, Hague , The Netherlands; 2005. [19] Doughty C, Long JCS, Hestir K, Benson SM. Hydrologic characterization of heterogeneous geologic media with an inverse method based on iterated function systems. Water Resour Res 1994;30:1721–45. [20] Fienen MN, Clemo T, Kitanidis PK. An interactive Bayesian geostatistical inverse protocol for hydraulic tomography. Water Resour Res 2008;44:W00B01. [21] Fogg GE. Groundwater ﬂow and sand body interconnectedness in a thick, multiple aquifer system. Water Resour Res 1986;22:679–94. [22] Francese R, Mazzarini F, Bistacchi A, Morelli G, Pasquare G, Praticelli N, et al. A structural and geophysical approach to the study of fractured aquifers in the Scansano-Magliano in Toscana Ridge, southern Tuscany, Italy. Hydrogeol J 2009;17:1233–46. [23] Fu J, Gómez-Hernández J. A blocking Markov Chain Monte Carlo method for inverse stochastic hydrogeological modeling. Math Geosci 2009;41:105–28. [24] Gordon AD. Hierarchical classiﬁcations. Classiﬁcation, 2nd edition (Monographs on Statistics and Applied Probability, 82). Chapman & Hall / CRC; 1999. p. 69–111. [25] Grasmueck M. 3-D ground-penetrating radar applied to fracture imaging in gneiss. Geophysics 1996;61:1050–64. [26] Grenier C, Bernard-Michel G, Benabderrahmane H. Evaluation of retention properties of a semi-synthetic fractured block from modelling at performance assessment time scales (Aspo Hard Rock Laboratory, Sweden). Hydrogeol J 2009;17:1051–66. [27] Gwo J-P. In search of preferential ﬂow paths in structured porous media using a simple genetic algorithm. Water Resour Res 2001;37. [28] Hanor JS. Effective hydraulic conductivity of fractured clay beds at a hazardous waste landﬁll; Louisiana Gulf Coast. Water Resour Res 1993;29:3691–8. [29] Hansen N, Ostermeier, A. Adapting arbitrary normal mutation distributions in evolution strategies: the covariance matrix adaptation. IEEE International Conference on Evolutionary Computation 1996, IEEE Press, 1996, p. 312–7. [30] Hao YH, Yeh TCJ, Xian JW, Illman WA, Ando K, Hsu KC, et al. Hydraulic tomography for detecting fracture zone connectivity. Ground Water 2008;46:183–92. [31] Hill MC. The practical use of simplicity in developing groundwater models. Ground Water 2006;44:775–81. [32] Ingber AL. Adaptive Simulated Annealing (ASA). McLean, VA: Lester Ingber Research; 1993. [33] Ingber AL. Simulated annealing — practice versus theory. Math Comput Model 1993;18:29–57. [34] Kirkpatrick S, Gelatt Jr. CD, Vecchi MP. Optimization by simulated annealing. Science 1983;220:671–80. [35] Lavenue M, de Marsily G. Three-dimensional interference test interpretation in a fractured aquifer using the pilot point inverse method. Water Resour Res 2001;37. [36] Le Goc R, de Dreuzy JR, Davy P. Statistical characteristics of ﬂow as indicators of channeling in heterogeneous porous and fractured media. Adv Water Resour 2010;33:257–69. [37] Long JCS, Remer JS, Wilson CR, Witherspoon PA. Porous media equivalents for networks of discontinuous fractures. Water Resour Res 1982;18. [38] Marryote RA, Dougherty DE, Stollar RL. Optimal groundwater management; 2. Application of simulated annealing to a ﬁeld-scale contamination site. Water Resour Res 1993;29:847–60. [39] Martinez-Landa L, Carrera J. An analysis of hydraulic conductivity scale effects in granite (Full-scale Engineered Barrier Experiment (FEBEX), Grimsel, Switzerland). Water Resour Res 2005;41:13. [40] Mauldon AD, Karasaki K, Martel SJ, Long JCS, Landsfeld M, Mensch A, et al. An inverse technique for developing models for ﬂuid ﬂow in fracture systems using simulated annealing. Water Resour Res 1993;29:3775–89.

800

R. Le Goc et al. / Advances in Water Resources 33 (2010) 782–800

[41] McDermott CI, Sauter M, Liedl R. New experimental techniques for pneumatic tomographical determination of the ﬂow and transport parameters of highly fractured porous rock samples. J Hydrol 2003;278:51–63. [42] McLaughlin D, Townley LR. A reassessment of the groundwater inverse problem. Water Resour Res 1996;32. [43] Nakao S, Najita J, Karasaki K. Hydraulic well testing inversion for modeling ﬂuid ﬂow in fractured rocks using simulated annealing: a case study at Raymond ﬁeld site, California. J Appl Geophys 2000;45:203–23. [44] Renshaw CE. Estimation of fracture zone geometry from steady-state hydraulic head data using iterative sequential cokriging. Geophys Res Lett 1996;23. [45] Sambridge M, Mosegaard K. Monte Carlo methods in geophysical inverse problems. Rev Geophys 2002;40. [46] Silliman SE. An interpretation of the difference between aperture estimates derived from hydraulic and tracer tests in a single fracture. Water Resour Res 1989;25: 2275–83. [47] Stehfest H. Remark on algorithm 368. Numerical inversion of laplace transforms. Commun ACM 1970;13:47–9. [48] Tarantola, A. Inverse Problems Theory, Methods for Data Fitting and Model Parameter Estimation. Elsevier: Netherlands; 1987. [49] Tiedeman CR, Shieh PA, Christian SB. Characterization of a high-transmissivity zone by well-test analysis: Steady state case. Water Resour Res 1995;31:27–37.

[50] Tsai FT-C, Sun N-Z, Yeh WW-G. A combinatorial optimization scheme for parameter structure identiﬁcation in ground-water modeling. Ground Water 2003;41:156–69. [51] Tsang C-F, Neretnieks I. Flow channeling in heterogeneous fractured rocks. Rev Geophys 1998;36. [52] Tsang YW, Tsang CF. Flow channeling in a single fracture as a two dimensional strongly heterogeneous permeable medium. Water Resour Res 1989;25:2076–80. [53] Tsoﬂias GP, Van Gestel JP, Stoffa PL, Blankenship DD, Sen M. Vertical fracture detection by exploiting the polarization properties of ground-penetrating radar signals. Geophysics 2004;69:803–10. [54] Vesselinov VV, Neuman SP, Illman WA. Three-dimensional numerical inversion of pneumatic cross-hole tests in unsaturated fractured tuff 2. Equivalent parameters, highresolution stochastic imaging and scale effects. Water Resour Res 2001;37:3019–41. [55] Yeh WWG, Yoon YS. Aquifer parameter identiﬁcation with optimum dimension in parameterization. Water Resour Res 1981;17:664–72. [56] Zheng C, Wang P. Parameter structure identiﬁcation using tabu search and simulated annealing. Adv Water Resour 1996;19:215–24. [57] Zimmerman DA, de Marsily G, Gotway CA, Marietta MG, Axness CL, Beauheim RL, et al. A comparison of seven geostatistically based inverse approaches to estimate transmissivities for modeling advective transport by groundwater ﬂow. Water Resour Res 1998;34:1373–413.

An inverse problem methodology to identify flow channels in fractured media using synthetic steady-state head and geometrical data

An inverse problem methodology to identify flow channels in fractured media using synthetic steady-state head and geometrical data

Recommend Documents