1D geological imaging of the subsurface from geophysical data with Bayesian Evidential Learning. Part 2: Applications and software

1D geological imaging of the subsurface from geophysical data with Bayesian Evidential Learning. Part 2: Applications and software

Journal Pre-proof 1D geological imaging of the subsurface from geophysical data with Bayesian Evidential Learning. Part 2: Applications and software H...

3MB Sizes 0 Downloads 6 Views

Journal Pre-proof 1D geological imaging of the subsurface from geophysical data with Bayesian Evidential Learning. Part 2: Applications and software Hadrien Michel, Thomas Hermans, Thomas Kremer, Ann Elen, Frédéric Nguyen PII:

S0098-3004(19)30602-8

DOI:

https://doi.org/10.1016/j.cageo.2020.104456

Reference:

CAGEO 104456

To appear in:

Computers and Geosciences

Received Date: 18 June 2019 Revised Date:

20 December 2019

Accepted Date: 20 February 2020

Please cite this article as: Michel, H., Hermans, T., Kremer, T., Elen, A., Nguyen, Fréé., 1D geological imaging of the subsurface from geophysical data with Bayesian Evidential Learning. Part 2: Applications and software, Computers and Geosciences (2020), doi: https://doi.org/10.1016/j.cageo.2020.104456. This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. © 2020 Published by Elsevier Ltd.

1D geological imaging of the subsurface from geophysical data with Bayesian Evidential Learning. Part 2: Applications and software Authors1: Hadrien MICHEL1,2,3, Thomas HERMANS2, Thomas KREMER1, Ann ELEN4, Frédéric NGUYEN1 (1) University of Liège, Urban and Environmental Engineering Department, Faculty of Applied Sciences, Liège, Belgium, (2) Ghent University, Department of Geology, Ghent, Belgium, (3) F.R.S.-FNRS (Fonds de la Recherche Scientifique), Brussels, Belgium, (4) KU Leuven, Department of Earth and Environmental Sciences, Leuven, Belgium Corresponding author: Hadrien MICHEL University of Liège Allée de la découverte, 9 B-4000, Liège (Belgium) E-mail: [email protected] Declaration of interest: none Link to the code: https://github.com/hadrienmichel/BEL1D Highlights: -

Application of the Bayesian Evidential Learning 1D imaging framework.

-

A set of MATLAB graphical user interfaces is provided.

-

Consistent results for both SNMR and surface waves.

-

BEL1D can be generalized to any type of 1D geophysical data.

1

Authorship statement: Hadrien Michel developed the BEL1D codes and produced the results presented in this paper. Thomas Kremer has significantly contributed and improved all the SNMR related elements developed in this paper. The Mont Rigi SNMR dataset was acquired and pre-processed by Ann Elen for her Master Thesis. Frédéric Nguyen and Thomas Hermans both co-supervised the whole work. 1

Abstract: Bayesian Evidential Learning 1D imaging (BEL1D) is a framework that enables the rapid interpretation of geophysical data in terms of 1D models of the subsurface with associated uncertainty. In this paper, we present a set of MATLAB toolboxes for the interpretation of SNMR data and dispersion curves from surface waves methods under the BEL1D framework, along with some results on field data. The obtained results are compared to the solutions provided by other approaches for the interpretation of these datasets, to demonstrate the robustness of BEL1D. Even if the method is presented for these two specific applications, it is not constrained to any type of geophysical data, as it only requires the knowledge of the forward model to be used. A generalized toolbox is therefore also proposed, only requiring the user to provide a consistent forward model in order to use BEL1D. Keywords:

1D imaging, uncertainty, Prediction Focused Approach, Bayesian Evidential

Learning,

machine

learning,

inversion,

SNMR,

2

Surface

Waves,

MATLAB.

1

1. Introduction

2

Geophysics offers a large ensemble of methods to investigate subsurface properties.

3

However, building consistent images of the subsurface based on geophysical data remains a

4

challenge for all methods. Often, this process is handled by deterministic inversion algorithms

5

(Aster et al., 2013). Such methods consist in minimizing an objective function (often the sum

6

of the data misfit and a model misfit) using gradient-based approaches. Doing so requires the

7

computation of the (often large) Jacobian matrix of the problem at each iteration, a CPU-

8

costly operation. To establish uncertainty estimates on the models, deterministic inversions

9

typically relies on linear error propagation (Aster et al., 2013; Kemna et al., 2007).

10

More recently, due to the rise of computer performances, stochastic methods have

11

been applied to geophysics and are able to quantify more properly uncertainty. Most

12

algorithms rely on Markov chains Monte Carlos (McMC) methods in order to explore the

13

prior model space (Sambridge, 2002). Those methods offer the advantage to provide a much

14

more complete appraisal of the ensemble of possible solutions. Nevertheless, stochastic

15

methods are often linked to high computation costs, since they require a large number of

16

forward model runs to sample the posterior distribution in the prior model space.

17

For example, deterministic inversion is routinely applied for the interpretation of

18

surface nuclear magnetic resonance data (SNMR), where QT-inversion (an inversion that

19

takes into account the full pulse-moment-time dataset) is the state of the art methodology

20

(Müller-Petke and Yaramanci, 2010). To estimate uncertainty on the obtained model, Müller-

21

Petke et al. (2016) proposed a bootstrap algorithm. On the other end of the spectrum, surface

22

waves dispersion curves are more and more often interpreted using stochastic approaches

23

(Socco et al., 2010), using genetic algorithm (GA), simulated annealing (SA), and

24

neighborhood algorithm (NA) which all sample models randomly under a guidance system.

3

25

In this paper, we present applications of a new stochastic method for imaging the

26

subsurface called Bayesian Evidential Learning 1D imaging (BEL1D), a new framework for

27

modeling the subsurface with geophysical data relying on the constitution of statistics-based

28

relationships between model parameters and data from geophysical experiments. The

29

algorithm consists of following steps (Figure 1Error! Reference source not found.) (see

30

companion paper ‘1D geological modeling of the subsurface from geophysical data with

31

Bayesian Evidential Learning Part 1: Method and demonstration’):

32



33 34

(yellow box in Figure 1) and associated data

(blue box in Figure 1) by forward modeling •

Reduction of the dimensionality of the data and model using principal ( )

component analysis (PCA):

35 36

Generation of prior models



and

( )

Constitution of statistical relationships between the models parameters and the

37

reduced data (green cloud in Figure 1), using canonical correlation analysis

38

(CCA) :

39



(

,

distribution (

41

density estimators •

(

,

)

Generation of posterior distributions of

40

42

)

,

) to field data

by constraining the bivariate (red box in Figure 1) using kernel

Sampling of the constituted distributions and back-transformation of the

43

samples into the original space

44

subsurface

45

(purple box in Figure 1)

, delivering a set of 1D models of the

that are constrained to the knowledge of the geophysical data

46

The method offers the advantage not to require multiple runs of the CPU-costly

47

inversion of the data, as it is based solely on the use of the forward model. BEL1D offers an

48

alternative to some stochastic algorithms, often requiring much larger computation times, as it 4

49

relies on a limited number of independent prior samples to deduce the posterior distribution.

50

One of the key advantage of the approach is that the relationship between the model

51

parameters and the data can be reused for several data sets, as long as they belong to similar

52

prior model, largely reducing the computational cost of BEL1D. On the other hand, the

53

method is also an efficient alternative to deterministic inversion, as the CPU cost remains

54

acceptable and the results are much more informative due to their probabilistic nature. Above

55

these advantages, BEL1D does not require the definition of regularization parameters or of

56

any other form of bias. In the algorithm presented in this paper, we only consider a finite

57

number of layers in the models. This approach,

58

methodologies (e.g. García-Jerez et al., 2016; Günther and Müller-Petke, 2012; Müller-Petke

59

et al., 2016; Wathelet, 2008), reduces significantly the number of parameters to consider.

applied in numerous other inversion

60 61

Figure 1: Illustration of the BEL1D process.

5

62

In this contribution, we validate the effectiveness of BEL1D for two types of

63

geophysical data: a dispersion curve from surface wave analysis and the free induction decay

64

curves from surface nuclear magnetic resonance. Surface waves are seismic waves whose

65

propagation are parallel to the earth’s surface. They propagate according to mechanical and

66

geometric properties of the subsurface. Their amplitude is generally superior to body waves,

67

making their acquisition much easier. The propagation of these waves is governed by

68

geometric dispersion, often modeled by the dispersion curve. In most interpretations, the

69

focus is made on the fundamental mode dispersion curve, hence the strongest one (Socco et

70

al., 2010). The current state-of-the art inversion method for the interpretation of dispersion

71

curves is an improved version of the neighborhood algorithm implemented in Geopsy

72

(Wathelet, 2008). The resulting models are expressed in terms of P-wave velocity (

73

wave velocity ( ) and density ( ). However, surface waves are mostly sensitive to S-wave

74

velocity (Xia et al., 1999). Often, a link between

75

Poisson’s ratio ( ) (Equation 1) as is the case in Geopsy (Wathelet, 2008).

=

and

), S-

is considered through the

&

0.5 − ' × % 1−

(1)

76

Surface Nuclear Magnetic Resonance (SNMR) is a near-surface geophysical method

77

used to retrieve information about the groundwater in the first hundred meters of the

78

subsurface. The method relies on the quantum mechanics behavior of protons composing the

79

water molecules. At rest, the protons are precessing about the Earth’s magnetic field at their

80

Larmor frequency. By applying a magnetic field tuned at this precise frequency, we modify

81

the thermal equilibrium of the proton’s spin. As soon as the injection is stopped, the spins

82

relax back to their thermal equilibrium, emitting a secondary magnetic field that is registered

83

through induced currents in the receiver antenna (free induction decay). The obtained signal

84

can be interpreted in terms of water content and decay time, the latter being correlated with

6

85

hydrogeological parameters of the subsurface such as pore sizes and hydraulic conductivities

86

(Behroozmand et al., 2015). The current state-of-the-art interpretation method is the QT

87

inversion (Müller-Petke and Yaramanci, 2010). This inversion method uses the full QT (pulse

88

moment-time) dataset originating from the field acquisition, in order to estimate both the

89

water content (or partial water contents) and the relaxation time. The method is fully

90

implemented in MRSmatlab (Müller-Petke et al., 2016), a set of toolboxes for the processing,

91

modeling and interpretation of SNMR data.

92

The results for the two tested cases are compared on different aspects. First, the

93

accuracy of the obtained distributions is quantified, either comparing the results to other

94

studies, to benchmark values, or to results originating from state-of-the-art inversion codes.

95

Second, the efficiency is discussed, both in terms of computation time and of memory usage,

96

and compared to the state-of-the-art method for deterministic inversion.

97

2. Implementation of BEL1D

98

2.1. Program workflow

99

The code is built on the Bayesian Evidential Learning 1D imaging (BEL1D) presented in

100

the companion paper “1D geological modeling of the subsurface from geophysical data with

101

Bayesian Evidential Learning Part 1: Method and demonstration“.

102

In the proposed codes, the models are described as a succession of finite number of

103

horizontal layers of unknown thickness, with associated parameters that are related to the

104

geophysical experiment (water content, velocities, densities, etc.). This leads to small

105

numbers of uncorrelated parameters. Although the latter is not strictly required by BEL1D, it

106

makes it straightforward to define the prior in a general way. Then, BEL1D requires the

107

generation of geophysical datasets associated with the models, i.e. running the geophysical

108

forward model. 7

109

The method relies on the constitution of statistical relationships between models

110

parameters and the associated datasets. Contrary to classical inversions processes, this method

111

does not require the use of regularization parameters that are known to bias geophysical data

112

interpretation (Aster et al., 2013). Moreover, this method can explore large prior model spaces

113

as efficiently as narrowed ones, due to its construction. It is therefore a very efficient method

114

for the construction of models from geophysical data without the introduction of prior biasing

115

information, contrary to other Bayesian methods that sometimes use unrealistic small prior to

116

converge faster.

117

As BEL1D is ubiquitous, a generalized toolbox has first been developed and can be

118

used with any forward model available. The input for the GUI is the data (A) in a MATLAB

119

‘.mat’ file containing a matrix called data. The user can specify the names and units of the X

120

and Y variables on the graph. Those names are later used in the model display (RMS error

121

scale bar). The user can then describe the prior model space in panel B. There, the names and

122

the ranges of the parameters (up to 3 parameters plus the thickness) can be set for each layer

123

and the sampler selected.

8

124 125 126

Figure 2: Graphical user interface layout in the generalized case.

2.2. Implementing Surface Waves as a toolbox in BEL1D

127

In the MATLAB graphical user interface (GUI) (Figure 3), the models are presented

128

as sets of a finite known number of layers, with unknown thicknesses, seismic P and S

129

velocities and densities. The last layer is considered an infinite half-space. At the user 9

130

requests, it is possible to insert constraints on the velocities through an interval of Poisson’s

131

ratio and to only consider increasing velocities and densities through the layers. Such

132

conditions will relax the independence between the parameters. The used data is the

133

fundamental mode of the dispersion curve (however, it should be stated that BEL1D could be

134

used with multiple modes). This dispersion curve is exploited in the frequency/phase velocity

135

space. The forward modeling is performed via the GPDC command line tool provided by

136

Geopsy (Wathelet, 2008) called through the “jsystem” MATLAB function (Rosenberg, 2018),

137

a function similar to the MATLAB-integrated ‘system’ function.

138

In this case, the user inputs are in the data panel (A), the file describing the dispersion

139

curve is selected. The file formatting is the same as the one from Geopsy (Wathelet, 2008);

140

the prior model space description (B) is similar to the SNMR case, with the choice of the

141

number of layers, the ranges for the different parameters, the type of sampler and the number

142

of models for the prior realizations. However, in this case, the user can also specify if the

143

models should have increasing velocities and densities with depth (checkbox “increasing”)

144

and whether or not to consider Poisson’s coefficient (NaNs to ignore and doubles to consider).

145

Finally, once all the inputs are present, the user can launch BEL1D in panel C.

10

146 147

Figure 3: Graphical user interface layout for the interpretation of dispersion curves from multichannel analysis

148

of surface waves.

149

2.3. Implementing Surface Nuclear Magnetic Resonance as a toolbox in BEL1D

11

150 151

Figure 4: Graphical user interface layout for the interpretation of (multiple-loops) SNMR datasets

152

In this section, we present a new MATLAB graphical user interface (GUI) for the

153

interpretation of SNMR datasets through BEL1D. This GUI is adapted to multiple-loops

154

datasets but is also able to manage more classical single receiver experiments. For SNMR, the

155

models are described through a finite number of layers, with unknown thicknesses (e), water

156

contents (W), and decay times (T2*). The data formatting is the one issued from MRSmatlab

157

(Müller-Petke et al., 2016) as is the kernel function.

158

The input parameters for the GUI are separated in three different panels: data (A), Prior model

159

space (B) and kernels (C). To enter SNMR data, the user must specify first the path to the

160

folder containing the data and then the name(s) of the file(s) containing the experimental data

161

in the table (A1). As the GUI is adapted to multiple loops configurations, the user can enter

162

multiple files corresponding to different acquisitions performed on the same site with 12

163

different configurations. Then, the user must select the configuration that they want to use

164

(A2). To do so, they choose a transmitter size in column 1 and check the chosen receivers

165

used with this transmitter. If multiple files where selected, the configuration can differ for all

166

files, leading to complementary information. The prior model space definition is performed in

167

panel B. The user can choose the number of layers (up to 6), describe the parameter ranges,

168

select the type of sampler and the number of prior realizations. Finally, the kernels for the

169

forward modeling (C) are either computed under the assumption of a resistive Earth or a 100

170

Ohm.m resistivity, either loaded from previously computed files (that are located in the same

171

folder as the data) to handle more complex resistivity models. The software can handle up to

172

16 different configurations at the same time (up to four transmitters and four receivers. Both

173

the data and the kernels are in the MRSmatlab format (Müller-Petke et al., 2016). Finally,

174

once all the inputs are present, the user can launch BEL1D in panel D.

175

3. Results

176

The different toolboxes that are presented are automatically generating most of the

177

different figures that are presented in the first part of those companion papers: the CCA space,

178

the load of each parameter in the different CCA space dimensions, the posterior parameters

179

and models distributions. However, in this section, we will mainly focus on the posterior

180

(parameters and models distributions) analysis and compare it with our knowledge of the

181

sites. Nonetheless, a brief analysis of the CCA space will be proposed as it may give an

182

insight on the sensitivity of the experiment towards the parameters of interest.

183

3.1. Case 1: InterPacific – Surface Waves

184

The InterPacific project (Intercomparison of methods for site parameter and velocity

185

profile characterization) aimed at assessing the reliability of in-hole and surface wave

186

methods for the estimation of shear wave velocity (Garofalo et al., 2016b). In this section, we

13

187

will use the mean dispersion curve that arose from these experts’ analyses of the raw data to

188

demonstrate BEL1D applicability to retrieve the shear wave velocity profile. We will focus on

189

the case proposed by Garofalo et al. (2016b) in the Mirandola test-site located in the Po plain.

190

The site is characterized by soft sediments (sand, clays and gravels) overlaying a Pliocene

191

bedrock (sand and marine pelite), found at a depth between 50 and 150 meters (Garofalo et

192

al., 2016b, 2016a). We will here assume that the soil is constituted of three layers: two soft

193

layers, representing the alluvial plain and its variations and a third layer representing the

194

bedrock. A full description of the prior model space is given in Table 1, with the additional

195

information that the parameters

196

coefficient is always within 0.2 and 0.5 for each layer. Thickness (e) [m]

,

and

P-wave (

are increasing with depth and that the Poisson’s

velocity S-wave

) [m/s]

velocity Density

( ) [m/s]

( )

[kg/m³]

Min

Max

Min

Max

Min

Max

Min

Max

Layer 1

5

50

200

4000

100

500

1500

3500

Layer 2

45

145

200

4000

100

800

1500

3500

200

4000

300

2500

1500

3500

Layer 3 197

Table 1: Mirandola – Prior model space

198

We generated 5000 prior realizations and simulated their fundamental mode dispersion

199

curve (the associated data). To sample the models, we used uniformly distributed variables

200

with a rejection sampler in order to satisfy the increase of the parameters values with depth

201

and the Poisson’s coefficient interval. From the original 64 dimensions of the data (the 64

202

couples frequency-phase velocity of the sampled dispersion curve), PCA reduced the

203

dimensionality to 11 (the minimum to be able to perform further steps of BEL1D, as the

204

dimension of

205

– the (PCA reduced) model space). In the CCA model space, the first two dimensions (where

(

– the PCA reduced dataset – must be larger or equal to the one of

14

(

(=

)

206

shear-wave velocities and thicknesses are gathered) show high correlations but all the 9

207

remaining dimensions presents scattered values (Figure 5). This implies that the field

208

experiment may enable a significant reduction of uncertainty for the shear-wave velocities

209

and thicknesses, whereas no substantial reduction is expected for the other parameters. Due to

210

the lack of existing error models for dispersion curves (to our knowledge), BEL1D is applied

211

without specific noise analysis. This is further discussed later on. The process results in the

212

production of 1000 models in the posterior.

213 214

Figure 5: Mirandola – Data versus model space in the CCA space (first 5 dimensions out of 11). The blue dots

215

represent each prior realizations and the orange line the position of the field data.

15

216 217

Figure 6: Mirandola - Comparison of the prior (blue) and posterior (orange) parameters distributions.

218

As expected, the main reductions of uncertainty are obtained for shear-wave velocities

219

(Figure 6), which are known to be the most sensitive to surface waves. Then, the reduction of

220

uncertainty is substantial for compression waves velocities (especially for the third layer), due

221

to the relationship constrained by the Poisson’s coefficient interval. Finally, the model is

222

nearly insensitive to the density in agreement with previous studies (e.g.: Xia et al., 1999). No

223

particular correlation is observed between the parameters as all but one Pearson’s correlation

224

coefficient are below 0.5 in absolute value, the only exception being observed between

16

,)

,)

225

and

226

coefficient.

(0.59) due to the limitation of possible values for

, constrained by the Poisson’s

227

The depth of the interface with the bedrock still remains very uncertain, but the most

228

probable value matches borehole (Garofalo et al., 2016a) measurements (Figure 7) whereas

229

the prior space did not imply such values (borehole information were not taken into account to

230

build the prior model space). Moreover, our uncertainty covers the range proposed by the

231

different experts (Figure 8). The comparison of the results obtained through BEL1D with the

232

results from the InterPacific project (Figure 8) shows that BEL1D provides a posterior

233

distribution that encompass the InterPacific results for the alluvial plain. However, the

234

bedrock seems to present larger

235

This may originate from the use of the average dispersion curve from the aggregated data

236

issued by the whole set of analysts, instead of the individual dispersion curves of each analyst.

values compared to the results of 5 out of the 14 analysts.

237 238

Figure 7: Mirandola – Estimated depth to the bedrock. The borehole value originates from Garofola (2016a).

17

239 240

Figure 8 : Mirandola – Posterior models distributions and their associated RMS error. The black lines in the

241

profile represent the models obtained from surface wave methods by the experts that participated in the

242

InterPacific project (Garofalo et al., 2016b).

243

Figure 9 shows that BEL1D was able to reject unlikely areas in the data space. From

244

this figure, we can see that the posterior distribution of dispersion curves all exhibits a

245

stepwise decrease, also observed in the prior distribution, instead of the steady decrease of the

246

experimental phase velocity observed from 2 to 15 Hz. We attribute this to the limited number

247

of layers used in our prior, inherent to a blocky imaging framework. From Figure 9, it appears

248

that most of the dispersion curves picked by the experts teams are fitting inside the most

249

probable area (lowest RMS). If we assume that the range of dispersion curves picked by the

250

expert group is representative of the noise level, BEL1D samples posterior models within this

18

251

range. Our result is therefore valid, even when accounting for the intrinsic uncertainty on the

252

dispersion curve.

253 254

Figure 9 : Mirandola – Comparison of the prior (grey) and posterior (blue to red) data spaces. The colors of the

255

posterior dispersion curves represent the RMS error with the same scale as the one used in Figure 8 (blue = low

256

RMSE and red = high RMSE). The white dispersion curve represents the average dispersion curve as defined in

257

Garofalo (2016b). The black dots represent picked dispersion curves from the different teams of experts.

258

To further validate our results, we used the same dataset in Geopsy (Wathelet, 2008)

259

in order to produce models that explain the dataset (Figure 10). We used the same prior model

260

space to run the inversion. The results show that the models that explain best the dataset are

261

similar to the one resulting from BEL1D.

19

262 263 264

Figure 10: Mirandola – Results from Geopsy on the dataset.

3.2. Case 2: Mont Rigi (Belgium) – SNMR

265

The Mont Rigi is located in the Belgian Fagnes region, in the Eastern part of the

266

country. This site presents an ideal case study for SNMR as it is remote and far from any

267

electromagnetic noise sources, due to its natural reserve classification.

20

268 269

Figure 11: Peat thickness at the Mont Rigi test site according to GPR prospection. The coordinates are in

270

Belgian Lambert 72 (EPSG:31370). (Modified from Wastiaux and Schumacker, 2003)

271

Geologically, the site is characterized by a metric peat layer on top of a Cambrian

272

bedrock (La Venne formation) known as an aquiclude with a very low water content (Gilson

273

et al., 2017). In contrast, peat is known to present very high water contents with observed total

274

porosities around 90% (Wastiaux, 2008). According to previous GPR exploration (Wastiaux

275

and Schumacker, 2003), the peat layer should have a thickness between 2.5 and 4.5 meters

276

(Figure 11). The SNMR response should therefore allow to easily distinguish the two layers

277

with a properly designed experiment. The designed on-site experiment consisted of a classical

278

coincident loops transmitter/receiver couple with a diameter of 20 meters.

21

279 280

Figure 12 : Extract of the obtained SNMR signal from Mont Rigi after processing in MRSmatlab (Müller-Petke

281

et al., 2016).

282

Figure 12 shows the measured data set, after signal processing operations (despiking,

283

harmonic modelling, reference noise cancellation and despiking again). We see that the noise

284

amplitude is quite low (18 nV), and a clear decay is observed for the smallest pulse moments

285

(those most sensitive to the first meters where the peat layer is located).

286

First, the signals have been truncated to 0.2 seconds to decrease the impact of noise,

287

mostly present at large timeframes, where the NMR signal is absent. Then, the signals have

288

been inverted using the QT Inversion (Müller-Petke and Yaramanci, 2010), to constitute a

289

benchmark to compare to the results of BEL1D. We used a smooth-mono model description

290

(smooth models with a single relaxation time for a given depth) and a regularization

291

parameter of 6000 for both water content and relaxation time using the L-curve criteria. Then, 22

292

BEL1D was applied with the uniformly distributed prior model space described in Table

293

2Error! Reference source not found.. We sampled 1000 prior realizations and produced

294

1000 models for the posterior.

295

Thickness [m]

Water content [%]

Relaxation time [ms]

Minimum

Maximum

Minimum

Maximum

Minimum

Maximum

Layer 1

0

7.5

30

80

0

200

Layer 2

/

/

0

15

100

400

Table 2: Prior model space for the Mont Rigi SNMR experiment

296

The CCA space analysis (Figure 13) shows that we could expect a significant

297

reduction of uncertainty for some parameters, as the first three dimensions show high

298

correlations. The first dimension, dominated by the water content of the half-space shows an

299

especially narrow relationship between the reduced data and model space. On the other hand,

300

the last dimension, mostly represented by the water content of the first layer, shows no

301

specific correlation. We should therefore expect a much better reduction for the parameters of

302

the half-space than for the first layer.

23

303 304

Figure 13: Mont Rigi – Data versus model space in the CCA space. The blue dots represent each prior

305

realizations and the orange line the position of the field data.

306

The noise level (after denoising) impact on BEL1D was assessed by analyzing noise

307

propagation in the PCA space (Hermans et al., 2016) for a random sample of 50 models from

308

the prior. The values of the covariance matrix

309

of the order of 10*&+ ). However, once transformed into CCA space, it resulted in significant

310

changes in bandwidths for the kernel density estimation, with between 0.04 and 0.13 instead

311

of the default 0.01 value.

(

in PCA space are relatively low (maximum

24

312 313

Figure 14 : SNMR at Mont Rigi - Prior (blue) and posterior (orange) distributions of the parameters.

314

The analysis of the posterior parameters model space (Figure 14) shows that the

315

reduction of uncertainty from the prior model space to the posterior is significant and

316

produces consistent results, when compared to the geological context and the QT inversion

317

solution (dashed gray lines in Figure 15). The thickness of the peat layer tends to be smaller

318

than the maximum estimate (4.5 m) from to the GPR interpretation (Wastiaux and

319

Schumacker, 2003). This is explained by the equivalence of the produced signal from a high

320

water content in a thin layer and a lower water content in a thick layer. The use of a narrower

321

prior taking into account the GPR information would prevent such a behavior.

25

322

No specific correlation between the parameters is arising from BEL1D. However, we

323

observe that some relations are created. For example, the maximum value of the thickness for

324

the first layer tend to decrease when the water content of the aforementioned layer increases.

325

This behavior arises from the non-uniqueness of the SNMR response stated above.

326

Analyzing the set of models produced by BEL1D (Figure 15), it is observed that a

327

large majority of the models show a root mean square error lower than the noise level in the

328

dataset, implying that the set of probable models provides an efficient estimation of the sites

329

parameters. The RMS errors of the posterior samples show that they all explain the data to a

330

similar level; the uncertainty is thus intrinsic to the data set and the non-unicity of the

331

solution. The non-uniqueness of the solution is easily observable, with two distinct behaviors:

332

one with a median water content and a thick first layer and another with a very high water

333

content but a thinner first layer. Those two cases show similar RMS errors, hence are

334

explaining the data at the same level. The probability of each posterior model is shown in the

335

RMS error colorbar for which each color represents the same number of models. Therefore, it

336

is observed that the low RMS error models also correspond to the most probable ones.

26

337 338

Figure 15: SNMR at Mont Rigi - Posterior models distributions with their associated misfit and results of the QT

339

inversion (dashed gray line). The black line in the RMS error graph represents the level of noise in the original

340

dataset.

341

In the data space, it is observed that the simulated data (exponential decays) from the

342

posterior space are corresponding to the experimental data (Figure 16). We show that, even if

343

the raw data are noisy, BEL1D is able to dramatically reduce the uncertainty on the posterior

344

distributions, centering the posterior data on the experimental ones. However, the fit is better

345

for the first pulse moments, hence the most sensitive to this particular soil configuration.

27

346 347 348

Figure 16: SNMR at Mont Rigi - Comparison of the prior (gray), posterior (blue) and experimental (red) data.

3.3. CPU timing and memory usage

349

In this section, we monitor computation time and RAM usage using the MATLAB®

350

integrated profiler (Table 3). The reported computation time only accounts for BEL1D run in

351

the GUI (‘pushbutton_Run_Callback’). The RAM usage corresponds to the peak memory

352

observed when using all the possibilities of the GUI (run BEL1D, show parameters, models

353

and save results). All the tests have been performed on a workstation equipped with an Intel® 28

354

Core™ i7-6800K hexacore CPU with 64 Go of RAM at 2400MHz and a GTX 1050 GPU in

355

MATLAB R2018a. Even if the GUIs enable the optional use of the parallel computing

356

toolbox of MATLAB, this toolbox was not used for these tests. Computation time [sec] Peak Memory [Gb] (forward run time/noise impact estimation) Case 1 (5000 models in the prior)

139.640

(90.273/Not 0.178

Available) Neighborhood algorithm on case 1

<5

Not available

Case 2 (1000 models in the prior)

46.951 (0.575/16.917)

0.118

QT inversion on case 2

7.717

0.012

L-curve on case 2 (QT inversion)

71.905

0.043

(2977 models visited)

357

Table 3: Computation time and RAM usage

358

When comparing the results of BEL modeling on the second case to QT inversion in

359

MRSMatlab (Müller-Petke et al., 2016) performances, it is observed that QT inversion is

360

faster and uses way less memory. However, this timing assumes that we know the optimal

361

regularization parameter in advance. If we want to account for the computation of the L-curve

362

in the CPU time, we should add 71.905 seconds to the computation time. Moreover, the

363

solution obtained through QT-inversion is deterministic, whereas BEL1D provides a

364

stochastic solution in a timeframe that is very similar.

365

In the case of surface waves, it is observed that the neighborhood algorithm (Wathelet,

366

2008) performs way better in terms of CPU time than BEL1D. This is mainly explained by

367

the highly optimized forward model used in Geopsy, that is strongly hindered by multiple

29

368

system calls from MATLAB (jsystem function (Rosenberg, 2018)) followed by read and write

369

operations from a hard drive in our case.

370

From Table 3, it is observed that the performances of BEL1D highly depends on the

371

efficiency of the forward model run. The other operations mainly depend on the complexity of

372

the dataset (PCA is more demanding for SNMR than for surface waves as the datasets are

373

much larger) and the number of models in the prior (mainly hindering the kernel density

374

estimation). At equal performance of the forward model, the efficiency of BEL1D is thus

375

directly proportional to the number of models sampled from the prior.

376

Overall, the use of BEL1D appears as an efficient method to model the earth based on

377

geophysical data, both in terms of accuracy, CPU usage and memory cost. In the case of large

378

campaigns, a set of prior realizations and the associated CCA spaces could be created once

379

and reused for the interpretation of all the data, removing the costly forward model runs from

380

the computation times, as well as the dimensionality reduction steps thus leading to very rapid

381

stochastic interpretations (less than a minute). For example, a SNMR survey at Mont-Rigi to

382

determine the thickness of the peat layer could use the same prior for all the interpretation of

383

the sounding curves.

384

4. Conclusion

385

We successfully applied BEL1D to two different field cases using completely different

386

geophysical methods: surface waves analysis and surface nuclear magnetic resonance. The

387

first case showed the applicability of BEL1D to dispersion curves obtained by analysis of

388

surface waves. We showed that the resulting posterior models were coherent with the range of

389

results independently obtained by experts (Garofalo et al., 2016b) and is inherent to the

390

uncertainty around the dispersion curve. Moreover, we compared with the results from a

391

direct search inversion technique, the Neighborhood Algorithm, and obtained similar results. 30

392

In addition, we observed that the most likely depth to the bedrock as originated from the

393

process corresponds to the depth measured in the borehole.

394

For the second case, BEL1D has been applied to SNMR data acquired in the Belgian

395

Fagnes. We showed that the process outputs reasonable ranges of uncertainty on the

396

parameters and includes the QT Inversion results. The careful analysis of the RMS error of

397

the proposed models indicates that the range of uncertainty delivered by BEL1D is intrinsic to

398

the non-unicity of the solution for geophysical inverse problems: many models explain the

399

data to the same level.

400

Those two applications demonstrate the versatility of BEL1D and its applicability to

401

multiple geophysical methods with very few adaptations. Provided that a forward model

402

already exists, the adaptation of the codes to any other 1D method would be straightforward.

403

For example, the forward operator from HV-Inv (García-Jerez et al., 2016) could be used for

404

the interpretation of H/V data. System calls may also help further extend the applicability of

405

BEL1D. This may be the case for the use of all the 1D forward operators implemented in

406

pyGIMLi (Rücker et al., 2017) – vertical electrical sounding, frequency- and time-domain

407

electromagnetic and magnetotelluric.

408

BEL1D also gives a total freedom in the definition of the prior model as it is efficient

409

with both large and narrow prior. As demonstrated with the surface wave, constraints between

410

parameters can be easily implemented.

411

In term of performance, the computation cost of BEL1D is directly proportional to the

412

efficiency of the forward model and the number of samples in the prior model space. It must

413

be noted that the prior samples being independent, the simulation of synthetic data can be

414

fully parallelized. The performance of BEL1D can thus be easily estimated from the available

415

computing facilities. The cost is therefore much smaller than other stochastic methods

31

416

requiring ten to hundreds thousands of runs to converge towards a solution. In the SNMR

417

case, we show that BEL1D provides the posterior distribution in a computation time similar to

418

the state-of-the-art deterministic QT inversion. Moreover, computation timing could be

419

dramatically reduced by pre-field forward modelling operations for large geophysical

420

campaigns where the prior would be similar. Prior realizations and the associated datasets

421

would then be reused (PCA and CCA operations also), letting only the kernel density

422

estimation, sampling and back-transformation operations to be applied after field acquisition,

423

leading to extremely rapid imaging.

424

In its present version, BEL1D is dedicated to the 1D inversion of geophysical data in

425

layered Earth model where the contrast between layer is supposed to be sharp. In the future,

426

we plan to extend the method to smoothly varying systems accounting for a large number of

427

thin, correlated layers.

428

5. Acknowledgment

429

The authors would like to thank the contributors to the development of MRSmatlab,

430

the cornerstone of SNMR data interpretation in BEL1D. Marc Wathlet directed us towards the

431

InterPacific project for the surface wave application. Professor S. Foti and Federico Passeri

432

shared with us the raw dispersion curves from the different teams of experts for the Mirandola

433

case. Hadrien Michel is a fellow funded by the F.R.S.-FNRS.

434

6. Computer codes availability

435

The different MATLAB codes that have been used to produce the results presented in

436

this paper are available at github.com/hadrienmichel/BEL1D. They are released with this

437

paper under the BSD-2-Clause license. The codes consist in a series of graphical user

438

interfaces for the interpretation of SNMR data (BEL1DSNMR), dispersion curves from

439

surface waves analyses (BEL1DSW) and for global cases, assuming the user provides a 32

440

forward model (BEL1DGENERAL). All those functions are standalone (no specific toolbox

441

required) in MATLAB but can run faster when the Parallel Computing toolbox is available.

442

The main codes have been developed by H. Michel (ULiège, Belgium):

443

Hadrien MICHEL

444

Urban and Environmental Engineering Department

445

Allée de la découverte, 9

446

B-4000 Liège

447

Belgium

448

The forward models that are used are Open Source MATLAB toolboxes for SNMR

449

(MRSMatlab: Müller-Petke et al., 2016) and command line tools for surface waves (Geopsy:

450

Wathelet, 2008).

451

In order to run the codes, a MATLAB license is required. Minimum requirements for

452

the computations are the ones for your MATLAB license, at the exception of RAM, for which

453

a minimum of 8 Go is recommended and more could be required for more complex

454

computations. If using the parallel capabilities of MATLAB (parallel computing toolbox), a

455

large number of cores is preferred and, globally, a 4-core (or more) processor is advised.

456

7. References

457

Aster, R., Borchers, B., Thurber, C., 2013. Parameters Estimation and Inverse Problems, 2nd

458

ed. Academic Press, New York. 376pp.

459

Behroozmand, A.A., Keating, K., Auken, E., 2015. A Review of the Principles and

460

Applications of the NMR Technique for Near-Surface Characterization. Surveys in

461

Geophysics 36, 27–85. https://doi.org/10.1007/s10712-014-9304-0

33

462

García-Jerez, A., Piña-Flores, J., Sánchez-Sesma, F.J., Luzón, F., Perton, M., 2016. A

463

computer code for forward calculation and inversion of the H/V spectral ratio under

464

the

465

https://doi.org/10.1016/j.cageo.2016.06.016

diffuse

field

assumption.

Computers

&

Geosciences

97,

67–78.

466

Garofalo, F., Foti, S., Hollender, F., Bard, P.Y., Cornou, C., Cox, B.R., Dechamp, A.,

467

Ohrnberger, M., Perron, V., Sicilia, D., Teague, D., Vergniault, C., 2016a.

468

InterPACIFIC project: Comparison of invasive and non-invasive methods for seismic

469

site characterization. Part II: Inter-comparison between surface-wave and borehole

470

methods.

471

https://doi.org/10.1016/j.soildyn.2015.12.009

Soil

Dynamics

and

Earthquake

Engineering

82,

241–254.

472

Garofalo, F., Foti, S., Hollender, F., Bard, P.Y., Cornou, C., Cox, B.R., Ohrnberger, M.,

473

Sicilia, D., Asten, M., Di Giulio, G., Forbriger, T., Guillier, B., Hayashi, K., Martin,

474

A., Matsushima, S., Mercerat, D., Poggi, V., Yamanaka, H., 2016b. InterPACIFIC

475

project: Comparison of invasive and non-invasive methods for seismic site

476

characterization. Part I: Intra-comparison of surface wave methods. Soil Dynamics

477

and

478

https://doi.org/10.1016/j.soildyn.2015.12.010

Earthquake

Engineering

82,

222–240.

479

Gilson, M., Briers, P., Ruthy, I., Dassargues, A., 2017. Carte hydrogéologique de Wallonie,

480

Planchettes Elsenborn - Langert - Dreiherrenwald n°50/3-4, 50A/1. Service public de

481

Wallonie, DGO 3, Belgique. [In french]

482

Günther, T., Müller-Petke, M., 2012. Hydraulic properties at the North Sea island of Borkum

483

derived from joint inversion of magnetic resonance and electrical resistivity

484

soundings.

485

https://doi.org/10.5194/hess-16-3279-2012

Hydrology

and

Earth

System

34

Sciences

16,

3279–3291.

486

Hermans, T., Oware, E., Caers, J., 2016. Direct prediction of spatially and temporally varying

487

physical properties from time-lapse electrical resistance data. Water Resources

488

Research 52, 7262–7283. https://doi.org/10.1002/2016WR019126

489

Kemna, A., Nguyen, F., Gossen, S., 2007. On linear model uncertainty computation in

490

electrical imaging. Presented at the SIAM Conferance on Mathematical and

491

Computational Issues in Geosciences, Santa Fe.

492

Müller-Petke, M., Braun, M., Hertrich, M., Costabel, S., Walbrecker, J., 2016. MRSmatlab —

493

A software tool for processing, modeling, and inversion of magnetic resonance

494

sounding data. GEOPHYSICS 81, WB9–WB21. https://doi.org/10.1190/geo2015-

495

0461.1

496

Müller-Petke, M., Yaramanci, U., 2010. QT inversion — Comprehensive use of the complete

497

surface

NMR

data

set.

498

https://doi.org/10.1190/1.3471523

GEOPHYSICS

75,

WA199–WA209.

499

Rosenberg, A.A., 2018. jsystem: Fast drop-in replacement for matlab’s slow “system”

500

command: avivrosenberg/matlab-jsystem. https://github.com/avivrosenberg/matlab-

501

jsystem (Accessed on 16th of June 2019).

502

Rücker, C., Günther, T., Wagner, F.M., 2017. pyGIMLi: An open-source library for

503

modelling and inversion in geophysics. Computers & Geosciences 109, 106–123.

504

https://doi.org/10.1016/j.cageo.2017.07.011

505 506

Sambridge, M., 2002. Monte Carlo methods in geophysical inverse problems. Reviews of Geophysics 40 (3), 1009. https://doi.org/10.1029/2000RG000089

507

Socco, L.V., Foti, S., Boiero, D., 2010. Surface-wave analysis for building near-surface

508

velocity models — Established approaches and new perspectives. GEOPHYSICS 75,

509

A83-A102. https://doi.org/10.1190/1.3479491

35

510 511

Wastiaux, C., 2008. Les tourbières sont-elles des éponges régularisant l’écoulement ? Bulletin de la Société géographique de Liège 50, 57-66. [In french]

512

Wastiaux, C., Schumacker, R., 2003. Topographie de surface et de subsurface des zones

513

tourbeuses des réserves naturelles domaniales des Hautes-Fagnes (No. C60/1-2-3).

514

Université de Liège, Service de phytosociologie, flore et végétation des Hautes-

515

Fagnes. [In french]

516

Wathelet, M., 2008. An improved neighborhood algorithm: Parameter conditions and

517

dynamic

518

https://doi.org/10.1029/2008GL033256

519

scaling.

Geophysical

Research

Letters

35,

L09301.

Xia, J., Miller, R.D., Parl, C.B., 1999. Estimation of near-surface shear-wave velocity by

520

inversion

of

Rayleigh

521

https://doi.org/10.1190/1.1444578

waves.

522

36

GEOPHYSICS

64,

691–700.

-

A new approach for 1D imaging from geophysical data. A set of MATLAB graphical user interfaces is provided. Estimation of the uncertainty on the model parameters. Consistent results for SNMR. BEL1D can be generalized to any type of 1D geophysical data.

Declaration of interests ☒ The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. ☐The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: