Decoupled joint inversion with variable splitting: example scheme for magnetotelluric, seismic and gravity data

SUMMARY

We present a general framework for multiphysics joint inversion of any number of geophysical data sets. Its main feature is the use of the variable splitting approach: an auxiliary multiparameter model space is introduced in which minimization of the coupling and stabilizing functionals is carried out. The use of rediscretization and interpolation to map between this auxiliary space and the model spaces allows the coupled models to have completely different parametrizations. Joint inversion is decoupled into the individual inversion and the coupling-regularization subproblems, each of which can be solved by a different optimization algorithm. For each subproblem, the linking term controlling the distance between the model and the corresponding auxiliary variable takes the form of a quadratic regularization with a reference model. As a result, any existing inversion code supporting such regularization can be integrated without modifications into the developed framework. As a concrete example scheme, we consider an application of the framework to 3-D joint inversion of magnetotelluric, seismic refraction and gravity data. We discuss different coupling functionals, mainly those corresponding to the more universal structural constraints: joint total variation, joint minimum support, cross-gradient, one-way cross-gradient and their combinations for a general multimodel case. The use of coupling based on explicit ‘petrophysical’ relationship between the properties is also considered. Performance of the developed framework is studied on three synthetic cases: a time-lapse joint inversion of full-tensor gravity gradiometry and seismic data, a joint inversion of magnetotelluric, seismic and gravity data and a joint inversion for electrical resistivity tomography and audio-magnetotellurics.

Gravity anomalies and Earth structure, Electrical resistivity tomography (ERT), Magnetotellurics, Inverse theory, Joint inversion, Seismic tomography

Subject

General Geophysical Methods

Issue Section:

General Geophysical Methods

1 INTRODUCTION

Joint inversion is an actively developing approach to the integration of multiple geophysical methods that aims at reducing the uncertainties inherent to the geophysical inversions and subsequent geological interpretation. Related approach, when a model obtained with one geophysical method is incorporated in the inversion of another method, is termed ‘model fusion’ (Haber & Holtzman Gazit 2013) or ‘cooperative inversion’ (Moorkamp et al. 2016). Both approaches can be applied to methods sensitive to the same physical property, for example magnetotellurics and electrical resistivity tomography, which are both sensitive to the resistivity (see Vozoff & Jupp 1975), or to methods sensitive to different properties. In the latter case, some coupling between the models has to be assumed (the single-property joint/cooperative inversion can be viewed as a special case with trivial coupling).

Depending on the nature of the available prior information, coupling between the geophysical models can be based on structural similarity (see Crestel et al. 2018, for an overview), explicitly defined common structural parameters (e.g. Juhojuntti & Kamm 2015; Zheglova et al. 2018) or relationships between the physical properties, defined either analytically (e.g. Heincke et al. 2017; Colombo & Rovetta 2018) or using statistical data analysis techniques such as fuzzy clustering (e.g. Lelièvre et al. 2012; Sun & Li 2016) and Gaussian mixture models (Astic et al. 2021). A promising approach to model coupling is based on maximization of various measures of statistical dependence: Haber & Holtzman Gazit (2013) and Mandolesi & Jones (2014) experimented with the mutual information; as an alternative, the use of the variation of information (Moorkamp 2021) and the joint entropy (Zhdanov et al. 2022) have been suggested. Gramian constraints (Zhdanov et al. 2012; Malovichko et al. 2020) allow for efficient maximization of the correlation coefficients between the models or their transforms. Alternatively to the coupling of geophysical models, a joint inverse problem can be solved directly for lithologies (Bosch 1999) or for thermochemical (Khan et al. 2007; Afonso et al. 2013) or petrophysical (Chen et al. 2007; Gao et al. 2012) parameters.

Deterministic inversion methods based on local optimization have well-known limitations: the uncertainty estimation, when available, is restricted to linearized estimates, solutions depend on the choice of the regularization parameters (as a scalarization of a multiobjective optimization problem) and, for nonlinear problems, on the initial model. However, at the present time, for most 3-D inverse problems with a large number of unknowns, stochastic, global optimization methods are not computationally feasible. This situation may change with the advances in techniques like the reduced order modelling (Manassero et al. 2021) and deep learning (Colombo et al. 2021).

In this paper we consider a deterministic 3-D joint inversion framework, with the primary aim of applying it to large magnetotelluric (MT), gravity and seismic first-arrival traveltime (FATT) data sets, on both regional and local scales. During the preparation of this publication, magnetic data inversion was added to the framework. In the future, we plan to add inversion of seismic surface wave dispersion data. Thus, we need a framework to which geophysical methods can be easily added, preferably as existing inversion codes. Model discretization required by a particular method is dictated by the physics (resolution, boundary conditions, etc.) and by the numerical implementation of the forward problem. Although it is possible to use a common grid for the joint inverse problem and individual, problem-specific grids for the forward problems (e.g. see Moorkamp et al. 2011, Heincke et al. 2017), our choice here is to use an individual grid for each inverse problem—this makes it easier to adapt inversion codes and ensures that a model parametrization is capable of entirely explaining each data set. The ‘philosophy’ of our approach is that the priority is given to the individual inversions (i.e. the model parametrization, the forward solver and the optimization method are optimal for each data set in the sense that the same choices would have been made in a ‘separate’ inversion case), while the ‘joint’ part, that is the coupling, is added as a logically independent and easily modifiable block. We show how this can be achieved with a simple technique, often referred to as variable splitting (e.g. Wang et al. 2008): terms of an objective function are decoupled by introducing an auxiliary variable and penalizing the distance between this variable and the model vector. By combining this technique with interpolation between the grids, the joint inverse problem is decoupled into the separate inversion (with standard quadratic regularization) and the coupling (and regularization) subproblems that are solved in an alternating manner.

The paper is organized as follows. First, we derive a general split-variable formulation of joint inversion and discuss its practical implementation as an alternating minimization algorithm. Next, we describe several structure-coupling functionals, which are implemented in our joint inversion framework: the joint total variation (JTV), the joint minimum support (JMS), the cross-gradient (XG) and the one-way cross-gradient (OWXG). We then discuss the main aspects of the application of the proposed scheme to MT, FATT and gravity problems. Finally, performance of the framework is demonstrated on synthetic data: in the first example we consider a time-lapse joint inversion of FATT and gravity gradiometry data using JMS constraints, in the second example we compare performance of several combinations of JTV, XG, OWXG and ‘petrophysical’ constraints in a joint inversion of MT, FATT and gravity data, and in the third example we consider joint inversion of audio-magnetotelluric (AMT) and electrical resistivity tomography (ERT) data combining 2-D finite-element and 3-D finite-difference grids.

2 JOINT INVERSION BY VARIABLE SPLITTING AND ALTERNATING MINIMIZATION

Consider a multiphysics joint inverse problem—the same region of the Earth’s interior is imaged with N geophysical methods, each sensitive to a different physical property of the medium and modelled by a forward operator $f_{i}$ ⁠; the goal is, given N observed data vectors $d_{i}$ ⁠, to simultaneously recover the physical properties described by the model vectors $m_{i}$ ⁠:

\begin{array}{r} d_{i} = f_{i} (m_{i}) + ϵ_{i}, i = 0, 1, \dots, N - 1. \end{array}

(1)

For each method, assuming the noise $ϵ_{i}$ is Gaussian with zero mean, the maximum likelihood solution can be found by minimizing the data misfit functional

\begin{array}{r} Φ_{d_{i}} (m_{i}) = \frac{1}{2} [f_{i} (m_{i}) - d_{i}]^{T} C_{d_{i}}^{- 1} [f_{i} (m_{i}) - d_{i}], \end{array}

(2)

where $C_{d_{i}}$ is the covariance matrix of the observed data. Because this solution is generally non-unique due to ill-posedness of the inverse problem (1) and unattainable for local optimization due to nonlinearity of $f_{i} (m_{i})$ ⁠, prior information about the model must be introduced—in the form of regularization and, in the case of the multiphysics joint inversion, also by postulating some coupling between the individual models. This coupling, as well as the regularization, can generally be imposed as a penalty functional. Denoting by $R$ the combination of all the ‘prior’ functionals, we can set the joint inversion as an unconstrained optimization problem:

\begin{array}{r} min_{m_{0}, m_{1}, \dots, m_{N - 1}} \sum_{i = 0}^{N - 1} λ_{i} Φ_{d_{i}} (m_{i}) + R (m_{0}, m_{1}, \dots, m_{N - 1}) . \end{array}

(3)

Coefficients $λ_{i} > 0$ weight the individual data misfit functionals relative to each other and to the ‘prior’ terms. Practical implementation of eq. (3) can be difficult due to the complexity and heterogeneity of the objective function—the individual inverse problems, as well as the ‘prior’ terms, often have different requirements to model parametrization and optimization techniques (and regardless of this, maximum reuse of the existing codes is always desirable). A straightforward workaround is based on the block coordinate descent (BCD) approach (Hu et al. 2009; Haber & Holtzman Gazit 2013) or alternating minimization: the idea is to cycle through the models $m_{i}$ (i.e. blocks of $[m_{0}^{T}, m_{1}^{T}, \dots, m_{N - 1}^{T}]^{T}$ ⁠) each time minimizing eq. (3) with respect to one model while keeping the other models fixed; this decouples the $Φ_{d_{i}}$ terms, significantly simplifying practical implementation of the joint inversion (at the cost of theoretically slower convergence). We develop a different alternating minimization approach, whose advantages over BCD will be demonstrated below.

Let us introduce for each $m_{i} \in R^{M_{i}}$ an auxiliary variable $u_{i} \in R^{M_{i}}$ and, instead of eq. (3), write:

\begin{array}{r} min_{m_{i}, u_{i}} \sum_{i = 0}^{N - 1} [λ_{i} Φ_{d_{i}} (m_{i}) + \frac{α_{i}}{2} ‖ u_{i} - m_{i} ‖^{2}] + R (u_{0}, u_{1}, \dots, u_{N - 1}) . \end{array}

(4)

With all $α_{i} \to \infty$ ⁠, this problem is equivalent to eq. (3). This technique, known as variable splitting, following the works of Wang et al. (2008) and Goldstein & Osher (2009) is widely used with $ℓ_{1}$ -regularized problems as a means of separating the non-differentiable $ℓ_{1}$ and the $ℓ_{2}$ terms of the objective function. Here it separates the $Φ_{d_{i}}$ terms from the regularization and coupling term $R$ ⁠.

Assume that each vector $m_{i} = [m_{i, 1}, m_{i, 2}, \dots, m_{i, M_{i}}]^{T}$ parametrizes a grid-based discretization of a physical property $m_{i} (x)$ of the medium in the following form:

\begin{array}{r} m_{i} (x) = \sum_{j = 1}^{M_{i}} m_{i, j} φ_{i, j} (x), x \in Ω_{i} \subset R^{3}, \end{array}

(5)

where $φ_{i, j} (x)$ are some basis functions defining a 3-D grid. Thus, we assume that individual models are discretized independently and the model regions $Ω_{i}$ are, in general, different (with a necessary limitation that the intersection of all $Ω_{i}$ is not empty). In contrast to $m_{i}$ ⁠, vectors $u_{i}$ in eq. (4) are coupled by the term $R$ which implies that they must, at some point, be mapped to the same basis. Let us discretize all $u_{i}$ by the same grid, that is analogously to eq. (5), $u_{i} (x) = \sum_{j = 1}^{M} u_{i, j} φ_{j} (x)$ ⁠, with $x \in Ω : ⋂_{i = 0}^{N - 1} Ω_{i} \subset Ω$ ⁠, and denote by matrix $P_{i}$ interpolation onto this grid from the grid of $m_{i}$ ⁠. Let us also define the multiparameter model vector $u = [u_{0}^{T}, u_{1}^{T}, \dots, u_{N - 1}^{T}]^{T}$ ⁠. Using this notation, we can rewrite problem (4) as follows:

\begin{array}{r} {\begin{cases} \begin{aligned} min_{m_{i}} Φ_{d_{i}} (m_{i}) + \frac{α_{i}}{2 λ_{i}} ‖ u_{i} - P_{i} m_{i} ‖^{2}, i = 0, 1, \dots, N - 1, \\ min_{u} R (u) + \sum_{i = 0}^{N - 1} \frac{α_{i}}{2} ‖ u_{i} - P_{i} m_{i} ‖^{2} . \end{aligned} \end{cases} \end{array}

(6)

The problem is thus decomposed into a system of N data-fitting (inversion) subproblems, each regularized by a simple quadratic functional, and the coupling-regularization subproblem. Derivatives of the linking terms with respect to $m_{i}$ required by the inversion subproblems include transpose $P_{i}^{T}$ of the interpolation operators, which is often difficult to implement; an iterative solver such as the conjugate gradient method would require computationally expensive (for large 3-D grids) application of $P_{i}$ and $P_{i}^{T}$ at each iteration. However, given that both $m_{i}$ and $u_{i}$ adequately represent the models (e.g. satisfy the Nyquist criterion), distance between the models can as well be calculated in the $m_{i}$ space. Another difficulty with the inversion subproblems, especially when the gradient of $Φ_{d_{i}} (m_{i})$ is ‘rough’ (as, e.g. in ray-based tomography), is that, even if the smoothness of $u_{i}$ is controlled by $R (u)$ ⁠, lack of smoothness in the updates to $m_{i}$ at early stages, when $m_{i}$ and $u_{i}$ are far apart, may lead to instability and breakdown of the inversion. The required additional smoothing can be introduced by replacing the Euclidean norm in the linking terms by a Sobolev norm. Applying these changes to system (6), we get:

\begin{array}{r} {\begin{cases} \begin{aligned} min_{m_{i}} Φ_{d_{i}} (m_{i}) + \frac{{\hat{α}}_{i}}{2} ‖ m_{i} - Q_{i} u_{i} ‖_{W_{2}^{1}}^{2}, i = 0, 1, \dots, N - 1, \\ min_{u} R (u) + \sum_{i = 0}^{N - 1} \frac{α_{i}}{2} ‖ u_{i} - P_{i} m_{i} ‖^{2}, \end{aligned} \end{cases} \end{array}

(7)

where ${\hat{α}}_{i} = α_{i} / λ_{i}$ ⁠, $Q_{i}$ is interpolation from the grid of $u$ onto the grid of $m_{i}$ and $W_{2}^{1}$ denotes (weighted) discretized Sobolev norm of order one:

\begin{array}{r} ‖ m ‖_{W_{2}^{1}}^{2} = ‖ m ‖^{2} + w m^{T} L m, \end{array}

(8)

where $L$ is a negative discrete Laplacian and w is a weighting coefficient. In the coupling-regularization subproblem, we omit additional smoothing since in the cases considered here it is either redundant or conflicting with the regularization present in $R$ ⁠. Note that if all the models and $u$ use the same discretization and $α_{i} \to \infty$ ⁠, problem (7) is equivalent to eq. (3).

Discretizations of $m_{i}$ are dictated by the inverse problems. Since the regularization and coupling should not create excessive details, discretizing $u$ by a grid that is not coarser than the finest $m_{i}$ grid is always safe. However, this is not always necessary and computationally feasible. For example, inversion of MT data often requires a very fine grid in the near surface, while its joint inversion with tomography and/or gravity aims at resolving the deeper (and larger scale) structures. In this case, discretizing $u$ by a regular grid with the smallest step from the resistivity grid would be prohibitively expensive, but since $u$ enters MT inversion as a reference model only, using a coarser grid will still allow for recovery of a fine resistivity structure in the near surface (see Section 5.2).

System (7) can be solved by alternating minimization: several (or one, for Newton-type methods) iterations of the inversion subproblems are followed by several iterations of the coupling-regularization subproblem, after which this cycle, which we will refer to as the outer iteration, is repeated. We use a continuation (see, e.g. Nocedal & Wright 1999) in $α_{i}$ and ${\hat{α}}_{i}$ ⁠: following Wang et al. (2008), $α_{i}$ are gradually increased with the outer iterations as a geometric series $α_{i, n} = q^{n} α_{i, 0}$ ⁠, where $q > 1$ ⁠. On the other hand, ${\hat{α}}_{i} = α_{i} / λ_{i}$ ⁠. Recalling eq. (3), we recognize $λ_{i}$ as the Lagrange multipliers of the constraints $2 Φ_{d_{i}} (m_{i}) = N_{d_{i}}$ ⁠, where $N_{d_{i}}$ is the dimension of $d_{i}$ ⁠; a common practical approach to determining these Lagrange multipliers, known as cooling or adaptive regularization, is to use continuation $λ_{i, n} = q_{λ}^{n} λ_{i, 0}$ with $q_{λ} > 1$ ⁠, starting from a “small” value $λ_{i, 0}$ ⁠, until the constraint is satisfied (see, e.g. Haber et al. 2000; Zhdanov 2002). Assuming that $q_{λ} \approx q$ ⁠, we can keep ${\hat{α}}_{i}$ constant during the inversion–this simplification was used in all our numerical examples and proved to be working satisfactorily. Overall, with ${\hat{α}}_{i}$ fixed and $α_{i}$ increasing, the balance is gradually shifted from the prior $R$ towards the data terms, similar to the (separate inversion) cooling. A finer tuning of $α_{i}$ and ${\hat{α}}_{i}$ is usually needed at the final iterations in order to prevent overfitting and to assure simultaneous convergence of all the methods (see lines 11–15 of Algorithm 1 below). Convergence is controlled through the root-mean-square (RMS) values of the covariance-weighted data misfits:

\begin{array}{r} {RMS}_{i} = \sqrt{\frac{2 Φ_{d_{i}} (m_{i})}{N_{d_{i}}}}, \end{array}

(9)

and the relative mean misfits between the models and the auxiliary variables:

\begin{array}{r} r_{i} = \frac{‖ u_{i} - P_{i} m_{i} ‖}{‖ P_{i} m_{i} ‖} . \end{array}

(10)

Parameters $α_{i, 0}$ and ${\hat{α}}_{i}$ are problem-dependent and are selected in such a way that the effects of coupling and regularization are maximized while acceptable values of ${RMS}_{i}$ and $r_{i}$ are reached. Moreover, $α_{i, 0}$ control relative importance of the individual methods in the joint inversion. Practical aspects of this are discussed in Section 5.1.

The resulting inversion scheme is summarized below as Algorithm 1.

Open in new tab

Decoupling minimization problems for the different data misfit functionals simplifies programming and allows each inverse subproblem to be solved using both numerical grid and optimization method tailored to the specific physics. Compared to the BCD, Algorithm 1 does not require implementation of $R$ on the side of each inversion subproblem. Another advantage over the BCD is that the steps in different $m_{i}$ can be taken simultaneously, that is the inverse problems can be run in parallel. The linking terms act as the standard quadratic ‘damping + smoothing’ stabilizers and the information exchange between the subproblems is performed by means of the reference models, thus most existing (separate) inversion codes can be used without any modification. With $N = 1$ ⁠, Algorithm 1 effectively changes regularization in an existing inversion code. Similarly, isolation of the coupling-regularization subproblem simplifies its implementation and modification and allows the use of tailored optimization methods. Strategies of Um et al. (2014) and Gao & Zhang (2018) demonstrate a similar level of decoupling, but instead of variable splitting they essentially rely on sequential minimization of components of the joint objective function: decoupled inversions are restarted from the models that are ‘preconditioned’ by minimization of cross-gradient with damping. Another similar strategy, although restricted to the case of explicit petrophysical relationships between the models, is that of Heincke et al. (2017) and Kamm et al. (2015).

3 REGULARIZATION AND COUPLING

Here we give several illustrative examples of $R (u)$ ⁠, focusing on the more universal structural priors.

For discretization of the regularization-coupling subproblem, we use a ‘voxel’ regular grid with step h and $M = M_{x} \times M_{y} \times M_{z}$ cells. With (⁠ $x_{1}, x_{2}, \dots, x_{M}$ ⁠) being lexicographically ordered coordinates $x_{i, j, k} = (x_{i}, y_{j}, z_{k})$ of the cell centres, the multiparameter model vector is defined as follows:

\begin{array}{r} u = [u_{0} (x_{1}), \dots, u_{0} (x_{M}), u_{1} (x_{1}), \dots, u_{N - 1} (x_{M})]^{T} . \end{array}

(11)

Partial derivatives are approximated by forward differences and zero Neumann boundary conditions are applied:

\begin{array}{r} \partial_{x} u_{l} (x_{i, j, k}) \approx {\begin{cases} \begin{aligned} \frac{u_{l} (x_{i + 1, j, k}) - u_{l} (x_{i, j, k})}{h} if i < M_{x}, \\ 0 if i = M_{x}, \end{aligned} \end{cases} \end{array}

(12)

which we write in matrix form as $\partial_{x} u_{i} \approx D_{x} u_{i}$ ⁠, and analogously for y and z directions.

To update $u$ at each inner iteration of Algorithm 1, the following system of linear equations is solved:

\begin{array}{r} [H_{R} + A] δ u = - \nabla_{u} R - A (u - u_{ref}), \end{array}

(13)

where $H_{R}$ is some approximation of the Hessian of $R$ ⁠, $A = diag [α_{i}] \otimes I$ ⁠, $I$ is $M \times M$ identity matrix and $\otimes$ denotes Kronecker product. Then, $u \leftarrow u + μ δ u$ with the step length $μ \leq 1$ determined by backtracking line search. Here we focus on Gauss–Newton-like forms of eq. (13), as from our experience they combine ease of implementation with reasonable efficiency; the provided expressions for the gradient can be used to implement gradient-based and quasi-Newton schemes.

3.1 Joint total variation

Total variation (TV) regularization is efficient in many inverse problems because of its edge-preserving properties (Rudin et al. 1992). The TV is defined as $ℓ_{1}$ norm of the magnitude of the model’s gradient; in order to make the absolute value differentiable at zero it is commonly ‘smoothed’ using a small positive constant $β$ ⁠:

\begin{array}{r} R_{TV} (u_{i}) = \int_{Ω} \sqrt{| \nabla u_{i} (x) |^{2} + β} d x . \end{array}

(14)

TV of the multiparameter model $u$ can be defined simply as a sum of TV functionals of the individual models: $R_{TV} (u) = \sum_{i} R_{TV} (u_{i})$ ⁠.

Another way to generalize TV to the multiparameter model case is by using the $ℓ_{1, 2}$ mixed norm: taking $ℓ_{2}$ norm of the vector $[| \nabla u_{0} |, | \nabla u_{1} |, \dots, | \nabla u_{N - 1} |]^{T}$ at each spatial position and then $ℓ_{1}$ norm of the result gives functional known as colour, vectorial or joint total variation (JTV, Blomgren & Chan 1998; Haber & Holtzman Gazit 2013; Crestel et al. 2018). We use a slightly generalized, weighted version of JTV:

\begin{array}{r} R_{JTV} (u) = \int_{Ω} \sqrt{\sum_{i = 0}^{N - 1} \nabla u_{i} (x) \cdot W_{i} \nabla u_{i} (x) + β} d x, \end{array}

(15)

where $W_{i} = diag [w_{i, x}, w_{i, y}, w_{i, z}]$ ⁠, $w_{i, ξ} > 0$ ⁠. This functional is convex (see the references above) and thus a stabilizing functional according to Tikhonov & Arsenin (1977), at the same time it introduces coupling between the models by aligning their edges, that is local maxima of the magnitude of the gradient. The coupling property of JTV can be illustrated by the following example: consider $u$ consisting of two piecewise constant models with the jumps of unit amplitude and with unit values of the total variation; it is clear that if the jumps in the two models are disjoint, then $R_{JTV} (u) = 2$ ⁠, but if instead they are fully coincident, $R_{JTV} (u) = \sqrt{2}$ ⁠.

A simple and relatively efficient way to minimize TV and JTV functionals is by using the iteratively reweighted least squares (IRLS) method (Vogel & Oman 1996; Molodtsov & Troyan 2017). It leads to an approximation of the Hessian of JTV that is block-diagonal, that is system (13) is decoupled into the following N systems:

\begin{array}{r} [L_{w; i} + α_{i} I] δ u_{i} = - L_{w; i} u_{i} - α_{i} (u_{i} - u_{ref, i}), i = 0, 1, \dots N - 1, \end{array}

(16)

where

\begin{array}{r} L_{w; i} = \sum_{ξ = x, y, z} D_{ξ}^{T} diag [\frac{w_{i, ξ}}{\sqrt{\sum_{i = 0}^{N - 1} g_{w; i, j}^{2} + β}}] D_{ξ}, \end{array}

(17)

and $g_{w; i, j} = \sqrt{w_{i, x} [D_{x} u_{i}]_{j}^{2} + w_{i, y} [D_{y} u_{i}]_{j}^{2} + w_{i, z} [D_{z} u_{i}]_{j}^{2}}$ ⁠.

Matrix $L_{w; i}$ can be viewed as a finite-difference diffusion operator with anisotropic (due to the weighting $W_{i}$ ⁠) diffusivity that depends on the $u$ computed at the previous IRLS iteration. The edges (large $g_{w; i, j}$ ⁠) locally reduce all the diffusivities and in this way persist between the iterations, resulting in edge-preserving smoothing (similar to TV), and between the models, resulting in coupling. It follows that coincident edges are more resistant to smoothing, but new edges can only be introduced by $u_{ref}$ ⁠, that is by the data.

Because systems (16) are symmetric positive definite, they can be efficiently solved by the preconditioned conjugate gradient (PCG) method. We use Jacobi preconditioner and, as usual in Gauss–Newton-like schemes (see, e.g. Nocedal & Wright 1999), terminate PCG early by setting relative tolerance to 0.001–0.01. The condition number of the system depends on $β$ ⁠; if $β$ is too small (and when $α_{i}$ are small) PCG may stagnate. We find that, given the model gradients are reasonably scaled (see below), $β = 10^{- 7}$ provides a good balance between PCG convergence rate and the accuracy of $ℓ_{1}$ norm approximation.

Scaling of the models, which represent different physical quantities, is important for efficient coupling; apart from the bounding transforms that are discussed below, it is achieved by defining $W_{i} = s_{i}^{- 1} diag [w_{x}, w_{y}, w_{z}]$ ⁠, where $s_{i}$ is the RMS magnitude of the gradient of $u_{ref, i}$ ⁠. Similarly to the classical $ℓ_{2}$ -norm regularization case (e.g. Li & Oldenburg 1998), constants $w_{x} = w_{y} \approx 10$ and $w_{z} = 1$ usually work well for ‘subhorizontal’ structures, while some scenarios with steeply dipping geology may benefit from $w_{z} > w_{x} = w_{y}$ ⁠.

3.2 Joint minimum support

For linear inverse problems, regularization based on the $ℓ_{1}$ norm leads to sparse solutions (Loris et al. 2007). Similarly, the $ℓ_{1, 2}$ mixed norm promotes joint sparsity—same sparsity pattern between the individual model vectors (e.g. Gramfort et al. 2012). These approaches are particularly efficient because the corresponding stabilizers are convex. Non-convex stabilizers are more problematic from the optimization standpoint, but have stronger sparsity-promoting properties; a well-known example is the minimum support (MS) functional introduced by Last & Kubik (1983) and generalized by Portniaguine & Zhdanov (1999):

\begin{array}{r} R_{MS} (u_{i}) = \int_{Ω} \frac{[u_{i} (x) - u_{b; i} (x)]^{2}}{[u_{i} (x) - u_{b; i} (x)]^{2} + β} d x . \end{array}

(18)

With $β \to 0$ ⁠, it approximates the volume of the regions of $Ω$ where $u_{i}$ deviates from a (known) background model $u_{b; i}$ ⁠, that is the volume of the support of $u_{i} (x) - u_{b; i} (x)$ ⁠. In the multiparameter case, by analogy with mixed norms, we arrive at the joint minimum support (JMS) functional (Molodtsov & Troyan 2017):

\begin{array}{r} R_{JMS} (u) = \int_{Ω} \frac{\sum_{i = 0}^{N - 1} [u_{i} (x) - u_{b; i} (x)]^{2}}{\sum_{i = 0}^{N - 1} [u_{i} (x) - u_{b; i} (x)]^{2} + β} d x . \end{array}

(19)

Similarly to the $ℓ_{1, 2}$ norm, JMS is a stabilizing functional and introduces coupling through joint sparsity. The coupling property follows from the fact that the integrand of eq. (19), the function $t \mapsto t / (t + β)$ with $t \geq 0$ ⁠, is subadditive. JMS regularization tends to ‘focus’ the models $u_{i} - u_{b; i}$ such that they have the same support, which can be especially useful in time-lapse problems. It was applied to magnetic, gravity and marine controlled-source electromagnetic data (Jorgensen & Zhdanov 2021; Tu & Zhdanov 2022).

To illustrate the behaviour of JMS, let us consider an ‘inverse problem’ with one data point $d = f (m_{0})$ and two model parameters $m_{0} = (m_{0, 1}, m_{0, 2})$ representing two voxels and let $m_{1} = (m_{1, 1}, m_{1, 2})$ be a ‘fixed’ reference model. JMS of $m_{0}$ and $m_{1}$ (with zero background models and $β = 0.001$ ⁠) is plotted in Fig. 1: at $m_{1} = 0$ ⁠, JMS regularization selects two sparse solutions to $d = f (m_{0})$ ⁠, but as we move away from zero along $m_{1, 1}$ ⁠, only the solution having the same sparsity pattern as $m_{1}$ remains.

$Joint minimum support (JMS) of two 2-D model vectors $\mathbf {m}_0 = (m_{0,1}, m_{0,2})$ and $\mathbf {m}_1 = (m_{1,1}, m_{1,2})$ plotted as a function of $\mathbf {m}_0$ for four different values of $\mathbf {m}_1$. Minimization of JMS under the constraint $d = f(\mathbf {m}_0)$ (black line) leads to sparse $\mathbf {m}_0$ with the same sparsity pattern as $\mathbf {m}_1$.$

Figure 1.

Joint minimum support (JMS) of two 2-D model vectors $m_{0} = (m_{0, 1}, m_{0, 2})$ and $m_{1} = (m_{1, 1}, m_{1, 2})$ plotted as a function of $m_{0}$ for four different values of $m_{1}$ ⁠. Minimization of JMS under the constraint $d = f (m_{0})$ (black line) leads to sparse $m_{0}$ with the same sparsity pattern as $m_{1}$ ⁠.

Open in new tab Download slide

Minimization of JMS can be carried out using IRLS (by analogy with MS, following Last & Kubik (1983) and Portniaguine & Zhdanov (1999)). In the vicinity of a model $u^{(k)}$ found at IRLS iteration k, (eq. 19) is approximated by a quadratic functional:

\begin{array}{r} R_{JMS} (u) \approx \sum_{i = 0}^{N - 1} (u_{i} - u_{b; i})^{T} diag [c_{j}] (u_{i} - u_{b; i}), \end{array}

(20)

with

\begin{array}{r} c_{j} = \frac{1}{\sum_{i = 0}^{N - 1} {(u_{i, j}^{(k)} - u_{b; i, j})}^{2} + β} . \end{array}

(21)

Substituting it into the eq. (13) and taking unit step length, we get an explicit formula for the updated model vector components:

\begin{array}{r} u_{i, j}^{(k + 1)} = \frac{2 c_{j} u_{b; i, j} + α_{i} u_{ref, i, j}}{2 c_{j} + α_{i}}, \end{array}

(22)

for $i = 0, \dots, N - 1$ ⁠, $j = 1, \dots, M$ ⁠. JMS is straightforward to modify to include scaling of the models similarly to JTV (but with the coefficients $s_{i}$ equal to RMS values of $u_{ref, i} - u_{b; i}$ ⁠).

3.3 Modifications of cross-gradient

The cross-gradient constraint of Gallardo & Meju (2003) can be imposed, for a pair of models, by minimizing $ℓ_{2}$ norm of the cross product of the model gradients:

\begin{array}{r} S (u_{i}, u_{j}) = \frac{1}{2} \int_{Ω} | \nabla u_{i} (x) \times \nabla u_{j} (x) |^{2} d x . \end{array}

(23)

The cross product is zero if the gradients (and, consequently, the level sets of the models) are parallel or if a gradient vanishes. The latter means that the constraint is rather inefficient for piecewise-constant models and also demonstrates that (eq. 23) is not a stabilizing functional. JTV, on the other hand, does not efficiently couple smooth models since diffusivity in eq. (17) depends only on the magnitude of the gradients. Naturally occurring models are usually combinations of the smooth and the piecewise constant cases, so it seems reasonable to use JTV as a stabilizing functional for the cross-gradient. Considering all pairs of the models and introducing for each pair a positive weighting coefficient $γ_{i j}$ ⁠, we have

\begin{array}{r} R (u) = R_{JTV} (u) + \sum_{\begin{array}{c} i = 0 \\ j = i + 1 \end{array}}^{N - 1} γ_{i j} S (u_{i}, u_{j}) . \end{array}

(24)

The gradient of eq. (24) consists of the blocks

\begin{array}{r} \nabla_{u_{i}} R = L_{w; i} u_{i} + \sum_{\begin{array}{c} j \neq i \\ ξ = x, y, z \end{array}} γ_{i j} D_{ξ}^{T} r_{i j, ξ}, \end{array}

(25)

where $r_{i j, ξ}$ is the discretized $ξ$ component of $\nabla u_{j} \times (\nabla u_{i} \times \nabla u_{j})$ ⁠. Using Gauss–Newton approximation for the cross-gradient terms, the Hessian of eq. (24) is approximated by

\begin{array}{r} H_{R} = [\begin{array}{c} L_{w; 0} + \sum_{i \neq 0} H_{i, i} & \dots & - H_{0, N - 1} \\ ⋮ & ⋱ & ⋮ \\ - H_{0, N - 1} & \dots & L_{w; N - 1} + \sum_{i \neq N - 1} H_{i, i} \end{array}], \end{array}

(26)

where

\begin{array}{r} H_{i, j} = γ_{i j} \sum_{ξ, η = x, y, z} D_{ξ}^{T} C_{ξ η} (u_{i}, u_{j}) D_{η}, \end{array}

(27)

and

\begin{array}{r} C_{ξ η} (u_{i}, u_{j}) = {\begin{cases} \begin{aligned} \sum_{ζ \neq ξ} diag [(D_{ζ} u_{i})_{k} (D_{ζ} u_{j})_{k}], & ξ = η, \\ - diag [(D_{ξ} u_{i})_{k} (D_{η} u_{j})_{k}], & ξ \neq η . \end{aligned} \end{cases} \end{array}

(28)

The resulting linear system (13) is solved by the PCG analogously to the JTV case, the main difference is that the search directions $δ u_{i}$ for the individual models are now coupled.

Denoting by $θ$ the angle between the two gradients, we can factorize the cross-gradient as follows:

\begin{array}{r} | \nabla u_{i} \times \nabla u_{j} |^{2} = | \nabla u_{i} |^{2} | \nabla u_{j} |^{2} (1 + \cos θ) (1 - \cos θ) . \end{array}

(29)

The first two factors zero the cross-gradient when the corresponding gradient vanishes, the third factor—when the gradients are antiparallel and the fourth—parallel. In many cases, the sign $σ_{i j}$ of correlation between the two reconstructed properties is known a priori and this information should be incorporated into the coupling constraints. This can be done (following Molodtsov et al. 2011) by selecting either the ‘parallel’ or the ‘antiparallel’ factor of eq. (29) and taking $ℓ_{2}$ norm of the result, which leads to the one-way cross-gradient functional:

\begin{array}{r} S_{σ_{i j}} (u_{i}, u_{j}) = \frac{1}{2} \int_{Ω} (| \nabla u_{i} | | \nabla u_{j} | - σ_{i j} \nabla u_{i} \cdot \nabla u_{j})^{2} d x, \end{array}

(30)

where $σ_{i j} = \pm 1$ ⁠. Minimization of this functional with $σ_{i j} = 1$ forces model gradients to be parallel, with $σ_{i j} = - 1$ – antiparallel and in both cases the gradients are free to vanish.

The gradient and Gauss–Newton approximation to the Hessian of eq. (30) are given by the following expressions:

\begin{array}{r} \nabla_{u_{i}} S_{σ_{i j}} = \sum_{ξ = x, y, z} D_{ξ}^{T} C_{ξ} (u_{i}) r_{i j}, \end{array}

(31)

\begin{array}{r} \nabla_{u_{i}, u_{j}}^{2} S_{σ_{i j}} \approx \sum_{ξ, η = x, y, z} D_{ξ}^{T} C_{ξ} (u_{i}) C_{η} (u_{j}) D_{η}, \end{array}

(32)

where $r_{i j}$ is the discretized $| \nabla u_{i} | | \nabla u_{j} | - σ_{i j} \nabla u_{i} \cdot \nabla u_{j}$ ⁠,

\begin{array}{r} C_{ξ} (u_{i}) = diag [\frac{g_{j, k}}{g_{i, k}} (D_{ξ} u_{i})_{k} - σ_{i j} (D_{ξ} u_{j})_{k}], \end{array}

(33)

and $g_{i, k} = \sqrt{\sum_{ξ} [D_{ξ} u_{i}]_{k}^{2} + β}$ ⁠. These expressions are substituted into eqs (25) and (26) when some of the cross-gradient terms in the functional (24) are replaced by eq. (30).

Regularization and coupling can be easily generalized by appending fixed models to the vector $u$ ⁠, for example in order to use geological or legacy geophysical models as a structural reference. Extra regularization (such as ‘damping’ towards a reference model) and coupling (such as ad hoc petrophysical relations, as demonstrated in the numerical example in Section 5.2) terms are often required when dealing with real data and are straightforward to implement.

4 APPLICATION TO MT, FATT AND GRAVITY

Here we outline the implementation of the inversion subproblems used in the numerical examples.

3-D MT inversion is performed using the nonlinear conjugate gradient (NLCG) method implemented in the ModEM software package (Kelbert et al. 2014). Resistivity is discretized on a non-uniform rectangular grid, with the cells increasing in size with the decay of the MT resolution with depth and laterally outside of the data-covered area; the model vector $m$ is formed from the natural logarithm of the cells’ resistivities. Regularization in ModEM has the form $(m - m_{ref})^{T} C_{m}^{- 1} (m - m_{ref})$ ⁠, with the model covariance $C_{m}$ implemented as a first-order autoregressive smoothing operator; it can be shown (Yanovskaya & Ditmar 1990; Xu 2005) that it is equivalent to a squared $W_{2}^{1}$ Sobolev norm. NLCG iterations are performed in transformed model space $\tilde{m} = C_{m}^{- 1 / 2} (m - m_{ref})$ ⁠. In $u$ ⁠, a logarithmic resistivity vector is used, so $m$ is transformed to $\tilde{m}$ and back at each outer iteration of Algorithm 1. To make use of conjugacy, several NLCG iterations are needed at each outer iteration; at the same time, for the alternating minimization with continuation, this number should not be too large. In our experiments, five iterations were sufficient for a satisfactory convergence rate.

Seismic first-arrival traveltime tomography (FATT) is implemented using the Gauss–Newton method, a single iteration of which is performed at each outer iteration of Algorithm 1. Velocity $v_{p}$ is defined over a voxel grid. The forward problem is solved by the 3-D finite-difference method of Podvin & Lecomte (1991) and its Jacobian is computed using our implementation of the ‘backtracking’ ray tracing, principle of which is described in the same paper; normal equations are solved by PCG; computations are parallelized over the sources. Inversion (as well as regularization and coupling) is performed in the transformed model domain, with the transform

\begin{array}{r} m (s) = \ln \frac{s - s_{min}}{s_{max} - s}, \end{array}

(34)

which guarantees that the slowness $s = 1 / v_{p}$ lies within the predefined interval $(s_{min}, s_{max})$ ⁠. Details on the bounding transform eq. (34) can be found in Commer & Newman (2008).

Our implementation of 3-D gravity inversion is based on the works of Li & Oldenburg (1998, 2003). A single Gauss–Newton iteration with PCG as a linear solver is performed at each outer iteration of Algorithm 1. The model is discretized with a non-uniform rectangular grid and analytical expressions for the gravity field of a constant-density rectangular prism are used to precompute the sensitivity matrix. Each row of the sensitivity matrix is compressed using a 3-D discrete wavelet transform (DWT) with the D4 Daubechies wavelet. Trade-off between the compression ratio and the forward problem accuracy is controlled by setting the relative error of the sensitivity reconstruction. The depth (or distance) weighting (Li & Oldenburg 1998) is applied prior to the compression; in addition to improving the recovery of deep structures, it also reduces the dynamic range of the sensitivity thus reducing the compression artefacts. Inversion is performed in the depth/distance-weighted model space, but the vector $u_{i}$ is unweighted, that is the weighting is only applied to the linking terms in the inversion subproblem. To constrain the density range, instead of a logarithmic barrier method used by Li & Oldenburg (2003), we use the bounding transform eq. (34).

Trilinear interpolation is used for $P_{i}$ ⁠, $Q_{i}$ ⁠, which is quite efficient when all the grids are rectangular. In case of a mismatch between the regions $Ω$ and $Ω_{i}$ ⁠, nearest-neighbour extrapolation is invoked and the interpolation-extrapolation result m is smoothly merged with the prior (or initial) model $m_{0}$ outside the interpolation region via a sigmoid function s centred on the regions boundary as $m \leftarrow s m + (1 - s) m_{0}$ ⁠.

5 NUMERICAL EXAMPLES

5.1 Example 1: time-lapse seismic–gravity joint inversion using JMS regularization

Joint inversion attempting to reconstruct compact, highly (co)localized features can be efficiently performed with the help of the joint minimum support regularization. To demonstrate this, we consider a simple synthetic example combining time-lapse FATT and full-tensor gravity gradiometry (FTG). Seismic tomography is widely used for time-lapse applications at various scales (e.g. Patanè et al. 2006; Ajo-Franklin et al. 2007; Hilbich 2010). The feasibility of time-lapse applications of FTG was studied, for example by Droujinine et al. (2007) and by Reitz et al. (2015); in this example, we use six components of the gravity gradient tensor (⁠ $U_{x x}$ ⁠, $U_{y y}$ ⁠, $U_{z z}$ ⁠, $U_{x y}$ ⁠, $U_{x z}$ ⁠, $U_{y z}$ ⁠) and the vertical gravity component $U_{z}$ ⁠.

Using notation introduced in Section 2, let us consider two experiments–FATT (⁠ $i = 0$ ⁠) and FTG (⁠ $i = 1$ ⁠), both performed twice—at time $t_{0}$ and $t_{1}$ ⁠:

\begin{array}{r} d_{i} (t_{j}) = f_{i} [m_{i} (t_{j})] + ϵ_{i} (t_{j}), i = 0, 1, j = 0, 1. \end{array}

(35)

In the context of time-lapse inversion, the states at $t_{0}$ and $t_{1}$ are usually referred to as baseline and monitor, respectively. If the data acquisition geometry stays constant, it is advantageous to work with the difference of monitor and baseline data, $Δ d_{i} = d_{i} (t_{1}) - d_{i} (t_{0})$ ⁠. A common strategy is: (1) find $m_{i} (t_{0})$ by inverting baseline data and (2) find the model temporal perturbation $Δ m_{i}$ by minimizing the double-difference data misfit functional (Waldhauser & Ellsworth 2000):

\begin{array}{r} Φ_{d_{i}} (Δ m_{i}) = \frac{1}{2} {‖ f_{i} [m_{i} (t_{0}) + Δ m_{i}] - f_{i} [m_{i} (t_{0})] - Δ d_{i} ‖}^{2} . \end{array}

(36)

Given smallness of $Δ m_{i}$ ⁠, the second step is relatively fast and both observational and modelling systematic errors are cancelled in eq. (36), increasing sensitivity to the temporal changes, compared with a straightforward sequential inversion of baseline and monitor data. The gravity forward problem is linear with respect to the density, so for the FTG inversion, eq. (36) simplifies to

\begin{array}{r} Φ_{d_{1}} (Δ m_{1}) = \frac{1}{2} {‖ F_{1} Δ m_{1} - Δ d_{1} ‖}^{2}, \end{array}

(37)

that is there is no need for the baseline density model if we are only interested in the temporal changes of density. Thus, after the reconstruction of the baseline velocity, which we perform by stand-alone FATT, time-lapse joint inversion reduces to the problem (7), in which the unknown models $m_{i}$ are replaced with $Δ m_{i}$ ⁠. Nevertheless, baseline gravity data can still be used in a scenario where baseline velocity model can be improved by joint inversion with gravity.

The SEG/EAGE Overthrust Model (Aminzadeh et al. 1997) is used as the baseline velocity model. The time-lapse velocity perturbation is a 1 km s⁻¹ cube with a 600 m edge, positioned horizontally at the centre of the model, at 1825 m depth (Fig. 2a). The true models are discretized by a regular cubic grid with 50 m spacing. 50 sources and 900 receivers are placed over a regular grid at 25 m depth (Fig. 2a); every source is registered by all the receivers. In the FTG problem, we directly model the time-lapse data difference. The density perturbation is 0.1 g cm⁻³ cube coincident with the velocity perturbation. Because there is no baseline density and discretizations of FTG and FATT are independent, a smaller grid can be used for density as shown in Fig. 2(b). The grid consists of a uniform core with 100 m horizontal and 90 m vertical steps padded at the lateral edges by three padding layers expanding by a factor of two, a total of $64^{3}$ cells. Vertical gravity and the gravity gradient components are calculated at $13^{2}$ points at 10 m height (Fig. 2b). Synthetic data are contaminated with Gaussian noise with the standard deviation of 1 ms for the traveltimes, 1 μGal for the gravity and 1 mE for the gravity gradient.

Figure 2.

Time-lapse inversion. (a) True monitor (⁠ $t_{1}$ ⁠) velocity model; locations of seismic sources and receivers are shown by big and small dots, respectively. (b) Corresponding density model (the time-lapse change) and FTG data locations; two faces display the model grid. (c) Baseline (⁠ $t_{0}$ ⁠) velocity model recovered by FATT.

Open in new tab Download slide

Gravity inversion is performed on the same grid that is used for generating the synthetic data. However, in the inversion, unlike in the synthetic data calculation, the sensitivity is compressed using DWT with the relative reconstruction error of 1 per cent, resulting in the compression ratio of 58; the depth weighting is set proportional to $1 / z$ ⁠. For the tomography, the model region that was used for the forward modelling is rediscretized with a 100 m step. Bounding transform eq. (34) is applied to the models such that $v_{p} \in (0.2, 6.5)$ km s⁻¹ and the density $σ \in (- 0.2, 0.2)$ g cm⁻³. Baseline traveltimes are inverted using Algorithm 1 with the TV regularization; initial model is 1-D, velocity linearly increases with depth from 2 to 6 km s⁻¹; reconstructed baseline model is shown in Fig. 2(c). The subsequent time-lapse inversion is essentially a simple ‘spike test’ (in which FTG problem is linear, but FATT is, of course, nonlinear and depends on the baseline).

Regularization for the time-lapse inversion is chosen based on the assumption that the volume of the support of the temporal changes is much smaller than the whole model volume—this corresponds to the MS functional (eq. 18) for separate inversions and the JMS functional (eq. 19) for joint inversion. The regularization-coupling grid is identical to the FATT inversion grid (⁠ $200 \times 200 \times 46$ cells, 100 m spacing).

To assess the coupling effect of JMS regularization separately from its regularizing effect and to quantify the degree of alignment of the reconstructed anomalies, we introduce the following quantity:

\begin{array}{r} γ (u) = \frac{R_{JMS} (u)}{\sum_{i = 0}^{N - 1} R_{MS} (u_{i})} . \end{array}

(38)

Given that the anomalous part of $u$ does not vanish everywhere, it follows from the definitions of MS and JMS functionals that $1 / N \leq γ (u) \leq 1$ (recall that $R_{MS}$ approximates the volume of the support of an individual model and $R_{JMS}$ —that of the multiparameter model, i.e. volume of the union of the individual supports). The upper bound, $γ (u) = 1$ ⁠, corresponds to fully disjoint anomalies, whose supports do not intersect, and the lower bound, $1 / N$ ⁠,—to full alignment, when the supports of all N anomalies are identical.

To determine the regularization parameters $α_{i}$ (to simplify the notation, instead of writing $α_{i, 0}$ we will understand $α_{i}$ as the initial values) and $\hat{α_{i}}$ ⁠, we use the following strategy:

1. Determine $\hat{α_{0}}$ ⁠, $\hat{α_{1}}$ ⁠. For each method, run a series of separate inversions with fixed $m_{ref, i}$ (i.e. Algorithm 1 without the coupling-regularization step) and varying $\hat{α_{i}}$ and select its optimal value according to the Occam/discrepancy principle—maximum $\hat{α_{i}}$ for which ${RMS}_{i} \approx 1$ ⁠.

2. Determine $α_{0}$ ⁠, $α_{1}$ ⁠. A reasonable starting point is $α_{i} = \hat{α_{i}}$ ⁠. Run a series of joint inversions with varying $α_{i}$ and select those that return minimum values of $R$ (of course, while the inversion converges in ${RMS}_{i}$ and $r_{i}$ ⁠). Such combinations of $α_{i}$ are, in general, non-unique. At the same time, we usually want to equalize contributions of the methods/data sets to the joint inversion as much as possible. Since the inverse problems are decoupled, relative weights of the data sets are primarily controlled by $α_{i}$ and not $\hat{α_{i}}$ ⁠: increasing $α_{i}$ increases the influence of ith data set on the joint inversion. It follows that good balance between the data sets can be achieved through similar final values of $r_{i}$ ⁠.

The values of $\hat{α_{i}}$ selected with separate inversions ensure convergence of the joint inversion in the data RMS; however, since in joint inversion, $m_{ref, i}$ are no longer fixed but move towards $m_{i}$ ⁠, it is usually possible to increase $\hat{α_{i}}$ slightly (usually less than by an order of magnitude) to enhance the regularization and coupling. In the current example we simply use the values found at the step 1: $\hat{α_{0}} = 100$ and $\hat{α_{1}} = 10^{8}$ ⁠.

With the $\hat{α_{i}}$ fixed, we perform grid search for $α_{0}$ and $α_{1}$ ⁠. For the purpose of finding the optimal parameters, full grid search is redundant, here it serves the illustrative purpose. By changing $α_{i}$ by an order of magnitude, we build a 15 $\times$ 15 logarithmic grid, centred roughly on (⁠ $\hat{α_{0}}$ ⁠, $\hat{α_{1}}$ ⁠). At each grid point $(α_{0}, α_{1})$ ⁠, we run the joint inversion with the maximum number of outer iterations set to 30, $q = 2$ and the number of the coupling-regularization iterations $n_{itn} = 100$ ⁠. To slightly optimize the search, if joint inversion does not reach the target value of $r_{0}$ ⁠, then the current $α_{0}$ is deemed too low and exploration of the current column of the grid is terminated, and analogously for $r_{1}$ and the respective rows. Fig. 3 shows the grid with the final values of JMS, $γ (u)$ ⁠, ${RMS}_{0}$ ⁠, ${RMS}_{1}$ ⁠, $r_{0} - r_{1}$ and the iteration count. For each i, as expected, as $α_{i}$ increases, $r_{i}$ and ${RMS}_{i}$ decrease and JMS increases. We are interested in the minimum of JMS and, to balance FTG and FATT, $r_{0} \approx r_{1}$ ⁠. The region satisfying these criteria, outlined by the rectangle in Fig. 3, is explored with a finer grid, with a factor of two step (Fig. 4). This time we use $q = 1.5$ ⁠, which increases the iteration count and, since the maximum number of iterations is unchanged, shifts the region of convergence towards larger $α_{i}$ ⁠. At the expense of more iterations, reducing q usually improves the inversion results. As we see from the fine grid, the quantities vary smoothly enough that determining $α_{i}$ to an order of magnitude should give satisfactory results.

$Scanning the initial values of $\alpha _0$ and $\alpha _1$ (index 0 corresponds to FATT, 1–to FTG) for JMS-regularized joint inversion, using grid with the step of a factor of ten; $\hat{\alpha _0} = 100$, $\hat{\alpha _1} = 10^8$, the outer iterations limit is set to 30 and $q=2$. Final values of (a) $\mathcal {R_{\text{JMS}}}$, (b) $\gamma$, (c) the iteration count, (d) $r_0-r_1$, (e) RMS$_0$ and (f) RMS$_1$. Hatching indicates positions where the inversion fails to converge; unexplored areas are grey. We are interested in minimizing $\mathcal {R_{\text{JMS}}}$ and $\gamma$ and, if there is no reason to prioritize either FATT or FTG, we also want $r_0 \approx r_1$. The region of interest, sampled with a finer grid in Fig. 4, is outlined by the rectangle.$

Figure 3.

Scanning the initial values of $α_{0}$ and $α_{1}$ (index 0 corresponds to FATT, 1–to FTG) for JMS-regularized joint inversion, using grid with the step of a factor of ten; $\hat{α_{0}} = 100$ ⁠, $\hat{α_{1}} = 10^{8}$ ⁠, the outer iterations limit is set to 30 and $q = 2$ ⁠. Final values of (a) $R_{JMS}$ ⁠, (b) $γ$ ⁠, (c) the iteration count, (d) $r_{0} - r_{1}$ ⁠, (e) RMS $_{0}$ and (f) RMS $_{1}$ ⁠. Hatching indicates positions where the inversion fails to converge; unexplored areas are grey. We are interested in minimizing $R_{JMS}$ and $γ$ and, if there is no reason to prioritize either FATT or FTG, we also want $r_{0} \approx r_{1}$ ⁠. The region of interest, sampled with a finer grid in Fig. 4, is outlined by the rectangle.

Open in new tab Download slide

$Same as in Fig. 3, but using a factor of two grid step to sample the central region of interest and with $q=1.5$. Note that decreasing q from 2 to 1.5 increases the iteration count and shifts the region of convergence towards higher $\alpha _i$, however, other quantities are less affected. Point ($6 \times 10^4, 10^{10}$) marked with ‘+’ corresponds to the models shown in Figs 6 c–d and point ($10^5, 10^8$) marked with ‘$\Delta$’ corresponds to the models shown in Fig. 5 and Figs 6 e–f.$

Figure 4.

Same as in Fig. 3, but using a factor of two grid step to sample the central region of interest and with $q = 1.5$ ⁠. Note that decreasing q from 2 to 1.5 increases the iteration count and shifts the region of convergence towards higher $α_{i}$ ⁠, however, other quantities are less affected. Point (⁠ $6 \times 10^{4}, 10^{10}$ ⁠) marked with ‘+’ corresponds to the models shown in Figs 6 c–d and point (⁠ $10^{5}, 10^{8}$ ⁠) marked with ‘ $Δ$ ’ corresponds to the models shown in Fig. 5 and Figs 6 e–f.

Open in new tab Download slide

The minimum of $R_{JMS}$ is associated with FATT dominating FTG (Figs 4a and d), which is consistent with the overall higher resolution of FATT. For the point $(10^{5}, 10^{8})$ in this region, marked with ‘ $Δ$ ’ in Fig. 4, the reconstructed velocity and density models are shown in Figs 5 and 6(e) and (f). Best balance between FTG and FATT, in terms of the smallest $| r_{0} - r_{1} |$ ⁠, along the $R_{JMS}$ valley bottom is reached at the point $(6 \times 10^{4}, 10^{10})$ marked with ‘+’ in Fig. 4; the corresponding models are shown in Figs 6(c) and (d).

$Time-lapse perturbations of (a) velocity and (b) density recovered by joint FATT–FTG inversion with JMS regularization, using initial $\alpha _0 = 10^5$, $\alpha _1 = 10^8$ and $q=1.5$. Axes mark the full extent of the inversion grids; true structure is shown by the green outline.$

Figure 5.

Time-lapse perturbations of (a) velocity and (b) density recovered by joint FATT–FTG inversion with JMS regularization, using initial $α_{0} = 10^{5}$ ⁠, $α_{1} = 10^{8}$ and $q = 1.5$ ⁠. Axes mark the full extent of the inversion grids; true structure is shown by the green outline.

Open in new tab Download slide

$A close view of the reconstructed velocity and density perturbations. (a–b) Separate inversions with MS regularization, $\alpha _0 = 10^6$, $\alpha _1 = 10^{10}$ and $n_{\text{itn}}=100$. Joint inversion with JMS regularization: (c–d) $\alpha _0 = 6 \times 10^4$, $\alpha _1 = 10^{10}$, $n_{\text{itn}}=100$, (e–f) $\alpha _0 = 10^5$, $\alpha _1 = 10^8$, $n_{\text{itn}}=100$ (same models as in Fig. 5) and (g–h) $\alpha _0 = 10^5$, $\alpha _1 = 10^8$, $n_{\text{itn}}=5$. True structure is shown by the green outline.$

Figure 6.

A close view of the reconstructed velocity and density perturbations. (a–b) Separate inversions with MS regularization, $α_{0} = 10^{6}$ ⁠, $α_{1} = 10^{10}$ and $n_{itn} = 100$ ⁠. Joint inversion with JMS regularization: (c–d) $α_{0} = 6 \times 10^{4}$ ⁠, $α_{1} = 10^{10}$ ⁠, $n_{itn} = 100$ ⁠, (e–f) $α_{0} = 10^{5}$ ⁠, $α_{1} = 10^{8}$ ⁠, $n_{itn} = 100$ (same models as in Fig. 5) and (g–h) $α_{0} = 10^{5}$ ⁠, $α_{1} = 10^{8}$ ⁠, $n_{itn} = 5$ ⁠. True structure is shown by the green outline.

Open in new tab Download slide

For separate inversion with MS regularization, grid search results in $α_{0} = 10^{6}$ and $α_{1} = 10^{10}$ ⁠, and the corresponding reconstructed models are shown in Figs 6(a) and (b). Not surprisingly, FTG inversion correctly positions the density anomaly in the horizontal plane but not vertically and its upward displacement is compensated by its greatly reduced amplitude. Joint inversion with $α_{0} = 6 \times 10^{4}$ and $α_{1} = 10^{10}$ (the best FTG–FATT balance) moves the density anomaly downwards (Figs 6c and d), but both velocity and density anomalies are poorly focused, as a result of the trade-off in JMS. Increasing the relative weight of FATT by moving down the valley of $R_{JMS}$ to $α_{0} = 10^{5}$ ⁠, $α_{1} = 10^{8}$ results in both anomalies satisfactorily aligned and positioned (Figs 6e and f); note the stronger focusing of the velocity anomaly compared to the separate FATT inversion.

Fig. 7 shows the progress of the inversions corresponding to the models in Fig. 6, in terms of ${RMS}_{i}$ ⁠, $r_{i}$ ⁠, $γ (u)$ and $R_{JMS}$ ⁠. The target RMS values are reached after the first few iterations, after which the models and the auxiliary variables continue to be gradually brought together by increasing $α_{i}$ until their relative difference $r_{i}$ reaches the target value of 0.1. Note that FATT is ahead of FTG in terms of $r_{i}$ ⁠, except in Fig. 7(b) where they converge simultaneously. While the stabilizer, $R_{JMS}$ ⁠, is being minimized, the structure of $u$ develops (by the transfer from $m_{i}$ ⁠), resulting in the overall growth of $R_{JMS}$ ⁠. At the same time, in the joint inversions, $γ (u)$ decreases as the support of the anomalies becomes more similar. In the MS-regularized separate inversion (Fig. 7a), the values of $R_{JMS}$ are similar, but $γ (u)$ stays close to 1 as the anomalies are almost disjoint (Figs 6a and b). Note that the points of the $γ (u)$ and $R_{JMS}$ curves correspond to the inner IRLS iterations of the coupling-regularization subproblem. The sawtooth-like appearance of these curves is a result of the continuation—at each following outer iteration, IRLS starts with new $u_{ref}$ and higher values of $α_{i}$ ⁠, the increased relative weight of the linking terms causes a jump in $R_{JMS}$ ⁠. With $n_{itn} = 100$ ⁠, we can clearly see the convergence of the IRLS. Terminating the IRLS very early with $n_{itn} = 5$ (Fig. 7d) results in higher final value of $R_{JMS}$ ⁠, which is manifested as somewhat smoother models in Figs 6(a) and (b), but the coupling remains satisfactory, both visually and as indicated by $γ (u)$ ⁠.

$Progress of the inversions whose final models are shown in Fig. 6. (a) Separate inversions with MS regularization, $\alpha _0 = 10^6$, $\alpha _1 = 10^{10}$ and $n_{\text{itn}}=100$. Joint inversion with JMS regularization: (b) $\alpha _0 = 6 \times 10^4$, $\alpha _1 = 10^{10}$, $n_{\text{itn}}=100$, (c) $\alpha _0 = 10^5$, $\alpha _1 = 10^8$, $n_{\text{itn}}=100$, (d) $\alpha _0 = 10^5$, $\alpha _1 = 10^8$, $n_{\text{itn}}=5$. For $\gamma (\mathbf {u})$ and $\mathcal {R_{\text{JMS}}}$, points of the curves correspond to the inner iterations of the coupling-regularization subproblem.$

Figure 7.

Progress of the inversions whose final models are shown in Fig. 6. (a) Separate inversions with MS regularization, $α_{0} = 10^{6}$ ⁠, $α_{1} = 10^{10}$ and $n_{itn} = 100$ ⁠. Joint inversion with JMS regularization: (b) $α_{0} = 6 \times 10^{4}$ ⁠, $α_{1} = 10^{10}$ ⁠, $n_{itn} = 100$ ⁠, (c) $α_{0} = 10^{5}$ ⁠, $α_{1} = 10^{8}$ ⁠, $n_{itn} = 100$ ⁠, (d) $α_{0} = 10^{5}$ ⁠, $α_{1} = 10^{8}$ ⁠, $n_{itn} = 5$ ⁠. For $γ (u)$ and $R_{JMS}$ ⁠, points of the curves correspond to the inner iterations of the coupling-regularization subproblem.

Open in new tab Download slide

5.2 Example 2: MT–seismic–gravity joint inversion using different regularization and coupling

This example is based on a modification of MT checkerboard-type benchmark model presented in Egbert & Kelbert (2012). Electrical resistivity, P-wave velocity and density models are displayed in Fig. 8; the models have the same pattern except for the top checkerboard layer (with the anomalous blocks being coincident, up to the grid discretization error). The ‘checkers’ are bounded by the planes $x, y = {- 27.6, - 11.7, - 8.7, 7.2, 10.2, 26.1}$ km and $z = {0, 1, 1.8, 5, 5.8, 10.6}$ km.

$Example 2, true models showing the full-extent inversion grids and the data geometry. (a) Resistivity model and MT data locations. (b) Velocity model: section through $y=20$ km and the $\pm 0.4$ km s−1 velocity anomalies; seismic sources (green dots) and receivers. (c) Density model and gravity data locations. The regularization-coupling grid is identical to the FATT grid.$

Figure 8.

Example 2, true models showing the full-extent inversion grids and the data geometry. (a) Resistivity model and MT data locations. (b) Velocity model: section through $y = 20$ km and the $\pm 0.4$ km s⁻¹ velocity anomalies; seismic sources (green dots) and receivers. (c) Density model and gravity data locations. The regularization-coupling grid is identical to the FATT grid.

Open in new tab Download slide

The resistivity values are 10, 100 and 1000 $Ω \cdot$ m. The resistivity model grid contains $66 \times 66 \times 59$ cells. It consists of the core area with the horizontal cell size of 1 km $\times$ 1 km and four lateral padding layers. In the vertical direction, the cell size increases with depth from 50 m to 16 km. Impedance tensor and vertical magnetic transfer functions data are computed over 15 $\times$ 15 regular grid of stations at the surface (Figs 8a and 9) for 12 logarithmically equispaced periods in 0.1–100 s range. Gaussian noise is added to the data, its standard deviation is 3 per cent of $\sqrt{| Z_{x y} | | Z_{y x} |}$ for all components of the impedance tensor and 0.03 for the tipper.

Figure 9.

Slices through the core parts of the true models shown in Fig. 8, locations of MT, seismic and gravity stations (green dots) and seismic sources (black dots). Black lines denote positions of the slices (note that 3 and 8 km horizontal slices are displayed in one subplot).

Open in new tab Download slide

The background velocity is linearly increasing from 4 km s⁻¹ at the surface to 6.8 km s⁻¹ at 20 km depth, the checkerboard pattern imposes $\pm 0.4$ km s⁻¹ anomalies with respect to this trend; the model is discretized on a regular cubic grid with 500 m step, resulting in $140 \times 140 \times 40$ cells. 64 sources and 225 receivers are placed at the surface over the regular grid with 8 and 4 km spacing, respectively (Figs 8b and 9); for each source, first arrival traveltimes are computed at all receivers within 60 km radius and contaminated with Gaussian noise with the standard deviation of 12.5 ms.

The density model contains two layers with $\pm 0.1$ g cm⁻³ anomalies and is discretized into $64^{3}$ cells; the cells are approximately 1 km $\times$ 1 km $\times$ 0.5 km in the core area, which is surrounded by two padding layers expanding with a factor of two (Figs 8c and 9). Vertical component of the gravity field is computed at 10 m above the surface at 225 points horizontally coinciding with the MT locations and Gaussian noise with a standard deviation of 0.1 mGal is added.

The synthetic data are inverted using Algorithm 1 with different stabilizing and coupling functionals: (a) TV, corresponding to separate inversions, (b) JTV, (c) JTV combined with the cross-gradient (XG), (d) TV combined with XG and the one-way cross-gradient (OWXG), (e) JTV combined with XG and OWXG and (f) JTV combined with XG, OWXG and a ‘petrophysics’ coupling; the corresponding reconstructed models are shown in Fig. 11. Apart from the choice of the stabilizing and coupling functionals, all the inversions are run with the same settings, which are summarized below.

$Progress of the inversions using different regularization and coupling functionals, corresponding to the final models shown in Fig. 11: (a) TV (separate inversions), (b) JTV, (c) JTV + XG, (d) TV + (OW)XG, (e) JTV + (OW)XG, (f) JTV + (OW)XG + functional (41). For $\gamma _{\operatorname{TV}} (\mathbf {u})$ and the ‘cross-gradient’ terms, points of the curves correspond to the inner iterations of the coupling-regularization subproblem.$

Figure 10.

Progress of the inversions using different regularization and coupling functionals, corresponding to the final models shown in Fig. 11: (a) TV (separate inversions), (b) JTV, (c) JTV + XG, (d) TV + (OW)XG, (e) JTV + (OW)XG, (f) JTV + (OW)XG + functional (41). For $γ_{TV} (u)$ and the ‘cross-gradient’ terms, points of the curves correspond to the inner iterations of the coupling-regularization subproblem.

Open in new tab Download slide

Figure 11.

Slices through the core parts of the resistivity, velocity and density models reconstructed using different regularization and coupling. Each column corresponds to one inversion, with the column caption indicating the coupling-regularization functionals used. Where ‘(OW)XG’ is specified, XG is applied to resistivity–density and resistivity–velocity pairs and OWXG—to velocity–density. Dashed lines denote the true structures, solid lines in the first column—positions of the slices (note that horizontal slices combine two depth levels). In vertical slices, vertical exaggeration is 2:1.

Open in new tab Download slide

The MT inversion is started from the homogeneous 100 $Ω \cdot$ m model. The number of NLCG iterations is set to five; the initial line-search step length is set to 20. The model covariance smoothing is applied twice with the factor 0.3 along each coordinate axis. FATT is started from the 1-D model corresponding to the true background velocity trend, with the model transform bounds 0.96 and 7.2 km s⁻¹; the relative tolerance of PCG is set to 0.01. Gravity inversion is started from a zero density model, with the model transform bounds −2.5 and 2.5 g cm⁻³. Depth weighting $\sim 1 / z$ is used and the DWT reconstruction error is set to 5 per cent, resulting in the compression ratio of 77; PCG tolerance is set to $10^{- 3}$ ⁠. All the inversion grids are the same as those of the true models used for computing the synthetic data.

In order to determine the regularization parameters $α_{i}$ and $\hat{α_{i}}$ ⁠, we follow the same strategy as in Example 1: first find $\hat{α_{i}}$ using separate inversions, then find $α_{i}$ that minimize JTV in the JTV-regularized joint inversion; however, because of the higher dimensionality and numerical complexity of the problem, we determine $α_{i}$ by trial and error (using order-of-magnitude steps) without exhaustive grid search, understanding that the results may be suboptimal. The same values of $α_{i}$ and $\hat{α_{i}}$ are then used in the rest of the inversions (with XG, OWXG and ‘petrophysics’ coupling), as we attempt to compare the effects of the coupling constraints. For all the cases, the results appear to be satisfactory, however, they probably can be improved by changing $α_{i}$ ⁠. To determine $γ_{i j}$ ⁠, with $α_{i}$ and $\hat{α_{i}}$ fixed, we choose the maximum values for which the convergence is maintained and the models are sufficiently ‘smooth’ (in terms of (J)TV).

Using the above procedure, the parameters are set as follows (with index 0 denoting FATT, 1–gravity, 2–MT): $α_{0} = 200$ ⁠, ${\hat{α}}_{0} = 20$ ⁠, $α_{1} = 10^{5}$ ⁠, ${\hat{α}}_{1} = 10^{6}$ ⁠, $α_{2} = 1$ ⁠, ${\hat{α}}_{2} = 10$ and $q = 1.3$ ⁠. In the inversions with XG, $γ_{01} = γ_{02} = 200$ ⁠, $γ_{12} = 20$ and when OWXG is applied to the FATT–gravity pair, $γ_{01} = 2000$ and $σ_{01} = - 1$ ⁠.

For FATT and gravity, the Sobolev norm weight $w = 1000$ ⁠. For TV and JTV, $w_{x} = w_{y} = 10$ and $w_{z} = 1$ ⁠. The number of Gauss–Newton iterations $n_{itn} = 4$ and PCG tolerance is set to 0.01. The regularization-coupling grid is the same as the FATT grid.

To measure the alignment of the models’ edges, similarly to the JMS case considered in Example 1, we normalize JTV and define

\begin{array}{r} γ_{TV} (u) = \frac{R_{JTV} (u)}{R_{TV} (u)}, \end{array}

(39)

which is bounded by $1 / \sqrt{N}$ and 1.

The change of RMS data misfits, relative distances $r_{i}$ between $m_{i}$ and $u_{i}$ ⁠, XG and OWXG functionals and $γ_{TV} (u)$ with the outer iterations is plotted in Fig. 10. Note that, except for the Fig. 10(c), the same functionals are plotted irrespective of whether or not they are part of the corresponding objective function. Table 1 summarizes the relative performance of the inversions in terms of the relative mean error with respect to the true models: $mRMS = ‖ m_{i} - m_{i, true} ‖ / ‖ m_{i, true} ‖ \cdot 100 %$ ⁠, as well as the RMS covariance-weighted data misfits and the number of iterations.

Table 1.

Open in new tab

Number of outer iterations, data and model normalized RMS errors for the models in Figs 11 and 12.

		MT		FATT		Gravity
	itn	RMS	mRMS [%]	RMS	mRMS [%]	RMS	mRMS [%]
Separate (TV)	24	1.10	13.63	1.06	73.25	0.99	107.37
JTV	24	1.10	13.34	1.04	73.41	1.08	106.40
JTV + XG	28	1.10	13.06	1.08	70.56	1.06	109.11
TV + (OW)XG	26	1.10	13.22	1.04	73.32	1.05	81.28
JTV + (OW)XG	29	1.10	12.88	1.08	71.56	1.09	67.91
JTV + (OW)XG + (41)	25	1.10	11.49	1.07	67.96	1.10	65.16
25%, separate (TV)	18	1.10	16.88	1.01	82.85	0.72	97.85
25%, JTV + OWXG	22	1.09	15.81	0.96	79.92	0.80	68.32

		MT		FATT		Gravity
	itn	RMS	mRMS [%]	RMS	mRMS [%]	RMS	mRMS [%]
Separate (TV)	24	1.10	13.63	1.06	73.25	0.99	107.37
JTV	24	1.10	13.34	1.04	73.41	1.08	106.40
JTV + XG	28	1.10	13.06	1.08	70.56	1.06	109.11
TV + (OW)XG	26	1.10	13.22	1.04	73.32	1.05	81.28
JTV + (OW)XG	29	1.10	12.88	1.08	71.56	1.09	67.91
JTV + (OW)XG + (41)	25	1.10	11.49	1.07	67.96	1.10	65.16
25%, separate (TV)	18	1.10	16.88	1.01	82.85	0.72	97.85
25%, JTV + OWXG	22	1.09	15.81	0.96	79.92	0.80	68.32

Table 1.

Open in new tab

Number of outer iterations, data and model normalized RMS errors for the models in Figs 11 and 12.

		MT		FATT		Gravity
	itn	RMS	mRMS [%]	RMS	mRMS [%]	RMS	mRMS [%]
Separate (TV)	24	1.10	13.63	1.06	73.25	0.99	107.37
JTV	24	1.10	13.34	1.04	73.41	1.08	106.40
JTV + XG	28	1.10	13.06	1.08	70.56	1.06	109.11
TV + (OW)XG	26	1.10	13.22	1.04	73.32	1.05	81.28
JTV + (OW)XG	29	1.10	12.88	1.08	71.56	1.09	67.91
JTV + (OW)XG + (41)	25	1.10	11.49	1.07	67.96	1.10	65.16
25%, separate (TV)	18	1.10	16.88	1.01	82.85	0.72	97.85
25%, JTV + OWXG	22	1.09	15.81	0.96	79.92	0.80	68.32

		MT		FATT		Gravity
	itn	RMS	mRMS [%]	RMS	mRMS [%]	RMS	mRMS [%]
Separate (TV)	24	1.10	13.63	1.06	73.25	0.99	107.37
JTV	24	1.10	13.34	1.04	73.41	1.08	106.40
JTV + XG	28	1.10	13.06	1.08	70.56	1.06	109.11
TV + (OW)XG	26	1.10	13.22	1.04	73.32	1.05	81.28
JTV + (OW)XG	29	1.10	12.88	1.08	71.56	1.09	67.91
JTV + (OW)XG + (41)	25	1.10	11.49	1.07	67.96	1.10	65.16
25%, separate (TV)	18	1.10	16.88	1.01	82.85	0.72	97.85
25%, JTV + OWXG	22	1.09	15.81	0.96	79.92	0.80	68.32

Fig. 11 presents a comparison of the models obtained with different combinations of the structural constraints. Using JTV instead of TV mainly increases the contrast of the anomalies, while the improvement in their structural similarity is minor, most visible in vertical sections of the density model. Adding XG further increases the structural similarity, as evidenced, for example by the shape of the low-velocity blocks (see 3 km depth slices) and the reduced ‘bleeding’ in the resistivity models. Postulating positivity of the velocity–density correlation via the use of OWXG appears to be critical for the reconstruction of density (cf. columns 3 and 4–6 of Fig. 11)—essentially, it brings depth resolution into the gravity inversion. This effect is similar to the effect of the positivity constraint on the susceptibility in magnetic data inversion (Li & Oldenburg 1998). However, also applying OWXG to the resistivity only marginally improves the models and does not result in reconstruction of the bottom-layer resistive blocks since their absence satisfies the OWXG constraint (see the irregular-geometry test below). Note that the pattern of the upper checkerboard layer is different for velocity, resistivity and density, that is structure present in one of the properties is absent in the others, and all the tested structural constraints work fine in this situation, producing no artefacts.

In the next test, we examine the effects of direct ‘petrophysics’ coupling, assuming such information is available for a certain subset of the model parameters. With the exception of the top checkerboard layer, true velocity $v_{p}$ and logarithm of resistivity $ρ$ are linearly related as follows:

\begin{array}{r} p (u_{0}, u_{2}) = - \frac{0.4}{\ln 10} u_{2} + v_{p} (u_{0}) - v_{p, 0} + 0.8 = 0, \end{array}

(40)

where $u_{0}$ is the slowness model transformed by eq. (34), $u_{2} = \ln ρ$ and $v_{p, 0}$ is the background velocity. This constraint is enforced for the bottom checkerboard layer only by adding to $R (u)$ the term

\begin{array}{r} \frac{1}{2} p (u_{0}, u_{2})^{T} W p (u_{0}, u_{2}), \end{array}

(41)

where $W$ is diagonal matrix whose entries are 1000 for the cells below $z = 5$ km and zero otherwise. Adding it to the combination of JTV, XG and OWXG from the previous test somewhat accelerates convergence of the inversion and dramatically improves models of all the properties (see Table 1), in particular, resistive blocks of the bottom checkerboard layer are now recovered (last column of Fig. 11).

To simulate an irregular acquisition geometry often encountered in the real data, particularly in regional studies, we randomly draw from the regular-grid data sets 25 per cent of the MT and gravity locations and 50 per cent of seismic sources and receivers (Fig. 12). These subsamples of the data are inverted separately using TV (Fig. 12a) and jointly using JTV and OWXG (Fig. 12b). OWXG is applied to all pairs of the models with $γ_{01} = γ_{02} = 2000$ ⁠, $γ_{12} = 200$ and $σ_{01} = σ_{02} = - 1$ ⁠, $σ_{12} = 1$ ⁠. Data and model RMS errors and the number of iterations for separate and joint inversions are summarized in Table 1. As we can see from Fig. 12 (especially the resistivity model), joint inversion, by making use of the spatially complementary sensitivities, improves recovery of the features that are poorly resolved by the individual inversions due to the missing stations.

$Slices through the core parts of the models reconstructed using randomly selected 25 per cent of the data. (a) Separate inversions, TV regularization. (b) Joint inversion, JTV + OWXG ($\sigma _{01}=\sigma _{02}=-1$, $\sigma _{12}=1$). Dashed lines denote the true structures, dots—horizontal positions of stations, ‘+’— horizontal positions of seismic sources.$

Figure 12.

Slices through the core parts of the models reconstructed using randomly selected 25 per cent of the data. (a) Separate inversions, TV regularization. (b) Joint inversion, JTV + OWXG (⁠ $σ_{01} = σ_{02} = - 1$ ⁠, $σ_{12} = 1$ ⁠). Dashed lines denote the true structures, dots—horizontal positions of stations, ‘+’— horizontal positions of seismic sources.

Open in new tab Download slide

5.3 Example 3: joint inversion of MT and electrical resistivity tomography data

In this example we demonstrate how the proposed approach can be applied to the situation where the coupled inverse problems use radically different grids. It is inspired by field data from Furnas Volcano, Azores (Hogg et al. 2018), where a 1.5 km electrical resistivity tomography (ERT) profile was acquired across a fault-zone associated fumarole area together with audio-magnetotelluric (AMT) data. For ERT we use 2-D finite-element modelling and inversion implemented in pyGIMLI (Rücker et al. 2017). For AMT, the 3-D version of ModEM (Kelbert et al. 2014) is used. The true model is 2-D, consisting of a 1 $Ω \cdot$ m ‘fault’, 10 $Ω \cdot$ m ‘clay cap’ and a 1000 $Ω \cdot$ m resistor embedded in 100 $Ω \cdot$ m background with topography (Fig. 13). The ERT data are generated using 64 electrodes with 24 m spacing in the Wenner configuration and contaminated by Gaussian noise with the standard deviation of 2 per cent + 1 μV. The AMT data are calculated for four stations with 380 m spacing at nine frequencies in 1–100 Hz range and contaminated by Gaussian noise with 3 per cent standard deviation; only the off-diagonal components of the impedance tensor are inverted.

$Resistivity models used for generating the synthetic (a) ERT and (b) AMT data. Red dots–positions of ERT electrodes, white–AMT stations. In (b), air cells and cells at $y\gt 0$ are removed, the grid is symmetric with respect to $y=0$.$

Figure 13.

Resistivity models used for generating the synthetic (a) ERT and (b) AMT data. Red dots–positions of ERT electrodes, white–AMT stations. In (b), air cells and cells at $y > 0$ are removed, the grid is symmetric with respect to $y = 0$ ⁠.

Open in new tab Download slide

The AMT data are inverted using a $46 \times 16 \times 40$ model grid consisting of a core area with the horizontal cell size of 50 m $\times$ 50 m which is expanded by 5 padding cells in each horizontal direction, with the cell size increasing by a factor of 2.5. In the vertical direction, the cell size increases with depth from 5 m to 1.4 km. The model covariance smoothing is applied twice with a factor of 0.3 along the x- and z-directions and 0.9 along the y-direction. The ERT data are inverted using 2-D Delaunay triangulation grid with the cells expanding with depth starting from two edges between neighbouring electrodes at the surface. Both ERT and AMT inversions start with 100 $Ω \cdot$ m homogeneous initial models.

For the joint inversion, using eq. (7) with $R (u) = ‖ u_{0} - u_{1} ‖^{2} = 0$ ⁠, we get

\begin{array}{r} u_{0} = u_{1} = \frac{1}{α_{0} + α_{1}} (α_{0} P_{0} m_{0} + α_{1} P_{1} m_{1}), \end{array}

(42)

so the reference models for the inverse subproblems are essentially the weighted average of the ERT and AMT models from the previous outer iteration. As both model vectors are logarithms of resistivity, this corresponds to the weighted geometric mean of the resistivities. We define $u$ on a $340 \times 130$ regular grid with a 5 m step and use bilinear interpolation between the cell centres for $P_{i}$ and $Q_{i}$ ⁠. In the AMT case, when mapping from the 3-D model to the 2-D $u$ ⁠, we average along the y-direction over the core part of the grid, to minimize the effect of the model ‘fading’ away from the profile, and when mapping back, ‘broadcast’ along the y-direction over the whole grid.

At each outer iteration of the joint inversion, the ERT inversion is terminated after three Gauss–Newton iterations and the AMT inversion—after 10 NLCG iterations. The regularization parameters are set as follows for all the inversions: ${\hat{α}}_{0} = 40$ (ERT) and ${\hat{α}}_{1} = 1$ (AMT) and are kept constant. To analyse the effect of ERT/AMT weighting we test three values of $α_{0} / α_{1}$ ⁠: 0.1, 1 and 10.

The models resulting from the joint and separate inversions are shown in Fig. 14. As we can see from the separate inversion results, the near-surface part of the model is well resolved by ERT, but not by AMT, due to large spacing between the stations and the lack of high frequencies, while the deeper structure is resolved by AMT. Joint inversions achieve better reconstruction of the whole model, but at the expense of slightly higher RMS values in AMT: all joint inversions were terminated after 30 outer iterations without the RMS falling below 1.1. Fig. 15 shows the progress of the joint inversions in terms of RMS and $r_{i}$ ⁠. The high RMS values in AMT are largely due to the resistive artefact to the left of the ‘fault’, which is produced by the ERT inversion on the edge of its high-sensitivity region. Using, instead of scalar $α_{i}$ ⁠, weighting matrices proportional to the integrated sensitivities should suppress such artefacts and further improve the joint inversion results. We see that the mapping between different grids based on a simple linear interpolation works reasonably well at least when the methods have comparable resolution lengths and the resistivity models are smooth. However, in more extreme cases accurate mapping of resistivities may require material averaging (see, e.g. Commer & Newman 2008) which will replace $P_{i}$ and $Q_{i}$ by nonlinear operators.

$Resistivity models recovered by separate ERT ($i=0$) and AMT ($i=1$) inversions and joint inversions with three different values of $\alpha _0 / \alpha _1$. For the joint inversions, $\mathbf {u}_i$ are also shown. The corresponding RMS, mRMS and $r_i$ values are indicated. The ERT models are masked according to the integrated sensitivity and for the AMT models the $y=0$ slices through the core part of the grid are shown. True structure is indicated by the black contour, positions of the electrodes/stations—by red dots.$

Figure 14.

Resistivity models recovered by separate ERT (⁠ $i = 0$ ⁠) and AMT (⁠ $i = 1$ ⁠) inversions and joint inversions with three different values of $α_{0} / α_{1}$ ⁠. For the joint inversions, $u_{i}$ are also shown. The corresponding RMS, mRMS and $r_{i}$ values are indicated. The ERT models are masked according to the integrated sensitivity and for the AMT models the $y = 0$ slices through the core part of the grid are shown. True structure is indicated by the black contour, positions of the electrodes/stations—by red dots.

Open in new tab Download slide

$Progress of the ERT–AMT joint inversions for three different values of $\alpha _0 / \alpha _1$.$

Figure 15.

Progress of the ERT–AMT joint inversions for three different values of $α_{0} / α_{1}$ ⁠.

Open in new tab Download slide

6 CONCLUSIONS

We have developed a flexible general framework for multiphysics joint inversion of any number of geophysical data sets. It is based on the variable splitting technique that introduces an auxiliary multiparameter model space in which minimization of the coupling and stabilizing functionals is carried out. By defining a mapping between this space and the model spaces via interpolation, completely different parametrizations can be applied to different models. Each of the inversion supbroblems and the coupling-regularization subproblem can be solved using a different optimization algorithm. For each subproblem, the linking term controlling the distance between the model and the corresponding auxiliary variable takes the form of a quadratic regularization with a reference model. Thus, any existing inversion code supporting such regularization can be integrated, without modifications, into this joint inversion framework. Using the developed framework, we have implemented 3-D joint inversion of magnetotelluric, seismic refraction and gravity data. We have implemented and tested on synthetic data different combinations of coupling constraints based on joint total variation, joint minimum support, cross-gradient and one-way cross-gradient, as well as an explicit relationship between the properties. The joint minimum support is a promising coupling regularization for multiphysics time-lapse inversions. The joint total variation and the cross-gradient are complementary, with JTV providing regularization and coupling edge-like features of the models by aligning their positions and XG coupling smoother features of the models by aligning gradient directions. The one-way cross-gradient can dramatically improve the effects of structural coupling in certain problems: in particular, for the gravity inversion it eliminates the non-uniqueness related to the lack of depth resolution. We expect that the developed approach will be especially useful for joint inversion with more complicated priors.

ACKNOWLEDGEMENTS

We would like to thank the editors, Colin Farquharson and two anonymous reviewers for their constructive comments that helped to improve the manuscript. This work has emanated from research supported in part by: a research grant to iCRAG from Science Foundation Ireland (SFI) under Grant Number 13/RC/2092 and co-funded under the European Regional Development Fund and by iCRAG industry partners; and a research grant to iCRAG2 from SFI under Grant Number 13/RC/2092_P2. Finally, we would like to thank Gary Egbert, Anna Kelbert and Naser Meqbel for making their ModEM code available to us.

DATA AVAILABILITY

The data underlying this paper will be shared on request to the corresponding author.

REFERENCES

Afonso

J.C.

Fullea

Yang

Connolly

Jones

2013

3-D multi-observable probabilistic inversion for the compositional and thermal structure of the lithosphere and upper mantle. II: general methodology and resolution analysis

J. geophys. Res.

118

(

1650

–

1676

Ajo-Franklin

J.B.

Minsley

B.J.

Daley

T.M.

2007

Applying compactness constraints to differential traveltime tomography

Geophysics

(

R67

–

R75

Aminzadeh

Jean

Kunz

1997

3-D Salt and Overthrust Models

Society of Exploration Geophysicists

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Astic

Heagy

L.J.

Oldenburg

D.W.

2021

Petrophysically and geologically guided multi-physics inversion using a dynamic gaussian mixture model

J. geophys. Int.

224

(

–

Blomgren

Chan

T.F.

1998

Color TV: total variation methods for restoration of vector-valued images

IEEE Trans. Image Process.

(

304

–

309

Bosch

1999

Lithologic tomography: from plural geophysical data to lithology estimation

J. geophys. Res.

104

(

749

–

766

Chen

Hoversten

G.M.

Vasco

Rubin

Hou

2007

A Bayesian model for gas saturation estimation using marine seismic AVA and CSEM data

Geophysics

(

WA85

–

WA95

Colombo

Rovetta

2018

Coupling strategies in multiparameter geophysical joint inversion

J. geophys. Int.

215

(

1171

–

1184

Colombo

Turkoglu

Rovetta

2021

Coupled physics-deep learning inversion

Comput. Geosci.

157

, doi:10.1016/j.cageo.2021.104917.

10.1016/j.cageo.2021.104917

Google Scholar

OpenURL Placeholder Text

WorldCat

Crossref

Commer

Newman

G.A.

2008

New advances in three-dimensional controlled-source electromagnetic inversion

J. geophys. Int.

172

(

513

–

535

10.1111/j.1365-246X.2007.03663.x

Crestel

Stadler

Ghattas

2018

A comparative study of structural similarity and regularization for joint inverse problems governed by pdes

Inverse Problems

(

), doi:10.1088/1361-6420/aaf129.

10.1088/1361-6420/aaf129

Google Scholar

OpenURL Placeholder Text

WorldCat

Crossref

Droujinine

Vasilevsky

Evans

2007

Feasibility of using full tensor gradient (FTG) data for detection of local lateral density contrasts during reservoir monitoring

J. geophys. Int.

169

(

795

–

820

10.1111/j.1365-246X.2007.02806.x

Egbert

G.D.

Kelbert

2012

Computational recipes for electromagnetic inverse problems

J. geophys. Int.

189

(

251

–

267

10.1111/j.1365-246X.2011.05347.x

Gallardo

L.A.

Meju

M.A.

2003

Characterization of heterogeneous near-surface materials by joint 2D inversion of dc resistivity and seismic data

Geophys. Res. Lett.

(

), doi:10.1029/2003GL017370.

10.1029/2003GL017370

Google Scholar

OpenURL Placeholder Text

WorldCat

Crossref

Gao

Abubakar

Habashy

T.M.

2012

Joint petrophysical inversion of electromagnetic and full-waveform seismic datajoint petrophysical em and seismic inversion

Geophysics

(

WA3

–

WA18

10.1190/geo2011-0157.1

Gao

Zhang

2018

An efficient sequential strategy for realizing cross-gradient joint inversion: method and its application to 2-D cross borehole seismic traveltime and dc resistivity tomography

J. geophys. Int.

213

(

1044

–

1055

Goldstein

Osher

2009

The split Bregman method for l1-regularized problems

SIAM J. Imag. Sci.

(

323

–

343

Gramfort

Kowalski

Hämäläinen

2012

Mixed-norm estimates for the M/EEG inverse problem using accelerated gradient methods

Phys. Med. Biol.

(

), doi:10.1088/0031-9155/57/7/1937.

10.1088/0031-9155/57/7/1937

Google Scholar

OpenURL Placeholder Text

WorldCat

Crossref

Haber

Holtzman Gazit

2013

Model fusion and joint inversion

Surv. Geophys.

(

675

–

695

10.1007/s10712-013-9232-4

Haber

Ascher

U.M.

Oldenburg

2000

On optimization techniques for solving nonlinear inverse problems

Inverse Problems

(

), doi:10.1088/0266-5611/16/5/309.

10.1088/0266-5611/16/5/309

Google Scholar

OpenURL Placeholder Text

WorldCat

Crossref

Heincke

Jegen

Moorkamp

Hobbs

R.W.

Chen

2017

An adaptive coupling strategy for joint inversions that use petrophysical information as constraints

J. appl. Geophys.

136

279

–

297

10.1016/j.jappgeo.2016.10.028

Hilbich

2010

Time-lapse refraction seismic tomography for the detection of ground ice degradation

Cryosphere

(

243

–

259

10.5194/tc-4-243-2010

Hogg

et al. ,

2018

3-d interpretation of short-period magnetotelluric data at Furnas Volcano, Azores Islands

J. geophys. Int.

213

(

371

–

386

Abubakar

Habashy

T.M.

2009

Joint electromagnetic and seismic inversion using structural constraints

Geophysics

(

R99

–

R109

Jorgensen

Zhdanov

M.S.

2021

Recovering magnetization of rock formations by jointly inverting airborne gravity gradiometry and total magnetic intensity data

Minerals

(

), doi:10.3390/min11040366.

10.3390/min11040366

Google Scholar

OpenURL Placeholder Text

WorldCat

Crossref

Juhojuntti

Kamm

2015

Joint inversion of seismic refraction and resistivity data using layered models-applications to groundwater investigation

Geophysics

(

EN43

–

EN55

10.1190/geo2013-0476.1

Kamm

Lundin

I.A.

Bastani

Sadeghi

Pedersen

L.B.

2015

Joint inversion of gravity, magnetic, and petrophysical data—a case study from a Gabbro intrusion in Boden, Sweden joint potential-field inversion

Geophysics

(

B131

–

B152

10.1190/geo2014-0122.1

Kelbert

Meqbel

Egbert

G.D.

Tandon

2014

Modem: a modular system for inversion of electromagnetic geophysical data

Comput. Geosci.

–

10.1016/j.cageo.2014.01.010

Khan

Connolly

Maclennan

Mosegaard

2007

Joint inversion of seismic and gravity data for lunar composition and thermal state

J. geophys. Int.

168

(

243

–

258

10.1111/j.1365-246X.2006.03200.x

Last

Kubik

1983

Compact gravity inversion

Geophysics

(

713

–

721

Lelièvre

P.G.

Farquharson

C.G.

Hurich

C.A.

2012

Joint inversion of seismic traveltimes and gravity data on unstructured grids with application to mineral exploration

Geophysics

(

–

K15

Oldenburg

D.W.

1998

3-D inversion of gravity data

Geophysics

(

109

–

119

Oldenburg

D.W.

2003

Fast inversion of large-scale magnetic data using wavelet transforms and a logarithmic barrier method

J. geophys. Int.

152

(

251

–

265

10.1046/j.1365-246X.2003.01766.x

Loris

Nolet

Daubechies

Dahlen

2007

Tomographic inversion using

ℓ

1-norm regularization of wavelet coefficients

J. geophys. Int.

170

(

359

–

370

10.1111/j.1365-246X.2007.03409.x

Malovichko

Khokhlov

Yavich

Zhdanov

M.S.

2020

Incorporating known petrophysical model in the seismic full-waveform inversion using the Gramian constraint

Geophys. Prospect.

(

1361

–

1378

10.1111/1365-2478.12932

Manassero

Afonso

J.C.

Zyserman

Jones

Zlotnik

Fomin

2021

A reduced order approach for probabilistic inversions of 3D magnetotelluric data II: joint inversion of mt and surface-wave data

J. geophys. Res.

126

(

–

Mandolesi

Jones

A.G.

2014

Magnetotelluric inversion based on mutual information

J. geophys. Int.

199

(

242

–

252

Molodtsov

Troyan

2017

Multiphysics joint inversion through joint sparsity regularization

, in

SEG Technical Program Expanded Abstracts 2017

1262

–

1267

Society of Exploration Geophysicists

Molodtsov

Kashtan

Roslov

2011

Joint inversion of seismic and magnetotelluric data with structural constraint based on dot product of image gradients

, in

SEG Technical Program Expanded Abstracts 2011

740

–

744

Society of Exploration Geophysicists

Moorkamp

2021

Joint inversion of gravity and magnetotelluric data from the ernest-henry iocg deposit with a variation of information constraint

, in

First International Meeting for Applied Geoscience & Energy

1711

–

1715

Society of Exploration Geophysicists

10.1190/segam2021-3582000.1

Moorkamp

Heincke

Jegen

Roberts

A.W.

Hobbs

R.W.

2011

A framework for 3-D joint inversion of mt, gravity and seismic refraction data

J. geophys. Int.

184

(

477

–

493

10.1111/j.1365-246X.2010.04856.x

Moorkamp

Lelièvre

P.G.

Linde

Khan

2016

Integrated Imaging of the Earth: Theory and Applications

, Vol.

218

John Wiley & Sons

10.1002/9781118929063

Nocedal

Wright

S.J.

1999

Numerical Optimization

Springer

Patanè

Barberi

Cocina

De Gori

Chiarabba

2006

Time-resolved seismic tomography detects magma intrusions at mount etna

Science

313

(

5788

821

–

823

10.1126/science.1127724

Podvin

Lecomte

1991

Finite difference computation of traveltimes in very contrasted velocity models: a massively parallel approach and its associated tools

J. geophys. Int.

105

(

271

–

284

10.1111/j.1365-246X.1991.tb03461.x

Portniaguine

Zhdanov

M.S.

1999

Focusing geophysical inversion images

Geophysics

(

874

–

887

Reitz

Krahenbuhl

2015

Feasibility of time-lapse gravity and gravity gradiometry monitoring for steam-assisted gravity drainage reservoirs

Geophysics

(

WA99

–

WA111

10.1190/geo2014-0217.1

Rücker

Günther

Wagner

F.M.

2017

pyGIMLi: an open-source library for modelling and inversion in geophysics

Comput. Geosci.

109

106

–

123

Rudin

L.I.

Osher

Fatemi

1992

Nonlinear total variation based noise removal algorithms

Physica D

(

1–4

259

–

268

10.1016/0167-2789(92)90242-F

Sun

2016

Joint inversion of multiple geophysical and petrophysical data using generalized fuzzy clustering algorithms

Geophys. Suppl. Mon. Not. R. astr. Soc.

208

(

1201

–

1216

Tikhonov

A.N.

Arsenin

V.Y.

1977

Solutions of Ill-Posed Problems

, Vol.

330

WH Winston

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Zhdanov

M.S.

2022

Joint focusing inversion of marine controlled-source electromagnetic and full tensor gravity gradiometry data

Geophysics

(

K35

–

K47

10.1190/geo2021-0691.1

E.S.

Commer

Newman

G.A.

2014

A strategy for coupled 3D imaging of large-scale seismic and electromagnetic data sets: application to subsalt imaging

Geophysics

(

ID1

–

ID13

10.1190/geo2013-0053.1

Vogel

C.R.

Oman

M.E.

1996

Iterative methods for total variation denoising

SIAM J. Sci. Comput.

(

227

–

238

Vozoff

Jupp

1975

Joint inversion of geophysical data

J. geophys. Int.

(

977

–

991

10.1111/j.1365-246X.1975.tb06462.x

Waldhauser

Ellsworth

W.L.

2000

A double-difference earthquake location algorithm: method and application to the Northern Hayward Fault, California

Bull. seism. Soc. Am.

(

1353

–

1368

Wang

Yang

Yin

Zhang

2008

A new alternating minimization algorithm for total variation image reconstruction

SIAM J. Imag. Sci.

(

248

–

272

2005

Representations of inverse covariances by differential operators

Adv. Atmos. Sci.

(

181

–

198

Yanovskaya

Ditmar

1990

Smoothness criteria in surface wave tomography

J. geophys. Int.

102

(

–

10.1111/j.1365-246X.1990.tb00530.x

Zhdanov

M.S.

2002

Geophysical Inverse Theory and Regularization Problems

, Vol.

Elsevier

Zhdanov

M.S.

Gribenko

Wilson

2012

Generalized joint inversion of multimodal geophysical data using Gramian constraints

Geophys. Res. Lett.

(

), doi:10.1029/2012GL051233.

10.1029/2012GL051233

Google Scholar

OpenURL Placeholder Text

WorldCat

Crossref

Zhdanov

M.S.

Čuma

2022

Cooperative inversion of multiphysics data using joint minimum entropy constraints

Near Surf. Geophys.

(

623

–

636

Zheglova

Lelièvre

P.G.

Farquharson

C.G.

2018

Multiple level-set joint inversion of traveltime and gravity data with application to ore delineation: a synthetic study

Geophysics

(

R13

–

R30

10.1190/geo2016-0675.1

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Download all slides

Month:	Total Views:
August 2024	62
September 2024	225
October 2024	92
November 2024	106
December 2024	106
January 2025	79
February 2025	44
March 2025	90
April 2025	41

Article Contents

Decoupled joint inversion with variable splitting: example scheme for magnetotelluric, seismic and gravity data

SUMMARY

1 INTRODUCTION

2 JOINT INVERSION BY VARIABLE SPLITTING AND ALTERNATING MINIMIZATION

3 REGULARIZATION AND COUPLING

3.1 Joint total variation

3.2 Joint minimum support

3.3 Modifications of cross-gradient

4 APPLICATION TO MT, FATT AND GRAVITY

5 NUMERICAL EXAMPLES

5.1 Example 1: time-lapse seismic–gravity joint inversion using JMS regularization

5.2 Example 2: MT–seismic–gravity joint inversion using different regularization and coupling

5.3 Example 3: joint inversion of MT and electrical resistivity tomography data

6 CONCLUSIONS

ACKNOWLEDGEMENTS

DATA AVAILABILITY

REFERENCES

Citations

Views

Altmetric

Email alerts

Astrophysics Data System

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Decoupled joint inversion with variable splitting: example scheme for magnetotelluric, seismic and gravity data

SUMMARY

1 INTRODUCTION

2 JOINT INVERSION BY VARIABLE SPLITTING AND ALTERNATING MINIMIZATION

3 REGULARIZATION AND COUPLING

3.1 Joint total variation

3.2 Joint minimum support

3.3 Modifications of cross-gradient

4 APPLICATION TO MT, FATT AND GRAVITY

5 NUMERICAL EXAMPLES

5.1 Example 1: time-lapse seismic–gravity joint inversion using JMS regularization

5.2 Example 2: MT–seismic–gravity joint inversion using different regularization and coupling

5.3 Example 3: joint inversion of MT and electrical resistivity tomography data

6 CONCLUSIONS

ACKNOWLEDGEMENTS

DATA AVAILABILITY

REFERENCES

Citations

Views

Altmetric

Email alerts

Astrophysics Data System

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only