Enhancing learning to solve multicomponent fractional viscoelastic equations with U-net Fourier neural operators

Abstract

The research of viscoelastic media is currently a hot topic in the interpretation and processing of seismic data. To accurately simulate the propagation of seismic waves in viscoelastic media, the fractional viscoelastic equation has emerged as an indispensable method. However, solving this equation numerically has proven to be challenging due to the complexity introduced by its fractional Laplacian operators. Recently, deep learning, especially Fourier neural operators (FNO), has shown excellent performance in learning to fast solve partial differential equations. Traditional FNO methods may face crosstalk problems and this make it difficult to achieve satisfactory accuracy when solving the multicomponent fractional order viscoelastic equation. To solve this problem, we introduce a novel approach based on U-net Fourier neural operator (U-FNO). As an enhanced learning method to the traditional FNO-based method, the U-FNO-based method integrates a U-Fourier layer following the standard Fourier layer as a form of regularization, thereby achieving superior prediction accuracy for multicomponent equations. Specifically, both the Fourier layers and U-Fourier layers in U-FNO are trained with the solutions of the equation from previous time steps as inputs. This training process enables the U-FNO to efficiently produce more accurate solutions for subsequent wavefield. Numerical simulations reveal that the U-FNO-based method efficiently learns to solve the fractional viscoelastic wave equation independent of fractional Laplacian operators. Additionally, U-FNO-based method offers superior prediction accuracy in comparison with the traditional FNO-based method.

fractional viscoelastic equations, partial differential equations, U-net Fourier neural operators, deep learning

Issue Section:

Research article

1. Introduction

Seismic wave forward modelling is essential in the process of seismic data processing (Carcione 1993; Carcione et al. 2002; Liu & Sen 2009a,b; Ren & Liu 2013; Xu et al. 2023). During their propagation through media, seismic waves are affected by attenuation and elasticity: inherent properties of the earth (Robertsson et al. 1994; Alkhalifah 2000; Duveneck & Bakker 2011). The viscoelastic equation, widely recognized for describing the characteristics of viscoelastic media, has garnered considerable attention. Numerous scholars have contributed to the advancement of the fractional viscoelastic wave equation, utilizing constant-Q models to simulate these properties during the propagation of the seismic wavefield (Kalyani et al. 2014). This equation can describe the attenuating effects in an attenuating medium (Li et al. 2016; Zhu 2017; Zhu et al. 2019). However, the solution to the complex equation needs several Fourier transforms because of the fractional Laplacian operators, which are associated with the quality factor Q. Considering the constraints of computing hardware, the necessity for multiple Fourier transforms constitutes a major problem (Yao et al. 2017). Furthermore, to accurately represent the complex propagation of the wavefield, the solution to the viscoelastic equation comprises multiple components, adding to the challenge of solving the equation. Recently, a range of techniques including low-rank decomposition (Sun et al. 2015; Chen et al. 2019), Taylor series expansion (Guo et al. 2016; Zhang et al. 2020), and independent fractional operators (Patnaik et al. 2020; Wang et al. 2022) are introduced into improve the simulation of viscoelastic media. To obtain seismic wavefields of higher frequency and resolution, these methods necessitate finer grid discretization, leading to increased computational costs in forward modelling (Liu & Sen 2011). This limitation often presents a significant challenge in applications such as seismic migration and inversion. Selecting the appropriate solution methods to enhance both accuracy and computational efficiency has emerged as a primary research focus in recent years (Xing & Zhu 2019; Xiong & Guo 2022; Zhou et al. 2023).

The successful implementation of deep learning, particularly through data-driven methods, has inspired numerous scholars to explore suitable approaches for solving partial differential equations (PDEs). A data-driven method for solving PDEs involves learning the mathematical and physical behaviour of these equations from data in a supervised learning framework without relying on complex traditional equations (LeCun et al. 2015; Qu et al. 2022). Recent progress in data-driven-based techniques has dramatically improved computational efficiency, often achieving speeds several orders of magnitude faster than those of traditional solvers. In the realm of data-driven strategies for seismic simulation, two popular methods are widely used. The initial method is the spatial mapping technique, which frequently utilizes convolutional neural networks (CNNs) to apply finite-dimensional operators within its category. These methods have demonstrated remarkable success in quickly and accurately predicting outputs for high-dimensional PDEs (Jiang et al. 2021; Zhong et al. 2023). However, the lack of physical information results in reduced prediction accuracy for this technique (Wang et al. 2017; Smith et al. 2020; Bhattacharya et al. 2021; Wen et al. 2021). The second technique employs the physical information constraint approach, integrating these constraints into a complex network to achieve solutions with high accuracy (Raissi et al. 2019; Zhang et al. 2023). During training, this method (such as physical information neural network (PINN)) automatically satisfies specific physical information using automatic differentiation, leading to enhanced accuracy and improved generalization. Unlike data-driven CNN-based methods, the PINN-based approach falls into the category of wave-equation methods and shares strong resemblances to classical numerical solvers (Moseley et al. 2020; Alkhalifah et al. 2021; Karniadakis et al. 2021; Song et al. 2021; Chen & Ge 2023). The PINN-based method is more suitable for equations that are amenable to solution by finite-difference methods. However, solving this fractional equation with the finite-difference method poses challenges, rendering the PINN-based method unsuitable for equation solving (Wang et al. 2018).

Currently, the Fourier neural operator (FNO) is introduced to solve the problem of PDEs in the supervised learning framework (Li et al. 2020a,b). Differing from the methods previously mentioned, FNO adopts an inductive bias strategy by integrating physical information within the structure of the convolutional blocks (Lu et al. 2019; Grady et al. 2023). The FNO-based method uses a framework that accurately captures the configuration of the wave propagator in the Fourier domain. It is achieved through the use of two fully connected layers, specifically designed for the spatial and wavenumber domains (Kosloff & Baysal 1982; Stoffa et al. 1990). This strategy estimates the mathematical–physical properties of PDEs by establishing the mapping with solutions of the equations at different times and spatial locations (Rashid et al. 2022). Subsequently, the FNO-based method has been widely applied to learning solutions for seismic wave equations (Song & Wang 2022; Zhang et al. 2023). The commonly used training strategy in the time domain is to use the wavefield snapshot at this time step to predict the wavefield snapshot at next time step and establish an appropriate loss function to update the network (Wei & Fu 2022). Recently, by introducing model parameters to modify the FNO network structure, the FNO method has been successfully applied to the rapid inversion of model parameters (Yang et al. 2023; Yin et al. 2023). The FNO-based method has shown great potential for development in the fields of forward modelling and inversion in geophysics. However, its prediction accuracy for complex equations falls short because of the intrinsic regularization effect present in the FNO framework. Meanwhile, solving the viscoelastic equation, especially with multiple component solutions, poses a significant challenge. Recently, U-net has become increasingly popular as a deep learning architecture for data analysis, especially in image processing (Long et al. 2015). It uses convolutional layers to extract features and has succeeded in tasks like image classification and object detection (Krizhevsky et al. 2012; He et al. 2016). Combining the benefits of both FNO and U-net, researchers have proposed the U-net Fourier neural operator (U-FNO) and solved the problems with complex equations, such as multiphase flow equations (Wen et al. 2022). Their research declares that the U-FNO-based method outperforms the traditional FNO in improving the accuracy of complex PDE solutions. Currently, U-FNO has yet to be applied in the field of geophysics.

We examine the performance of FNOs in learning viscoelastic wave equations and compare the predictive performance between the standard FNO-based method and the U-FNO-based method. The U-FNO method combines the advantages of both FNO- and U-net-based methods. The U-net, employed as a regularization term, effectively addresses the crosstalk problem in traditional FNO methods for multicomponent equations and achieves high-accuracy solutions for multicomponent fractional viscoelastic equations. This provides a foundation for achieving high-precision inversion by modifying the network.

This paper first introduces the fractional viscoelastic equation. Subsequently, we offer an overview of the FNO framework used for seismic modelling. Next, the theory of the U-FNO architecture for seismic modelling is introduced. Finally, we assess the precision and performance of our proposed method by a homogeneous model and a partial Hess model.

2. Methodology

In this section, we detail the fractional viscoelastic equation along with its solving techniques, discuss the structure and problem of the FNO, and introduce the U-FNO framework as an innovative solution to these problems.

2.1. The introduction of fractional viscoelastic equation

The 2D fractional viscoelastic equation, derived from the constant-Q model, is often formulated as follows (Zhu & Carcione 2014):

\begin{array}{r} {\begin{array}{l} \frac{\partial v_{x}}{\partial t} = \frac{1}{ρ} \frac{\partial τ_{x x}}{\partial x} + \frac{1}{ρ} \frac{\partial τ_{x z}}{\partial z}, \\ \frac{\partial v_{z}}{\partial t} = \frac{1}{ρ} \frac{\partial τ_{x z}}{\partial x} + \frac{1}{ρ} \frac{\partial σ_{z z}}{\partial x}, \\ \frac{\partial τ_{x x}}{\partial t} = D_{p} \frac{\partial v_{x}}{\partial x} + D_{p} \frac{\partial v_{z}}{\partial z} - 2 D_{s} \frac{\partial v_{z}}{\partial z}, \\ \frac{\partial t_{z z}}{\partial t} = D_{p} \frac{\partial v_{x}}{\partial x} - 2 D_{x} \frac{\partial v_{x}}{\partial x} + D_{p} \frac{\partial v_{z}}{\partial z}, \\ \frac{\partial τ_{x z}}{\partial t} = D_{s} \frac{\partial v_{z}}{\partial x} + D_{s} \frac{\partial v_{x}}{\partial z}, \end{array} \end{array}

(1)

where $v = (v_{x}, v_{z})$ and $t = (τ_{x x}, τ_{z x}, τ_{z z})$ represent the particle velocity and stress components, respectively, ρ denotes the density, and D_ξ can be described as follows:

\begin{array}{r} D_{ξ} (t) \approx a_{ξ} {(- \nabla^{2})}^{γ_{ξ}} + b_{ξ} {(- \nabla^{2})}^{γ_{ξ} - 0.5} \partial_{t}, \end{array}

(2)

where $ξ$ means the parameters related to the P- and S-waves individually; $γ_{ξ} = \arctan (1 / Q_{ξ}) / π$ represents the fractional operators, which are associated with the quality factor $Q_{ξ}$ ⁠; $\nabla^{2}$ denotes the Laplacian operator, $a_{ξ}$ and $b_{ξ}$ are the intermediate variables that can be described as follows:

\begin{array}{r} {\begin{array}{l} a_{ξ} = M_{ξ} {(V_{ξ} / ω_{0})}^{2 γ} \cos (π γ_{ξ}), \\ b_{ξ} = M_{ξ} {(V_{ξ} / ω_{0})}^{2 γ - 1} ω_{0}^{- 1} \sin (π γ_{ξ}), \\ M_{ξ} = ρ V_{ξ}^{2} \cos^{2} (\frac{π γ_{ξ}}{2}), \end{array} \end{array}

(3)

where $ω_{0} = 2 π f_{0}$ stands for the reference angular frequency and $V_{ξ} = (V_{p 0}, V_{s 0})$ represent the reference P-velocity and reference S-velocity.

Owing to the involvement of two fractional Laplacian operators ${(- \nabla^{2})}^{γ_{ξ}}$ and ${(- \nabla^{2})}^{γ_{ξ} - 0.5}$ ⁠, which are dependent on the quality factor Q and exhibit variations across different spatial positions, solving them using the conventional finite-difference method remains a substantial challenge. Currently, several methods have been proposed to achieve efficient and accurate solutions for fractional Laplacian operators, enabling rapid and precise propagation of viscoelastic wavefields. In this paper, the staggered grid pseudo-spectral (SGPS) method is employed as a widely used technique for solving the viscoelastic equation to obtain the training and testing data. To enable the SGPS method to solve variable fractional equations of complex models, we approximate the variable Q value with an average Q value. In solving Equation (1), the solution frequently manifests as particle velocity, represented by $v = (v_{x}, v_{z})$ ⁠. To obtain this solution, we further rewrite Equation (1) as follows:

\begin{array}{r} {\begin{array}{l} \frac{\partial v {(t)}_{x}}{\partial t} = \frac{1}{ρ} \frac{\partial τ {(t)}_{x x}}{\partial x} + \frac{1}{ρ} \frac{\partial τ {(t)}_{x z}}{\partial z}, \\ \frac{\partial v {(t)}_{z}}{\partial t} = \frac{1}{ρ} \frac{\partial τ {(t)}_{x z}}{\partial x} + \frac{1}{ρ} \frac{\partial σ {(t)}_{z z}}{\partial z}, \\ \frac{\partial τ {(t)}_{x x}}{\partial t} = D_{p} \frac{\partial v {(t - 1)}_{x}}{\partial x} + D_{p} \frac{\partial v {(t - 1)}_{z}}{\partial z} - 2 D_{S} \frac{\partial v {(t - 1)}_{z}}{\partial z} \\ \frac{\partial τ {(t)}_{z z}}{\partial t} = D_{p} \frac{\partial v {(t - 1)}_{x}}{\partial x} - 2 D_{S} \frac{\partial v {(t - 1)}_{x}}{\partial x} + D_{p} \frac{\partial v {(t - 1)}_{z}}{\partial z}, \\ \frac{\partial τ {(t)}_{x z}}{\partial t} = D_{S} \frac{\partial v {(t - 1)}_{z}}{\partial x} + D_{S} \frac{\partial v {(t - 1)}_{x}}{\partial z}, \end{array}, \end{array}

(4)

where t and $t - 1$ represent this time step and the previous time step, respectively. This can be further simplified to the following form:

\begin{array}{r} v (t) = G (v (t - 1)), \end{array}

(5)

where G is a function representing Equation (4). From Equation (5), to predict the wavefield snapshot at this time step, we only need the wavefield at previous time step. To simplify the subsequent use of symbols we let $v_{t}$ and $v_{t - 1}$ stand for $v (t)$ and $v (t - 1)$ ⁠, respectively.

2.2. The review of FNO architecture for seismic modelling

Currently, an innovative method known as the FNO is emerged, which entails the direct parameterization of the integral kernel within Fourier domain. Its main objective is to build a mapping in an infinite-dimensional space with a limited set of an input–output dataset, a critical task in geophysics for modelling complex wave propagation. This can be accomplished by employing convolution with functions related to low wavenumbers to capture global features, and then applying an activation function to these features to transition back to the high wavenumber model. This process is crucial for modelling seismic wave propagation. The exceptional capability of the FNO allows it to approximate functions characterized by Fourier modes. Its architecture is comprised of multiple Fourier layers, with each layer hosting two fully connected layers in both the spatial and wavenumber domains, as shown in Fig. 1b. The fully connected layers in the spatial domain, which includes a local transform W, plays a pivotal role as the trainable phase-screen compensator, enabling a model to accommodate local variations. Meanwhile, the fully connected layer within the wavenumber domain integrates a sequence of transformations: a Fourier transform F, followed by a linear transformation R, and concludes with an inverse Fourier transform $F^{- 1}$ ⁠; It serves as a non-local spatial convolution operator and functions as a wavenumber filter within the process of phase-shift. We introduce a four-layer FNO architecture designed for learning to solve the fractional viscoelastic equation, illustrated in Fig. 1a. The FNO layer receives two-component wavefields at a partial time step as input and predicts the two-component wavefields for the subsequent time step. The mathematical representation of this learning process is as follows:

\begin{array}{r} F o u t_{L} = R e L U (\begin{array}{c} W_{x, z} (F C 1 (v_{x (i - 1)} (x, z) + v_{z (i - 1)} (x, z))) + \\ F_{x, z}^{- 1} (R_{k_{x}, k_{z}} \cdot F_{x, z} (F C 1 (v_{x (i - 1)} (x, z) + v_{z (i - 1)} (x, z)))) \end{array}), \end{array}

(6)

where $i - 1$ represents the previous time step and $F C 1$ is a full connection layer that elevates the data to a higher-dimensional channel. Equation (6) reveals the data flow between the FNO time-space and time-wavenumber domains. The predicted wavefield at this time step i is:

\begin{array}{r} [v_{x (i)} (x, z), v_{z (i)} (x, z)] = F C 2 \cdot F o u t_{L}, \end{array}

(7)

where $F C 2$ includes the two full connection layers and a $R e L U$ function, and it make data back to the target dimension. Equations (6) and (7) can also be described as:

\begin{array}{r} v_{t} = O (v_{t - 1}), \end{array}

(8)

where O stands for a nonlinear map operator.

Figure 1.

Training and prediction processes by FNO (a) and U-FNO (c) and the architectures of Fourier layers (b) and U-Fourier layers (d).

Open in new tab Download slide

By comparing Equation (5) with Equation (8), it becomes clear that the method based on the FNO seeks to refine the nonlinear mapping operator O to capture the mathematical–physical characteristics of the function G. This objective can be attained with optimizing a loss function that evaluates the difference between the predicted $O (v_{t - 1})$ and true wavefields $G (v_{t - 1})$ through the L2 norm, a widely accepted metric for quantifying the difference between two datasets. This process allows the FNO to capture the complex interactions and dynamics within the seismic wavefield, thereby enhancing its predictive power and accuracy in solving the fractional viscoelastic equation. Unfortunately, when the FNO-based method solves multicomponent equations, crosstalk occurs between the two components, affecting the network's accuracy in solving equations.

2.3. The theory of U-FNO architecture for seismic modelling

U-FNO is introduced to enhance the optimization of FNO. It consists of the U-Fourier layer (Fig. 1d) and the Fourier layer (Fig. 1b). By incorporating a U-net architecture, the U-Fourier layer improves its capability for parameterizing high-frequency information in the wavefield through local convolutional kernels, distinguishing it from the traditional Fourier layer. This addition increases the flexibility and robustness of FNO, making it more effective in solving the fractional viscoelastic wave equation and in handling complex seismic wavefield modelling. The training process in wavefield modelling can be revised by employing the U-FNO-based method, as shown in Fig. 1c. Specifically, the two-component wavefield with partial time steps is sequentially processed through the FNO layer followed by the U-FNO layer. The process of U-FNO, which is aimed at obtaining the solution of the fractional viscoelastic equation, is as follows:

The U-FNO employs the wavefield snapshots from the previous several time steps (⁠ $N_{t}$ ⁠) as input, which is then elevated to a higher-dimensional channel domain through a fully connected layer (⁠ $F C 1$ ⁠), leading to a promoted snapshots of wavefield (V).
This promoted snapshots of wavefield (V) is further refined with some Fourier layers. In each Fourier layer, the promoted wavefield is transformed into the wavenumber domain by a sequence of operations: Fourier transform F, linear transform R, and inverse Fourier transform $F^{- 1}$ ⁠. Meanwhile, this promoted wavefield is linearly transformed by local transform (W) in the spatial domain. Moreover, the outputs of these two parts are combined by activation function $R e L U$ to obtain the output from this Fourier layer. Subsequently, this output is input into the next Fourier layer.
The final output from these Fourier layer (⁠ $F o u t_{L}$ ⁠) serves as the input for several U-Fourier layers. In each U-Fourier layer, the input is processed to obtain (⁠ $U F o u t$ ⁠) not only through two components of the conventional FNO but also via the U-net (⁠ $U N E T$ ⁠).
The final output, which is obtained by the last U-Fourier layer (⁠ $U F o u t_{N}$ ⁠), is then mapped back to the target dimension using a fully connected layer (⁠ $F C 2$ ⁠). This network includes two fully connected layers and a $R e L U$ function, which enables it to predict the wavefield at the current time.

Similar to Equation (6), the learning processes of FNO and U-FNO can be mathematically represented as follows:

\begin{array}{r} {\begin{array}{c} F o u t_{L} = R e L U (\begin{matrix} W_{x, z} (F C 1 (v_{x (i - 1)} (x, z) + v_{z (i - 1)} (x, z))) + \\ F_{x, z}^{- 1} (R_{k_{x}, z_{Σ}} \cdot F_{x, z} (F C 1 (v_{x (i - 1)} (x, z) + v_{z (i - 1)} (x, z)))) \end{matrix}), \\ UFou t_{N} = R e L U (W_{x, z} \cdot Fout_{L} + F_{x_{z} z}^{- 1} (R_{k_{x}, k_{z}} \cdot F_{x, z} \cdot Fout_{L}) + Unet \cdot Fout_{L}), \end{array} \end{array}

(9)

The final predicted wavefield at this time step (i) is.

\begin{array}{r} [v_{x (i)} (x, z), v_{z (i)} (x, z)] = F C 2 \cdot UFou t_{N} . \end{array}

(10)

Through training U-FNO with snapshots of solutions to this complex equation, the resulting U-FNO becomes a PDE solver capable of generating accurate solutions for Equation (1). By comparing Equations (9) and (6), it is demonstrated that the features of the wavefield snapshots extracted by the U-net in the U-FNO method can be used as regularization terms to mitigate the crosstalk problem between the two component datasets. The layer number of Fourier (L) and U-Fourier (N) is often tuned based on the characteristics and requirements of the specific problem being solved. After many experiments, we find that employing two Fourier layers followed by two U-Fourier layers can obtain the optimal performance for solving a fractional viscoelastic equation. The input and training approach utilized in this method play a critical role in the training process. Figure 2 offers a schematic depiction of the input and output within the U-FNO (taking snapshots $N_{t}$ = 3 with as an example). During training, we utilize $N_{t}$ true snapshots in a recursive method to generate the subsequent one snapshot. Meanwhile, in the prediction process, we employ $N_{t}$ snapshots, including snapshots of the true and predicted wavefields, to generate the subsequent wavefield snapshot. It is worth noting that in the recursive prediction process, the true wavefield snapshots are decreasing.

Figure 2.

Schematic description of input and output in training and prediction (number of snapshots is three).

Open in new tab Download slide

3. Numerical examples

In this section, the capability of U-FNO to predict seismic wavefield propagation in viscoelastic media is demonstrated by a homogeneous model and a partial Hess model. The SGPS-based method, in conjunction with PML boundary conditions, is employed to solve the fractional viscoelastic wave equation for making dataset. These methodologies allow us to generate datasets for training, validation, and prediction. Importantly, informed by insights from previous research and extensive experiments, we set the dimensions of the Fourier and U-Fourier layers to 12 × 20. Additionally, we use structural similarity (SSIM) and signal-to-noise ratio (SNR) (see Appendix A) to evaluate the quality of the predicted wavefield snapshots derived from both the homogeneous and partial Hess models.

3.1. The test on a homogeneous model

For the first test, a 2D homogeneous model is employed. This model is designed with a grid size of 100 × 100 and a grid spacing of 10 m. In Table 1, the parameters for velocity and the quality factor Q associated with P- and S-waves are detailed. We use a Ricker wavelet at 35 Hz with an interval of 1 ms, and a maximum sampling time of 0.24 s as the source. Figure 3 provides a visual representation of the distribution of training, validation, and prediction sets. This figure depicts the even distribution of 625 sources across the velocity model, each spaced at 40 m. From shots 100 to 500 (highlighted in yellow), the 314th shot, located at the model's centre and marked by a red star, is selected to evaluate the prediction performance. Here, 80% of the yellow region is randomly allocated to the training dataset, while the remaining 20% is allocated to the validation dataset. These datasets are compiled with a maximum time interval limited to 0.18 s, and particle velocity snapshots are resampled every 10 ms.

Figure 3.

A visual depiction of the distribution of the training, validation, and prediction sets.

Open in new tab Download slide

Table 1.

Open in new tab

Model parameters of the 2D homogeneous model.

Superscript	$V p$ (m/s)	$V s$ (m/s)	$Q p$	$Q s$
Parameter	3500	2500	40	30

Table 1.

Open in new tab

Model parameters of the 2D homogeneous model.

Superscript	$V p$ (m/s)	$V s$ (m/s)	$Q p$	$Q s$
Parameter	3500	2500	40	30

Respectively training by FNO-based and U-FNO-based approaches, we present their predictions for the $v_{x}$ and $v_{z}$ components in Figs. 4 and 5. Figures 4b, d, f, and h and 5b, d, f, and h demonstrate the predictive capabilities of both FNO-based and U-FNO-based approaches in simulating the reflected wavefield, direct wavefield, and the boundaries condition. Our research evaluates the impact of the number with wavefield snapshots on prediction. Through a comparison of Fig. 4b and f with 4d and h, a distinct improvement in P-wave amplitude accuracy emerges with the increment in the number of particle velocity inputs within both the FNO and U-FNO. Meanwhile, the difference (Fig. 4c, e, g, and i) between true wavefield snapshots and the corresponding predicted wavefield snapshots provides additional confirmation of this observation. During the subsequent analysis of Fig. 4b and d and 4f and h, employing an equal number of inputting wavefield snapshots, it becomes evident that the U-FNO amplifies prediction accuracy in comparison to the FNO method. Furthermore, it is worth noting that utilizing a separate wavefield snapshot as the input for the FNO method leads to inadequate prediction accuracy of P-wavefield snapshots. By contrast, U-FNO-based methods can achieve high-precision predictions of wavefields using just a single wavefield snapshot as the input (Fig. 4f). Figure 5 shows the prediction of particle velocity $v_{z}$ ⁠, similar with the earlier observations made regarding particle velocity. Simultaneously, through a comparison of the difference in Figs. 4 and 5, it becomes apparent that the predictive accuracy of $v_{z}$ is relatively lower when contrasted with $v_{x}$ ⁠, especially in the FNO-based method. This is because the $v_{z}$ component of particle velocity experiences more frequent event polarity reversals compared to the $v_{x}$ component, resulting in relatively poorer event continuity. Consequently, the $v_{z}$ component is more challenging to predict than the $v_{x}$ component. However, using the U-FNO-based method can further improve the prediction accuracy of the $v_{z}$ component.

$Snapshots of particle velocity ${{v}_x}$ in the homogeneous model: true particle velocity (a); predicted particle velocity by FNO with inputting one-particle velocity (b) and three-particle velocity (d); predicted particle velocity by U-FNO with inputting one-particle velocity (c) and three-particle velocity (e); the difference between (b) and (a), (c) and (a), (d) and (a), (e) and (a), respectively (f–i).$

Figure 4.

Snapshots of particle velocity $v_{x}$ in the homogeneous model: true particle velocity (a); predicted particle velocity by FNO with inputting one-particle velocity (b) and three-particle velocity (d); predicted particle velocity by U-FNO with inputting one-particle velocity (c) and three-particle velocity (e); the difference between (b) and (a), (c) and (a), (d) and (a), (e) and (a), respectively (f–i).

Open in new tab Download slide

$Snapshots of particle velocity ${{v}_z}$ in the homogeneous model: true particle velocity (a); predicted particle velocity by FNO with inputting one-particle velocity (b) and three-particle velocity (d); predicted particle velocity by U-FNO with inputting one-particle velocity (c) and three-particle velocity (e); the difference between (b) and (a), (c) and (a), (d) and (a), (e) and (a), respectively (f–i).$

Figure 5.

Snapshots of particle velocity $v_{z}$ in the homogeneous model: true particle velocity (a); predicted particle velocity by FNO with inputting one-particle velocity (b) and three-particle velocity (d); predicted particle velocity by U-FNO with inputting one-particle velocity (c) and three-particle velocity (e); the difference between (b) and (a), (c) and (a), (d) and (a), (e) and (a), respectively (f–i).

Open in new tab Download slide

To provide further validation for the effectiveness of our method, we enhance our analysis by vertical profiles of particle velocity snapshots. Figure 6 illustrates the particle velocity $v_{x}$ at CDP of 0.5 km, and Fig. 7 showcases the particle velocity $v_{z}$ at CDP of 0.3 km. After making a comprehensive comparison between Figs. 6 and 7, it becomes clear that both the U-FNO-based method and the number of inputting particle velocities enhance the precision in predicting particle velocities (⁠ $v_{x}$ ⁠, $v_{z}$ ⁠). Remarkably, even with a single input particle velocity, the predictions generated using the U-FNO exhibit a better similarity with the true wavefield. This shows the exceptional capacity of the U-FNO to provide accurate and reliable predictions.

$Comparison of the vertical profiles in particle velocity ${{v}_x}$ at 0.5 km: vertical profiles of true particle velocity and predicted particle velocity by FNO with inputting one-particle velocity (a) and inputting three-particle velocity (c); vertical profiles of true particle velocity and predicted particle velocity by U-FNO with inputting one-particle velocity (b) and inputting three-particle velocity (d).$

Figure 6.

Comparison of the vertical profiles in particle velocity $v_{x}$ at 0.5 km: vertical profiles of true particle velocity and predicted particle velocity by FNO with inputting one-particle velocity (a) and inputting three-particle velocity (c); vertical profiles of true particle velocity and predicted particle velocity by U-FNO with inputting one-particle velocity (b) and inputting three-particle velocity (d).

Open in new tab Download slide

$Comparison of the vertical profiles in particle velocity ${{v}_z}$ at 0.5 km: vertical profiles of true particle velocity and predicted particle velocity by FNO with inputting one-particle velocity (a) and inputting three-particle velocity (c); vertical profiles of true particle velocity and predicted particle velocity by U-FNO with inputting one-particle velocity (b) and inputting three-particle velocity (d).$

Figure 7.

Comparison of the vertical profiles in particle velocity $v_{z}$ at 0.5 km: vertical profiles of true particle velocity and predicted particle velocity by FNO with inputting one-particle velocity (a) and inputting three-particle velocity (c); vertical profiles of true particle velocity and predicted particle velocity by U-FNO with inputting one-particle velocity (b) and inputting three-particle velocity (d).

Open in new tab Download slide

Furthermore, Table 2 presents the computed SSIM and SNR values, providing additional support for the aforementioned observation. Simultaneously, a clear trend emerges from the information in Table 2 is that the prediction of $v_{z}$ poses a more complex prediction, compared with the prediction of $v_{x}$ within both the FNO-based and U-FNO-based methods. Figure 8 illustrates the contrast in training and validation losses between FNO and U-FNO, utilizing one (Fig. 8a) and three (Fig. 8b) input wavefield snapshots. Compared with the FNO-based method, the U-FNO-based approach reveals better convergence. Moreover, as the number of input wavefields increases, both the U-FNO and FNO methods exhibit substantial convergence enhancements for the uncomplicated homogeneous model.

Figure 8.

Training losses and validation losses by inputting one snapshot of particle velocity (a) and three snapshots of particle velocity (b).

Open in new tab Download slide

Table 2.

Open in new tab

SSIM and SNR of the prediction by FNO and U-FNO with the homogeneous model.

		FNO		U-FNO
Superscript		One snapshot	Three snapshots	One snapshot	Three snapshots
SSIM	$v_{x}$	0.3075	0.6504	0.6944	0.8805
	$v_{z}$	0.2924	0.6113	0.6373	0.8773
SNR	$v_{x}$	13.75	18.96	19.68	28.57
	$v_{z}$	9.71	16.19	16.62	25.10

		FNO		U-FNO
Superscript		One snapshot	Three snapshots	One snapshot	Three snapshots
SSIM	$v_{x}$	0.3075	0.6504	0.6944	0.8805
	$v_{z}$	0.2924	0.6113	0.6373	0.8773
SNR	$v_{x}$	13.75	18.96	19.68	28.57
	$v_{z}$	9.71	16.19	16.62	25.10

Table 2.

Open in new tab

SSIM and SNR of the prediction by FNO and U-FNO with the homogeneous model.

		FNO		U-FNO
Superscript		One snapshot	Three snapshots	One snapshot	Three snapshots
SSIM	$v_{x}$	0.3075	0.6504	0.6944	0.8805
	$v_{z}$	0.2924	0.6113	0.6373	0.8773
SNR	$v_{x}$	13.75	18.96	19.68	28.57
	$v_{z}$	9.71	16.19	16.62	25.10

		FNO		U-FNO
Superscript		One snapshot	Three snapshots	One snapshot	Three snapshots
SSIM	$v_{x}$	0.3075	0.6504	0.6944	0.8805
	$v_{z}$	0.2924	0.6113	0.6373	0.8773
SNR	$v_{x}$	13.75	18.96	19.68	28.57
	$v_{z}$	9.71	16.19	16.62	25.10

3.2. The test on a partial Hess model

To evaluate the adaptability of U-FNO to complex models, we select a partial Hess model for assessing the accuracy of the predicted particle velocities (⁠ $v_{x}$ ⁠, $v_{z}$ ⁠). Figure 9 displays the P- and S-wave velocities, along with the quality factors Q. This partial Hess model is divided into a 100 × 100 grid with an 8-m grid interval. A Ricker wavelet with main frequency of 35 Hz serves as the source wavelet. Its maximum sampling time and time interval are 0.25 s and 0.5 ms, respectively. To improve prediction accuracy, we set the inputting snapshot to 5. The particle velocity snapshots are taken every 3 ms for resampling. The settings of the training dataset and the validation dataset are similar to those of the homogeneous model (Fig. 3). These datasets have a maximum sampling time of 0.13 s.

Figure 9.

Partial Hess model: (a) V_p; (b) V_s; (c) Q_p; and (d) Q_s.

Open in new tab Download slide

Figures 10 and 11 provide comparisons of predicted particle velocity snapshots at 0.15 s, a time step that extends beyond the training period respectively. In Figure 10b and d, both methods reconstruct the particle velocity snapshots of the complex model. However, on comparing Fig. 10c with e, it becomes evident that the difference between the predicted particle velocity snapshots $v_{x}$ obtained through the U-FNO and the true particle velocity is reduced compared to the difference observed in the FNO method. This observation indicates that the U-FNO approach demonstrates enhanced accuracy in predicting the particle velocity $v_{x}$ ⁠. for complex models. This trend is similarly noticeable in the prediction of the particle velocity $v_{z}$ ⁠. On comparing Figs. 10 and 11, it becomes evident that predicting the particle velocity $v_{z}$ is more challenging than predicting $v_{x}$ for complex models. Nevertheless, U-FNO exhibits superior performance in predicting the particle velocity $v_{z}$ ⁠. To achieve a clear comparison of the accuracy for particle velocity snapshots within this complex model, we have extracted vertical profiles of particle velocity $v_{z}$ at 0.6 km and particle velocity $v_{x}$ at 0.7 km, as illustrated in Fig. 12. When Fig. 12 parts a–c are compared with Fig. 12 parts b–d, it becomes clear that the particle velocity snapshots (⁠ $v_{z}$ ⁠, $v_{x}$ ⁠) predicted by U-FNO bear a stronger resemblance to the true particle velocity snapshots (⁠ $v_{z}$ ⁠, $v_{x}$ ⁠) in comparison to the predictions made by FNO. We also provide a comparison of the training and validation losses for both FNO and U-FNO using the partial Hess model in Fig. 13. For complex models, U-FNO also demonstrates enhanced generalization and convergence capabilities in comparison to FNO.

$Snapshots of particle velocity ${{v}_x}$ in a partial Hess model: true particle velocity (a); predicted particle velocity by FNO-based method (b) and U-FNO-based method (c); the differences between (b) and (a), (c) and (a) (d–e).$

Figure 10.

Snapshots of particle velocity $v_{x}$ in a partial Hess model: true particle velocity (a); predicted particle velocity by FNO-based method (b) and U-FNO-based method (c); the differences between (b) and (a), (c) and (a) (d–e).

Open in new tab Download slide

$Snapshots of particle velocity ${{v}_z}$ in partial Hess model: true particle velocity (a); predicted particle velocity by FNO-based method (b) and U-FNO-based method (c); the differences between (b) and (a), (c) and (a) (d–e).$

Figure 11.

Snapshots of particle velocity $v_{z}$ in partial Hess model: true particle velocity (a); predicted particle velocity by FNO-based method (b) and U-FNO-based method (c); the differences between (b) and (a), (c) and (a) (d–e).

Open in new tab Download slide

$Comparison of the vertical profiles in true and predicted particle velocity ${{v}_x}$ by FNO (a) and U-FNO (b); comparison of the vertical profiles in true and predicted particle velocity ${{v}_z}$ by FNO (c) and U-FNO (d).$

Figure 12.

Comparison of the vertical profiles in true and predicted particle velocity $v_{x}$ by FNO (a) and U-FNO (b); comparison of the vertical profiles in true and predicted particle velocity $v_{z}$ by FNO (c) and U-FNO (d).

Open in new tab Download slide

Figure 13.

Training and validation losses of epochs for partial Hess model.

Open in new tab Download slide

The differences observed in Figs. 14c and f and 15c and f indicate that as seismic waves propagate, both methods experience errors after a long period of propagation. However, compared to traditional FNO-based methods, our proposed method maintains commendable predictive performance. Simultaneously, we further evaluate the training performance of our proposed method on wavefields with source locations distant from the training set. We select the wavefield snapshots from the 614th source location as the prediction set. As shown in Fig. 16, the wavefield of the 614th source is already distant from the training set, and its prediction performance can well illustrate the generalization ability of our method. From the wavefield snapshot predicted at 0.17 s for the 614th shot (Fig. 17), it can be observed that even when far from the training region, our proposed method demonstrates good prediction performance. This indicates that our method has better generalization. U-FNO-based method exhibits better predictive performance in component (⁠ $v_{x}$ ⁠, $v_{z}$ ⁠) than FNO-based method. From Table 3, SSIM and PSNR demonstrate that as seismic waves propagate over a long period, the prediction accuracy of both FNO and U-FNO decreases to varying degrees. However, the U-FNO-based method achieves better performance in prediction. In addition, the SSIM and SNR values at the 614th frame at 0.17s indicate that the U-FNO-based method has higher accuracy than the FNO-based method when away from the training data area. Observations in Table 3 indicate that our proposed method has better generalization ability and can make better predictions for data that is longer in duration and farther away from the training area.

$Snapshots of particle velocity in 314th shot at 0.17 s: true particle velocity ${{v}_z}$(a) and ${{v}_x}$(f), predicted particle velocity ${{v}_z}$(b) and ${{v}_x}$(g) by FNO-based method, the difference in ${{v}_z}$(c) and ${{v}_x}$(h) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity ${{v}_z}$(d) and ${{v}_x}$(i) by U-FNO-based method, the difference in ${{v}_z}$(e) and ${{v}_x}$(j) between true particle velocity and predicted particle velocity by U-FNO-based method.$

Figure 14.

Snapshots of particle velocity in 314th shot at 0.17 s: true particle velocity $v_{z}$ (a) and $v_{x}$ (f), predicted particle velocity $v_{z}$ (b) and $v_{x}$ (g) by FNO-based method, the difference in $v_{z}$ (c) and $v_{x}$ (h) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity $v_{z}$ (d) and $v_{x}$ (i) by U-FNO-based method, the difference in $v_{z}$ (e) and $v_{x}$ (j) between true particle velocity and predicted particle velocity by U-FNO-based method.

Open in new tab Download slide

$Snapshots of particle velocity in 314th shot at 0.19 s: true particle velocity ${{v}_z}$(a) and ${{v}_x}$(f), predicted particle velocity ${{v}_z}$(b) and ${{v}_x}$(g) by FNO-based method, the difference in ${{v}_z}$(c) and ${{v}_x}$(h) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity ${{v}_z}$(d) and ${{v}_x}$(i) by U-FNO-based method, the difference in ${{v}_z}$(e) and ${{v}_x}$(j) between true particle velocity and predicted particle velocity by U-FNO-based method.$

Figure 15.

Snapshots of particle velocity in 314th shot at 0.19 s: true particle velocity $v_{z}$ (a) and $v_{x}$ (f), predicted particle velocity $v_{z}$ (b) and $v_{x}$ (g) by FNO-based method, the difference in $v_{z}$ (c) and $v_{x}$ (h) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity $v_{z}$ (d) and $v_{x}$ (i) by U-FNO-based method, the difference in $v_{z}$ (e) and $v_{x}$ (j) between true particle velocity and predicted particle velocity by U-FNO-based method.

Open in new tab Download slide

Figure 16.

A visual depiction of the distribution of 614th shot prediction dataset.

Open in new tab Download slide

$Snapshots of particle velocity in 614th shot at 0.17 s: True particle velocity ${{v}_z}$(a) and ${{v}_x}$(f), predicted particle velocity ${{v}_z}$(b) and ${{v}_x}$(g) by FNO-based method, the difference in ${{v}_z}$(c) and ${{v}_x}$(h) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity ${{v}_z}$(d) and ${{v}_x}$(i) by U-FNO-based method, the difference in ${{v}_z}$(e) and ${{v}_x}$(j) between true particle velocity and predicted particle velocity by U-FNO-based method.$

Figure 17.

Snapshots of particle velocity in 614th shot at 0.17 s: True particle velocity $v_{z}$ (a) and $v_{x}$ (f), predicted particle velocity $v_{z}$ (b) and $v_{x}$ (g) by FNO-based method, the difference in $v_{z}$ (c) and $v_{x}$ (h) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity $v_{z}$ (d) and $v_{x}$ (i) by U-FNO-based method, the difference in $v_{z}$ (e) and $v_{x}$ (j) between true particle velocity and predicted particle velocity by U-FNO-based method.

Open in new tab Download slide

Table 3.

Open in new tab

SSIM and SNR of the prediction by FNO and U-FNO with the partial Hess model.


	SSIM				SNR

	$v_{x}$		$v_{z}$		$v_{x}$		$v_{z}$

Superscript	FNO	U-FNO	FNO	U-FNO	FNO	U-FNO	FNO	U-FNO
314th at 0.15 s	0.8877	0.9366	0.8668	0.9145	32.81	37.37	29.1	34.06
314th at 0.17 s	0.8841	0.9357	0.8652	0.9131	30.1	35.87	26.59	32.96
314th at 0.19 s	0.8833	0.9342	0.8632	0.9127	28.13	34.91	24.59	31.33
614th at 0.17 s	0.6463	0.7248	0.6063	0.7163	26.85	29.47	23.73	27.3


	SSIM				SNR

	$v_{x}$		$v_{z}$		$v_{x}$		$v_{z}$

Superscript	FNO	U-FNO	FNO	U-FNO	FNO	U-FNO	FNO	U-FNO
314th at 0.15 s	0.8877	0.9366	0.8668	0.9145	32.81	37.37	29.1	34.06
314th at 0.17 s	0.8841	0.9357	0.8652	0.9131	30.1	35.87	26.59	32.96
314th at 0.19 s	0.8833	0.9342	0.8632	0.9127	28.13	34.91	24.59	31.33
614th at 0.17 s	0.6463	0.7248	0.6063	0.7163	26.85	29.47	23.73	27.3

Table 3.

Open in new tab

SSIM and SNR of the prediction by FNO and U-FNO with the partial Hess model.


	SSIM				SNR

	$v_{x}$		$v_{z}$		$v_{x}$		$v_{z}$

Superscript	FNO	U-FNO	FNO	U-FNO	FNO	U-FNO	FNO	U-FNO
314th at 0.15 s	0.8877	0.9366	0.8668	0.9145	32.81	37.37	29.1	34.06
314th at 0.17 s	0.8841	0.9357	0.8652	0.9131	30.1	35.87	26.59	32.96
314th at 0.19 s	0.8833	0.9342	0.8632	0.9127	28.13	34.91	24.59	31.33
614th at 0.17 s	0.6463	0.7248	0.6063	0.7163	26.85	29.47	23.73	27.3


	SSIM				SNR

	$v_{x}$		$v_{z}$		$v_{x}$		$v_{z}$

Superscript	FNO	U-FNO	FNO	U-FNO	FNO	U-FNO	FNO	U-FNO
314th at 0.15 s	0.8877	0.9366	0.8668	0.9145	32.81	37.37	29.1	34.06
314th at 0.17 s	0.8841	0.9357	0.8652	0.9131	30.1	35.87	26.59	32.96
314th at 0.19 s	0.8833	0.9342	0.8632	0.9127	28.13	34.91	24.59	31.33
614th at 0.17 s	0.6463	0.7248	0.6063	0.7163	26.85	29.47	23.73	27.3

3.3. The test on a partial Marmousi model

To verify the applicability of the proposed method for complex models, we apply the partial Marmousi model in this section. Figure 18 presents the P- and S-wave velocities of the partial Marmousi model, along with the quality factor Q. All these models are divided into a 100 × 100 grid with grid intervals of 10 m. A Ricker wavelet, with a central frequency of 30 Hz, is used as the source wavelet. The maximum sampling duration and the time interval are 0.2 s and 0.5 ms, respectively. In this model test, we load the seismic source simultaneously on both the components (⁠ $v_{x}$ ⁠, $v_{z}$ ⁠) to verify the reason for the low prediction accuracy of the $v_{z}$ component. The input snapshots are set to 5. The particle velocity snapshots are captured every 3 ms for resampling. The settings for the training dataset and validation dataset are shown in Fig. 3. These datasets have a maximum sampling time of 0.13 s.

Figure 18.

Partial Marmousi model: (a) V_p; (b) V_s; (c) Q_p; and (d) Q_s.

Open in new tab Download slide

Figures 19 and 20 compare the predicted particle velocity snapshots at 0.17 s, which is a time step beyond the training period. From Fig. 19b and d, it can be observed that both FNO and U-FNO methods can be effectively applied to complex models. However, by comparing Fig. 19c and e, it is evident that the differences obtained using the U-FNO method are significantly smaller than those using the FNO method. This indicates that our proposed method has higher prediction accuracy. Similar observations can also be seen in Fig. 20. It is worth noting that the difference in the $v_{z}$ component of this model does not significantly increase compared to the $v_{x}$ component. This is because when the seismic source loads both the components (⁠ $v_{x}$ ⁠, $v_{z}$ ⁠) simultaneously, the continuity of events in the $v_{z}$ component is similar to that in the $v_{x}$ component, and there is no significant polarity reversal. In addition, we also use the SSIM and SNR in Table 4 to verify the superiority of our proposed method. As shown in Table 4, the SSIM and SNR of our proposed method for multicomponent prediction are both higher than those of the traditional FNO method. This indicates that our proposed method has higher prediction accuracy.

$Snapshots of particle velocity in 314th shot at 0.17 s: True particle velocity ${{v}_x}$(a), predicted particle velocity ${{v}_x}$(b) by FNO-based method, the difference in ${{v}_x}$(c) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity ${{v}_x}$(d) by U-FNO-based method, the difference in ${{v}_x}$(e) between true particle velocity and predicted particle velocity by U-FNO-based method.$

Figure 19.

Snapshots of particle velocity in 314th shot at 0.17 s: True particle velocity $v_{x}$ (a), predicted particle velocity $v_{x}$ (b) by FNO-based method, the difference in $v_{x}$ (c) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity $v_{x}$ (d) by U-FNO-based method, the difference in $v_{x}$ (e) between true particle velocity and predicted particle velocity by U-FNO-based method.

Open in new tab Download slide

$Snapshots of particle velocity in 314th shot at 0.17 s: true particle velocity ${{v}_z}$(a), predicted particle velocity ${{v}_z}$(b) by FNO-based method, the difference in ${{v}_z}$(c) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity ${{v}_z}$(d) by U-FNO-based method, the difference in ${{v}_z}$(e) between true particle velocity and predicted particle velocity by U-FNO-based method.$

Figure 20.

Snapshots of particle velocity in 314th shot at 0.17 s: true particle velocity $v_{z}$ (a), predicted particle velocity $v_{z}$ (b) by FNO-based method, the difference in $v_{z}$ (c) between true particle velocity and predicted particle velocity by FNO-based method, predicted particle velocity $v_{z}$ (d) by U-FNO-based method, the difference in $v_{z}$ (e) between true particle velocity and predicted particle velocity by U-FNO-based method.

Open in new tab Download slide

Table 4.

Open in new tab

SSIM and SNR of the prediction by FNO and U-FNO with the partial Marmousi model.

Superscript		FNO	U-FNO
SSIM	$v_{z}$	0.8490	0.8994
	$v_{x}$	0.8463	0.9012
SNR	$v_{z}$	23.51	29.67
	$v_{x}$	24.99	30.42

Table 4.

Open in new tab

SSIM and SNR of the prediction by FNO and U-FNO with the partial Marmousi model.

Superscript		FNO	U-FNO
SSIM	$v_{z}$	0.8490	0.8994
	$v_{x}$	0.8463	0.9012
SNR	$v_{z}$	23.51	29.67
	$v_{x}$	24.99	30.42

3.4. Computational performance

For a more comprehensive comparison of computational performance, we have detailed the computational costs in Table 5, with all training, testing, and prediction processes conducted on a GPU (GeForce RTX 3060). The computational performance of the traditional method is not included in Table 4 due to its implementation on the GPU using C code, rendering comparisons with CUDA-based Python implementations less meaningful without adequate context. Notably, the computational efficiency of the FNO method in generating wavefields has shown significant acceleration compared to traditional methods once the network is adeptly trained. This efficiency improvement is quantified as an enhancement of two to three orders of magnitude in computational speed. In our key comparison, we evaluated the computational performance of dataset prediction on the GPU for both FNO and U-FNO methods. We found that the computational cost for U-FNO is 0.61 times higher than that of the FNO method. This increase can be ascribed to the integration of an additional U-net layer in conjunction with the conventional Fourier layers. Nonetheless, U-FNO demonstrates a notable improvement in computational performance when contrasted with the traditional SGPS method.

Table 5.

Open in new tab

Computational performance of FNO and U-FNO.

Method	Number of trainable parameters	Time (s)
FNO	926 417	0.00 353
U-FNO	1 156 337	0.00 571

Table 5.

Open in new tab

Computational performance of FNO and U-FNO.

Method	Number of trainable parameters	Time (s)
FNO	926 417	0.00 353
U-FNO	1 156 337	0.00 571

4. Conclusions

Our research aims to develop a more accurate surrogate model to approximate the solution of viscoelastic wave equations while maintaining an acceptable computational cost. Compared with the traditional FNO-based method, our method introduces U-Fourier layers subsequent to the standard Fourier layers. This improvement effectively approximates the nonlinear mapping operator that governs solution of the equation through data-driven training, thereby markedly improving the accuracy solution of equations. Notably, our approach retains the benefits of the conventional FNO-based method, including obviating the need for the fractional Laplace operator and enhancing computational efficiency. Numerical examples show that our method exhibits better performance in approximating solutions to fractional viscoelastic equations compared to traditional FNO-based methods. Additionally, it can prove significant robustness and generalization ability by predicting solutions for multicomponent fractional viscoelastic equations, even beyond the confines of the dataset. The proposed method demonstrates significant advantages in approximating wave-equation solutions, showing potential for applications that require multiple iterations of simulation. In the future, by optimizing the network and integrating the velocity parameter model, this method could be further enhanced and more effectively applied to the field of parameter inversion.

Acknowledgements

Many colleagues have helped with suggestions for improving this papers.

Conflict of interest statement

The authors declared that they do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Funding

This work is supported by the National Natural Science Foundation of China (grant nos. 42274147 and 41874144).

Data availability

Data associated with this research are available and can be obtained by contacting the corresponding author.

Appendix A

SSIM function is a quantitative metric used to evaluate the similarity between the true snapshot (x) and the predicted snapshot (y). This evaluation is based on factors such as the local mean values (⁠ $μ_{x}$ and $μ_{y}$ ⁠), standard deviations (⁠ $σ_{x}^{2}$ and $σ_{y}^{2}$ ⁠), cross-covariances (⁠ $σ_{x y}$ ⁠) of the two models. $c_{1}$ and $c_{2}$ are regularization constants, with values of $c_{1} = 6.5025 \times 10^{- 8}$ and $c_{2} = 5.85225 \times 10^{- 7}$ ⁠, respectively. Meanwhile, SNR between the true snapshot (x) and the predicted snapshot (y) quantifies the ratio of signal power to noise power within the models. They are described as.

\begin{array}{r} S S I M (x, y) = \frac{(2 μ_{x} μ_{y} + c_{1}) (2 σ_{x y} + c_{2})}{(μ_{x}^{2} + μ_{y}^{2} + c_{1}) (σ_{x}^{2} + σ_{y}^{2} + c_{2})}, \end{array}

(A-1)

and

\begin{array}{r} S N R (y, x) = 10 \log_{10} \frac{x_{}^{2}}{y - x_{}^{2}} . \end{array}

(A-2)

Higher SSIM and SNR values indicate greater similarity between the true and predicted wavefield snapshots.

References

Alkhalifah

2000

An acoustic wave equation for anisotropic media

Geophysics

1239

–

Month:	Total Views:
October 2024	64
November 2024	25
December 2024	7
January 2025	112
February 2025	153
March 2025	125
April 2025	66
May 2025	4

Article Contents

Enhancing learning to solve multicomponent fractional viscoelastic equations with U-net Fourier neural operators

Abstract

1. Introduction

2. Methodology

2.1. The introduction of fractional viscoelastic equation

2.2. The review of FNO architecture for seismic modelling

2.3. The theory of U-FNO architecture for seismic modelling

3. Numerical examples

3.1. The test on a homogeneous model

3.2. The test on a partial Hess model

3.3. The test on a partial Marmousi model

3.4. Computational performance

4. Conclusions

Acknowledgements

Conflict of interest statement

Funding

Data availability

Appendix A

References

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only