This application relates to seismic exploration, particularly to an automatic seismic facies identification method based on combination of self-attention mechanism and U-shape network architecture, and more particularly to a seismic image semantic segmentation method combining Segformer self-attention segmentation network, U-shape network architecture, and Hypercolumn semantic segmentation, which can be applied to the automatic seismic facies identification and classification of seismic data.
With the increasing demand for oil and gas and the rapid development of artificial intelligence (AI), the intelligence and automation level of the oil-gas exploration technology has been continuously enhanced. At present, the oil-gas exploration is mainly dependent on the seismic exploration, in which the post-stack seismic data is collected through artificial seismic wave reflection, and the information of underground structure, lithology, and oil-gas potential is analyzed through multidisciplinary knowledge mining, thereby locating the distribution of underground petroleum reservoirs. Traditionally, the seismic facies classification is performed by manual interpretation or by semi-automatic feature extraction and seismic facies segmentation using some mathematical strategies. However, the manual interpretation is highly subjective and labor-consuming, and the semi-automatic methods are less accurate and time-effective, and thus fail to accurately locate the oil-gas reservoirs under complex underground structure and sedimentation conditions. Therefore, how to enable the efficient automatic identification of seismic facies with the help of computer resources has become a problem in the art that urgently needs to be solved. In order to improve the automatic seismic facies identification, considerable attempt has been made to investigate the seismic facies classification based on deep learning, in which a nonlinear mapping from seismic data to seismic facies labels is learned in an end-to-end manner based on the existing labeled seismic data and applied to new seismic data for seismic facies classification. This method can realize the end-to-end seismic facies classification with less labor consumption and improve the interpretation effect. Dramsch and Lüthje (2018) migrated the visual geometry group-16 (VGG16) network pre-trained by ImageNet to the manually-labeled seismic data, and identified the seismic facies in the center of a sliding window by means of a sliding window algorithm, thereby realizing the automatic seismic facies classification. Zhao (2018) employed a convolutional neural network (CNN) using encoder-decoder architectures for the seismic facies classification. Di et al. (2018) labeled 4 inline sections and adopted a U-Net-like network for automatic seismic facies identification. Although these deep learning-based seismic facies identification methods all have an improved identification accuracy, these methods still have the following disadvantages, and thus fail to reach the efficient and accurate seismic facies interpretation:
In view of the deficiencies in the prior art, this application provides an automatic seismic facies identification method based on combination of Self-Attention mechanism and U-shape network architecture, in which an encoding-decoding U-shape architecture is introduced based on the self-attention-based semantic segmentation network; the semantic segmentation network is used as an encoder module, and a patch expanding module based on the fully-connected (FC) layer for upsampling is introduced in the decoder; and the feature fusion is performed by Hypercolumn, thereby enabling the global attention-based seismic facies segmentation involving lower calculation amount and multi-scale feature extraction.
In a first aspect, this application provides an automatic seismic facies identification method based on combination of Self-Attention mechanism and U-shape network architecture, including:
In an embodiment, the step of “obtaining and preprocessing post-stack seismic data to construct a sample training and validation dataset” includes:
In an embodiment, the step of “building an encoder by using an overlapped patch merging module and a self-attention transformer module” includes:
sAtt(x)=MHSA(LN(x))+x (1); and
FFN(x)=L2(cv(L1 (LN(x))))+x (2);
In an embodiment, the step of “building a decoder by using a patch expanding module, the self-attention transformer module, and a skip connection module” includes:
In an embodiment, the step of “building a seismic facies identification model by using the encoder, the decoder, and a Hypercolumn module” includes:
{circumflex over (d)}
(i)=Linear[Ci,C](d(i)) (5);
{circumflex over (d)}
(i)=Upsample[2i×]({circumflex over (d)}(i)) (6);
d
f=Linear[5C,C](Concat({circumflex over (d)}(i))) (7); and
M=Linear[C,NC](dƒ) (8);
In an embodiment, step (e) includes:
In an embodiment, when training the seismic facies identification model, parameters of the seismic facies identification model are subjected to iterative updating and learning by using a batch stochastic gradient descent (SGD) algorithm; and after training the seismic facies identification model, the size of the seismic section image is adjusted to a multiple of 16 to be input into the trained seismic facies identification model to predict a type of the seismic facies.
In an embodiment, the iterative updating and learning of the parameters of the seismic facies identification model is performed through steps of:
Compared to the prior art, this application has the following beneficial effects.
Regarding the automatic seismic facies identification method provided herein, the self-attention mechanism and the U-shape architecture are combined, and the Hypercolumn is adopted for automatic seismic facies segmentation and recognition. Moreover, a hybrid loss function is introduced to make the model more concerned with the continuity of segmentation. Compared with the existing U-shape network and Segformer models, the seismic facies identification network model provided by this application not only allows a lower calculation amount, but also reaches a higher seismic facies identification accuracy. Furthermore, based on the accurate seismic facies identification results, the location and structure of the underground sedimentary environment can be predicted more effectively, thereby providing favorable technical support and reference for the oil-gas exploration.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
The disclosure will be further described in detail below in conjunction with the embodiments and
The seismic interpretation of post-stack seismic data can be used as semantic segmentation of a seismic image to divide the seismic image into different areas, thereby reflecting the underground sedimentary environments at different locations, so as to be applicable to the location of oil and gas reservoirs. However, it is difficult to arrive at satisfactory seismic facies classification results by directly using the image segmentation methods since these methods do not involve the correlation modeling between pixels of the seismic image and are less effective in depicting the boundary details of some seismic facies types. Therefore, the attention deep neural network model is used to learn the regional association and continuity of seismic images, so as to describe the boundary details of the seismic facies more accurately. The present disclosure provides an automatic seismic facies identification method based on combination of Self-Attention mechanism and U-shape network architecture for predicting the seismic facies corresponding to seismic data, so as to perform seismic interpretation.
This application provides an automatic seismic facies identification method based on combination of Self-Attention mechanism and U-shape network architecture, which includes the following steps.
In this embodiment, a collection equipment collects seismic waves to obtain the original seismic data. The original seismic data can be superimposed to form a post-stack seismic data V∈I×C×D. In seismic exploration, for facilitating observation and analysis, the post-stack seismic data will be processed to improve the resolution of the post-stack seismic data. The original seismic data is a type of seismic data received by a source excitation.
In this embodiment, the self-attention transformer module is a semantic segmentation network.
In this embodiment, the step of “obtaining and preprocessing post-stack seismic data to construct a sample training and validation dataset” includes the following steps.
Original seismic data is collected and preprocessed to obtain the post-stack seismic data, which is recorded as V∈I×C×D, where I,C,D represents the number of Inline, the number of Crossline, and the number of sampling points within a certain period, respectively. The amplitude of the post-stack seismic data is normalized to [0, 1]. The post-stack seismic data is equally divided into ten section blocks along the crossline direction. The first 70% of the sub-blocks of each of the ten section blocks in the crossline direction are configured as a training set, and the last 30% of the sub-blocks are configured as a validation set. The size of the seismic section image inside each sub-block is adjusted, and the height and width (H, W) of the seismic section image are transformed to a multiple of 16 by using the linear interpolation method, that is, the resolution is transformed into
Finally, the seismic section image obtained after right-and-left flipping and Gaussian noise transformation is used as the input sample dataset of the model.
In this embodiment, the step of “building an encoder by using an overlapped patch merging module and a self-attention transformer module” includes the following steps.
For a seismic section image x∈H×W with an input height of H and an input width of W, an encoder composite function ƒe=ƒe4∘ƒe3∘ƒe2∘ƒe1 is built to satisfy
For the feature map
a first subfunction
is built. Ci is the number of channels of a feature map output by an i-th subfunction of the encoder. As shown in
In each of subcode blocks, the overlapped patch merging module performs matrix multiplication on the input data by means of overlapped sliding windows, thereby realizing the linear embedding of the feature map, which can downsample 2×the spatial resolution dimension of the input data. The self-attention transformer module contains a self-attention submodule and a feedforward neural network (FNN) submodule to learn global features and fusion features between different positions in the seismic section image. For the input feature map x(i) at the i-th stage, the calculation formula of subcode block is expressed as:
x
(i+1)=ƒei(x(i))=(FFN⋅sAtt)2⋅Conv(x(i)) (10).
In the formula (10), ⋅ is an operational character of composite function. Conv is the overlapped patch merging module obtained by using convolution operation. sAtt is a calculation function of the self-attention submodule. FFN is a calculation function of the feedforward neural network submodule.
The calculation formulas of the self-attention submodule and the FNN submodule are respectively expressed as:
sAtt(x)=MHSA(LN(x))+x (1); and
FFN(x)=L2(cv(Li(LN(x))))+x (2).
In the above formulas, LN is a layer normalization function. MHSA is a multi-head self-attention calculation function. L1 and L2 are two fully-connected layer functions. cv is a convolutional layer function used for position encoding.
In this embodiment, the step of “building a decoder by using a patch expanding module, the self-attention transformer module, and a skip connection module” includes the following steps.
For the multi-stage encoding feature {x(i)}i=14 encoded by the encoder, a decoder function ƒd is built to satisfy ƒd(x(1), x(2), x(3), x(4))∈H×W. The decoding process is symmetrical with the encoding process, and the decoder function ƒd includes four second subfunctions ƒd, and the four second subfunctions ƒdi constitute four stages of the decoder. For an encoding feature map and a decoding feature map
and decoding feature map
is obtained. Among them, i={2,3,4}, d(4)=x(4); and concat represents a tensor concatenation operation along a channel dimension. A feature map of a last stage of the decoder is expressed as d(0)=ƒd1([d(1)])∈H×W×C.
Each of the four second subfunctions ƒdi consists of the patch expanding module based on fully-connected layer upsample and two self-attention transformer modules which is called a sub-decoding block. As shown in
In above formulas, Linear represents a fully-connected layer, and Reshape represents a dimension reshaping operation.
Stages of the encoder is correspondingly connected to stages of the decoder by the “skip connection” structure to splice the features of the encoder and decoder, so that the decoder receives features of the encoder from corresponding stages of the encoder for fusion, so as to fuse the different semantic features of shallow coarse grain and high-level fine grain.
In this embodiment, the step of “building a seismic facies identification model by using the encoder, the decoder, and a Hypercolumn module” includes the following steps.
A Hypercolumn structure is introduced to fuse output feature maps {d(i)}i=04 of the four stages of the decoder to obtain a fused feature map. Pixel-level seismic facies classification is performed on the fused feature map. In this way, features with multiple levels are comprehensively used to classify to improve the effect of intensive prediction tasks. The calculation formulas are expressed as:
{circumflex over (d)}
(i)=Linear[Ci,C](d(i)) (5);
{circumflex over (d)}
(i)=Upsample[2i×]({circumflex over (d)}(i)) (6);
d
f=Linear[5C,C](Concat({circumflex over (d)}(i))) (7); and
M=Linear[C,NC](dƒ) (8).
In above formulas, Upsample[2ix] represents a bilinear interpolation for 2i×upsampling. Concat(⋅) represents a concatenation operation along the channel dimension. Linear [C,NC] Is a linear mapping from a dimension C to a dimension NC. NC is the number of seismic facies types. As show in
In this embodiment, step (e) includes the following steps.
The pixel-level cross-entropy loss (CE) is selected as the main optimization goal of model training, and Dice loss is used to assist the model to investigate regional correlation. A calculation formula of the hybrid loss is expressed as:
Loss=0.7*CE+0.3*Dice (9).
In the formula (9), CE(y,p)=Σi,jΣk=1Cyi,j(k)log pij(k).
y is a real seismic facies label of a seismic image. p is a predicted mask of the seismic image. yi,j represents a one-hot coding label corresponding to a pixel at i,j of the seismic image. pij(k) indicates a probability that the pixel at the i,j of the seismic image is predicted to be a kth-type seismic facies. When training the model, the Adam optimizer with a batch size of 8, an initial learning rate of 1e−3 and a weight decay of 1e−4 is used to train and learn the model.
In this embodiment, regarding the automatic seismic facies identification method, when training the seismic facies identification model, parameters of the seismic facies identification model are subjected to iterative updating and learning by using the batch stochastic gradient descent (SGD) algorithm.
After training the seismic facies identification model, the size of the seismic section image is adjusted to a multiple of 16 to be input into the trained seismic facies identification model to predict a type of the seismic facies.
In this embodiment, the iterative updating and learning of the parameters of the seismic facies identification model is performed through steps of: calculating a gradient of the hybrid loss function; and updating the parameters of the seismic facies identification model along a negative direction of the gradient to achieve a continuous descent of the hybrid loss function.
When the model is trained, for any seismic section, the size of the seismic section image is adjusted to a multiple of 16 by using the linear interpolation method and input into the model for prediction. The size of the output tensor of the model is [H, W, NC]. The largest probability index is taken as the predicted type of the seismic facies in the third dimension. A seismic facies matrix with [H, W] is finally output. The value of each position represents the type of seismic facies of the corresponding position of the input seismic image.
First, a public seismic dataset, which is obtained from Block F3 of North Sea in Netherlands labeled by Alaudah and AlRegib (2016), is used to verify the validity of the present disclosure. The parts of Inline 300-700 and Crossline 300-1000 were selected as the training set, and the rest was selected as the test set. Preprocessing was carried out according to step (b) of the method in description, then trained and tested according to step (e), and the trained model was obtained and evaluated.
Finally, a post-stack seismic data obtained from Bohai Bay Basin in China was used to further verify the validity of the disclosure. For the post-stack seismic data with a main frequency of 29 Hz and a frequency band of 6-52 Hz, the HUSeg model was used to identify the seismic facies of all inline sections of the seismic body, and the results were stitched into the seismic facies, and the sedimentary facies were explained by stratigraphic slices.
Number | Date | Country | Kind |
---|---|---|---|
202210759364.2 | Jun 2022 | CN | national |
This application is a continuation of International Patent Application No. PCT/CN2022/108319, filed on Jul. 27, 2022, which claims the benefit of priority from Chinese Patent Application No. 202210759364.2, filed on Jun. 30, 2022. The content of the aforementioned application, including any intervening amendments thereto, is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2022/108319 | Jul 2022 | US |
Child | 18321632 | US |