The present invention is related to a method for reconstructing at least one trace in a seismic image of a common receiver and time domain, the image comprising traces in time domain with seismic data and one or more traces to be reconstructed.
A first aspect of the invention is a method that is characterized by a specific use of a convolutional neural network trained under an unsupervised learning approach with a modified receptive field.
A second aspect of the invention is a deblending method based on the use of a reconstructing method according to the first aspect of the invention applied to a denoising step of a deblending process allowing a very effective data acquisition while keeping a high quality output data sets after being processed according to the first and/or second aspects of the invention.
One of the technical fields with a more intensive development is in the field of seismic data acquisition as daily exploration costs are very high.
Seismic data are occasionally uniformly distributed and sampled. In marine acquisitions, the seismic data are often sparse in one direction and dense in the other. Irregularity in marine data spatial distribution is inevitable due to cable drifting.
In land seismic data acquisitions, a large number of defective traces are not unusual, and big data gaps are also common due to various acquisition constraints.
When standard seismic processing tools are directly applied on these data, severe aliasing and strong artifacts are expected. The resulting image-quality degradation further leads to deteriorating amplitude versus off-set (AVO) analysis, ray density imbalance in velocity tomography, and deviating gradients in full Waveform Inversion (FWI) update.
A first technical challenge is posed, which consists in the reconstruction of the collected data to densely and uniformly gridded system.
Machine learning based interpolation methods have been disclosed as being more accurate and requires fewer human intervention compared with traditional interpolation methods. Some of these methods are supervised learning methods that need labeled seismic datasets (wherein labeled seismic dataset is interpreted in this context as a dataset where for each gappy seismic data is known the true value of the missing traces) and do not generalize well to new decimated/gappy datasets (testing set) with different characteristics to the training dataset. The existing unsupervised learning methods train generative models from random noise which has the downside that the interpolation result is not invariant to the random noise initialization, and the learned interpolation knowledge is hard to transfer across different shot-gather images. That is, every decimated shot-gather image requires an individual random noise generator wherein shot is the well-known term used for a seismic source.
A single shot-gather interpolation without information from neighboring shots is a challenging test problem.
Additionally, in traditional seismic acquisitions, the responses to different seismic sources are collected separately. Interferences among seismic responses are avoided by using large time intervals, typically from a few seconds to 30 seconds, between consecutive shots. Such separate-source acquisition methods incur high operating costs due to prolonged survey time and efforts. The aim for improving efficiency motivated the development of simultaneous-source acquisition.
A denser blended-acquisition is achieved in a reduced survey time by firing multiple shots using a random time-dithering scheme within a relatively short time interval. However, deblending is needed to separate simultaneous-source acquisition data into single-source responses to facilitate post-acquisition analysis.
Deblending is an under-determined problem and a first approximation is based on employing a so-called pseudo-deblending to obtain a solution using the least-squares method. Unfortunately, the pseudo-deblending procedure is not effective in removing unwanted interferences caused by blending noises.
Recent works in deblending methods focused either on attenuating the blending noise using filter-based strategies or separating the sources directly by formulating it as an inversion problem.
Therefore, a further challenge is to develop an effective deblending method for decoupling simultaneous-source acquisition data.
The present invention is a method for reconstructing at least one trace in a seismic image solving the first posed problem and, it can be further used for deblending seismic data.
A first aspect of the invention is a method for reconstructing at least one trace in a seismic image of a common-receiver or, when receivers are uniformly spaced, common mid-point receiver or a common offset receiver; and, time domain, the image comprising traces in time domain with seismic data and at least one trace to be reconstructed. Common mid-point receiver is also known as common mid-point gather and common offset receiver is also known as common offset gather.
The method overcomes the identified drawback by carrying out the following steps:
deploying a convolutional neural network for predicting a trace, the convolutional neural network comprising at least one layer, wherein the at least one layer of the convolutional neural network comprises a kernel function with a blind-trace receptive field covering adjacent traces;
training the convolutional neural network using a plurality of traces of the seismic image, the blind-trace receptive field of the kernel function covering adjacent traces with data but not covering the trace located at the position of the trace to be reconstructed;
inputting the seismic image into the convolutional neural network predicting the value of the at least one trace to be reconstructed;
assigning the predicted value of the at least one trace to be reconstructed in the seismic image at the location of said trace to be reconstructed.
A seismic trace is a time series response to seismic sources recorded at a receiver position. According to the prior art, when the acquisition is carried out with separate-sources, the seismic data in the time domain can be represented as a multidimensional array Pijkb=(xis, xjr, tk); where xis is the ith source position, xjr is the jth receiver position and tk is the kth time sample.
The shape of the array P is ns× nr× nt where ns is the number of unblended shots, nr is the number of receivers and nt is the length of the time series.
The order of the indices in the P matrix can be taken in different order, the important thing is that one index is related to the receivers, another index is related to the sources and the remaining index is related to time. Any other order will result in operations that must take into account the order of the indices. Operations that lead to the same result by exchanging this order but are consistent with the chosen order are considered equivalent.
The P matrix can be represented as a three-dimensional volume where for example the time axis is vertical. From this matrix it is possible to define at least two set of 2D images, those corresponding to the values determined by planes parallel to the time and receiver number axes and those corresponding to the values determined by planes parallel to the time and source number axes. The first set is identified as a common-shot and time domain and the second set is identified as common-receiver and time domain. Additionally, it is possible to define two set of 2D images, those corresponding to the values determined by planes parallel to the time axis and constant sums of the source and receiver numbers, and those corresponding to the values determined by planes parallel to the time axis and constant differences of the source and receiver numbers. In the previous sentence, the first set is identified as common-midpoint and time domain and the second set is identified as common-offset and time domain. These two planes may be defined when receivers are uniformly spaced.
According to the first aspect of the invention, given a common-receiver and time domain image extracted from an acquired data set comprising traces in time domain with seismic data and at least one trace to be reconstructed, the method provides a reconstructed image.
According to the first step, a convolutional neural network is deployed intended for predicting a trace to be reconstructed. For instance, a trace that is not available from the data set acquired from a seismic survey. The convolutional neural network comprises at least one layer and the at least one layer comprises a kernel function with a blind-trace receptive field covering adjacent traces excluding the trace whose receptive field is being considered.
The image values are discrete values that correspond to a discrete time value and a discrete value that identifies either the receiver position number or the source number. From now on we will use either the matrix value or the pixel value, understanding that its implementation is by means of an image.
The reception field of a trace of the at least one layer will be a subset of the input image if the layer is the first layer or, if it is an intermediate layer or the last layer, then it will be a subset of a feature map obtained at the output of the immediately preceding layer. In any case, a data series of the image or the feature map according to the time axis will be deemed as a trace. Along the description, the feature map will be treated as an image comprising pixel values.
The kernel function is a function that may be expressed mathematically and, for a given reference pixel, evaluates an expression as a function of the pixel values of a bounded environment by providing a scalar value, the value that results in the pixel of the output feature map at the position of the reference pixel. The domain formed by the pixels involved in the function argument, the bounded environment, defines the receptive field.
According to the first step, the argument of the receptive field does not comprise pixels of the trace located at the reference pixel and therefore only values of adjacent traces. According to this feature, the receptive field is identified as a blind-trace receptive field.
The convolutional neural network is trained using a plurality of traces of the seismic image, the blind-trace receptive field of the kernel function covering adjacent traces with data but not covering the trace located at the position of the trace to be reconstructed. As a result, the learning method may be classified as unsupervised since the convolutional neural network is trained without the need of labeling preselected traces or images.
Once the convolutional neural network has been trained, the seismic image of a common-receiver or, when receivers are uniformly spaced, common mid-point receiver or a common offset receiver and, time domain comprising at least one trace to be reconstructed is inputted into it predicting the value of said trace. The predicted value of the trace to be reconstructed is assigned in the seismic image at the location where previously said trace to be predicted was located.
Although the kernel function can be expressed mathematically by means of a scalar function with its arguments, this function expresses the way in which the information is processed and combined from the information stored in the previous layer. For this reason, this kernel function could be instantiated in a computer program by means of an executable function but it could also be configured in a device by establishing the appropriate connections and logic gates that physically combine the information from one layer to the next.
According to an embodiment, it will be disclosed an example wherein the convolutional neural network is a U-net. The U-net comprising downscaling layers providing channels which allow the network to obtain context information that is subsequently propagated to higher resolution layers in the upscaling layers. Additionally, U-net based neural networks are chosen since the input image and the output image have the same size.
Directly applying a U-net based neural network as in the prior art for optimizing the unsupervised training loss will lead to a local minimum solution.
The strategy according to the present embodiment prevent the local minimum since at least one layer is a trace-blind layer.
According to an embodiment, the U-net comprises at least a down-sampling layer, wherein the at least one down-sampling layer is the layer with a kernel function with the blind-trace receptive field.
As a result, the blind-trace receptive field is used when the U-net is generating context information and, this context information is propagated to the high-resolution layer in order to predict traces to be reconstructed with a high accuracy. It has been proved that using the blind-trace receptive field in down-sampling layers the predicted trace is not distorted at high resolution and accurately predicted.
According to an embodiment, the blind-trace receptive field of the kernel function is determined by the following processing steps:
limiting the receptive field to cover one side according to the direction of the trace;
duplicating the image to be inputted into the receptive field resulting into a first copy of the image and a second copy of the image, the second copy of the image being rotated 180° with respect to the first copy of the image;
the first copy and the second copy of the image are inputted into the receptive field of the U-net and the resulting images being combined into a single image.
According to this embodiment, the inputted image is duplicated, one of the two images rotated 180° with respect to the other image. Since the receptive field has been limited to cover one side, the first image allows processing one side of the original image and the second image allows processing the other side of the original image in a very efficient manner.
In a preferred embodiment, the receptive field is limited at the side corresponding to the transversal direction of in which the data is stored. If matrices are stored by rows then the receptive field is limited at the upper part.
Once the first copy and the second copy of the image are inputted to the modified U-net resulting in two new n-channel output feature maps; these are combined into a single image comprising the predicted traces. According to a preferred embodiment, the two re-channel output feature maps are a 32-channel output feature maps.
According to a preferred embodiment, a one row offset is applied to both output feature maps to ensure a blind-trace receptive field before rotating them back and combining them. According to a preferred embodiment, the two images are combined by first concatenating the two output feature maps into a 2n-channel feature map followed by two 1×1 convolution steps. According to the preferred embodiment wherein the two n-channel output feature maps are a 32-channel output feature maps, said two output feature maps are concatenated into a 64-channel feature map.
According to a more preferred embodiment, in the first step we use a 32-filters 1×1 convolution followed by a ReLU activation layer resulting in a 32-channel feature map; in the last step we use a single-filter 1×1 convolution followed by a leaky ReLU activation layer resulting in a single-channel output image comprising the predicted traces.
According to an embodiment, the at least one layer with a blind-trace receptive field is a down-sampling layer and wherein any layer of the convolutional neural network further comprises an output for outputting a feature map and, wherein:
before inputting a feature map, output of a previous layer or the image if the current layer is the first layer, into the current layer, the feature map is padded by adding rows of zeros at the end of the feature map located at one side of the trace;
carrying out the convolution operation;
cropping out the same number of rows previously added wherein the cropping of the output feature map is carried out on the side opposite to the side on which rows were previously added.
In this particular embodiment, the blind-trace is implemented by a padding process.
According to this process, the image or the feature map is padded by adding rows resulting in a larger image with the pixels shifted in the column direction.
The convolution operation is executed over the padded image or feature map and then, the resulting image is cropped out removing the same number of rows previously added on the side opposite to the side on which rows were previously added recovering a feature map with the size of the initial image.
According to an embodiment, the training process of the U-net uses a converging criterion based on an approximation error estimation Es for measuring the interpolation loss when predicting the reconstructed trace and said approximation error estimation Es being determined as a linear combination of a misfit loss and a regularization loss, the regularization loss being determined by the following steps:
determining the main energy area of the non-reconstructed image;
calculating the norm of the reconstructed image in the frequency domain limited to the area not being part of the main energy area, according to a predetermined norm.
If the original complete image is y∈t×n, the image with no traces to be reconstructed, where t and n represent the total sampling time and number of traces respectively, and the observed decimated image (the image with traces to be reconstructed) is represented by x∈t×n; then the set of indices of the decimated traces is represented by m where the cardinality of the set represent the number of missing traces or traces to be reconstructed. The decimation ratio may be written as r=|m|/n.
Given the set m, observed image x can be obtained by:
where σm(y) is the mask operation mapping from complete image to decimated image: y→x. On the other hand, the reconstruction process may be expressed as an interpolation function mapping decimated image to complete image, fθ(x): x→y where fθ represent the neural network parameterized by θ. The unsupervised training loss is then set up as:
L
θ(m)=∥x−σm(fθ(x))∥
where ∥⋅∥ is the normalized l1 norm.
the optimal solution can be obtained by θ*=arg minθLθ(m)
Blind-trace network as disclosed prevents the identity mapping in the unsupervised learning task. But without extra steps, it fails to reconstruct regularly decimated seismic data. There is limited variance in the trace missing patterns for regularly decimated image, and the patterns between training and reconstructed traces are very different. In this description the term decimated will be interpreted as drop/miss data when referring to a trace and, regularly decimated will be interpreted as drop/miss data shows a regular pattern. This ill-posed interpolation problem is mitigated by adding a regularization guidance using an automatic spectrum suppression in the f-k domain.
If (x) denotes the 2D Fourier transform, then (x) is the decimated seismic data sorted in shot-gathers in the f-k domain. The main energy of the complete data (including first arrivals, ground roll and reflected energy) will be a fan with one or more apparent velocities, the energy being bounded in a certain area S from the f-k domain, a suppression mask M∈t×n can be created satisfying Mi=0 if i∈S and Mi=1 if i∉S.
The error estimation Es is determined as a linear combination of a misfit loss and a regularization loss. According to a further embodiment, the linear combination only comprises a single parameter for the regularization loss that may be expressed as:
where ⊙ denote the entry-wise multiplication and α is the coefficient that balance the misfit loss and regularization loss.
After determining the main energy area of the non-reconstructed image, the regularization loss factor is estimated. Then, the next steps is determining the regularization loss by calculating the norm of the reconstructed image in the frequency domain limited to the area not being part of the main energy area, according to a predetermined norm.
Then the training process of the U-net is in an iterative process with a converging criterion based on an approximation error of Es estimated as the misfit loss plus the regularization loss multiplied by factor α. Factor α is a positive value and according to some numerical experiments, the value of α has been chosen in the range [10−4, 10−2]. An appropriate α value can be determined by evaluation on a validation set. The validation set can be a group of undecimated traces that are not used in computing the misfit loss, i.e. these traces are masked out on purpose just for identifying a and will be used in the network training after a is determined.
According to an embodiment, the misfit loss is determined as the difference between the non-reconstructed image and the reconstructed image after removing the reconstructed trace/traces.
This miss fit may be expressed as x−σm(fθ(x)) measure by a predetermined norm.
According to a specific embodiment, the reconstructing method according to the first aspect of the invention is used for deblending seismic data.
As it has been disclosed, the seismic data in the time domain can be represented as a multidimensional array Pijkb=(xis,xjr,tk) when the survey has been carried without any blending conditions.
For blended acquisition σl is defined as the lth source group containing a subset of source positions Within each σl the shots are fired with relatively short and random delay times τli (dithering).
Therefore, according to this embodiment, a survey is carried out in which the data acquired in the receivers are blended because the sources are fired in groups in such a way that within each group it is not allowed to wait long enough for the acoustic signal of the previous fire to have disappeared.
The method according to this embodiment is as follows:
deploying a plurality of ns acoustic sources in the upper surface of the reservoir domain wherein the ns acoustic sources are grouped in B groups σl, l=1 . . . B, of acoustic sources, each acoustic source being only in one group σl of sources and at a location xis, i=1 . . . ns and, deploying a plurality of nr, acoustic receivers in the upper surface of the reservoir domain at a location xjr, j=1 . . . nr;
for each group σl, l=1 . . . B of acoustic sources, each acoustic source is shot with a random delay time τli and the response in the acoustic receivers stored in a data structure that may be represented by Pljkb=(xis,xjr,tk−Tli); wherein tk is the kth time sample in the time domain;
calculating, for each group σ1, l=1 . . . B, the Fourier-transform Πb(:,:,ωk)=F{Pljkb} wherein ωk is the kth frequency and “:” denoting variables depending in index i or index j;
for each frequency ωk determining æLS(:,:,ωk)=Γ*Πb(:,:,ωk) wherein Γ* is
Γ*=ΓkHD
being D a diagonal matrix and ΓkH the conjugate transpose of Γk the blending matrix that may be calculated from the random delay times τli as
calculating an Inverse Fourier Transform of F−1(ΠLs(:,:,ωk))=Pljk;
the shot-gather ordering in the output Pljk is sorted to get the image Ij of the trace data in the common-receiver or the common-midpoint receiver or the common-offset receiver and, time domain;
for each trace carrying out a deblending step by reconstructing the coherent signal of the traces using a reconstructing method according to any of those previously disclosed.
After acquiring the blended seismic data, for each group, the blended seismic data corresponding to such group is processed independently of the other groups. According to a very short survey, there is a single group of blended data and therefore all the seismic data is blended.
Step c) calculates the Fourier-transform wherein now the time variable is transformed to the frequency variable. Since this problem is overdetermined, a least square method is applied by calculating ΠLS(:,:,ωk)=Γ*Πb(:,:,ωk). Γ* is the result of the multiplication of a diagonal matrix by the conjugate transpose of the blending matrix which shift common-receiver and time domain images in the time variable by multiplying by factor e−√{square root over (−1)}ω
Once the data has been shifted and the least square solution calculated, a set of 77,, images are generated by selecting planes in the gather domain such as in the common-receiver domain from Pljk and then for each trace carrying out a deblending step by reconstructing the coherent signal of the traces using a reconstructing method according to any of those previously disclosed.
Steps c) and the following steps are expressed explicitly in the Transformed Fourier space because the formulation is very clear in order to obtain pseudodeblended data; however, the same result may be obtained by operating in time-domain by applying a time shifting process. Any method providing the same pseudodeblended data will be interpreted as being equivalent.
The diagonal matrix used for calculating Γ*=ΓkHD applies an scaling factor to each row of ΓkH which is determined as D=(ΓkΓkH)−1; nonetheless, this calculations are avoided in order to not scale the coherent information. In this case, the diagonal matrix may be represented as D=I. In this case; we will name the deblended data as pseudo-deblended data.
Additionally, if the seismic survey is carried out firing each source only once then the condition σm∩σn=Ø is satisfied and rows of Γk are orthogonal.
These and other features and advantages of the invention will be seen more clearly from the following detailed description of a preferred embodiment provided only by way of illustrative and non-limiting example in reference to the attached drawings.
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied method that may be implemented as a computer program product, at least those parts manipulating the acquired seismic data.
In order to show an specific embodiment of the disclosed methods in a detailed manner, let represent the 1-D Fourier transformation along the time axis, that is, for what has been named a trace, −1 its inverse and fθ(⋅) is a denoising convolutional neural network (CNN) with parameters θ. Further, let C denote the array concatenation operator which takes arrays and stacks them into a single higher-dimensional array, e.g. Π=CΠk. With no unblended data available, a model that minimizes the unsupervised blending loss function is trained, being the loss function
and ={x(1), . . . , x(N)}, where the input x=−1CΓmHΠkb, is the pseudo-deblended data in the common receiver domain. fθ(x) gives the prediction of the deblended data. The prediction of the model is blended again by the known blending matrix Γ. This result is the blended prediction: Γm(fθ(x(k)))m. The term −1CΓmHΓm(fθ(x(k)))m is the pseudodeblended output of the model's prediction. The mean absolute error is taken from the pseudodeblended output of the model's prediction and the original pseudo-deblended data, i.e. the input x. The action of the blending followed by the pseudodeblending operator −1CΓmHΓm((⋅))m can be implemented efficiently and directly in the time domain by first combining the traces according to the blended-acquisition time dithering code, then each bended shot-gather is copied a number of times equal to the corresponding source group size, and finally each of these copies is time-shifted (dithering decoded) to undo the delays introduced in the field. By adopting this blending loss, the deblending task is accomplished in a unsupervised way with the blended data and the blending matrix.
Direct minimization on this blending loss with a traditional deep CNN leads to identity mapping. It will produce the pseudo-deblended result which is exactly the least square solution. This phenomenon could be observed in
To avoid this local minimum, a strong constraint for the model by making it trace-wise blind is proposed. According to this proposal, the coherent signal of the traces are essentially reconstructed from their adjacent traces, while not looking at themselves in the input. Thus for a specific trace, there would be no chance for its blending noise to be mapped from input to output. For those noises in its adjacent area, the model automatically neglect them due to their discontinuity and the convergence of the blending noise.
The illustration is shown in
According to this embodiment, the blind-trace convolutional neural network is constructed for the deblending task.
The network structure is shown in
In the blind-trace U-net, the receptive field of each layer is strictly restricted to the upper half area for each row so that the model can extract the coherent features for each trace based on its left or right adjacent area in the original input respectively.
After the blind-trace U-net, the two output feature maps are cropped at the bottom and padded at the top such that the target traces can be excluded from its receptive field to fulfill the “blind-trace” purpose.
Then, they are rotated back (270° and 90° accordingly) and concatenated, followed by two consecutive 1×1 convolutional layers that integrate the feature maps and squeeze the channel size to 1.
In
According to a preferred embodiment that may be applied to any of the disclosed examples, before the padding and cropping operations, the trace is buried under the patches, and at the end of the process, the trace will be excluded from the patches. For the edges, there is only one side of the patch that is informative and zeros are padded on the other side.
The image is processed only once and the network gives the predictions for all traces at the same time.
Specifically, to constrain the receptive field within the upper half for all rows in the blind-trace U-net, the convolutional layers and down-sampling layers are changed as shown in
Convolution: The feature maps are padded with zeros before each convolutional layer. Given the size of the filter k×k, └k/2┘ lines are padded on the top of the feature maps, and crop the └k/2┘ lines at the bottom after the convolution.
Max-pooling: The feature maps are padded on the top with one line and crop one at the bottom before max-pooling. There is no need to change the up-sampling layers since it does not affect the receptive filed after the modification for the down-sampling layers.
With the aforementioned changes, for a single trace in the feature maps, it takes the information purely from itself and the area above in the input. No information below it can leak into the result.
According to this embodiment, the rotated inputs are projected to 32 feature maps in the first layer. There are 4 contracting blocks and 4 expanding blocks in the blind-trace U-net, each of which consists of two consecutive modified convolutional layers and a max-pooling/up-sampling layer. The feature maps are doubled in the last convolutional layer of each contracting block and halved correspondingly in the expanding blocks.
Each blind-trace convolutional layer in the encoder is replaced by a blind-trace residual block (i.e. a residual block with all convolutional layers modified). Batch Normalization is adopted before each activation in the decoder.
The last two 1×1 convolutional layers project the concatenated 64 feature maps to 32 and 1 respectively. It has been used ReLU activation in the intermediate convolutional layers and leaky ReLU for the last 1×1 convolutional layer.
The proposed blind-trace network with blending loss combines the merits of both the conventional filter-based method and the inversion-based method.
The large number of the weights in the U-Net endows the network to achieve a complex non-linear filter to extract information from the coherent signal, meanwhile, for the low-SNR area where filters could not obtain any coherent information, the minimization of the blending loss reconstructs the coherent signal underneath through nonlinear inversion. This is a one-stage deblending framework and does not require much exhaustive and meticulous hyperparameter tuning.
Regarding the experiment, three different types of blending schemes are used to demonstrate the performance of the proposed method. The synthetic data examples can be seen in
“alternate” blending. In this blending strategy, two consecutive shots are blended using short and random dithering time. After pseudo-deblending, the noise will cover the whole common receiver gather as shown in
200 unblended shots are collected with the size (1600(receiver numbers)×750(time samples)). Random delays in the range of (0, 1]s are added to even shots and blended with the odd shots. The sampling period is 6 ms.
“half” blending. The second half of the shot gathers are shifted with relatively long delays and added to the first half. Noises in the pseudo-deblended common receiver gathers concentrate in the two corners of the image as shown in
“continuous” blending. Simultaneous source of streamer acquisition is simulated in this experiment. A BP2004 velocity model is used for generating 200 consecutive unblended shots.
(BP2004 denotes the 2004 British Petroleum Velocity Benchmark. https://software.seg.org/datasets/2D/2004_BP_Vel_Benchmark/eage_abstract.pdf)
The shots have been recorded every 5 to 6 seconds continuously. There are 959 receivers for each record and 3334 samples along the time axis (dt=6 ms, hence T≈20 s). Approximately 4 shots are blended at different locations in the common receiver gathers. The noise level, in this case, is much higher than the other blending schemes as shown in
Results of One-Stage Deblending
In order to avoid massive matrix multiplication in the f-x domain in loss computation, the blending and pseudo-deblending is performed in the time-space by applying dithering codes to the shots, followed by summation and cropping. In the first stage, the weights and bias are randomly initialized and trained with the Adam optimizer.
“Half” blending scheme introduces more challenges for deblending models. Due to the concentration of the noises and long dithering time, the weak amplitudes in late arrivals are severely contaminated by strong noises from the early arrivals. As can be seen in
In “continuous” blending scheme, more unblended shots are collected within one blended shot. The noisy area enlarges to the whole image, and the signal is covered by multiple layers of noises. Less coherent information can be attained by the model.
However, the one-stage deblending can still give a very good result as demonstrated in
Number | Date | Country | Kind |
---|---|---|---|
21382674.6 | Jul 2021 | EP | regional |