Embodiments of the subject matter disclosed herein generally relate to seismic exploration of underground formations in which data processing uses a deep neural network (DNN) for attenuating seismic interference noise.
Exploration geophysics uses physical methods, such as seismic, gravitational, magnetic, electrical and electromagnetic to measure physical properties (such as seismic wave propagation velocity, its asymmetry, etc.) inside a underground formation thereby creating a 3D image of the explored underground formation. The value(s) of one or more physical properties enable detecting the presence and position of economically useful geological deposits, such as rock with sought-after minerals, hydrocarbons, water, etc.
In seismic exploration, seismic data is acquired during a seismic survey. Seismic sensors detecting seismic waves traveling through the explored underground formation record the seismic data. Besides the seismic waves carrying information about the underground formation, the seismic sensors also detect unwanted acoustic energy known as seismic interference (SI) noise. SI, which is typically coherent noise (i.e., bearing a systematic phase relation between adjacent traces), is a significant problem and having a negative impact on seismic data processing operations such as ghost elimination (deghosting), multiple elimination (demultiple), velocity estimations and amplitude versus offset (AVO) analysis (as discussed, for example, in the 2004 article “Seismic interference noise attenuation” by Gulunay et al., published in SEG Technical Program Expanded Abstracts, pp. 1973-1976). The angles of incidence for SI noise may differ from sail line to sail line within one survey depending on the relative placement of the external sources to the receivers. Likewise, the amplitudes of SI noise can also vary greatly depending on the relative distance between the sources and the receivers. As SI noise is generated by powerful dedicated sources for seismic exploration, it tends to be well preserved over large distances and may overlap with reflections from sub-surface layers which have significantly lower amplitudes.
SI noise is typically observed at different arrival times in each seismic shot recording. Therefore, the conventional algorithms for SI noise attenuation often resort to obtaining a more incoherent distribution of the SI noise in the common receiver or common offset domain. Data transformations (e.g., T-p transform, where p is slope of dt/dx with t being arrival time and x source receiver distance known as “offset,” and T is intercept time obtained, for example, by projecting the slope to x=0) are also commonly adopted in an attempt to discriminate the noise via differences in dips/curvature from underlying signal. After transforming the data, a filtering-based denoising (e.g., the f-x prediction filters, where f is frequency and x is space) may be applied to remove SI noise. However, removing SI noise with different moveout (dip and/or curvature) for an entire dataset requires different sets of parameters be manually tested and selected for applying a conventional algorithm. This process can be laborious. In addition, in some situations (such as when SI noise coming from two directions or with similar dip to the wanted signals is recorded), the conventional algorithms are inadequate leaving noise residual and or loosing signal (i.e., seismic waves carrying information about the underground formation).
In recent years, encouraged by successful applications in conventional image processing, seismic data processing started employing DNNs (deep neural networks) in various seismic data processing operations including SI noise attenuation. Most studies confirmed the advantage of DNN-based approaches in saving time and labor cost and/or improving processing quality by comparing such approaches with a a conventional approach. However, seismic data processing using DNNs have not been widely deployed in actual seismic processing projects because when compared with a production workflow integrating a series of conventional methods of signal processing, the DNN approaches can easily fail to accurately extract the signal.
It remains therefore of interest, in seismic data processing for natural resource exploration in underground formation, to develop DNN-based approaches able to replace high-computing cost conventional algorithms while yielding at least same result quality.
Methods and devices according to various embodiments perform seismic exploration using a DNN to attenuate/remove SI noise. In order to overcome the problem of potentially inadvertently failing to accurately separate the signal from the noise, the DNN is trained using an SI model, SI free shots and simulated random noise.
According to an embodiment, there is a seismic exploration method that includes selecting a representative subset of a seismic data acquired during a survey and extracting an SI model from the representative subset using a conventional method. The seismic exploration method further includes combining the SI model with SI free shots and simulated random noise to generate a training dataset, training a DNN using the training data, to output the SI model and a sum of the SI free shots and the simulated random noise, and using the trained DNN to process the seismic data in order to generate an image of subsurface formation for detecting presence and/or location of sought-after natural resources.
According to another embodiment, there is a seismic data processing apparatus having an interface configured to obtain seismic data acquired during a survey; and a data processing unit connected to the interface. The data processing unit is configured to select a representative subset of a seismic data acquired during a survey and to extract an SI model from the representative subset using a conventional method. The data processing unit is further configured to combine the SI model with SI free shots and simulated random noise to generate a training dataset, to train a DNN using the training data so that the DNN to output the SI model and a sum of the SI free shots and the simulated random noise, and to use the trained DNN to process the seismic data in order to generate an image of a subsurface formation for detecting presence and/or location of sought-after natural resources.
According to yet another embodiment, there is a computer readable recording medium non-transitorily storing executable codes that when executed by a computer make the computer perform a seismic exploration method. The seismic exploration method includes selecting a representative subset of a seismic data acquired during a survey and extracting an SI model from the representative subset using a conventional method. The seismic exploration method further includes combining the SI model with SI free shots and simulated random noise to generate a training dataset, training a DNN using the training data, to output the SI model and a sum of the SI free shots and the simulated random noise, and using the trained DNN to process the seismic data in order to generate an image of subsurface formation for detecting presence and/or location of sought-after natural resources.
The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate one or more embodiments and, together with the description, explain these embodiments. In the drawings:
The following description of the exemplary embodiments refers to the accompanying drawings. The same reference numbers in different drawings identify the same or similar elements. The following detailed description does not limit the invention. Reference throughout the specification to “one embodiment” or “an embodiment” means that a particular feature, structure or characteristic described in connection with an embodiment is included in at least one embodiment of the subject matter disclosed. Thus, the appearance of the phrases “in one embodiment” or “in an embodiment” in various places is not necessarily referring to the same embodiment. Further, the particular features, structures or characteristics may be combined in any suitable manner in one or more embodiments.
The embodiments described in this section relate to the attenuation of noise due to a seismic source other than a primary seismic source. In some cases, the other seismic source is deployed as part of the same data acquisition system (either on the same or another vessel), the noise being called “cross-talk noise.” In these cases, the actuation time and position of the seismic source causing the noise are known. In other cases, the seismic source causing the noise is not part of the seismic data acquisition system, the noise may be called seismic interference (SI) noise. In this case the actuation timing and position of the second source are likely unknown. SI noise has typically a lower amplitude than the cross-talk noise, as the noise causing source is likely farther away. The seismic data processing methods and devices described in this section use DNNs for attenuation of SI noise and/or cross-talk noise, while preserving the wanted signal.
SI model 130 is then combined with SI-free shot data 140 (i.e., corresponding to data acquired without SI noise in the survey area, the SI-free shot data being obtained by simulation or a result of conventional SI-removal) and simulated random noise 150. The resulting blended data is used as input for training and validation 160 of the DNN 180. Injecting additional random noise into the input makes the DNN to better focus on learning the high-level features of the input data. DNN 180 learns to predict two outputs: a first one corresponding to the SI noise and a second one corresponding to the SI-free shot and the random noise (as indicated by the arrows emerging from DNN 180 box and turning back to SI model 130 and the box surrounding SI-free shot data 140 and simulated random noise 150, respectively). In other words, DNN 180 may be trained so its first channel output to match SI model 130 and its second channel output to match the sum of SI-free shot data 140 and simulated random noise 150. Note that here the term ‘channel’ is used in the machine learning sense to represent a certain component of an image. DNN 180 may be output any or both these channels as well as other similar user-defined channels.
Optionally (as indicated by the dashed line of box 170), DNN training and validation may also use adjacent shots on both sides (location-wise or time-wise) of the reference shot are added to the DNN’s input (as described, for example, in the 2020 article “Attenuation of marine seismic interference noise employing a customized U-Net” by Sun et al. published in Geophysical Prospecting, vol. 68, no. 3). When adjacent shots are used, although the input data slices are in shot domain, the DNN may then also use of information in the common receiver domain since seismic events from consecutive shots are correlated and continuous whereas the SI noise tends to be uncorrelated and discontinuous.
Once the DNN is adequately trained (i.e., 190), it can be used to process the entire acquired dataset 110 without injecting additional random noise. The output of the processing using DNN is then a dataset without SI noise.
Method 200 further includes extracting an SI model from the representative subset using a conventional method at 220. Here, the “conventional method” may be an inversion-type method that optimizes the subsurface structure’s model, a transformation-type method (e.g., a data transform followed by a filter-based denoising) or direct noise identification in a subset of the survey data without a primary signal (e.g., during in the line turns, from noise records before the start of a line or after shooting a line, while transiting to infill lines, etc.).
Method 200 then includes combining the SI model with SI free shots and simulated random noise to generate a training dataset at 230. The SI free shots may be simulated or may be the result of applying a conventional method. Then, at 240, a DNN is trained (using as input the training dataset) to output the SI model and/or a sum of the SI free shots and the simulated random noise.
Method 200 may include splitting a dataset resulting from combining the SI model with SI free shots and simulated random noise into a validation dataset and the training dataset, and after the DNN is trained using the training dataset, DNNs accuracy is tested using the validation dataset.
Method 200 uses the trained DNN to process the entire seismic dataset acquired during the survey to produce an image of subsurface formations in the surveyed area at 250. The image represents attribute values (e.g., seismic wave propagation velocity, reflectivity, etc.) within the subsurface enabling detection of sought-after natural resources (e.g., oil and gas, geothermal water, minerals, etc.).
Besides attenuating noise due to external sources (other than the ones yielding the primary signal), this method may be applied to attenuate turn noise, shear-like noise caused by scattering off sea-floor heterogeneities (known as “Vz noise”), cable strum noise, multiples, ghosts, bubble noise, etc.
DNN is a term used to describe hardware such as network nodes and software such as machine learning algorithms. DNNs have an input layer, an output layer and few hidden layers in-between. DNNs have a hierarchical organization similar to neurons in human brains. DNNs consist of neurons, synapses, weights, biases, and functions. DNNs are trained to yield desired outputs by supervised learning with an input/training dataset and one or more target output(s) (in our case, (i) the SI model and/or (ii) a sum of the SI free shots and the simulated random noise) through back propagation.
SI noise attenuation using DNNs as described above was tested using synthetic (simulated) data illustrated in
The simulated primary-source signals and blending noise both have events of hyperbolic curvatures as if being acquired from a source-over-streamer blended seismic data acquisition (e.g., using TopSeis system described in the 2017 article “Shooting over the streamer spread: a novel approach in seismic marine acquisition and imaging” by Vinje et al. published in 79th Annual Conference and Exhibition, EAGE, Extended Abstracts, pp. 1-5). For a shooting interval of 3.0 s with a shot-to-shot dithering of maximum ± 500 ms, a blended data batch of 300 blended shots (primary-source signal plus blending noise) have been simulated. The shots in this batch have different numbers of linear events with different dips and amplitudes. Other thirty blended shots were generated using a slightly different shooting interval, 24 of which were then used as DNN training data and the other 6 were for validation.
Jittered gathers obtained by applying a random dither on each trace may be used as test data.
Three experiments for testing the use of DNNs for the attenuating SI noise were conducted.
The first experiment used the unjittered signals (
In the second experiment, random noise, which lacks learnable high-level features, was added to first training data set (i.e., input for training was
In this second experiment, the DNN was trained to output the three coherent components free of random noise and jitter (i.e., as in
Experiment 3 builds on experiment 2 retaining the random noise along with the wanted signal in its corresponding output channel in addition to injecting random noise into the training input. The random noise was uniform random noise, but may also be gaussian white, pink, black, brownian, or cauchy noise. The goal was to maintain the benefits of predicting coherent noise through the reconstruction pattern, so that a more complete prediction can be achieved based on a better extraction of the high-level features, while avoiding loss of the desired signal (e.g., the local jitters on the events). Note that real field seismic data do not always have dramatic low-level features like these artificially introduced local jitters, either on the desired signal or the coherent noise, but it is still important to preserve information of the raw data as complete as possible during the processing.
The results (primary signal, blending noise and linear noise) output by the DNN trained in experiment 3 are illustrated in
These three experiments with their respective training and using DNN patterns lead to improving and nuancing the use of DNNs for removing SI noise. In experiment 1, since the DNN simply decomposes the random noise-free input seismic to the different components, loss of data features occurs in each of them but their summation is approximately equal to the raw input. In experiment 2, in which random noise is injected into the training inputs and the DNN learns to reconstruct output channels free of random noise, DNN is more focused on learning the high-level features of data, but some low-level features (e.g., the local variance of amplitude and time-shift) were overlooked. Experiment 3 showed that if one output (here, the primary-source signal) is expected to keep not only one coherent component but also the random noise, the DNN tends to reconstruct the other random noise-free changes first. The channel with random noise may then be obtained by subtracting the reconstructed channels from the raw data. This pattern better preserves signal fidelity of the coherent component in a selected channel at the expense of adding jitter from the other channels interpreted as random noise.
The above three processing patterns could be selectively used for different purposes based on their own trade-offs. If the goal is to achieve good signal fidelity and preserve primary information, it is advantageous to inject random noise into primary signal data used as input and the output channel for training the DNN.
Following up on experiment 3, applied to a real data test, to train the DNN, SI contaminated data was produced by randomly blending SI-free shots with records containing almost pure SI noise originating from different directions as illustrated in
The DNN was trained with data in which identical simulated random noise was injected into the training (and validation) inputs. After being independently trained, this DNN was applied on the same set of field data contaminated by SI noise during acquisition. Compared to the reference case (
The main advantage of using DNNs is seismic data processing is significantly decreasing the processing time. Training the DNN with data prepared as discussed (i.e., extracting an SI model from the representative subset using a conventional method and then combining the SI model with SI free shots and simulated random noise) yield accurate results both for simulated and real seismic data.
The above-discussed methods may be implemented in a computing device 1000 as illustrated in
Computing device 1000 suitable for performing the activities described in the exemplary embodiments may include a server 1001. Server 1001 may include a central processor (CPU or GPU) 1002 coupled to a random-access memory (RAM) 1004 and to a read-only memory (ROM) 1006. ROM 1006 may also be other types of storage media to store programs, such as programmable ROM (PROM), erasable PROM (EPROM), etc.
Processor 1002 may communicate with other internal and external components through input/output (I/O) circuitry 1008 and bussing 1010 to provide control signals and the like. The I/O circuitry 1008 is configured to obtain seismic data acquired during a survey. Processor 1002 carries out a variety of functions as are known in the art, as dictated by software and/or firmware instructions. The processor is configured to select a representative subset of a seismic data acquired during a survey, to extract an SI model from the representative subset using a conventional method, to combine the SI model with SI free shots and simulated random noise to generate a training dataset, to train a DNN using the training data so that the DNN to output the SI model and a sum of the SI free shots and the simulated random noise, and to use the trained DNN to process the seismic data in order to generate an image of subsurface formation for detecting presence and/or location of sought-after natural resources.
RAM 1004 may be configured to store executable codes that when executed by a computer make the computer perform a seismic exploration method such as method 200 and its variations described in this section.
Server 1001 may also include one or more data storage devices, including hard drives 1012, CD-ROM drives 1016 and other hardware capable of reading and/or storing information, such as DVD, etc. In one embodiment, software for carrying out the above-discussed steps may be stored and distributed on a CD-ROM or DVD 1016, a USB storage device 1018 or other form of media capable of portably storing information. These storage media may be inserted into, and read by, devices such as CD-ROM drive 1014, disk drive 1012, etc. Server 1001 may be coupled to a display 1020, which may be any type of known display or presentation screen, such as LCD, plasma display, cathode ray tube (CRT), etc. A user input interface 1022 is provided, including one or more user interface mechanisms such as a mouse, keyboard, microphone, touchpad, touch screen, voice-recognition system, etc.
Server 1001 may be coupled to other devices, such as sources, detectors, etc. The server may be part of a larger network configuration as in a global area network such as the Internet 1028, which allows ultimate connection to various computing devices.
The embodiments described in this section set forth methods and devices configured to attenuate/remove SI noise fast and accurately using a DNN. It should be understood that this description is not intended to limit the invention. On the contrary, the exemplary embodiments are intended to cover alternatives, modifications and equivalents, which are included in the spirit and scope of the invention. Further, in the detailed description of the exemplary embodiments, numerous specific details are set forth in order to provide a comprehensive understanding of the invention. However, one skilled in the art would understand that various embodiments may be practiced without such specific details.
Although the features and elements of the present exemplary embodiments are described in the embodiments in particular combinations, each feature or element can be used alone without the other features and elements of the embodiments or in various combinations with or without other features and elements disclosed herein.
This written description uses examples of the subject matter disclosed to enable any person skilled in the art to practice the same, including making and using any devices or systems and performing any incorporated methods. Other examples that occur to those skilled in the art are intended to be within the scope of the disclosed inventions.
Number | Date | Country | |
---|---|---|---|
63252895 | Oct 2021 | US |