In order to get as much information as possible about the properties of a formation surrounding a borehole, modern sonic-logging technology has developed over the last few decades. For example, an acoustic source or a transmitter, e.g., a monopole, dipole, quadrupole, etc. source, emits an acoustic wave inside the borehole. The resulting acoustic wave is then recorded with an acoustic array after it has reached the borehole walls and traveled through the formation via elastic waves. Data analysis on the recorded waveforms provides useful information about the geophysical properties of the borehole. Such information is composed of the slowness of the different types of waves traveling through the formation, as well as the dispersion curves when the slowness is frequency dependent.
Sonic tools have been designed in order to operate simultaneously with the drilling of the borehole. Such tools are called logging-while-drilling (“LWD”) tools and are linked to a bit within a bottom hole assembly (“BHA”). Processing sonic data obtained from LWD is a challenging task because the recorded waveforms experience a low signal to noise ratio (“SNR”) due to the drilling noise, for example, any noise generated by the drilling operation such as the sound of the bit grinding the rock and/or any shocks of the BHA against the borehole walls. The noisy data acquired by LWD sonic logging requires robust techniques to extract useful information.
Described herein are robust automated techniques to estimate first-arrival times of compressional waves (“P-waves”) on Drilling and Measurement (“D&M”) acoustic monopole data. Estimating first-arrival times can be a challenging task because D&M data experience a very low signal to noise ratio, making the conventional first-motion detection algorithms unreliable. Hence several signal processing tools are described for use together in order to build a robust algorithm. First, after using a first-motion detection algorithm, outliers are detected and discarded. The remaining outputs allow for windowing of the signal around the P-wave arrivals, and then calculating the time delays between the waveforms. These time delays are useful information for two reasons. First, the time delays can be used to greatly reduce the error on the first-arrival estimations. Secondly, the time delays can provide an approximate value of the P-slowness. Knowing the P-slowness, the data can be filtered, for example, using either two-dimensional (“2-D”) Fourier transform or the Radon transform of all the waveforms at once, reducing efficiently the noise. Then, repeating this three steps scheme for few iterations provides reliable estimations of first-arrival times. Two example applications for the first-arrival time estimates are also described. First, the P-slowness deduced from these techniques is very close to the P-slowness calculated by Slowness Time Coherence (“STC”) on both wireline and D&M data. Another application may be the borehole diameter and/or mud slowness estimations.
An example method for estimating a time of flight (“TOF”) for an acoustic wave generated by an acoustic source for each of a plurality of acoustic receivers in an array can include recording data corresponding to the acoustic wave received at each of the acoustic receivers, estimating a respective TOF for the acoustic wave for each of the acoustic receivers, and discarding one or more of the respective estimated TOFs. Each of the discarded TOFs can be a statistical outlier. The method can also include calculating a plurality of time delays for the acoustic wave using a plurality of non-discarded estimated TOFs, and updating the respective estimated TOFs for the acoustic wave for each of the acoustic receivers using the calculated time delays. Each of the time delays can be a difference between respective estimated TOFs for the acoustic wave for at least two of the acoustic receivers. Additionally, the method can optionally further include determining a compressional wave (“P-wave”) slowness value based on the calculated time delays, filtering the recorded data using a two-dimensional filtering transform based on the P-wave slowness value, and re-performing, using the filtered recorded data, the steps of estimating the respective TOF for the acoustic wave for each of the acoustic receivers, discarding one or more of the respective estimated TOFs, calculating the plurality of time delays and updating the respective estimated TOFS using the calculated time delays.
Optionally, the two-dimensional filtering transform can be an F-K transform or a Radon transform, wavelet transform, curvelet, or any other transformation allowing to represent the data in two dimensions. Alternatively or additionally, the TOF for the acoustic wave for each of the acoustic receivers can optionally be estimated using a first-arrival time detection algorithm. For example, the first-arrival time detection algorithm can be an entropy algorithm, a threshold based algorithm, or a statistical algorithm based. Alternatively or additionally, a plurality of time delays for the acoustic wave can optionally be calculated using a high-order statistical algorithm. For example, the high-order statistical algorithm can be a bicoherence correlation algorithm, a coherence-correlation algorithm, a cross-correlation algorithm, bispectral-algorithm.
Additionally, one or more of the respective estimated TOFs can be discarded by performing a least-squares regression on the estimated TOFs, calculating a standard deviation of differences between the estimated TOFs and respective values of the least-squares regression for the acoustic wave for each of the acoustic receivers, and if an absolute value of a difference between an estimated TOF for the acoustic wave for a given acoustic receiver and a value of the least-squares regression for the acoustic wave for the given acoustic receiver is greater than the standard deviation, discarding the estimated TOF for the acoustic wave for the given receiver.
Optionally, the method can further include, upon discarding one or more of the respective estimated TOFs, windowing respective portions of the recorded data corresponding to the acoustic wave for each of the acoustic receivers around each of the non-discarded estimated TOFs and disregarding data outside each of the respective windowed portions. For example, disregarding data outside each of the respective windowed portions can include, but is not limited to, setting the data outside each of the respective windowed portions to zero or a flag value. Accordingly, the data outside each of the respective windowed portions can be identified and not used (e.g., ignored, disregarded, etc.) in the remaining calculations.
Alternatively or additionally, the method can include determining a P-wave slowness value based on the updated TOFs for the acoustic wave for each of the acoustic receivers. For example, determining the P-wave slowness value for each of the acoustic receivers can include performing a least-squares regression on the updated TOFs, and calculating a slope of a best-fitting line obtained by the least-squares regression.
Optionally, the acoustic source and the array can be located in a borehole. Additionally, the method can further include inverting a diameter of the borehole and/or a mud slowness value based on the updated TOFs for the acoustic wave for each of the acoustic receivers.
An example system for estimating a TOF for an acoustic wave can include an acoustic tool and a control unit. The acoustic tool can include at least one acoustic source configured to generate an acoustic wave and an array including a plurality of acoustic receivers. The control unit can include at least one processor and a memory. In addition, the control unit can be configured to record data corresponding to the acoustic wave received at each of the acoustic receivers, estimate a respective TOF for the acoustic wave for each of the acoustic receivers, and discard one or more of the respective estimated TOFs. Each of the discarded TOFs can be a statistical outlier. The control unit can also be configured to calculate a plurality of time delays for the acoustic wave using a plurality of non-discarded estimated TOFs, and update the respective estimated TOFs for the acoustic wave for each of the acoustic receivers using the calculated time delays. Each of the time delays can be a difference between respective estimated TOFs for the acoustic wave for at least two of the acoustic receivers.
Additionally, the control unit can be further configured to determine a P-wave slowness value based on the calculated time delays, filter the recorded data using a two-dimensional filtering transform based on the P-wave slowness value, and re-perform, using the filtered recorded data, the steps of estimating the respective TOF for the acoustic wave for each of the acoustic receivers, discarding one or more of the estimated TOFs, calculating the plurality of time delays and updating the estimated TOFS using the calculated time delays.
Optionally, the two-dimensional filtering transform can be an F-K transform or a Radon transform. Alternatively or additionally, the TOF for the acoustic wave for each of the acoustic receivers can optionally be estimated using a first-arrival time detection algorithm. For example, the first-arrival time detection algorithm can be an entropy algorithm an entropy algorithm. Alternatively or additionally, a plurality of time delays for the acoustic wave can optionally be calculated using a high-order statistical algorithm. For example, the high-order statistical algorithm can be a bicoherence correlation algorithm.
Additionally, one or more of the respective estimated TOFs can be discarded by performing a least-squares regression on the estimated TOFs, calculating a standard deviation of differences between the estimated TOFs and respective values of the least-squares regression for the acoustic wave for each of the acoustic receivers, and if an absolute value of a difference between an estimated TOF for the acoustic wave for a given acoustic receiver and a value of the least-squares regression for the acoustic wave for the given acoustic receiver is greater than the standard deviation, discarding the estimated TOF for the acoustic wave for the given receiver.
Optionally, the control unit can be further configured to, upon discarding one or more of the respective estimated TOFs, window respective portions of the recorded data corresponding to the acoustic wave for each of the acoustic receivers around each of the non-discarded estimated TOFs and disregard data outside each of the respective windowed portions. For example, disregarding data outside each of the respective windowed portions can include, but is not limited to, setting the data outside each of the respective windowed portions to zero or a flag value. Accordingly, the data outside each of the respective windowed portions can be identified and not used (e.g., ignored, disregarded, etc.) in the remaining calculations.
Alternatively or additionally, the control unit can be configured to determine a P-wave slowness value based on the updated TOFs for the acoustic wave for each of the acoustic receivers. For example, determining the P-wave slowness value for each of the acoustic receivers can include performing a least-squares regression on the updated TOFs, and calculating a slope of a best-fitting line obtained by the least-squares regression.
Optionally, the acoustic source and the array can be located in a borehole. Additionally, the control unit can be further configured to invert a diameter of the borehole and/or a mud slowness value based on the updated TOFs for the acoustic wave for each of the acoustic receivers.
It should be understood that the above-described subject matter can be implemented as a computer-controlled apparatus, a computer process, a computing system, or an article of manufacture, such as a computer-readable storage medium.
Other systems, methods, features and/or advantages will be or may become apparent to one with skill in the art upon examination of the following drawings and detailed description. It is intended that all such additional systems, methods, features and/or advantages be included within this description and be protected by the accompanying claims.
The components in the drawings are not necessarily to scale relative to each other. Like reference numerals designate corresponding parts throughout the several views.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. Methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure. As used in the specification, and in the appended claims, the singular forms “a,” “an,” “the” include plural referents unless the context clearly dictates otherwise. The term “comprising” and variations thereof as used herein is used synonymously with the term “including” and variations thereof and are open, non-limiting terms. The terms “optional” or “optionally” used herein mean that the subsequently described feature, event or circumstance may or may not occur, and that the descriptiop includes instances where said feature, event or circumstance occurs and instances where it does not. While implementations will be described for estimating a TOF for an acoustic wave for each of a plurality of acoustic receivers in an array located in a borehole, it will become evident to those skilled in the art that the implementations are not limited thereto, but are applicable for estimating a TOF for an acoustic wave for each of a plurality of acoustic receivers in an array located in other environments such as land surface, marine streamer, horizontal and vertical wells, cased and open hole .
Borehole Sonic Logging
Example Environment
Referring now to
The acoustic source 106 can be configured to excite a monopole waveform, for example, a spherical acoustic wave where energy is emitted equally in all directions and the wavefront is perpendicular to the direction of wave propagation. Although monopole waveforms are described in the examples below, this disclosure contemplates that the acoustic source 106 can be configured to excite other types of waveforms, including but not limited to dipole or quadrupole waveforms. It should be understood that the acoustic source 106 is configured to transmit energy (e.g., acoustic waves) into the formation 110. The energy can be characterized by its frequency and wavelength. Optionally, the acoustic source 106 can transmit broadband energy at frequencies between 0.5 and 20 kHz, for example. The transmitted energy can excite compressional, shear, Stoneley, flexural and/or quadrupole waves in the formation 110. Additionally, the array of acoustic receivers 108 is configured to detect the compressional, shear, Stoneley, flexural or quadrupole waves travelling in the drilling fluid 101, for example. It should be understood that the energy transmitted by the acoustic source 106 can be reflected and/or refracted from the fluid-formation interface. The array of acoustic receivers 108 can optionally include a plurality of acoustic receivers (e.g., receivers 108A). In addition, the acoustic receivers can be equally spaced along the borehole axis. The waveform emitted by the acoustic source 106 can be recorded at each of the acoustic receivers of the array of acoustic receivers 108. By arranging acoustic receivers in an array with different spacing from the acoustic source 106, it is possible to improve signal quality and extract various borehole signals over a broad frequency band. In addition, it should be understood that the borehole 102, as well as the acoustic source 106 and the array of acoustic receivers 108, are provided only as examples and are not intended to be limiting.
The acoustic tool (e.g., the acoustic source 106 and the array of acoustic receivers 108) can be operably connected with a control unit 120. It should be understood that the control unit 120 can optionally be located above, on and/or below the surface of the formation 110. Alternatively or additionally, the control unit 120 can be integrated with the acoustic tool and arranged in the borehole 102. The control unit 120 can optionally be configured to control the acoustic source 106 and/or the array of acoustic receivers 108, as well as receive, process and store sonic data (e.g., the acoustic data detected, collect, recorded, etc. by the acoustic receivers 108). In its most basic configuration, the control unit 120 typically includes at least one processing unit and system memory. Depending on the exact configuration and type of control unit 120, system memory may be volatile (such as random access memory (RAM)), non-volatile (such as read-only memory (ROM), flash memory, etc.), or some combination of the two. The processing unit can be a standard programmable processor that performs arithmetic and logic operations necessary for operation of the control unit 120.
For example, the processing unit can be configured to execute program code encoded in. tangible, computer-readable media. Computer-readable media refers to any media that is capable of providing data that causes the control unit 120 (i.e., a machine) to operate in a particular fashion. Various computer-readable media may be utilized to provide instructions to the processing unit for execution. Example tangible, computer-readable recording media include, but are not limited to, an integrated circuit (e.g., field-programmable gate array or application-specific IC), a hard disk, an optical disk, a magneto-optical disk, a floppy disk, a magnetic tape, a holographic storage medium, a solid-state device, RAM, ROM, electrically erasable program read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices.
In addition, the control unit 120 can have additional features/functionality. For example, the control unit 120 may include additional storage such as removable storage and non-removable storage including, but not limited to, magnetic or optical disks or tapes. The control unit 120 may also contain network connection(s) that allow the device to communicate with other devices. The control unit 120 may also have input device(s) such as a keyboard, mouse, touch screen, etc. Output device(s) such as a display, speakers, printer, etc. may also be included. The additional devices may be connected to the bus in order to facilitate communication of data among the components of the control unit 120. All these devices are well known in the art and need not be discussed at length here.
Wave Equations
For an homogeneous medium, Newton's second law of motion is shown by Eqn. (1).
where is the mass density, u is the particle displacement, τ is the stress tensor, and fi is the sum of the external body forces. In the absence of body forces, the homogeneous momentum equation is shown by Eqn. (2).
The displacements induced by seismic waves are very small. Assuming that the medium is isotropic, then the linear stress-strain relationship shown in Eqn. (3) is used.
τij=λδij∇·u+2μeij, (3)
where λ and μ are the Lame parameters. The strain tensor is shown by Eqn. (4).
By substituting Eqn. (3) into Eqn. (2), Eqn. (5) is obtained.
In Eqn. (5), the right hand side involves the gradients of the Lame parameters. These gradients are nonzero as longus the medium is inhomogeneous. However, by solving Eqn. (5) on several layers, each of them being inhomogeneous, it is possible to ignore these terms. Then, the solution in the whole medium is calculated using transmission and reflection coefficients. Another reason for discarding these terms is that their modulus will tend to zero at high frequencies. Within this approximation, Eqn. (5) becomes Eqn. (6).
Eqn. (6) is referred to as the seismic wave equation.
Now, taking the divergence of Eqn. (6), the wave equation corresponding to the compressional waves, or P-waves, shown in Eqn. (7) is obtained.
where α, which is shown in Eqn. (8), is the wave speed.
The particle motion induced by P-waves is parallel to the wave propagation direction, as it is for acoustic waves in fluids or gas. For example, the particle motion induced by P-waves is shown in
where β, which is shown in Eqn. (10) is the S-wave speed.
The particle motion induced by S-waves is perpendicular to the propagation direction. For example, the particle motion induced by S-waves is shown in
Ray Tracing and Snell's Law
Vm is the acoustic wave speed in the borehole mud, VP is the P-wave speed and VS is the S-wave speed. It should be understood that VP>VS, and hence, P-waves are always detected first at acoustic receivers. Generally, VS>Vm, and particular media where Vm>VS are called slow formations.
To investigate the path followed by acoustic waves through both the mud and the formation, it is helpful to use ray tracing. This technique provides good approximation of the real wave propagation path when the wavefronts can be approximated as planes, or when the wavelength is'much smaller than the borehole diameter. A ray is a line which is parallel to the wave propagation direction and which represents the fastest path between two points. At the interface between the borehole mud and formation, Snell's law, which is shown by Eqn. (11), determines the propagation direction of the refracted P- and S-waves.
When the refraction angle is equal to 90°, the refracted waves propagate parallel to the mud-formation interface. According to Huygens principle, every point of the interface, when excited by a P- or S-wave, acts as a secondary source for P- and S-waves, in both the borehole and the formation. The resulting waves are called head-waves.
The last arrival generated by a monopole source and detected by the receivers are acoustic waves which are generated by the Stoneley waves traveling through the formation. Stoneley waves are surface waves that propagate along the mud-formation interface, and whose amplitude decreases exponentially away from the interface (evanescence property). This decay is frequency-dependent, high frequencies decay faster than low frequencies. Also, whereas P- and S-waves in isotropic media are non-dispersive, Stoneley waves are slightly dispersive, which means that their velocity depends on the frequency. Amplitude and speed of low-frequency Stoneley waves are sensitive to the permeability of the formations and to the fractures. Hence, the study of these waves provides information about these two properties of the formation in the neighborhood of the interface.
First Arrival Time-Picking
Provided below are techniques for automatic estimation of first-arrival times on the waveforms recorded by the sonic tools (e.g., the sonic tool described with regard to
When the signal to noise ratio (“SNR”) is high, as it may be the case in wireline sonic logging, estimating times of flight by visual inspection may be an easy task and may provide highly accurate estimations. As well, many automated algorithms have been put forward during the last decades, in order to avoid intervention of any human operator, hence processing rapidly the thousands of waveforms recorded. These automated algorithms can provide as well highly accurate times of flight estimations, as long as the SNR remains high. However, the robustness of these algorithms decreases heavily in the presence of high amplitude noise, or when the waveforms are corrupted with arrivals of undesired waves, e.g., under low SNR conditions such as when data is recorded during LWD operations. Accordingly, the examples below focus on the data recorded by sonic tools during a LWD operation. The waveforms recorded in such a situation experience a very low SNR. The drilling noise may have an amplitude nearly equal to the P-wave amplitude, and some high amplitude tube waves (similar to Stoneley waves) can be generated by some shocks experienced by the sonic tool against the borehole walls. Another undesired wave is the one generated by the transmitter and which travels directly through the collar until the receiver array. Hence, it has a high velocity and arrives before the P-wave, interfering with it and making accurate TOF estimations a difficult task.
Conventional First-Arrival Time Detection Algorithms
The most widely used techniques to pick up first-arrival times use the ratio of energy of the signal within two distinct windows. For example, it is possible to take, at any time, the ratio between the energy of the signal a forward window and a backward window as shown in Eqn. (12).
where xi are the signal samples and nw is the length of the window. The maximum of the energy ratio provides an estimation of the first-arrival time. This technique captures well the first arrival time as long as the SNR is high. However, for low SNR, it may provide biased estimation of arrival times. Hence, several other methods using energy calculations have been put forward.
For example, the short time average, long time average (“STA/LTA”) algorithm computes the ratio of the energy between a short time window and a long time window as shown in Eqns. (13) and (14).
where ns and nl are the sizes of the short and long time windows. Then, as shown in Eqn. (15), a ratio between the STA and LTA is defined.
Finally, the numerical derivative of r is calculated as shown by Eqn. (16).
d
i
=r
i+1
−r
i. (16)
The index for which d is maximum provides an estimation of the first arrival time. This method is one of the most widely used among the seismic processing community.
A modified energy ratio technique is now provided. After calculating the energy ratio between a forward and backward window, a modified energy ratio is calculated as shown by Eqn. (17).
{tilde over (r)}
i=(|xi|·ri)3, (17)
and the peak is closed to the first arrival time.
Another class of algorithms uses autoregressive representations of the signal in order to determine the arrival time. First, a set of data points(x1 . . . xN) which contains the onset of the noise free signal is needed. Then, this sequence is divided in two intervals, one preceding and one including the onset. In each interval, the data is fit with an autoregressive representation of order M as shown in Eqn. (18).
with t=1, . . ., M, for interval 1, and t=N−M+1, . . . , N, for interval 2. The process η is assumed to be white Gaussian, with E((ηt(i))2)=σi2. An arbitraryinteger K such that M+1<K<N−M−1 is then chosen. Since it is assumed that η is white Gaussian, the maximum likelihood function using data from intervals [M+1,K] and [K+1,N−M] can be expressed as shown in Eqn. (19).
where Ωi=(a1(i), . . . , aM(i), σi2), and p1=M +1, p2=K+1, q1=k, q2=N−M, n1=K−M, n2=N−M−K. Maximizing the log-likelihood is shown in Eqn. (20).
The solution is then shown in Eqn. (21).
Going back to Eqn. (19) and substituting these values, the following formula for the log likelihood as a function of K is obtained, which is shown in Eqn. (22).
where C is a constant. The value of K for which the log likelihood function (Eqn. (19)) is maximized provides an estimation of the first-arrival time.
A similar technique first provides an approximate estimation of the first-break. It has been shown to perform very well on high quality data. There are several drawbacks however. First, it needs to provide arbitrary order for the AR, i.e., autoregressive process, processes modeling within the two windows. Second, this algorithm requires the estimations of all the AR parameters, which can be accurately calculated only if enough samples within the window of interest are available. This will not be the case with sonic logging field data, which contain 256 samples on the full signal length.
An entropy method may be used for picking first-arrivals. In the entropy method, the entropy of the signal x, given a time window of length nw is defined by Eqn. (23).
The entropy will be rapidly varying when the time window moves from a noise only part to a signal plus noise part. As a result, it is expected the first-break should occur nearly the maximum of the entropy function derivative. In order to prevent high variability of the entropy, while it preserves enough information about the noise free signal, the size of the time window should be equal to few periods of the signal.
Edge Preserving Smoothing Method
The edge preserving smoothing (“EPS”) method is a powerful tool in order to better estimate the maxima of a numerical derivative, which is needed in both the STA/LTA and entropy methods. Indeed, because of the noise and the irregularity of the signals, the derivatives of either entropy or STA/LTA ratio are highly variable, leading to maxima which do not necessarily correspond to any arrivals.
Consider the EPS algorithm applied to the signal (yi)i=1,N. The only parameter to set is the length of the window, nw. At each point i, consider the nw windows as shown by Eqn. (24).
window(i)j=(yi−n
The value EPS(i) is then chosen as the mean of the window which standard deviation is the smallest as shown by Eqn. (25).
The result of the edge preserving smoothing procedure on the entropy function, together with its numerical derivative and the signal, is shown in
Example Application to Field Data
First-arrival time detection algorithms are applied on the signals recorded at an acoustic receiver array of a sonic tool such as the sonic tool described with regard to
One of the difficulties shared by all the automated first-arrival time detection algorithms is to properly pick up the first break without mistaking with eventual later arrivals. Indeed, in borehole sonic logging, not only are P-arrivals recorded, but also S and Stoneley arrivals. Because S-waves and Stoneley waves are most of the time much stronger than P-waves, first-break algorithms may sometimes output these arrivals instead of the low amplitude P-wave onset. It is possible to reduce or eliminate this problem by dividing the energy ratio, or the entropy function and STA/LTA derivatives, by the energy of the signal within a window of size 2 . . . ne around each point. Actually, given the noisy field data, the best results were obtained by dividing by the power ¼ of the energy. Given for example the entropy function (Hi), the first-arrival time estimations can be obtained as shown by Eqn. (26).
Among the numerous tests performed on the field data, the entropy method exhibits good ability in picking approximately the arrival times corresponding to the P-waves arrivals rather than some random high amplitude noise. Hence, using the entropy method algorithm is a first step toward more robust and accurate times of flight estimations.
Finding and Discarding Outliers
After estimating TOFs, the next step is to find and discard one or more of the TOF estimates that are missing P-wave arrivals, e.g., either detecting some noise before or missing it due to the low amplitude of compressional waves. At each depth, the sonic tool is composed of an array of equally-spaced acoustic receivers, for example, and hence the sonic tool is capable of recording a plurality of waveforms at each of the acoustic receivers. Since the spacing between each receiver is constant, it is expected that the time of flight of the P-wave between the acoustic source and each acoustic receiver should be linear with the index of the acoustic receiver, as long as the hypothesis that the borehole formation has a uniform P-wave velocity along the array length is approximately true. Given this a priori knowledge, the following steps can be taken in order to discard the time of flight estimates that have missed P-wave arrivals.
First, the first-break (first-arrival time, TOF, etc.) at each acoustic receiver can be calculated, for example, using the entropy method algorithm, where the numerical derivative of the entropy function is divided by the energy of the signal as expressed in Eqn. (26). Then, a least-square regression of the first break estimations as a function of the acoustic receiver index can be performed. This regression fits the best line to the entropy algorithm output.
Next, the standard deviation of the difference between the algorithm output and the linear regression can be calculated. For each acoustic receiver, i, if the absolute value of the difference between the estimated first break and the value of the least-squares regression in i is greater than the standard deviation, the corresponding first-arrival estimation is discarded. Then, a new linear regression is performed using the remaining (e.g., non-discarded) first-break estimations.
Next, the entropy algorithm is again performed, with two modifications. First, the entropy calculations are windowed within a window around the values given by the least-squares regression. The window must be small enough to avoid any later arrivals (e.g., S-wave and Stoneley arrivals). Additionally, the numerical derivative of the entropy is not divided by the energy since the risk of estimating a later high amplitude arrival has been reduced. Then, a new linear regression is performed on the results.
Next, the standard deviation of the difference between the new entropy algorithm output and the linear regression can be calculated. For each acoustic receiver, i, if the absolute value of the difference between the estimated first break and the value of the least-squares regression in i is greater than the standard deviation, the corresponding first-arrival estimation is discarded. Then, a new linear regression is performed using the remaining (e.g., non-discarded) first-break estimations. The final result is given either by the remaining values of the entropy method output, or by the linear regression for the waveforms for those TOF estimations were found to be irrelevant along the procedure.
Accordingly, using an outlier detection technique after having performed first-arrival time estimations, e.g., using any known first-arrival time detection algorithm, helps to improve the reliability of the results. By discarding outliers, it is possible to exclude some values which do not correspond to the P-wave arrivals. Then, additional techniques can be applied to the recorded data to achieve better accuracy on the first-arrival time estimations. As described in detail below, one technique is to estimate time delays between the signals recorded at two or more of the acoustic receivers. Since time delay estimation methods may be more robust to the presence of noise, it is possible to improve the accuracy on the times of flight estimations.
Time Delay Estimations
Let x1(t) and x2(t) be the measurement of a signal at two spatially separated sensors such as acoustic receivers, for example. Mathematically, these signals can be represented by Eqn. (27).
where s(t) is a stationary unknown signal and n1(t) and n2(t) are stationary noises at each sensor. s(t) is assumed to be uncorrelated with n1(t) and n2(t), but no assumption is made on the correlation between n1(t) and n2(t). h1(t) and h2(t) represent the impulse response functions of the medium. D is the time delay between the signals at the two sensors. As described above, estimating the time delay D is the objective. Assuming that the medium properties are the same, which is generally the case for seismic and sonic data, Eqn. (27) can be simplified to Eqn. (28).
Cross-Correlation and Generalized Cross-Correlation
An example technique in order to estimate the time delay D is to compute the cross-correlation function, for example, as defined by Eqn. (29).
R
x
x
(τ)=E(x1(t)x2(t+τ)) (29)
where E denotes the expectation. Since it is assumed that s, n1, and n2 are stationary, the cross-correlation function Rx
Practically, assuming that the processes x1 and x2 are ergodic, an expectation can be calculated by taking the time average along a single path as shown by Eqn. (30).
̂Rx
The cross-correlation function can also be calculated using Fourier analysis. Indeed, from the Wiener-Khintchine theorem,
R
x
x
(τ)=E(x1(t)x2(t+τ))=−1(Px
where Px
P
x
x
(w)=E(X1*(w)X2(w)), X1(w)=(x1), X2(w)=(x2), (33)
with F the Fourier transform operator, (f)(w)=∫−∞+∞f(t)e−iwtdt.
Using the signal model described in Eqn. (27), the cross-spectrum can be decomposed in order to see how it is affected by the different terms involved in Eqn. (27),
It can be seen that the estimation of the time delay is colored by the propagation media, the source as well as the correlation of the noise. This latter term may lead to a biased estimation of the time delay, while the two first affect the resolution on the maximum of the cross-correlation function. The effect of the correlation of the noises can also be seen directly on the time domain in the case of identical impulse response functions (equation (29)), since,
R
x
x
(τ)=E(x1(t)x2(t+τ))=Rss(τ−D)+Rn
where it can be seen that, although Rss(τ−D) reaches its maximum in τ=D, the last term Rn
Improving the Resolution of the Maximum
In order to improve the accuracy on the peak of the cross-correlation function, it is desirable to first filter the signals x1(t) and x2(t) prior the integration, with some filtering function G1(w) and G2(w). The functions obtained after filtering, y1 and y2 lead to the calculation of the generalized cross-correlation function Ry
R
x
x
(τ)=−1(E(X1*(w)X2(w))), (36)
and then it is possible to write the generalized cross-correlation function in terms of G1(w) and G2(w),
R
y
y
(τ)=−1(E(G1*(w)X1*(w)G2(w)X2(w))). (37)
The function,
Ψ(w)=H1*(w)H2(w); (38)
denotes the general frequency weighting. Several frequency weightings are known in the art and a few are described in detail below. It should be understood that the frequency weightings described below are provided only as examples and that other known frequency weightings can be used with the techniques described herein.
The smoothed coherence transform (SCOT)
The SCOT method uses the following weighting function,
The complex coherence function γx
where the function U(w) depends on the signal-to-noise ratios Ux
The inverse Fourier transform of the complex coherence function γx
The Roth Processor
The general frequency weighting is now defined by,
The phase transform (PHAT)
The PHAT method uses the weighting,
The Eckart Filer
The Eckart method uses the weighting,
The HT Processor
The HT processor was shown to be a Maximum Likelihood estimator for the time delay.
While all these weighting functions lead to a better resolution of the peak of the generalized cross-correlation function, it is not clear whether they make this peak less or more sensitive to noise.
Higher-Order Statistics
An alternative approach in order to avoid a biased estimation of the time delay due to the correlation of the noises is to use higher order statistics. These, methods are based on the calculation of higher order cumulants. The definition of the joint cumulant function corresponding to a set of random variables X1, . . . , Xn is shown by Eqn. (48),
and its Taylor expansion, using Einstein's summation convention,
The cumulants are then defined by the coefficients of the Taylor expansion,
Calculating analytically the coefficients of the Taylor expansion provides,
where π describes all the possible partitions of the set (1, . . . , k), and |π| is the cardinal of π.
Now, the cumulants functions of a stationary random process (x(t))t∈R are defined by,
As shown by Eqn. (52), cumulants are linked to the moments of a random process. The n-th order moment of a stationary random process x(t), k∈R is defined by,
m
n
X(τ1, τ2, . . . , τn−1)=E(x(t)x(t+τ1) . . . x(t+τn−1)). (53)
For a zero-mean process, the following equations defining the cumulants of order 2, 3 and 4 are obtained.
c
2
x(τ)=m2x(τ)
c
3
x(τ1, τ2)=m3x(τ1, τ2)
c
4
x(τ1, τ2, τ3)=m4x(τ1,τ2, τ3)−m2x(τ1)m2x(τ3−τ2)−m2x(τ2)m2x(τ3−τ1)−m2x(τ3)m2x(τ1−τ2) (54)
Going to the Fourier domain, the definitions of the power spectrum, the bispectrum, and the trispectrum are shown by Eqn. (55).
The power spectrum, the bispectrum and the trispectrum can also be expressed in terms of the Fourier transform of the random process x, X(w)=F(x)(w),
Ĉ
2
x(w)=E(X(w)X*(w)),
Ĉ
3
x(w1,w2)=E(X(w1)X(w2)X*(w1+w2)),
Ĉ
4
x(w1,w2,w3)=E(X(w1)X(w2)X(w3)X*(w1+w2+w3)). (56)
An important function, the bicoherence, which is the normalized bispectrum, is defined by Eqn. (57).
When dealing with two or three different random processes, useful functions are the cross-spectrum, the cross-bispectrum and the cross-bicoherence. Let x, y and z be three stationary random processes, these functions are defined (in the same order as mentioned) by Eqn. (58).
The main advantage of using higher order statistics is that cumulants of order higher than 2 are equal to zero when the random process has a symmetric probability density function. Hence, dealing for example with the important case of Gaussian noise, the use of higher order statistics greater than 2 may allow to avoid the biased delay estimation due to the correlated noise.
Time delay estimation with third order bicoherence-correlation
It is possible to deal with the problem of time delay estimation between two signals x1 and x2. For example, it is possible to being with the definition of the cross-bicoherence, which is the normalized cross-bispectrum, as shown by Eqn. (59).
The method for estimating the time delay using this function is known in the art. First, the bicoherence ratio Γx
Then an integration is performed along the real axis in the second variable, as shown by Eqn. (61).
Λx
Then, the inverse Fourier transform is applied to obtain the bicoherence correlation τx
τx
The time delay estimation is then provided by the time at which the bicoherence correlation reaches its maximum, as shown by Eqn. (63).
{circumflex over (D)}=argmaxττx
As done previously for the cross-spectrum function and the complex coherence function, it is possible to decompose the bicoherence ratio, as shown by Eqn. (64).
As for the complex coherence function, the effect of the media are normalized. Moreover, if the two measurements are performed in similar noise environments, then U(w1)=1, and the resolution of the peak will be acceptable. Thus, using higher order statistics, the term corresponding to the correlation of the noises has vanished.
Parametric Bispectrum Method
Described below is another interesting method, which uses third order statistics but without the need to compute the bispectra.
From Eqn. (28),
s(n−D)=x1(n−D)−n1(n−D), (66)
and then,
x
2(n)=x1(n−D)−n1(n−D)+n2(n), (67)
which can be written in a more aeneral form.
where P is an integer which is greater than D. Multiplying Eqn. (68) by x1(n) and x1(n+) and taking expectation, Eqns. (69) or (70) is obtained.
E(x1(n)x2(n+τ)x1(n+ρ))=Σi=−PaiE(x1(n)x1(n−i+τ)x1(n+ρ))−E(x1(n)n1(n+τ−D)x1(n+ρ))+E(x1(n)n2(n+τ)x1(n+ρ)), (69)
or,
Assuming that the signals and the noises n1 and n2 are independent, and since the noises are Gaussian, the functions Rx
Writing Eqn. (71) for several values of g and r, a linear over-determined system as shown by Eqn. (72) is obtained.
Rx
and the least squares solution of this linear system is,
A=(Rx
Then, the time delay D can be chosen as the value of i for which |ai| is maximum.
Time Delay Estimations Between N Signals
Now, N different signals, which are delayed versions of a source signal s plus an additive noise which is uncorrelated with s, are given by Eqn. (74).
The goal is to find the time delays between the signals x2, . . . ,xN and the reference signal Accordingly, the goal is to find the time difference between the reference signal x1 and at least two of the other signals. In other words, the goal is, to find the N−1 delays (D1, . . . , DN−1).
Basic Technique
The most straightforward way to achieve this challenge is to estimate the delays between two consecutive signals using one of the techniques described above. Hence, as soon as the delays d1=delay(x1,x2), . . . , dN−1=delay(xN−1,xN) are obtained, it is easy to deduce the values (D1, . . . , DN−1). Indeed, trivially,
Beamforming
A different approach from the ones described above is to use a technique called beamforming. This technique includes applying a set of time shifts to the waveforms array then summing the waveforms. The time shifts which lead to the sum which has the highest L2 norm provides an estimations of the times delays (D1, . . . , DN'11). Indeed, the L2 norm is maximum when the waveforms are aligned. Beamforming has been shown to be an optimum procedure when the noises which are added to the delayed source signal are white Gaussian.
Referring to {(Δτ0(θ), . . . , ΔτN(θ)), θ∈A, A ⊂ } as the set of time shifts that apply to the waveforms array, with Δτ1(Θ)=0, then the desired value of Θ is obtained by Eqn. (76).
Then ({circumflex over (D)}1, . . . , {circumflex over (D)}N)=(Δτ2({circumflex over (θ)}), . . . , ΔτN({circumflex over (θ)})), is obtained. By expanding the inner sum, it may be interpreted as varying the parameters 0 to shift and sum the cross-correlation between all possible pairs of waveforms,
where rl,j is the cross-correlation between waveforms l and j,
There are two disadvantages to the beamforming technique. First, it is computationally expensive, since the set A in which the parameter Θ belongs to may be very large. For example, if, for each waveform, it is desired to try integer shifts varying in [−K,K], then card(A)=(K+1)N−1, which means that the program has (K+1)N−1 L2 norms to calculate. Secondly, this technique may lead to biased delay estimations.when the noises are not white Gaussian.
Linear Regression
This technique consists in first calculating the delays between all the possible pairs of waveforms. Hence, for an array of N waveforms,
delays are calculated. If eij is the delay between the ith and jth waveforms, this delay is calculated by finding the maximum of the cross-correlation function,
From this estimation of the pairwise delays êij, an estimation of the parameters Δτl corresponding to the problem is desired, e.g., as shown by Eqn. (80).
x
l(t)=s(t−Δn)+nl(t), l=1, . . . , N. (80)
As described above, (Dl−1=τl, (D0=0) was defined. The linear relationship between the parameters eij and Δτl can be determined as shown by Eqn. (81).
ê
ij=Δτj−Δτi. (81)
Writing the vector e=(e12,e13, . . . ,e1N,e2N, . . . ,e(N−1)N), and the vector Δτ=(Δτ1, . . . , ΔτN), the following linear equation is obtained.
e=AΔτ, (82)
where,
An estimation of vector Δτ is obtained by solving this over-determined system by the least squares method,
Δτ=(ATA)−1ATe. (84)
Linear regression is much faster than beamforming all the waveforms. However, an error on a delay estimation between a pair of waveforms will propagate, which leads to biased estimation of all the delays (Δτ1, . . . , ΔτN). Pairwise time delay estimation via maximization of the cross-correlation function is efficient when the noises are uncorrelated. Nevertheless, when noises appear to be correlated, it is better to use high order statistics, as described above. Hence, instead of calculating cross-correlations, the estimations êij are obtained by finding the maximum of the bicoherence-correlation function defined by Eqn. (85).
Computing the bicoherence-correlation function is longer than computing the cross-correlation function, but it leads to better estimation of delays as long as the noises are symmetrically distributed.
Basic Technique Versus Linear Regression
The time delay estimations of a set containing N waveforms were considered as a random experiment. Hence, the error of each method was calculated by calculating the variance of the time delay estimations. Calling X the random variable which gives the result of a pairwise time delay estimator, without consideration of the method, it is possible to use either cross-correlation or bicoherence-correlation. Assume that the variance of X, Var(X), is independent of the true time delay which is equal to E(X).
The goal is to estimate the variance of each component of the time delays vector. D=(D1,DN−1) . The basic method uses the simple linear relationship,
where, from the assumption, Var(d i)=Var(X), ∀i. Hence, it is possible to calculate,
The above method may not be acceptable since the uncertainty on the time delays with respect to the reference waveform is proportional to the waveform index. Accordingly, for comparison, the variance of the vector D is calculated with the linear regression method described above.
In order to calculate the variance of each component, Var(Dk), k=1, . . . , N−1 ,the diagonal terms of the matrix (ATA)−1 are calculated. Careful calculation provides,
With the linear regression method, the uncertainty on each component of D is independent of the waveform number k. Additionally, this uncertainty is much smaller as compared to the basic method. Thus, the linear regression method can optionally be preferred when trying to estimates the N−1 delays relative to the first waveform.
Hybrid Regression-Beamforming
As described above, the linear regression method is computationally faster than beamforming all the waveforms. However, any biased estimations in the pairwise correlations may propagate leading to biased estimations of the delays Δτ. In order to get better robustness with results which are less sensitive to such errors at the first step of the algorithm, it may be desirable to use more time delay estimations between waveforms before inverting the over-determined system. Thus, instead of considering the time delays between all pairs of waveforms, the delays of all subset of waveforms of size P can be calculated. To calculate the delays of each of the
subsets of waveforms, the beamforming technique described above can be used. Accordingly, this method is referred to as “Hybrid regression-beamforming”.
Let k∈(1, . . . , NP), beamforming each subset of size P provides us the delays Δτp={(Δτ1(k), . . . , (ΔτP(k)), k∈[[1,NP]]}, and then, an over-determined system can be solved where the matrix AP has known entries,
The larger the subsets are, the more robust is the method, while increasing the computational complexity. Hence, a hierarchy of algorithms is possible. A way to achieve a proper time delay estimation is then to start with the simplest procedure, the linear regression with cross-correlation calculations, and evaluate the model error.
Performance Analysis Using Synthetic Data
In order to analyze the performance of the different techniques described above, Monte-Carlo simulations were performed on synthetic data. For each run, the uncorrupted delayed signal at eight acoustic receivers remains constant, while the noise is generated randomly, but always with the same correlation properties.
The Uncorrupted Waveform
First, an array of 8 identical waveforms (e.g., waveforms N=1, 2 . . . 8) was created, where each of the waveforms was composed of 256 samples. The reference waveform is a Ricker wavelet convolved with a FIR filter. The Ricker wavelet is defined by,
r(t)=(1−2(πft)2)e(−πft)
Generation of Spatially and Temporally Correlated Noise
N stationary noises (n1, . . . ,nN) are then generated in order to add them to the initial waveform described above. The computation is performed as follows.
First, the two following vectors were created,
a=(1,−0.7, 0.4, 0.1), (94)
b=(b1,b2,b3,b4,b5), (95)
where the elements bi are random variables, independent and Gaussian with zero mean and unity variance. The correlated noise is then generated via an auto-regressive-moving-average (“ARMA”) process,
a
1ν(1)(n)+a2ν(1)(n−1)+a3ν(1)(n−2)+a4ν(1)(n−3)=b1s(1)(n)+b2s(1)(n−1)+b3s(1)(n−2)+b4s(1)(n−3)+b5s(1)(n−4), (96)
where the process (s(n))n∈N is a white Gaussian noise. It is clear that the process v est zero-mean since there is no deterministic component on the right hand side of the equation. Moreover, the roots of the polynomial P(X)=a1+a2X+a3X2+a4X3 are (r1,r2,r3)=(−5.6, 0.8+1.1i, 0.8−1.1i) which the modulus are all strictly bigger than 1, |ri|>1. It follows that the process v is stationary.
Then, a set of three other identical ARMA processes using vectors are generated,
c=(1,−1.75, 1.4,−0.6), (97)
d=(d1, . . . , d10), (98)
where the elements di are random variables, independent and Gaussian with zero mean and unity variance.
where s1, s2 and s3 are independent white Gaussian noises. Again, the processes η1(1), η2(1), and n3(1) are clearly zero-mean. Moreover, the roots of the polynomial P(X)=c1+c2X+c3X2+c4X3 are (r1,r2,r3)=(1.1, 0.6+1.1i, 0.6−1.11) which the modulus are all strictly bigger than 1, |ri|>1. It follows that these processes are stationary. Finally, the first noise process is obtained by Eqn. ('100).
A linear combination of stationary processes is also stationary, which means that n1 is stationary.
To get the other noises nl, l=2, . . . ,N by calculating v(l),
a
1ν(t)(n)+a2ν(l)(n−1)+a3ν(l)(n−2)+a4ν(l)(n−3)=b1s(l)(n)+b2s(l)(n−1)+b3s(l)(n−2)+b4s(l)(n−3)+b5s(l)(n−4), (101)
and η1(l), η2(l), η3(l) from η1(l−1), η2(l−1), η3(l−1)
then, nl is calculated from,
Again, it is clear that all the processes nl, l=2, . . . , N are all stationary.
Results
1. Basic method using cross-correlations (i.e., “correlation” in
2. Basic method using bicoherence correlation (i.e., “bicoherence” in
3. Linear regression with cross-correlation (i.e., “linear with correlation” in
4. Linear regression with bicoherence(i.e., “linear with bicoherence” in
5. Hybrid regression-beamforming with P=2 (i.e., “linear with beamforming” in
In particular, 500 Monte-Carlo simulations were run in order to calculate the mean and the standard deviation of the time delays estimations. The experiments were performed for SNR=3, 7 and 9 dB.
In the case SNR=9 dB, very acceptable time delays estimations were obtained using techniques 1 to 4 above. As shown in
pairwise delays failed to provide consistent results. The reason is due to the spatial correlation of the noises which entails that the L2 norms calculated in Eqn. (76) do not reach their maximum when the waveforms are shifted by their exact time delays. In the case SNR=7 dB, acceptable results are also obtained, as the relative error for techniques 1 to 4 remain below 0.3. Again, similar to
Moreover, the techniques using higher order statistics (e.g., basic with bicoherence (technique 2 above) and linear regression with bicoherence (technique 4 above)) perform better than techniques using cross-correlation (basic with cross-correlation (technique 1 above) and linear regression with cross-correlation (technique 3 above)), as it was expected from the theoretical studies which showed that cross-correlations based techniques are sensitive to the spatially correlated noises, while high order statistics based techniques are not.
From the plots of the mean time delays estimations, it is not clear whether the techniques using linear regression perform better than basic methods which just add the pairwise delays between two adjacent waveforms. Indeed the means of linear regressions are approximately equals to the means of basic methods. To investigate deeper the performance of each method, the standard deviation of the results obtained with the different techniques can be plotted. For example,
In order to improve the accuracy of the results, it is desirable to reduce the size of the part of the signal on which the above techniques are applied. Once the window size and position are made, the values of the waveforms which are outside the window can be set to zero or flag value.
Accordingly, the data of the waveforms which are outside the window can be disregarded, ignored, or otherwise not considered in the calculations. For example,
Delays Between P-Waves
The following description focuses on the waveforms recorded during borehole sonic logging as opposed to synthetic data. As described above, each acoustic receiver in an array may highlight several arrivals. In a fast formation, it may be possible to distinguish most of the time P-wave, S-wave, and Stoneley wave arrivals. Since different types of waves have different speeds in the formation, they all have different delays at the acoustic receivers. However, the time delay techniques described herein are only capable to estimate a single delay between two respective waveforms. It is therefore desirable to find some time windows where the signal at each receiver is mainly composed of a single type of wave. The automated arrival-time detection algorithms described above provide an approximate P-wave arrival times. Hence, choosing a time window around P-wave arrival times ensures that the waveform within that window corresponds to the P-waves. Practically, the time length of the entire waveform can be preterved, but data outside the window is set to zero or a flag value, similar as discussed above. Once the waveform at each acoustic receiver is properly windowed, it is possible to apply any of the techniques providing time delay estimations between N signals. As described above, the linear regression with bicoherence-correlation is optionally preferred due to its better accuracy in presence of correlated noise.
Improving TOF Estimations Using Time Delays
Time delays and times of flight are linked by the fact that the delay between P-waves at two receivers is equal to the difference of the first-arrival times at these receivers. Hence, this link can help us in improving the accuracy on times of flight estimations, all the more that time delay estimations are more robust to noise than any arrival-time detection technique.
As used herein, TOF=(TOF1, TOFN) are the true times of flight at acoustic receivers 1,N, and ̂TOF=̂(TOF1,̂,TOFN) are the times of flight (TOF) calculated by the first-arrival time detection algorithms described above. Also, D=(D1,DN−1) are the actual time delays between the reference signal (or waveform) at acoustic receiver 1 and the signals (or waveforms) recorded at the receivers i=2,N, and D=(D1,̂,DN−1) are the time delays calculated using the beamforming algorithm described above. Thus, Di=TOFi+1−TOF1, i=1N−1, which can be written in a matricial form,
D=A·TOF. (104)
The estimated times of flight TOF and delayŝD verify the following equations,
where ξ1 and ξ2 are the errors on the algorithm estimations. As previously encountered when estimating N delays from all the possible pairwise delays, an over-determined system is encountered, e.g., with more equations than unknowns. The difference is that the accuracy on each equation is not uniform since time delay methods are more accurate than first-arrival time detection algorithms. This over-determined system can be solved by finding the most probable solution given some distribution on the error terms.
The joint a posteriori probability distribution of the times of flight TOF, knowing the algorithms outputŝTOF ard D, is given by,
Since the function P(D|TOF) is independent of TOF,
P(TOF|TÔF, {circumflex over (D)})∝P({circumflex over (D)}|TOF, TÔF)P(TOF|TÔF), (107)
Assuming that the error ξ1 is a vector of N−1 independent identically distributed Gaussian random variables, with zero mean and variance σ1, it is possible to calculate the first term of the product in the right hand side of Eqn. (107) as shown by Eqn. (108).
Making the same assumption for the error ξ2, i.e. it is a vector of independent identically distributed Gaussian random variables, with zero mean and variance σ2, it is possible to calculate the second term of the product as shown by Eqn. (109).
Then, maximizing the function as shown by Eqn. (110)
will provide the best estimation of the times of flight TOF, given the output of both the first arrival time-detection algorithm and the time delay algorithm. Finding the maximum of f is equivalent to find the minimum of,
The optimal times of flight TOF must satisfy the first derivative necessary condition of local minimum,
This is a linear system which can be solved analytically,
Hence, the solution to the first derivative necessary conditions is unique, providing a unique candidate as a global minimum. Since the function h is coercive, it is possible to deduce that this is indeed the global minimum. The two unknown parameters σ1 and σ2 can be assessed by evaluating the variance of the results of the first arrival time-detection and time delay algorithms on a sufficiently large set of field data.
From the tests, it appeared that the time delay algorithm is much more robust than the first-break algorithm. Hence, in order to highlight the improvement on times of flight estimations thanks to the time delays, it is possible to consider the simpler problem wherêD=D,
where ξ is a vector of independent identically distributed Gaussian variables with zero mean and variance σ. It is desirable to maximize,
under the constraint Di=TOFi+1−TOF1, . . . , i=1, . . . , N−1. Substituting the constraint in Eqn. (116), Eqn. (117) is obtained.
Maximizing P(TOF↑TOF,D) as a function of the single variable TOF1 provides,
This expression is very natural since TOF1 is calculated as a mean over all the estimated time of flights delayed by the true delays, so that all the random variableŝTOF1 and ̂TOF{circumflex over (−)}Di−1, i=2,N, have the same mean, the true time of flight TOF1, and the same variance. Accordingly, it is possible to deduce that the variance of the estimato{tilde over (r)}TOF1 is reduced by a factor N compared to the first arrival time-detection algorithm. It is also possible to find this expression by setting σ1=0 in the right hand side of Eqn. (114).
Two-Dimensional (2-D) Filtering
Optionally, the waveforms recorded by the array of acoustic receivers (e.g., the array of acoustic receivers 108 shown in
For all these reasons, filtering each waveform with a band-pass frequency filter is not the best solution for helping in capturing times of flight. Although it may enable to increase the signal to noise ratio, it is however inefficient in improving the ability of either a human operator or an existing algorithm to estimate accurately first-arrival times. Hence, we need to find a better way to filer the waveforms, which can better discriminate.P-waves from drilling noise and other unwanted waves. As described above, one of the difference between P-waves and other waves (e.g., collar waves, S-waves, random tube waves, etc.) is their slowness. Hence, a filter which could be able to perform a band-pass filtering on the slowness rather than on the frequencies may be much more efficient than the basic frequency filtering. But considering the waveforms one by one does not provide any information about the slowness of the recorded waves. However, by using the whole set of waveforms provided by the array of acoustic receivers, it is possible to obtain information about their slowness. Accordingly, a more efficient filtering based on slowness can be to consider a set of waveforms as a two dimensional (“2-D”) time-space data array. In order to perform such a two-dimensional filtering, two example methods based on some 2-D dimensional transformations and their inverse are presented. However, it should be understood that this disclosure should not be limited to using F-K filtering or the Radon transform and that other known 2-D filtering algorithms can be used.
F-K Filtering
The classic frequency decomposition for a plane progressive wave is the following,
P(t, x)=∫−∞+∞A(w)ei(wt−kx)dw (119)
where A is the amplitude of each frequency component, w is the pulsation (w=2πf), and k is the wave-number. Substituting such an expression in the wave equation provides a relation between the pulsation and the wave-number,
where vφ is called the phase speed of the wave. Defining the slowness as the inverse of the phase speed,
A common 2-D transformation where this particular relationship between wave-number and pulsation is the two-dimensional Fourier transform, also called the F-K transform, which can be defined as the following,
{circumflex over (P)}(w, k)=∫−∞+∞∫−∞+∞P(t,x)e−i(wt−kx)dkdw (122)
and the corresponding inverse 2-D Fourier transform,
P(t,x)=∫−∞+∞∫'1∞+∞{circumflex over (P)}(w,k)ei(wt−kx)dkdw (123)
Identification between the 2-D inverse Fourier transform and the progressive wave frequency decomposition provides,
{circumflex over (P)}(w, k)=A(w)δ(k−sw), (124)
where s is the slowness of the wave. Hence, on the F-K plane, each different type of wave will be located on the curve k=sw. If the waves are non-dispersive, then their slowness is independent of w, and the non-zero Fourier coefficients are located on straight lines which slope is equal to the waves' slowness. All these remarks are valid for either the collar-wave, P-wave, S-wave or any tube wave generated by some shock of the tool against the borehole walls. However, considering drilling noise as a random process without other assumptions, it may be spread on the entire F-K plane. It is then possible to perform an efficient filtering by first zone around the line k=Sp·w. It is then possible to set to zero all the Fourier coefficients outside this predefined zone. It is then possible to perform an inverse 2-D Fourier transform in order to go back to the time-space domain and get the filtered waveform set. This procedure should discard all the undesirable waves while preserving all the information about the P-wave, as well as it may remove most part of the drilling noise.
There are, however, a few limitations which make 2-D filtering less efficient than what the above theory was predicting. The first limitations are due to the fact that the waveforms are finite length, sampled signals, rather than some signals defined continuously on the entire R2 plane. Hence, what is computed using the 2-D fast Fourier transform is an approximation to the 2-D Fourier transform of such an ideal signal. Hence, there may be some aliasing in the wave-number,domain due to the spacing of the acoustic receivers which is not small enough. Additionally, the finite length nature of the recorded waveform entails that each wave in the F-K plane is not located exactly on the straight lines k=sw, but rather in a certain zone around such lines. Indeed, multiplying a signal defined for all times by a time window makes its Fourier transform convolved by the Fourier transform of the window. Hence, choosing the width of the filtering zone in the F-K plane is a matter of balance between keeping most of P-wave information while discarding other waves and drilling noise.
Radon Filtering
Another two dimensional transform which is also widely used in seismic processing as well as image processing, is called the Radon transform. In the specific case of seismic processing, it is sometimes referred to the taup transform. It is defined as the following,
V(τ, p)=∫−∞+∞P(t=τ+hp, x)dx (125)
As its mathematical definition shows, the Radon transform is the calculation of the sum of the signal amplitude along lines which slopes are equal to h. Since a waveform with a specific slowness at each acoustic receiver will be approximately equal to the waveform recorded at the first receiver shifted from a time length proportional to the space between the acoustic receivers, it should be understood that the Radon transform is an appropriate 2-D transform to highlight the presence of non-dispersive waves by looking at the amplitude of the Radon coefficients. Moreover, since there exists an inverse Radon transform, it is also possible to perform the same kind of slowness filtering as was performed using the F-K transform. Again, similar to F-K transform, the computation is performed on finite length sampled signals, which is an approximation of the Radon transform, called the Discrete Radon Transform. As well, since the Radon coefficients are calculated for a finite range of slopes h, it is only possible to compute an approximation to the inverse Radon transform.
The filtering process is similar to the F-K filtering. First, the Radon transform of the whole waveform set is computed. Hence, a rectangular zone defined by h∈[Sp−α,Sp+α] is defined and all the Radon coefficients outside this predefined zone are set to zero or a flag value. Then, an inverse Radon transform is performed to get the filtered waveforms set.
P-Wave Slowness Determination
Either F-K filtering or Radon filtering need to define a zone where most of the P-wave information is located in the F-K or Radon domains, respectively. As described above, these locations are related to the P-wave slowness. Hence, to perform these 2-D filtering techniques, the approximate P-wave slowness should be known/determined. P-wave slowness can be determined from the time delays calculated using the techniques described above. For example, the time delays relative to the waveform recorded at the first receiver are linked to the P-slowness by, D(x)=Sp·x, (126), where x is the position of each acoustic receiver in the acoustic array. Hence, from the time delay estimations, it is possible to deduce an approximate P-slowness by performing a least-square regression on the time delay estimations and then calculating the slope of the linear fit.
Example Operations
Referring now to
Referring now to
At 2308, a plurality of time delays for the acoustic wave can be calculated using the non-discarded estimated TOFs, for example, using a high-order statistical algorithm such as a bicoherence correlation algorithm described above. As described above, the time delays are delays between P-wave arrivals at each of the acoustic receivers in the acoustic array relative to,the first acoustic receiver in the acoustic array. Optionally, the high-order statistical algorithm can be applied after windowing the recorded data around each of the non-discarded estimated TOFs. By windowing the recorded data, it is possible to ignore data corresponding to S- and Stoneley wave arrivals, for example. Then, the estimated TOFs for the acoustic wave at each of the acoustic receivers in the acoustic array can be updated based on the calculated time delays. This can be accomplished, for example, by solving Eqn. (114) described above. In addition, it is possible to determine P-wave slowness based on the updated TOFs for the acoustic wave at each of the acoustic receivers in the acoustic array. This can be accomplished by performing a least-squares regression on the updated TOFs and calculating a slope of a best-fitting line as described above. At 2310, a determination is made as to whether an absolute value of the difference between the P-wave slowness value for a current iteration (i.e., Sp(i+1) in
Referring now to
Application of the TOF Estimates
Two example applications for the times of flight estimations are described below. The first one is the determination of the P-wave slowness, and the second one is the estimation of the borehole diameter.
P-Wave Slowness Estimation
As described above, the times of flight are linear as a function of each receiver position,
TOF(x)=TOF(0)+Sp·x. (127)
Hence, from the times of flight estimations provided by the algorithm output, it is possible to deduce the P-wave slowness by performing a least-square regression and calculating the slope of the linear fit.
For example, this technique was applied on example D&M data from a well in Australia.
Mud Slowness and Borehole Diameter Estimations
Times of flight estimations (e.g., according to the techniques described herein) can be used in order to determine the borehole diameter. An example technique is described below. Consider a borehole of diameter d filled with a fluid in which acoustic waves propagate with the slowness sf. The borehole is surrounded by few cylindrical layers, each layer i having a thickness Hi and a P-wave slowness Si. Using the ray tracing theory described above, it is possible to calculate the minimum transmitter-receiver spacing xi for a P-wave to be able to reach the receiver after having traveled into the formation until the layer Hi.
where s is the standoff, which is the distance between the sonic tool and the borehole walls. It is linked to the borehole diameter d by,
Hence, at a receiver located in Xi such that xi<Xi<xi+1, i P-wave arrivals will be recorded, each having a time of flight,
The first arrival time TOF recorded at this receiver corresponds to the P head wave which has the shortest time of flight, TOF=min(TOFl, l=1,i). Radial profiling can provide estimations of the layer thickness Hi and the corresponding velocities Vi. Noting m=(sf,d) and TOF=(TOF1,TOFN), then consider the times of flight as a function of these only two variables, TOF=G(m). An equation which involves both the times of flight estimations and the borehole diameter can be found. Actually, this is a set of equations which are all linked. However, a second unknown, the mud velocity sf, makes this system unsolvable if no further information involving the borehole diameter or the mud slowness is known. For the mud slowness, an approximate value provided by the mud supplier can be used. As well, the borehole diameter should be approximately equal to the bit-size. The ray tracing equation, together with the mathematical transcription of the last statement, can be summarized in, a non-linear system of three equations for two unknowns.
where ŝf is the mud slowness provided by the supplier and ξs
Accordingly, there are three independent equations for only two parameters. There are also uncertainties on all the estimated parameters involved in each equation. In order to get the solution to this inverse problem, (sf,d), which fits the best with all the known data, i.e., the bit size, the mud approximate mud slowness ŝf and the estimated time of flights at each receivers TOF, it is possible to solve this problem by maximizing the joint a posteriori distribution of the model (sf,d), to fit with the available data. This statistical optimization problem is similar to the one solved above to improve the accuracy on times of flight estimations using the estimated delays by solving an over-determined system. For example,
where independence between some variables is taken into account. The objective is to maximize this function in the variables sf and d. Since the denominator does not depend on these two parameters, the object is to maximize,
f(sf, d)=P(TOF|sf, d)P(sf|ŝf)P(d|{circumflex over (d)}), (133)
The calculation of the three terms in the right hand side of Eqn. (133) is described below.
Expression of the Likelihood Function
The first term P(TOF|sf,d) can be calculated in assuming that the error vector on the estimation TOF is composed of independent identically distributed Gaussian variables, with zero mean and variance σ. Hence, similarly as described above,
Expression of the Mud Slowness Probability Distribution
A log-normal distribution was suggested for the mud slowness probability distribution,
The parameters μ and λ can be found from the mean and the variance of the mud slowness distribution, respectively equal to ŝf and σs
Expression of the Diameter Probability Distribution
A Gaussian distribution was suggested for the diameter probability distribution.
Such a distribution may seem to be inappropriate since the borehole diameter cannot be smaller than the bit sized. The easiest way for avoiding this paradox is to simply maximize the function f on the constrained set
Maximization of the function f is equivalent to look for the maximum of,
Finding the suitable couple (s{tilde over (f)},d) cannot be performed analytically, since the function h is not a simple quadratic function. However, many algorithms are known to solve such a constrained (d≧d) optimization problem.
Validation
First, it is desirable to verify whether the techniques for estimating borehole diameter and mud slowness described above are valid. Particularly, it is desirable to verify if the solution to the optimization problem described above results in relevant estimations of the borehole diameter or mud slowness S. Hence, the problem can be solved using synthetic times of flights, which are calculated by ray tracing with the formula shown by Eqn. (130). For the mud slowness, the mean ŝf is chosen, whereas for the borehole diameter, the mean value provided by the two calipers used during wireline logging of the well is chosen.
Solutions to the optimization problem for a given range of depth are shown in
Application to Field Data
Described below are comparisons between the borehole estimations according to the technique described above and other known techniques for estimating the borehole diameter.
The techniques described herein managed to achieve high accuracy and robustness in estimating times of flight on D&M acoustic monopole data experiencing a very low signal to noise ratio. This have been possible due to some procedures such as the outliers detection, as well as more complex signal processing tools such as the bicoherence-correlation function. As described above, each step of the technique may be requirement for the following step of the technique to process the data. Outlier detection enables the ability to discard the first-arrival estimations which missed P-wave arrivals, which is aids in properly windowing the waveforms and be sure to process only P-waves and then calculate the relevant delays. While delays are useful to improve times of flight estimations thanks to-the robustness to correlated noise of the bicoherence-correlation, the delays also provide an estimation of the P-wave slowness. This value can be used to perform a two dimensional slowness filtering by the use of either Fourier or Radon transform. This type of filtering can be very useful, as it increases greatly the signal to noise ratio and makes highly accurate times of flight detection possible.
Deducing compressional slowness from times of flight is straightforward, and the techniques described above managed to provide a log very close to ones obtain using the standard Slowness Time Coherence algorithm. This highlights the reliability of the techniques since such accurate slowness estimations could not be performed if the first-arrival times were not calculated with high accuracy.
Example applications for the estimated TOFs according to the techniques described herein, e.g., the borehole diameter and mud slowness determinations, are also possible. Borehole diameter and mud slowness estimations were conducted on real data, which demonstrated that the combination of the accurate and robust first motion detection techniques described herein coupled with a Bayesian inversion allows to provide a reliable estimate of the borehole diameter as well as mud slowness. Application to real data and comparison with other tools used to estimate hole diameter illustrated the efficiency and robustness as described above.
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.
This application claims priority to U.S. Provisional Patent Application Ser. No. 62/025,479, which was filed on Jul. 16, 2014, and is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2015/001195 | 7/16/2015 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62025479 | Jul 2014 | US |