The present invention relates generally to methods and systems for inverting seismic data to compute physical properties of the earth's subsurface, and in particular methods and systems for performing phase-only full waveform inversion to compute velocity models from seismic data.
Subsurface exploration, and in particular exploration for hydrocarbon reservoirs, typically uses methods such as migration of seismic data to produce interpretable images of the earth's subsurface. In areas where the subsurface is complex due to faulting, salt bodies and the like, traditional migration methods often fail to produce adequate images. Additionally, traditional migration methods require a reasonably accurate velocity model of the subsurface; such velocity models may also be determined from the seismic data but may be very expensive in both expertise and computational cost.
There are many conventional methods for computing velocity models from seismic data, including NMO velocity analysis, migration velocity analysis, tomography, and full waveform inversion. Some methods, such as full waveform inversion, are very computationally expensive and have only recently become practical as computing power has increased. Conventional full waveform inversion is done in the time domain or in a transform domain such as the temporal Fourier transform domain or the Laplace transform domain. These methods often fail due to the lack of low frequencies, typically less than 3 Hertz, in seismic data. As one skilled in the art will appreciate, a velocity model is a low frequency model so it is difficult to invert for it from the seismic data that lacks the low frequency information.
Traditional methods of determining velocity models and using them for migration to produce images of the earths subsurface are expensive and fraught with difficulties, especially in complex areas. As the search for hydrocarbons moves to these complex areas, it is necessary to find better ways to process the seismic data and improve velocity models.
According to one implementation of the present invention, a computer-implemented method of determining properties of a subsurface region is disclosed. The method includes obtaining actual seismic data representative of the subsurface region and an initial earth property model for the subsurface region, performing forward modeling using the initial earth property model to create modeled seismic data with similar acquisition specifications as the actual seismic data, transforming the modeled and actual seismic data to a temporal Fourier frequency domain to create frequency domain modeled and actual seismic data wherein the frequency domain modeled and actual seismic data include an amplitude portion and a phase portion, measuring the misfit between the frequency domain modeled seismic data and frequency domain actual seismic data to produce frequency domain residual seismic data, performing phase unwrapping of the phase portion of certain observed frequency components of the frequency domain residual seismic data to create an unwrapped residual phase portion, and inverting the unwrapped residual phase portion to determine desired properties of the subsurface region of interest, wherein the inverting minimizes an objective function defined to measure the misfit.
The method may also include phase extrapolation. Additionally, the method may perform a second inverting step using the properties determined from the first inverting step as an initial model.
In another embodiment, a system for determining properties of a subsurface region is disclosed. The system includes a data source, a user interface, and a processor configured to execute computer modules designed to perform the method.
In another embodiment, an article of manufacture for determining properties of a subsurface region is disclosed. The article of manufacture includes a computer readable medium having a computer readable code embodied therein, the code being configured to execute the method.
The above summary section is provided to introduce a selection of concepts in a simplified form that are further described below in the detailed description section. The summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. Furthermore, the claimed subject matter is not limited to implementations that solve any or all disadvantages noted in any part of this disclosure.
These and other features of the present invention will become better understood with regard to the following description, pending claims and accompanying drawings where:
The present invention may be described and implemented in the general context of a system and computer methods to be executed by a computer. Such computer-executable instructions may include programs, routines, objects, components, data structures, and computer software technologies that can be used to perform particular tasks and process abstract data types. Software implementations of the present invention may be coded in different languages for application in a variety of computing platforms and environments. It will be appreciated that the scope and underlying principles of the present invention are not limited to any particular computer software technology.
Moreover, those skilled in the art will appreciate that the present invention may be practiced using any one or combination of hardware and software configurations, including but not limited to a system having single and/or multiple computer processors, hand-held devices, programmable consumer electronics, mini-computers, mainframe computers, and the like. The invention may also be practiced in distributed computing environments where tasks are performed by servers or other processing devices that are linked through a one or more data communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including memory storage devices.
Also, an article of manufacture for use with a computer processor, such as a CD, pre-recorded disk or other equivalent devices, may include a computer program storage medium and program means recorded thereon for directing the computer processor to facilitate the implementation and practice of the present invention. Such devices and articles of manufacture also fall within the spirit and scope of the present invention.
Referring now to the drawings, embodiments of the present invention will be described. The invention can be implemented in numerous ways, including for example as a system (including a computer processing system), a method (including a computer implemented method), an apparatus, a computer readable medium, a computer program product, a graphical user interface, a web portal, or a data structure tangibly fixed in a computer readable memory. Several embodiments of the present invention are discussed below. The appended drawings illustrate only typical embodiments of the present invention and therefore are not to be considered limiting of its scope and breadth.
The present invention relates to computing physical properties of the earth's subsurface and, by way of example and not limitation, can compute a velocity model using phase-only full waveform inversion.
To begin the explanation of the present invention, first consider the basic full waveform inversion method 100 illustrated in the flowchart of
In step 12, the initial model of earth properties is used by a seismic modeling engine to generate modeled seismic data. In general modeling can be performed in either the time domain or the frequency domain (temporal Fourier transform) with no penalty, depending on various factors like the size/extent of the modeling domain and the amount of memory available. Large 3D surveys typically require time-domain modeling because frequency domain modeling is extremely memory intensive for large numbers of model parameters. One significant advantage of frequency domain modeling is that one directly has access to both amplitude and phase, and this allows the use of “phase only” approaches that can be geared to be dominated by kinematics instead of amplitudes.
In step 14, we compute an objective function that will measure the misfit between the recorded seismic data and the modeled seismic data. The most widely used objective function for conventional full waveform inversion is simple least squares: the sum of the squares of the differences between the observed data and the modeled data for all sources, receivers and recorded time samples. However, this is not meant to be limiting; other objective functions can be used, including correlation, the L1 norm, and hybrid or long-tailed norms. The objective function may be constructed in the time domain or in a transform domain such as the frequency domain.
In the time domain, the least squares objective function may take the form:
E=½ΣsΣrΣt[ψobs(t,r,s)−ψmod(t,r,s)]2 Eqn. 1
where E is the objective function, s are the sources, r are the receivers, t is time, ψobs is the recorded data, and ψmod is the modeled data. This objective function suffers from the critical flaw that seismic data is bandlimited. Differencing of bandlimited signals introduces the possibility of “cycle skipping”, where the wave shapes of the modeled and observed data are similar enough to cause a small difference, but are misaligned in an absolute sense by (at least) one wave cycle. This, together with the local nature of full waveform inversion, leads to the likely possibility that the nonlinear optimization will fail and converge to a local minima rather than the global solution.
One way to change the characteristics of the problem is to change the objective function. If we transform to the frequency domain we can consider objective functions at one or more frequency components individually (monochromatically). In the time domain, we cannot consider a single time sample because of dependence on earlier times. In the frequency domain, the response at different frequencies is uncoupled: the solution at one frequency does not depend on the solution at any other frequency. We can also, importantly, treat amplitude and phase differently. Taking the temporal Fourier transform of Eqn. 1, the objective function becomes:
E(ω)=½ΣsΣr|Aobs(ω,r,s)eiφ
where Aobs(ω,r,s) is the amplitude of the observed data at receiver r, from source s, at temporal frequency ω, φobs(ω,r,s) is the phase of the observed data, Amod(ω,r,s) is the amplitude of the modeled data, and φmod(ω,r,s) is the phase of the modeled data.
In the frequency domain, we can consider the phase portion independently of the amplitude portion. For the phase-only case of full waveform inversion, by way of example and not limitation, the least squares objective function becomes:
E(ω)={square root over (1/2)}ΣsΣr|φobs(ω,r,s)−φmod(ω,r,s)|2. Eqn. 3
The modeled data in Eqns. 1-3 may be generated in the time or the frequency domain. The objective functions of Eqns. 1-3 measure the mismatch between the observed and modeled data and are decreased at each iteration. The inversion may be done as a phase-only inversion in either the time or frequency domain, as long as the mismatch can be measured directly or indirectly in terms of the phase of one or more frequency components.
Once the objective function is computed in step 14 of
The calculation of the search direction becomes more clear if we treat the modeled data as the action of a nonlinear seismic modeling operator on the earth property model. Using the example of velocity (v) as the earth property, the operator being nonlinear means that a linear change in velocity does not necessarily result in a linear change in the modeled data.
Using the symbol N to represent the nonlinear seismic modeling operator that maps velocity models into seismic data, and the action of this operator on the current velocity model as N(v), we can rewrite Eqn. 1:
E={square root over (1/2)}ΣsΣrΣt[ψobs(t,r,s)−N(v)]2 Eqn. 4
so the derivative with respect to velocity becomes:
Eqn. 5 shows that the derivatives used to update the earth property model depend very importantly on the modeling operator, the derivatives of the modeling operator with respect to velocity, and the current seismic data residual.
The nonlinear problem of full waveform inversion is solved by successive linearization. For the example of inverting for velocity, at iteration k, this is done by linearizing around the velocity vk, and seeking an update to the velocity δv, such that the updated model is: vk+1=vk+δv. We need the linearization in order to compute the search direction. Given the general linear least squares system:
E=∥y−Ax∥
2 Eqn. 6
The gradient or search direction can be written:
Where Af is the adjoint (conjugate transpose) of the linear operator A. For our nonlinear problem of full waveform inversion, we have the nonlinear operator N, and we need the adjoint of the linearized operator in order to compute a gradient. We use L for the linearized operator, and Lf for the adjoint of the linearized operator. The operator L maps a vector of velocity perturbations into a vector of wavefield perturbations, and the adjoint operator Lf maps a vector of wavefield perturbations into a vector of velocity perturbations (Eqn. 8).
Lδv1=δψ1
Lfδψ2=δv2 Eqn. 8
Once the search direction is computed, we need to determine how large a step to take in that direction, which is how the earth properties model is updated in step 18 of
The majority of published conventional approaches employ steepest descent or preconditioned steepest descent for nonlinear optimization. Once the search direction is estimated, these approaches forget about the current linear problem and use a nonlinear line search to estimate the best “step size” to take in the search direction. If we use δv for the search direction (usually the gradient of the objective function with respect to the velocity parameters), and α for the step size, we can express the nonlinear line search as:
min∝{{square root over (1/2)}ΣsΣrΣt[ψobs(t,r,s)−N(v+∝δv)]2}. Eqn. 9
One serious shortcoming of a nonlinear line search is taking such a large step that the modeled data becomes cycle skipped with respect to the observed data. This could result in a smaller residual and lead to convergence to a local minimum rather than the true global solution.
An alternative to using a nonlinear line search is to solve the linear problem at each successive linearization of the nonlinear evolution. Solving the linear problem obviates the need for a line search as the step size selection is implicit in the machinery of linear optimization, as in for example the conjugate gradient method. Solving the linear problem requires accurate machinery of the linearization: forward and adjoint linearized operators that pass the adjoint test. This often requires significant work, but can result in significant improvements in convergence. Using the linearized operators L and Lf described above, we can solve the linear system using, by way of example and not limitation, conjugate gradient on the normal equations. The linear system we want to solve is:
min∥Lδv−δψ∥2 Eqn. 10
where δψ is the current residual δψ=ψobs−N(vk).
After the earth property model has been updated, the process loops back to step 12 where the updated model is used to generate modeled seismic data. Step 14 is performed and, if the difference between the modeled seismic data and the recorded seismic data is large, steps 16 and 18 are also performed and looped back to step 12, until the difference at step 14 is sufficiently small or the number of loops or iterations reaches a predefined number.
When attempting a conventional full waveform inversion, method 100 of
Another serious limitation of conventional full waveform inversion is the bandwidth limitation. There is a direct relationship between the temporal bandwidth of data used to generate a gradient (search direction) and the spatial bandwidth of the gradient obtained by evaluation of Eqn. 5. Low temporal frequencies in the data produce long spatial wavelengths in the gradient. Consider
Examples of the importance of the initial earth properties model for a conventional full waveform inversion can be seen in
In
Based on method 100 of
An embodiment of the present invention is described by method 500 in
Phase unwrapping ensures that all appropriate multiples of 2π have been included in the phase portion of the data, meaning that the phase is continuous rather than jumping by 2π. There are methods for phase unwrapping but many fail for even moderate frequencies such as those greater than 2 Hz. Due to this, the inventors have developed a new method for phase unwrapping to prepare frequency domain data for inversion. The new method uses a particular type of left preconditioning that de-weights the influence of large phase jumps. Either the observed phase and modeled phase may be unwrapped individually or their difference, the residual phase, may be unwrapped. The latter is preferred since the phase differences between adjacent data points will be smaller.
The procedure we use for phase unwrapping is inspired by a fundamental theorem of vector calculus, also called the Helmholtz Decomposition. The Helmholtz Decomposition can be used to decompose a vector field into a curl-free component and a divergence-free component. We are interested in the curl-free component only, so we do not require a precise Helmholtz decomposition. The curl-free component is the gradient of a scalar potential, and is a conservative field. A conservative field is a vector field for which line integrals between arbitrary points are path independent. We identify unwrapped residual phase with the scalar potential whose gradient is the conservative field of a Helmholtz decomposition.
We start by taking the gradient of the input wrapped phase, and adjusting by adding or subtracting 2π so that the result lies in the range [−π,+π]. This “adjusted phase” is also known as the “principal value” of the phase. Here “gradient” means the numerical derivative along the directions of source and receiver, respectively. We can write the projection of the adjusted gradient of phase onto a conservative field as follows:
∇φres=g Eqn. 11
where φres is the unwrapped residual phase and g is the adjusted gradient of the wrapped phase, as explained above.
To calculate unwrapped phase, we discretize the gradient operator with respect to source and receiver coordinates and solve the overdetermined system shown in Eqn. 12 by least squares. In one embodiment, we find that a sparse QR factorization is a particularly effective method for solving this system of equations.
min∥∇φres−g∥2 Eqn. 12
This approach of projection onto a conservative field for phase unwrapping has difficulty at moderate frequencies much greater than 1 Hz. For ns sources and nr receivers, the system of equation 12 will have ns*nr rows for the adjusted gradient with respect to source coordinates, and ns*nr rows for the adjusted gradient with respect to receiver coordinates. It is therefore twice overdetermined.
We found that failures of the system are related to large magnitudes of the entries of the adjusted gradient, and by weighting these large magnitude entries down, which has the effect of de-emphasizing their importance in the system of equations, we can significantly improve robustness. In an embodiment, the application of a diagonal left preconditioner whose entries are inversely proportional to the magnitude of the adjusted gradient greatly improves the performance of phase unwrapping at higher frequencies. Other types of preconditioners may also be used and fall within the scope of the present invention.
The new system is shown in equation 13, where the kth element of the left preconditioner W is inversely proportional to the magnitude of the components of the kth element of the adjusted gradient raised to the power α.
min∥W[∇φres−g]∥2
Wk,s=|gk,s|−∝
Wk,s=|gk,r—∝ Eqn. 13
In one embodiment, this user-defined positive power a may be set to 2.5. Using this embodiment, examples of phase unwrapping with and without the preconditioner can be seen for data at 0.5 Hz in
We note that this phase unwrapping approach does not require integration or the specification of boundary conditions in order to obtain unwrapped phase from the principal value of the gradient of wrapped phase.
In another embodiment, phase unwrapping may be used in a nonlinear line search where the search direction for velocity update has been pre-determined. There are at least two alternatives. In one alternative, a conventional objective function is used, but data whose residual phase magnitude exceeds it is excluded. This implies that the line search is only sensitive to data that is not cycle skipped. In another alternative, the objective function for the nonlinear line search is replaced with the least squares sum of the unwrapped residual phase. This means that the line search will correctly handle cycle skipped data. This results in an objective function very similar to that shown in equation 3, but with unwrapped residual phase (φres) as shown in equation 14. We further note that unwrapped residual phase could be used as an objective function for stochastic or Bayesian inversion in order to correctly handle cycle skipped data.
E(ω)=½ρsΣrφres(ω,r,s)2. Eqn. 14
Although the present method of phase unwrapping with a preconditioner has been explained in terms of preparing seismic data for inversion, this is not meant to be limiting. One skilled in the art will appreciate that unwrapped seismic data may be useful in other processing flows such as horizon flattening, homomorphic deconvolution, refraction statics, and residual alignment; and that other types of data, such as synthetic aperture radar, could benefit from this method of phase unwrapping with a preconditioner.
Referring again to
In an embodiment, as we iterate through the phase-only full waveform inversion, we can improve our ability to recover long spatial wavelengths, such as those for velocity, by using a continuation approach to regularize successive iterations and constrain them to low wavenumber updates. The continuation approach is application of homotopy to smoothing regularization for nonlinear optimization. Homotopy here means starting with large magnitude for smoothing regularization and gradually decreasing the magnitude of the smoothing regularization over the course of the nonlinear evolution.
Smoothing regularization can implemented by adding rows to the linear system to penalize roughness in the model that is optimized. There are numerous other ways to implement roughness penalties. In one embodiment, the continuation approach may use analytic derivatives of polynomials representing slowness. A change of basis to smooth functions, for example radial basis functions, also works. Other possibilities include but are not limited to the spatial Fourier basis with a right preconditioner that scales with wavenumber, and 1st or 2nd numerical derivatives, either centered or not. In yet another embodiment, roughness penalties may be applied by application of 1st forward numerical differences to pixelized models. These examples are not meant to be limiting; one skilled in the art will appreciate that there are many more possible regularization operators that may be used in the context of the continuation approach which fall within the scope of the present invention.
Expanding on the idea of smoothing regularization by the use of derivative penalties using 1st order numerical differences, let us begin with a simple 3×3 pixelized velocity model. In two-dimensional space, the 9 velocities (vx,z) would appear as:
Writing this velocity model as a column vector, we get:
We can apply horizontal derivative penalties (a roughness penalty in the X direction) by penalizing the difference of adjacent velocities, e.g. (v1,1−v1,2). Note that the formal forward numerical derivative is written
but we can clear the denominator. This results in the matrix of horizontal derivative penalties shown:
and a similarly constructed matrix of vertical derivative penalties:
Note there are fewer rows than columns because the derivatives only involve horizontally or vertically adjacent pixels.
These horizontal and vertical derivative matrices can also be written as:
λxDxv=0
λzDzv=0 Eqn. 15
where v is the column vector of velocities, Dx is the matrix of horizontal derivatives, Dz is the matrix of vertical derivatives, and λx and λz are Lagrange multipliers.
The continuation approach starts with the Lagrange multipliers λx and λz large, and therefore initial solutions in the first “continuation step” are very smooth. Clearly this can aid in recovering the long spatial wavelengths of velocity. As the nonlinear evolution proceeds, we take additional continuation steps and the magnitudes of λx and λz are decreased. As the magnitude of the penalties is decreased, successively shorter spatial wavelengths are allowed in the velocity model.
There are many possible options for setting the initial λx and λz values. If chosen sufficiently large, only very long spatial wavelengths are allowed in the model, and the nonlinear evolution effectively becomes independent of the initial model. If chosen too small, the problem will not be regularized enough and independence from the starting model is lost. One embodiment for the initial values of these parameters is to normalize them by the operator norm of the linearized operator at each successive linearization. If, at the beginning of the nonlinear problem in the first linearization, we have the linear system Ax=y, we set λx and λz to be scaled by the operator norm ∥A∥. ∥A∥ can be obtained, for example, using the power method.
The phase-only full waveform inversion performed in the present invention may also include more accurately solving the linear problems at each iteration. If, at each successive linearization, we solve the Gauss-Newton problem to obtain the model update, rather than employ the combination of steepest descent and a line search, we get an improved result.
For the nonlinear problem of full waveform inversion, we linearize around the velocity at iteration k (vk), and seek to obtain an update to the velocity δv such that the updated model is:
v(k−1)=v(k)+δv. This is successive linearization. The application of derivative penalties to the linear problem implies that we want the update to the model to be smooth, as shown here:
λxDxδv=0
λzDzδv=0 Eqn. 16
A more desirable approach is to regularize the nonlinear problem. This implies we want the updated model to be smooth:
λxDx(vk+δv)=0
πzDz(vk+δv)=0 Eqn. 17
This requires a non-zero right hand side, but the right hand side is easily obtained by application of the derivative operators Dx and Dz to the current velocity:
λxDxδv=−λxDxvk
λzDzδv=−λzDzvk Eqn.18
In another embodiment of the present invention, the model generated by the phase-only full waveform inversion may be used as an initial model for conventional full waveform inversion. This is demonstrated in
The present method of phase extrapolation uses the relationship between linear phase shift and traveltime:
φf
where φf1 is the phase at frequency f1 and t is the traveltime. To extrapolate the phase to another frequency f2 and assuming that the traveltime does not change, we solve for t and substitute it:
In this embodiment, the phase is extrapolated to lower frequencies than those observed and conventionally usable. Conventionally usable frequencies are typically greater than 2 Hz. This is done by linearization of the unwrapped phase as a function of frequency and may be applied to the observed phase, the modeled phase, or the residual phase. The extrapolated data is then inverted using some objective function defined to measure phase mismatch. The method is applicable for any case when the phase is linear in frequency.
One skilled in the art will appreciate that there are many other possible uses of phase extrapolated data. By way of example and not limitation, synthetic aperture radar (SAR) data may be obtained, phase unwrapped using a preconditioner, and phase extrapolated prior to SAR imaging methods. Additionally, data that has been phase unwrapped using a preconditioner and phase extrapolated may then be used to evaluate a cost function. One example is the use of unwrapped phase to compute an objective function for stochastic or Bayesian optimization, with the advantage that the cost function would correctly handle cycle-skipped data.
Although the embodiments above have been explained in terms of two dimensional models, the methods are easily extended into three dimensions and multi-parameter earth models. The methods for phase unwrapping, phase extrapolation, and phase-only full waveform inversion disclosed in the present invention may be extended into multiple dimensions and remain within the scope of the present invention.
A system 1200 for performing the method is schematically illustrated in
While in the foregoing specification this invention has been described in relation to certain preferred embodiments thereof, and many details have been set forth for purpose of illustration, it will be apparent to those skilled in the art that the invention is susceptible to alteration and that certain other details described herein can vary considerably without departing from the basic principles of the invention. In addition, it should be appreciated that structural features or method steps shown or described in any one embodiment herein can be used in other embodiments as well.