The invention relates generally to the field of geophysical prospecting, and more particularly to a method for processing geophysical data. Specifically, the invention is a method for highlighting regions in one or more geological or geophysical datasets such as seismic, that represent real-world geologic features including potential hydrocarbon accumulations without the use of prior training data, and where the desired physical features may appear in the unprocessed data only in a subtle form, obscured by more prominent anomalies.
Seismic datasets often contain complex patterns that are subtle and manifested in multiple seismic or attribute/derivative volumes and at multiple spatial scales. Over the last several decades, geologists and geophysicists have developed a range of techniques to extract many important patterns that indicate the presence of hydrocarbons. However, most of these methods involve searching for either known or loosely defined patterns with pre-specified characteristics in one data volume, or two volumes at the most. These “template-based” or “model-based” approaches often miss subtle or unexpected anomalies that do not conform to such specifications. These approaches will not be discussed further here as they have little in common with the present invention except that they address the same technical problem. It is therefore desirable to develop statistical analysis methods that are capable of automatically highlighting anomalous regions in one or more volumes of seismic data across multiple spatial scales without a priori knowledge of what they are and where they are.
PCT Patent Publication WO 2010/056424 discloses a method to perform such statistical analysis to highlight anomalies automatically in multi-volume seismic analysis using a single moving window on the data to gather statistics. See also PCT Patent Publication WO 2011/139416. Both of these patent application publications are incorporated by reference herein in all countries that allow it. However, the single-windowed approach has some limitations arising from a lack of adaptivity, which biases the results towards prominent detection of obvious and dominant anomalies with weaker or suppressed response for subtle anomalies. Accordingly, there is a need for a method that mitigates these limitations, and the present invention satisfies this need.
In one embodiment, the invention is a method for inferring presence of hydrocarbons from a seismic data volume representing a subsurface region, using local statistical distributions of seismic data values, comprising:
(a) using two moving windows of user-selected size and shape, one being a pattern window and the other a sampling window larger than the pattern window, wherein the sampling window moves to different locations in the seismic data volume to sample the seismic data volume, and at each location of the sampling window the pattern window moves about within the sampling window;
(b) for each sampling window location, computing a statistical distribution of seismic data values for all pattern windows contained within the sampling window;
(c) from the statistical distribution within a sampling window, computing an outlier probability or residue for each sampling window; and
(d) interpreting the outlier probabilities or residues for the sampling windows for indications of hydrocarbon presence within the subsurface region;
wherein at least one of (a)-(d) is performed using a computer.
The present invention and its advantages will be better understood by referring to the following detailed description and the attached drawings in which:
The invention will be described in connection with example embodiments. To the extent that the following description is specific to a particular embodiment or a particular use of the invention, this is intended to be illustrative only, and is not to be construed as limiting the scope of the invention. On the contrary, it is intended to cover all alternatives, modifications and equivalents that may be included within the scope of the invention, as defined by the appended claims.
The present invention is a method for detecting anomalous patterns in multi-volume seismic or other geophysical data (for example, electromagnetic data) across multiple spatial scales without the use of prior training data. The inventive method may use Windowed Statistical Analysis, which involves the following basic steps in one embodiment of the invention:
A particularly convenient embodiment of the invention involves a combination of Windowed Principal Component Analysis (“WPCA”), Residual Analysis, and Clustering Analysis which will be described in detail below. However, anyone of ordinary skill in the technical field will readily appreciate how other statistical analysis techniques may be used or suitably adapted to achieve the same goals.
A useful generalization of Principal Component Analysis (“PCA”) is a method known as Independent Component Analysis (“ICA”), which is preferable when the data strongly differ from the standard multi-dimensional Gaussian distribution. In this case, the present inventive method may be correspondingly generalized to use Windowed ICA (“WICA”), followed by a generalization of Residual Analysis, termed Outlier Detection. In one embodiment, the present invention uses PCA on moving windows, followed by computation of inner products and data residuals from the Principal Components (“PCs”), which is believed to be advantageously applicable not only in seismic applications, but across the broader field of multi-dimensional data processing. This includes the fields of image, speech, and signal processing.
Principal Component Analysis (“PCA”) is a well-known classical technique for data analysis, first proposed by Pearson (“On Lines and Planes of Closest Fit to Systems of Points in Space,” Philos. Magazine v. 2, pp. 559-572 (1901)) and further developed by Hotelling (“Analysis of a Complex of Statistical Variables Into Principal Components,” Journal of Education Psychology v. 24, pp. 417-441 (1933)). What is believed to be the first known application of principal component analysis to seismic data occurred in the form of the Karhunen-Loeve transform, named after Kari Karhunen and Michel Loeve (Watanabe, “Karhunen-Loeve Expansion and Factor Analysis,” Transactions of the Fourth Prague Conference, J. Kozesnik, ed., Prague, Czechoslovakia Academy of Science (1967)). This method uses PCA to describe the information content in a set of seismic traces, with the form of the input dataset being entire seismic traces, not multi-dimensional windows of variable size. Watanabe's primary application was to decompose entire seismic traces, and use the first several principal component traces to reconstruct the most coherent energy, thereby filtering out non-geologic noise.
PCA is most commonly used in seismic analysis to reduce the number of measurement characteristics to a statistically independent set of attributes (see, e.g., Fournier & Derain, “A Statistical Methodology for Deriving Reservoir Properties from Seismic Data,” Geophysics 60, 1437-1450 (1995); and Hagen, “The Application of Principal Components Analysis to Seismic Data Sets,” Geoexploration 20, 93-111 (1982)). The seismic interpretation process often generates numerous derivative products from the original data. Since these attributes correlate to varying degrees, PCA has been an elegant way to reduce the number of attributes, while retaining a large amount of information.
PCT International Publication WO 2011/139416 (Nov. 10, 2011) is a recent publication describing a moving window-based statistical outlier detection technique devoted to finding geologic features of interest on a scoping and reconnaissance basis in geological and geophysical data. Moreover, such techniques have been applied to specific subsets or domains of seismic data for specialized signal processing or reservoir characterization applications. Key and Smithson (“New Approach to Seismic Reflection Event Detection and Velocity Determination,” Geophysics 55, 1057-1069 (1990)) apply PCA on 2D moving windows in pre-stack seismic data, and ratio the resultant eigenvalues as a measure of signal coherency. No use is made of the principal components themselves to detect features in the prestack seismic data. Sheevel and Payrazyan (“Principal Component Analysis Applied to 3D Seismic Data for Reservoir Property Estimation,” Society of Petroleum Engineers Annual Conference and Exhibition (1999)) calculate trace-based principal components using small, 1D moving vertical windows, and input those PCs that look most geologic into a classification algorithm that predicts reservoir properties away from well calibration. Once again, this 1D, single dataset approach, makes no attempt to automatically identify anomalies or outliers in the data. Cho and Spencer (“Estimation of Polarization and Slowness in Mixed Wavefields,” Geophysics 57, 805-814 (1992)) and Richwalski et al. (“Practical Aspects of Wavefield Separation of Two-Component Surface Seismic Data Based on Polarization and Slowness Estimates,” Geophysical Prospecting 48, 697-722 (2000)) use 2D windowed PCA in the frequency domain to model the propagation of a pre-defined number P- & S-waves.
The goal of Wu et al. (“Establishing Spatial Pattern Correlations Between Water Saturation Time-Lapse and Seismic Amplitude Time-Lapse,” Petroleum Society's 6th Annual Canadian International Petroleum Conference (56th Annual Technical Meeting) (2005)) is to optimally correlate single seismic or time-lapse seismic volumes to flow simulation data in a reservoir model to estimate actual saturation time-lapse values of spatial patterns. Their approach is to perform point-to-point comparisons, not on the original data volumes, but on the projection of these data onto the first principal eigenvector from PCA analysis. Thus, their objective is correlation of seismic data to a known model instead of identification of anomalous patterns in the seismic data.
U.S. Pat. No. 5,848,379 to Bishop (“Method for Characterizing Subsurface Petrophysical Properties Using Linear Shape Attributes,” (1998)) discloses a method to predict subsurface rock properties and classify seismic data for facies or texture analysis, not to identify geologic features of interest on a scoping and reconnaissance basis which is the technical problem addressed by the present invention. Bishop performs statistical analysis using PCA to decompose seismic traces into a linear combination of orthogonal waveform bases called Linear Shapes within a pre-specified time or depth interval. A Linear Shape Attribute (LSA) is defined as the subset of weights (or eigenvalues) used to reconstruct a particular trace shape. Also, Bishop does not disclose overlapping windows, simultaneously analyzing multiple data volumes, or use of a statistical distribution to detect anomalous data regions. Both of these features are used in at least some embodiments of the present inventive method.
Other approaches for statistically analyzing geological and geophysical data have used methods such as Artificial Neural Networks, Genetic Algorithms, and multipoint statistics, but not with the goal of automatic detection of anomalous patterns. In addition, these methods have generally had limited success, since their inner workings are often obscure, and they often require, and are highly dependent on, large amounts of training data.
As stated previously, PCA and ICA are methods that are commonly used to separate high-dimensional (i.e., multi-variable or -attribute) signals into statistically uncorrelated (i.e., independent) components. Windowed PCA and ICA may be used in the present invention to apply component analysis to a dataset that is derived from the original data by representing each point in the original data as a collection of points in its neighborhood (i.e., window). To illustrate this concept for a single window as in PCT International Publication No. 2011/139416, the implementation of WPCA on a single, 3-dimensional, data volume using a fixed window size is outlined next. The same procedure or its ICA equivalent could be applied to 2D data, or simultaneously to multiple 2D or 3D data volumes. Consider a 3D seismic volume of size Nx×Ny×Nz, and select a window shape (e.g., ellipsoid or cuboid) and size (e.g., radius r, nx×ny×nz)
Each voxel in the 3D seismic volume, Ii,j,k, may be represented as an nx×ny×nz dimensional vector {right arrow over (I)}i,j,k, that contains voxel values within each voxel's windowed neighborhood. Next, compute the mean and covariance matrix of all n-dimensional vectors (n=nx×ny×nz){{right arrow over (I)}i,j,k}(N=(Nx−nx)×(Ny−ny)×(Nz−nz) of them) as follows:
Then, compute the correlation matrix as
where t and k are two indices of the vector I and thus represent two different sets of spatial coordinates in three dimensions. Then, calculate the eigenvalues (Principal Values) {λ1>λ2> . . . λn} and eigenvectors (Principal Components) {v1, v2, . . . , vn} of Ŵ. Alternatively, eigenvalues of the covariance matrix may be computed; they will differ from the eigenvalues of the correlation matrix only by a scaling factor. These eigenvectors will be nx×ny×nz in size, and when reshaped from their vector form back to window form, represent the various (independent) spatial patterns in the data, ordered from most common to least common. The corresponding eigenvalues represent how much of the original data (i.e., amount of variance) that each eigenvector accounts for.
One or more of the following partial volumes of seismic or attribute data may then be generated, which are then examined for anomalies that may not have been apparent from the original data volume:
(1) Projection: The portion of the original data that can be recreated using each Principal Component or groups of Principal Components (chosen, for example, from clustering analysis). This is achieved by taking the inner-product of the mean-centered and normalized seismic volume on each Principal Component or groups of Principal Components. Thus, the projection of vector A onto vector B means proj(A)=(A·B)/|B|2 and is a vector in the direction of B.
(2) Residual: The remaining signal in the original volume that is not captured by the first k−1 (i.e., most common) Principal Components. In a preferred embodiment of the invention, this is achieved by projecting the mean-centered and normalized seismic volume onto the sub-space spanned by {vk, vk+1, . . . , vn} so that
where R is a user-defined threshold between 0 and 1. Alternatively, one could add projections bottom-up, but this would be computationally more burdensome in most cases.
(3) Outlier: The residual analysis of item (2) is the way the “degree of anomaly” of each voxel is determined in one embodiment of the invention. The attribute data volumes of (1) and (2) are not needed in an alternative way of computing the “degree of anomaly” of each voxel, which will be denoted as R′ (since it is related to, but not the same as, the residue R defined above), and is given by the following formula:
R′i,j,k=(Ii,j,k−Ī)TŴ−1(Ii,j,k−Ī).
Using this measure of degree of anomaly, a partial data volume may be developed. This measure also picks “outliers” that lie in the space spanned by the first few eigenvectors, but can be more computationally intensive than the above two steps in some cases. However, it may be noted that in this the step above of calculating the eigenvalues and eigenvectors can be skipped, or simply replaced by a Cholesky decomposition of the correlation matrix, which enables faster evaluation of R′.
There are variants of the above basic approach that employ different data normalization schemes. The method can be extended to an arbitrary number of seismic volumes. The adjustable parameters that the user can experiment with are (1) window shape, (2) window size, and (3) threshold, R, of residual projection.
Generalizations and Efficiencies in the Construction of Canonical Patterns
The following sections describe improvements to the windowed principal component analysis that enable more convenient applicability through reduced computation, and better use of results through interpretation of Principal or Independent Components and their selective retention or removal.
Computational Efficiency: The straight-forward method of computing the covariance matrix above is computationally burdensome for large datasets, both in memory and processor requirements. An alternative method is therefore disclosed herein that exploits the fact that the individual vectors of the PCA are windows moving across the data. Consider, for example, a 1-D dataset with values {I1, I2, . . . , In}. To evaluate the covariance matrix of windows of size K<N, the mean and second moment of the entries can be computed as follows:
It may be noted that this method only involves taking averages and inner products of sub-vectors of the data (sub-matrices in higher dimensions), and hence avoids storing and manipulating numerous smaller-sized windows derived from the original data. This modification of the computational method thus allows object-oriented software with efficient array indexing (such as Matlab and the use of Summed-Area Tables, a data structure described by Crow in “Summed-Area Tables for Texture Mapping,” Computer Graphics 18, 207 (1984)) to compute the covariance matrices with minimal storage and computational effort.
Nested Double Window
U.S. Provisional Patent Application No. 61/453,809, which is incorporated herein by reference in all countries that allow it, discloses a method, suitable for automation, for texture segmentation using double-windowed clustering analysis, which adds a “sampling window”, within which the first “pattern window” moves to gather local statistics. Used in conjunction with an outlier identification method, for example PCA (Principal Component Analysis) in the case of Gaussian distributions (see WO 2010/056424), this double-windowed approach can detect both obvious and subtle anomalies in multi-dimensional, multi-volume seismic data in a more adaptive manner compared to the single-windowed version.
The present invention applies the double-windowed approach to outlier detection in residual data volumes. In at least some of its embodiments, the present inventive method is described in the flowchart of
x=vector of values in pattern window (e.g. 9-d for a 3×3 window on 2-d image)
The outlier likelihood is then computed as the Residue (14):
Residue(Si)=(xc−Mean((Si))T·[Covariance((Si)]−1·(xc−Mean((Si))
where xc is the pattern window at the center of the sampling window Si. In the single window approach described previously above, the mean and covariance are computed only once over the whole data by moving the window within the data.
At step 15, a data volume of residue values may be computed and assembled, with a residue value for each voxel in the original seismic data volume. The residue value for a given voxel is determined by the sampling window centered on that voxel. Then, the residue volume may be displayed and visualized to interpret for features indicative of hydrocarbons. For voxels near the data volume boundary, for which the sampling window would not entirely fit within the data volume, one may use the distribution from the closest sampling window that entirely fits within the data volume. The residue volume may be thresholded, using a user-selected threshold, to better highlight the anomaly locations.
The above-described double windowed implementation offers several key advantages over single-windowed approaches:
(1) Residue, and hence notion of anomaly, become “local”—obvious anomalies are less likely to mask out subtle anomalies;
(2) Anomalies have less data sensitivity—local anomaly remains so regardless of structure of data far away from it;
(3) Fully parallelizable, unlike single-windowed approaches.
The difference in performance between the single- and double-windowed approaches is shown in
The significant enhancement of a number of subtler seismic features in
The foregoing application is directed to particular embodiments of the present invention for the purpose of illustrating it. It will be apparent, however, to one skilled in the art, that many modifications and variations to the embodiments described herein are possible. All such modifications and variations are intended to be within the scope of the present invention, as defined in the appended claims.
This application claims the benefit of U.S. Provisional Patent Application 61/660,477, filed Jun. 15, 2012, entitled SEISMIC ANOMALY DETECTION USING DOUBLE-WINDOWED STATISTICAL ANALYSIS, the entirety of which is incorporated by reference herein.
Number | Name | Date | Kind |
---|---|---|---|
5416750 | Doyen et al. | May 1995 | A |
5444619 | Hoskins et al. | Aug 1995 | A |
5539704 | Doyen et al. | Jul 1996 | A |
5586082 | Anderson et al. | Dec 1996 | A |
5677893 | de Hoop et al. | Oct 1997 | A |
5848379 | Bishop | Dec 1998 | A |
5852588 | de Hoop et al. | Dec 1998 | A |
5892732 | Gersztenkorn | Apr 1999 | A |
5940777 | Keskes | Aug 1999 | A |
6052650 | Assa et al. | Apr 2000 | A |
6226596 | Gao | May 2001 | B1 |
6236942 | Bush | May 2001 | B1 |
6295504 | Ye et al. | Sep 2001 | B1 |
6574565 | Bush | Jun 2003 | B1 |
6618678 | Van Riel | Sep 2003 | B1 |
6625541 | Shenoy et al. | Sep 2003 | B1 |
6725163 | Trappe et al. | Apr 2004 | B1 |
6735526 | Meldahl et al. | May 2004 | B1 |
6754589 | Bush | Jun 2004 | B2 |
6757614 | Pepper et al. | Jun 2004 | B2 |
6804609 | Brumbaugh | Oct 2004 | B1 |
6847895 | Nivlet et al. | Jan 2005 | B2 |
6941228 | Toelle | Sep 2005 | B2 |
6950786 | Sonneland et al. | Sep 2005 | B1 |
6957146 | Taner et al. | Oct 2005 | B1 |
6970397 | Castagna et al. | Nov 2005 | B2 |
6988038 | Trappe et al. | Jan 2006 | B2 |
7206782 | Padgett | Apr 2007 | B1 |
7222023 | Laurent et al. | May 2007 | B2 |
7243029 | Lichman et al. | Jul 2007 | B2 |
7248539 | Borgos et al. | Jul 2007 | B2 |
7266041 | Padgett | Sep 2007 | B1 |
7295930 | Dulac et al. | Nov 2007 | B2 |
7411903 | Jang et al. | Aug 2008 | B2 |
7453767 | Padgett | Nov 2008 | B1 |
7463552 | Padgett | Dec 2008 | B1 |
7697373 | Padgett | Apr 2010 | B1 |
8027517 | Gauthier et al. | Sep 2011 | B2 |
8363959 | Boiman et al. | Jan 2013 | B2 |
8380435 | Kumaran et al. | Feb 2013 | B2 |
8385603 | Beucher et al. | Feb 2013 | B2 |
20080270033 | Wiley et al. | Oct 2008 | A1 |
20110139416 | Bernard | Jun 2011 | A1 |
20110272161 | Kumaran et al. | Nov 2011 | A1 |
20110307178 | Hoekstra | Dec 2011 | A1 |
20120090001 | Yen | Apr 2012 | A1 |
20120197530 | Posamentier et al. | Aug 2012 | A1 |
20120197531 | Posamentier et al. | Aug 2012 | A1 |
20120197532 | Posamentier et al. | Aug 2012 | A1 |
20120197613 | Vu et al. | Aug 2012 | A1 |
20120234554 | Kumaran | Sep 2012 | A1 |
20120257796 | Henderson et al. | Oct 2012 | A1 |
20130064040 | Imhof et al. | Mar 2013 | A1 |
20130085715 | Lakshminarayan et al. | Apr 2013 | A1 |
20130336091 | Song et al. | Dec 2013 | A1 |
Number | Date | Country |
---|---|---|
WO 9964896 | Dec 1999 | WO |
WO 2010056424 | May 2010 | WO |
WO 2011139416 | Nov 2011 | WO |
Entry |
---|
Cho, W.H. et al. (1992), “Estimation of polarization and slowness in mixed wavfields,” Geophysics 57(6), pp. 805-814. |
Fournier, F. et al. (1995), “A statistical methodology for deriving reservoir properties from seismic data,” Geophysics 60(5), pp. 1437-1450. |
Hagen, D.C. (1982), “The application of principal components analysis to seismic data sets,” Geoexploration 20, pp. 93-111. |
Hotelling, H. et al. (1933), “Analysis of a complex of statistical variables into principal components,” Journal of Education Physchology 24, pp. 417-441. |
Key, S.C. et al. (1990), “New approach to seismic-reflection event detection and velocity determination,” Geophysics 55(8), pp. 1057-1069. |
Pearson, K. (1901), “On Lines and Planes of Closest Fit systems of Points in Space,” Philos. Magazine 2, pp. 559-572. |
Richwalski, S. et al. (2000), “Practical aspects of wavefield separation of two-component surface seismic data based on polarization and slowness estimates,” Geophysical Prospecting 48, pp. 697-722. |
Watanabe (1967), “Karhunen-Loeve Expansion and Factor Analysis,” Transactions of the 4th Prague Conference, J. Kozesnik, ed., Prague, Czechoslovakia Academy of Science, pp. |
Wu, J. et al. (2005), “Establishing Spatial Pattern Correlations between Water Saturation Time-Lapse and Seismic Amplitude Time-Lapse,” Petroleum Society's 6th Annual Canadian Int'l. Petroleum Conf., pp. 1-9. |
Scheevel, J.R. et al. (2001), “Principal Component Analysis Applied to 3D Seismic Data for Reservoir Property Estimation,” SPE Reservoir Evaluation & Engineering, 64-72. |
Watanabe, S. (1967), “Karhunen-Loeve Expansion and Factor Analysis Theoretical Remarks and Applications.” Transactions of the Fourth Prague Conference, pp. 1-26. |
Number | Date | Country | |
---|---|---|---|
20130338927 A1 | Dec 2013 | US |
Number | Date | Country | |
---|---|---|---|
61660477 | Jun 2012 | US |