This disclosure relates to a process for classifying seismic data into common waveform responses. An analysis window within the seismic data may be of constant time or depth duration or variable as defined by one or two interpreted horizons. This disclosure is particularly applicable to 3D seismic data volumes and to 2D seismic lines, and by natural extension, to microseismic events.
Geologic modeling is known. The accurate modeling of a subsurface domain, such as a reservoir under investigation for possible petroleum or oil and gas content, or in more general terms a geologic basin, is critical to the ongoing investigation of that domain. Drilling exploratory wells is an expensive undertaking, as is a full-scale seismic or magnetic survey, and accurate decision-making requires accurate geological mapping.
Information about the geologic horizons present in such a reservoir is clearly an important first step. Knowledge of the type and thickness of sedimentary strata provides a geologist with key information for visualizing the subsurface structure. In most areas, however, strata are cut with numerous faults, making the analytical task considerably more complicated. Geologic mapping requires that the faults be identified and that the amount of the slippage along the fault plane be quantified. The amount of slippage, or “throw”, can range from little to no actual movement in the case of a fracture, to a distance of hundreds of kilometers along a major fault zone such as the San Andreas Fault of California.
A three dimensional (“3-D”) model of a geologic domain would be a highly useful tool for geologists and exploration planning managers. That technology lies at the intersection between geology, geophysics, and 3-D computer graphics, and several inherent problems need to be overcome in such a product. First, data are often incomplete. The volumes in question range from the earth's surface down many thousands of feet, and data are generally difficult to obtain. Moreover, for the data that are available, often in the nature of seismic survey results and well log data, are subject to considerable processing and interpretation. Second, a large measure of professional judgment goes into the rendering of any such analysis, so that the goal of any analytical tool cannot be a complete result, but rather should be aimed at assisting the geologist to bring her judgment to bear in the most of efficient and effective manner possible.
A further difficulty stems from the inherent complexity of the problem. A typical petroleum reservoir, for example, may consist of many lithology variations, various diagenic overprints, and complicated fault and fracture regimes. Understanding the presence, mechanics, and distributions of the reservoir characterisitics is vital to optimizing the discovery, development, and ultimate hydrocarbon extraction.
Reflection seismic methods have long been used to image the geologic structure and stratigraphy of the earth. This is particularly true in the exploration for and development of hydrocarbon bearing strata. Differences in seismic signatures are functions of differences in geologic character. Interpretation of spatial patterns of similar and varying seismic waveforms may lead to interpretation of the associated geologic spatial variations, which, in turn, may lead to better exploration and development.
Many seismic waveform classification techniques are known, in which the classification waveforms are derived through complex statistical processes, which may be intuitively obscure and computationally expensive. The results of these techniques are often highly starting condition dependent, causing different results for different starting points, and the global variability in waveform responses may not be sampled or modeled. Also, the results may be heavily dependent upon the choice of statistical modeling algorithm.
The system and method are described below with respect to a waveform classification system for prospecting and subsequent development of oil and gas reserves and may be used as a tool by geoscientists and engineers in the prospecting and subsequent development of oil and gas reserves. However, the system and method has broader application since the system and method can be used in other near surface seismic imaging (e.g., ground penetrating radar use in civil engineering and archeology), to categorize microseismic events associated with hydraulic fracturing and the like and it is understood that the disclosure covers each of the applications of the system and method. The system may also be applied to microseismic data, often collected in association with hydraulic fracturing process in oil and gas development. Classifying microseismic event signatures may be useful in understanding spatial clustering of similar events and, further, to modeling source mechanisms. Understanding source mechanisms are helpful in interpreting fracture orientation and stress states.
An aspect of the disclosure involves a system and method for classifying seismic waveform to assist an analyst to rapidly and accurately identify commonalities and inter-relationships, which may be related to similar geologic conditions, within a collection of seismic waveform traces. The seismic waveforms correspond to, for example, seismic traces. A seismic trace is a time series curve recorded at a location on the earth's surface. The time series curve corresponds to echoes of sound or elastic waves from geological features in the subsurface. Investigating the spatial nature of these waveform commonalities and relationships is important for understanding geologic complexities.
Implementations of the present disclosure involve a system and/or method for classifying waveforms. More specifically, the disclosure describes a seismic waveform classification system (SWCS) directed to extraction and delineation of areal trends in seismic response. These trends may be directly correlated with geologic trends that may be related to a variety of investigative earth studies. Identification and interpretation of these trends is a common activity of geoscientists and engineers in the prospecting and subsequent development of oil and gas reserves, although the system and method can also be used in other near surface seismic imaging (e.g., ground penetrating radar use in civil engineering and archeology) or it may also be used to categorize microseismic events associated with hydraulic fracturing.
According to one aspect, the SWCS analyzes a large number of seismic traces collected at a particular location at the earth's surface and extracts the greatest diversity of waveform responses directly from the seismic traces as the final or starting classification waveforms. Because of this direct extraction of greatest diversity, the subset of traces for analysis may often only need to be one percent or less of the total number of traces. No choice of complicated or obscure statistical algorithm is needed. Additional conditioning without overly modifying the waveforms can be done. Further, the classification waveforms may then be ordered in terms of overall significance and of gradational similarity. Finally, because only the first classification waveform requires an exhaustive search of the sample subset of traces and because all subsequent classification waveforms are compared only to the previously derived waveforms, the system is computationally fast which permits implementation on a computer in a highly interactive and interpretive design.
The system and method allows for improved mapping of seismic waveform commonalities and inter-relationships using a straightforward, easily explained approach (i.e., no in-depth mathematical or statistical knowledge is required to understand the method). It ensures sampling of the greatest diversity of waveform responses with information on waveform hierarchy in terms of significance and similarity. Additionally, it is fast enough to be implemented in highly interactive and interpretive computer software.
This system may directly sample waveforms from seismic traces based upon the following scheme: 1) find the waveform that has a highest aggregate similarity to all traces in the set; 2) iteratively find the remaining desired number of waveforms from the trace set as those which are least similar to the previously identified; 3) optionally condition the waveforms found in steps 1 and 2 using a statistical “training” method such as self-organizing maps; 4) order the waveforms based on significance and similarity for interpretive purposes; and 5) compare the final classification waveforms to each seismic trace and assign the index of the one with the highest similarity to that location, producing a final classification map.
The system is configured to iteratively determining a set of waveforms that optimally represent the variability within the overall set of waveforms. That is, this set of waveforms are inherently as dissimilar to each other while collectively being as similar as possible to full set of waveforms. Once these representative waveforms have been determined, they may be optionally conditioned by the full set of waveforms. These final “classification waveforms” are then ordered according to similarity to each other and overall significance to the full set of waveforms.
The initial classification waveforms may be determined directly from the subset of seismic traces. The first classification waveform is the trace waveform that is most similar to all other trace waveforms in the subset. Thereafter, the additional classification waveforms are dependent upon the previously determined classification waveforms. That is, the next classification waveform is the trace waveform least similar (most dissimilar) to the ones already determine. The second is the least similar trace waveform to the first one found, the third is the least similar to the first two, the fourth least similar to the first three, and so on. This ensures that the initial classification waveforms (presuming the additional training step) represent the most common seismic waveform response and then most varied responses after that. Also, there is an extreme efficiency in increasing the number of classification waveforms as comparisons are only made between the number of traces in the subset and the number of previously generated classification waveforms without compromising maximum diversity in the determined waveforms.
Unlike other conventional approaches, two sets of ordering are easily obtained. The use of a separation index permits an ordering based upon significance. This permits interpretation of the number of classifications necessary to explain seismic waveform variability while minimizing redundancy (as indicated by slope changes in the index vs number of classifications plot). A second ordering based on similarity gradation may be simultaneously computed with the separation index. As the significance order is computed, a second list may be kept where the next significant classification waveform is inserted according to maximum similarity to adjacent classifications. Plotting the solution maps by similarity sorting permits interpretation of the granularity of detail desired to explain appropriate geologic detail.
The workflow illustrated in the
An example seismic volume is shown in
A subset of 528 traces from a 10×10 coarse grid was used for the analysis. The resulting CSI versus waveform index is shown in
Solution maps based on selecting the two and four most significant waveforms are shown in
Solution maps based on selecting eleven and twenty waveforms are shown in
The general sequence of operations for the system and method have been broken up into a number of processes as shown in
In a first process of the method, user input parameter governing the system operation are received (900.) For example, the parameters may include:
1) a maximum number of desired classification waveforms;
2) an analysis window that defines the trace waveform at each trace location, which may be a constant time or depth or may be variably defined by a single interpreted horizon or by two bounding interpreted horizons;
3) a statistical measure that defines “similarity” to be used in waveform to waveform comparisons. Examples of commonly used measures include the L1 norm (sum of absolute-value differences) and L2 norm (sum of squared differences);
4) a “domain” in which the statistical comparison will be made. The most common domain uses waveform sample amplitudes, although attributes such as peak frequencies from time-frequency domain are also useful. Complex waveform attributes such as magnitude, instantaneous phase, and instantaneous frequency may also be collectively used.
5) a number of traces to be used in the initial search for the classification waveforms. For 3D seismic volumes, this number may be defined by inline and crossline increments for a coarser grid than defined by the full volume. For 2D seismic lines, this number may be defined by a trace increment. It may also be defined by a random walk through either the 3D volume or through a collection of 2D lines.
The method may then determine the maximum number of samples for the waveforms (910) as found from the subset trace windowing controlled by the parameters specified above.
Process 920
The method may then find the most representative waveform to all other waveforms in the subset (920). This may be done by statistically comparing (using the statistical measure specified above) each waveform with all the other waveforms and selecting the waveform with the largest aggregate measure which becomes the first classification waveform. Computationally, this is the most intensive step in the full sequence of operations.
For example, process 920 may be performed using the following pseudocode:
Process 930
Once the most representative waveform/trace has been identified, the method may find the remaining classification waveforms by iteratively looping through the subset of waveforms and finding the waveform that is aggregately least similar (again using the similarity measure specified) from the classification waveforms previously found (930). As each such waveform is found, it is added to the list of classification waveforms until the maximum specified number is attained.
For example, process 930 may be performed using the following pseudocode:
Process 940
Once the waveforms are identified, the classification waveforms may be trained/conditioned (940). For example, because a subset of the traces were used in processes 920 and 930 above, the method may optionally “train” or condition the classification waveforms found with more waveforms from the full set of traces. Any number of “training” algorithms may be used, but the Kohonen self-organizing map is suggested. This approach randomly selects waveforms from the full set and updates the classification waveforms. The amount of conditioning is based upon a weighting scheme, where weight is a function of the similarity measure found between the random trace waveform and each classification waveform (i.e., the more similarity the more weight assigned). The weight is further scaled as the training proceeds (i.e., earlier traces in the random walk have more weight than later ones).
For example, process 940 may be performed using the following pseudocode:
Process 950
The method may then determine the order of significance among the final classification waveforms (950). This may be accomplished by finding the classification waveform that is aggregately least similar to all the other classification waveforms. This is most commonly the waveform derived from the one found in process 920, and this waveform is deemed the most significant. The next most significant classification waveform is the one least similar to the most significant one. Now that the first two significant waveforms have been identified, the remainder of the classification waveforms needs to be ordered. A cluster similarity index is used to determine this ordering. Many such indices exist in the literature, but the Cluster Separation Index (CSI) is recommended. For example, the Davies-Bouldin, Bezdek, Dunn, Xie-Beni, Gath-Geva, etc, indices may also be used.
It is defined as the ratio of the minimum distance among the clusters (or in this case, waveforms) to the maximum distance among the clusters (i.e., the distance between the first two most significant waveforms). The third significant waveform is the one that produces the smallest CSI when combined with the first two significant waveforms. The fourth significant is the one that produces the smallest CSI with the first three significant waveforms, and so on. A plot of final CSI values versus waveform index is useful for interpreting significance (example in
For example, process 950 may be performed using the following pseudocode:
Process 960
The method may then determine the order of similarity among the final classification waveforms (960). The end waveforms in this ordered list are the first two significant waveforms found in process 950 above. By definition, they are the two least most similar classification waveforms. Then iteratively find the ordering of the remaining waveforms. Determine the waveform aggregately least similar to the waveforms not yet assigned to the ordering. Then determine the insertion position in the list. This is found by finding the index between the two waveforms to which the waveform to be inserted is most similar. The ordering is complete when all classification waveforms have been assigned. Plotting the waveforms in order of similarity is useful for determining variation and graduation in waveform response (example in
For example, process 960 may be performed using the following pseudocode:
Processes 970 and 980
The method may then determine the optimal number of classification waveforms to be used in the final classification step (970). This determination may be based upon investigating the variation and detail within the solution maps generated from selecting the number of classification waveforms and color rendered based either on significance or similarity (examples in
For example, process 980 may be performed using the following pseudocode:
System Implementations
In some implementations, each of the processes shown in
According to another aspect, as depicted in
The processing device 1202B may also include a graphical user interface (or GUI) application 1214, such as a browser application, to generate a graphical user interface not (shown) on the display 1206B. The graphical user interface enables a user of the processing device 1202B to view seismic trace data, and/or map data. The graphical user interface 120 also enables a user of the processing device 1202B to interact with various data entry forms to view and modify settings data or preferences data (e.g., number of waveforms to be classified).
The analysis device 1206 is configured to receive data from and/or transmit data to one or more processing device 1202 through the communication network 1208. Although the analysis device 1206 is depicted as including an analysis device 1206, it is contemplated that the SWCS 1201 may include multiple analysis devices 1206 (e.g. multiple servers) in, for example, a cloud computing configuration. The communication network 1208 can be the Internet, an intranet, or another wired or wireless communication network. For example, communication network 1208 may include a Mobile Communications (GSM) network, a code division multiple access (CDMA) network, 3rd Generation Partnership Project (3GPP), an Internet Protocol (IP) network, a Wireless Application Protocol (WAP) network, a WiFi network, or an IEEE 802.11 standards network, as well as various combinations thereof. Other conventional and/or later developed wired and wireless networks may also be used.
The embodiments of the invention described herein are implemented as logical steps in one or more computer systems. The logical operations of the present invention are implemented (1) as a sequence of processor-implemented steps executing in one or more computer systems and (2) as interconnected machine or circuit engines within one or more computer systems. The implementation is a matter of choice, dependent on the performance requirements of the computer system implementing the invention. Accordingly, the logical operations making up the embodiments of the invention described herein are referred to variously as operations, steps, objects, or engines. Furthermore, it should be understood that logical operations may be performed in any order, unless explicitly claimed otherwise or a specific order is inherently necessitated by the claim language.
While the foregoing has been with reference to a particular embodiment of the invention, it will be appreciated by those skilled in the art that changes in this embodiment may be made without departing from the principles and spirit of the disclosure, the scope of which is defined by the appended claims.
This application claims the benefit of and priority to, under 35 USC 119(e) and 120, U.S. Provisional Patent Application Ser. No. 61/722,147 filed on Nov. 3, 2012 and titled Seismic Waveform Classification Systems and Methods”, the entirety of which is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61722147 | Nov 2012 | US |