Computer-implemented techniques for modeling physical properties of geological strata are known. Such techniques can model physical properties such as temperature, petroleum saturation, and pore pressure. Known models can compute such physical properties for each cell in an imposed grid of hundreds of millions of cells. A simple model that tracks one hundred million cells, with four bytes of data per cell, may yield 0.4 gigabytes of raw output data for each output event and physical quantity. Thus, a typical model of twenty-five output events and one hundred different physical quantities can output data on the order of one terabyte. Although it is possible to set up a data storage environment that can handle output data of this order of magnitude, significant efforts are needed to achieve this.
Embodiments of the present disclosure may include systems, methods, and computer-readable media for compressing topological grid data representing. e.g., an output from a physical property model of geological strata. In one embodiment, a method may employ a computerized system that obtains output data from such a model. The method may interpolate data for cells that represent strata “pinch-outs”. The method may then transform the interpolated topological grid data and truncate ordered lists of the resulting frequency domain coefficients to achieve reduced-size data with a known error rate. The method may capitalize on the relative smoothness of the physical property values in the model.
This summary is provided to introduce some of the subject matter described below and is not to he considered limiting.
The accompanying drawings which are incorporated in and constitute a part of this specification, illustrate embodiments of the present teachings and together with the description, serve to explain the principles of the present teachings. In the figures:
The following detailed description refers to the accompanying drawings. Wherever convenient, the same reference numbers are used in the drawings and the following description to refer to the same or similar parts. While several embodiments and features of the present disclosure are described herein, modifications, adaptations, and other implementations are possible, without departing from the spirit and scope of the present disclosure. Accordingly, the following detailed description does not limit the present disclosure. Instead, the proper scope of the disclosure is defined by the appended claims.
Disclosed herein are techniques for compressing the data output from physical property models of geological strata. Such techniques may capitalize on strong correlations among physical properties of spatially close cells and provide efficient compression schemes particularly suited for the outputs of physical property models of geological strata. Example process techniques can include the steps of obtaining topological grid data, interpolating data values for inactive cells, transforming the data, and truncating coefficients to obtain compressed data. These and other steps are discussed in detail below.
The dominating geometrical structures considered by geological strata models (e.g., temperature models) are geological layers or “strata”. A stratum can be defined as rock deposited during a specific geological time span (e.g., the Jurassic period, 200-145 million year ago).
In some embodiments, the model or the embodiment groups cells into larger blocks. For a fixed block size M, a model or an embodiment, can divide the grid into blocks including M3 cells. For example, for M=16, and for data with 1024×1024×96 cells total, the data may be quantized into 64×64×6 blocks. Such quantization can be used to simplify and break up modeling and/or compression calculations.
Note that, in general, specific strata do not exist everywhere. For example, it can happen that layers have not been deposited at some locations, or might have been eroded at earlier times. The lack of a particular layer at a particular location is referred to as a “pinch-out”.
The imposed grid depicted in
Cells that represent pinch-outs are referred to as “inactive” and may not have physical property value(s) initially estimated for them by the model. However, models may interpolate physical values for inactive cells based on values from the surrounding cells as discussed below (e.g., in reference to block 504 of
At block 502, the method obtains topological grid data for a layered geological structure. The topological grid data may be output from a model of physical properties of the geological strata. The grid data can include, for example, physical quantities for each cell defined by the topological grid.
The data may be obtained in various ways. For example, an embodiment may be included as part of an implementation of the model itself, such that the model provides the data to the embodiment within an electronic communications channel. In such configurations, the model may produce blocks of raw data, and the embodiment may be applied block-wise to compress the data. As another example, the data may be obtained from a model that is logically and/or physically disjoint from the embodiment. In such configurations, the embodiment may obtain the data from the model over a network such as a local area network or the internet. As yet another example, the topological grid data may be obtained from a source other than the output of a model of physical properties of geological strata.
At block 504, the method interpolates data values for pinch-out cell representations. In general, inactive cells, which represent pinch-out locations, have undefined values for some or all modeled physical quantities. Because then next block in the method of
An example suitable interpolation technique for block 504 is a Laplace interpolation scheme. Laplace interpolation may use a mapping of the interpolation problem to a Laplace equation, where all defined values are taken as boundary conditions. The Laplace equation may be solved on the topological grid (e.g., illustrated in
After block 504, the resulting interpolated values are relatively smooth, which is advantageous for the subsequent steps of blocks 506 and 508. Smooth values are also advantageous for obtaining good compression ratios. The output of block 504 may be referred to as interpolated topological grid data.
At block 506, the method transforms the interpolated topological grid data from the spatial domain to the frequency domain (Although the terra “frequency domain” can refer to units of quantity per time, those of skill in the art understand that it can also refer to the output of a transform such as a Fourier transform, regardless as to the type of units used for the input data.) A Fourier transform may be used for block 506 to transform the data values to the frequency domain by calculating according to, by way of non-limiting example, the following.
bk
In Equation (1) above, the term bk
In Equation (2), cos represents the trigonometric cosine function, and M represents block size as discussed above in reference to Equation (1).
Block 506 is not limited to this particular transformation. Depending on the input data, other transformations such as other discrete cosine transformations or discrete sine transformations may be used. Discrete wavelets transformations using, e.g., Haar, Daubechies, or Cohen-Daubechies-Feauveau wavelets are also suitable transformations.
One the data is transformed, the output of block 506 may be referred to as frequency domain interpolated topological grid data.
At block 508, the method truncates the transformed coefficients to reduce the size of the data. In general, after applying block 506, only a relative few, mostly low-frequency, components have significant weight. Accordingly, truncating the coefficients to eliminate those that represent negligible components can efficiently compress the data without losing significant information.
Thus, block 508 may first sort or re-order the coefficients using a suitable ordering. The chosen ordering may be fixed for all block, but may differ for different physical quantities or events. The ordering may be selected so that, for the most part, the absolute value of the reordered coefficients is decreasing. Because, generally, only the low-frequency components have significant weights, the ordering may move the low-frequency components to the front and the high-frequency ones to the back according to the ordering. As an example three-dimensional reordering may proceed according to the pattern established by the following example sequence: b000, b100, b010, b001, b200, b110, b101, b020, b011, b002,etc.
After the coefficients are re-ordered, block 508 truncates them to retain the first N coefficients; all coefficients beyond N in the ordering are discarded. The error introduced by this compression can be calculated by applying the inverse transformation and comparing the results to the input data. Block 508 can select a suitable value for N such that the maximum error introduced by the compression scheme is below a specific error limit (e.g., a percentage). that is, embodiments may prompt a user to enter an error limit, and the method can determine an appropriate value for N that achieves the limit while reducing the number of retained coefficients as much as possible as constrained by the specified error limit.
The output of block 508 may be referred to as truncated frequency domain interpolated topological grid data, or simply “compressed data”.
At block 510, the method stores the compressed data. That is, for each block, the first N of the coefficients are stored. Other relevant data (e.g., data representing the topological grid) may be stored as well. The data may be stored in persistent memory such as hard disk, flash memory, tape drive, or any other electronic storage technique that can accommodate bulk data. The data may be transmitted over a network such as a local area network or the internet prior to storage.
Collectively,
For the compression illustrated by
Numerous alterations and extensions of the described compression techniques are possible. For example, embodiments may employ a transformation other than a discrete cosine transformation for block 506 of
Furthermore, embodiments may consider and utilize dimensions greater than three. For example, geological property models of geological strata usually consider many different hydrocarbon components (e.g., up to 100), such as methane, ethane, propane, etc. The different hydrocarbon components can be enumerated, for instance 0 for methane, 1 for ethane, and so on. For any hydrocarbon component-related output, e.g., the mass of a specific hydrocarbon component per cell, the component index can be considered as additional dimensions, as opposed to values mapped from a three-dimensional cell. This means, for example, instead of mapping a particular cell (i1, i2, i3) to the mass of a particular hydrocarbon component (with index j) ai
When both time and temperature are taken into account as dimensions, the input data can be five-dimensional, represented symbolically as (i1,i2,i3,i4, i5), where (i1,i2,i3) enumerates the topological grid cell, i4 the hydrocarbon component, and i5 the time-step. In such an embodiment, a five-dimensional discrete cosine transform can be utilized. Using this technique can increase the compression ratios obtained by the considered algorithm substantially
The processor system 800 may also include a memory system, which may be or include one or more memory devices and/or computer-readable media 804 of varying physical dimensions, accessibility, storage capacities, etc. such as flash drives, hard drives, disks, random access memory, etc., for storing data, such as images, files, and program instructions for execution by the processor 802. In an embodiment, the computer-readable media 804 may store instructions that, when executed by the processor 802, are configured to cause the processor system 800 to perform operations. For example, execution of such instructions may cause the processor system 800 to implement one or more portions and/or embodiments of the method of
The processor system 800 may also include one or more network interfaces 808. The network interfaces 808 may include any hardware, applications, and/or other software. Accordingly, the network interfaces 808 may include Ethernet adapters, wireless transceivers, PCI interfaces, and/or serial network components, for communicating over wired or wireless media using protocols, such as Ethernet, wireless Ethernet, etc.
The processor system 800 may further include one or more peripheral interfaces 806, for communication with a display screen, projector, keyboards, mice, touchpads, sensors, other types of input and/or output peripherals, and/or the like. In some implementations, the components of processor system 800 need not be enclosed within a single enclosure or even located in close proximity to one another, but in other implementations, the components and/or others may be provided in a single enclosure.
The memory device 804 may he physically or logically arranged Of configured to store data on one or more storage devices 810. The storage device 810 may include one or more file systems or databases in any suitable format. The storage device 810 may also include one or more software programs 812, which may contain interpretable or executable instructions for performing one or more of the disclosed processes. When requested by the processor 802, one or more of the software programs 812, or a portion thereof, may be loaded from the storage devices 810 to the memory devices 804 for execution by the processor 802.
Those skilled in the art will appreciate that the above-described componentry is merely one example of a hardware configuration, as the processor system 800 may include any type of hardware components, including any necessary accompanying firmware or software, for performing the disclosed implementations. The processor system 800 may also be implemented in part or in whole by electronic circuit components or processors, such as application-specific integrated circuits (ASICs) or field-programmable gate arrays (FPGAs).
The foregoing description of several possible embodiments has been presented for purposes of illustration only. It is not exhaustive and does not limit the present disclosure to the precise form disclosed. Those skilled in the art will appreciate from the foregoing description that modifications and variations are possible in light of the above teachings or may be acquired from practicing the disclosed embodiments.
For example, the same techniques described herein with reference to the processor system 800 may be used to execute programs according to instructions received from another program or from another computing system altogether. Similarly, commands may he received, executed, and their output returned entirely within the processing and/or memory of the processor system 800. Accordingly, neither a visual interface command terminal nor any terminal at all is strictly necessary for performing the described embodiments.
Further, processor system 800 may implement a physical property model of geological strata as referred to herein. That is, the same processor system that implements the compression scheme for output data from a physical property model of geological strata may also implement the physical property model itself. In such an arrangement, block 402 of
The steps described need not be performed in the same sequence discussed or with the same degree of separation. Various steps may be omitted, repeated, combined, or divided, as necessary to achieve the same or similar objectives or enhancements. Accordingly, the present disclosure is not limited to the above-described embodiments, but instead is defined by the appended claims in light of their full scope of equivalents.
This application claims priority to U.S. Provisional Patent Application Ser. No. 61/775,952, filed on Mar. 11, 2013. The entirety of this provisional application is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61775952 | Mar 2013 | US |