This application claims the benefit of French Patent Application No. FR2200737 entitled “Detection and Correction of False Positive Classifications from a Product Sand Detection Tool,” filed Jan. 28, 2022, the disclosure of which is incorporated herein by reference in its entirety.
Timeseries classification plays an important role in the oil and gas industry as well as many other disciplines such as speech recognition, finance and medicine. Generally, timeseries classification techniques can be divided into feature-based and distance-based approaches.
In feature-based approaches, a feature extraction procedure is performed before a classification phase, whereas distance-based approaches have no feature extraction phase due to defined suitable distances through which the classification phase is carried out.
In distance-based approaches, distances can be computed over a raw or reduced representation or over decomposed coefficients (e.g. Fourier transform) of a timeseries. However performance of the distance-based approaches strongly depends on a quality of the timeseries alignment. Product sand detection tool (PSDT) signals have a low structural characteristic because sand inject events and a number of impacting sands, as well as many other factors in a downhole environment, are quite random. Consequently, distance-based approaches may not work effectively for detecting downhole sand entry occurrences.
In feature-based approaches, features such as, for example, mean, variance, maximum, minimum, entropy, power spectrum density, Fourier coefficients, autocorrelation function, etc., which capture statistics of signals that identify a certain class can be analyzed. A main advantage of the feature-based approaches is compact representation of a signal. However, because real-world signals tend not to be stationary due to a number of unpredictable factors, many more features may be required to capture informative content. Therefore, feature formulation and selection is very important when using feature-based approaches.
Use of a wavelet transform is another approach for exploiting time structure features. By using a wavelet transform, a timeseries waveform can be separated into “signal” and “noise” components, which can be used to obtain more informative features for classification purposes. Because PSDT waveforms at a sand inject entry point include specific patterns, the wavelet transform may be used to formulate some features.
Using a contemporary approach of deep learning for timeseries classification, which could be considered as an automatic learning feature-based approach, may be a good approach for detecting downhole sand entry points. However, this approach would require a large and well labeled data set and supporting hardware and software computational resources.
A classification model can easily be found in literature such as, for example, k-nearest neighbor, support vector machines, decision trees, random forest, logistic regression, and deep neural networks. Although these methods may perform differently, selection of these methods for a feature-based approach is mostly a grid search.
Embodiments of the disclosure may provide a method for detecting downhole sand entry points. A computing device receives a sand detection output of a product sand detection tool and a raw timeseries waveform corresponding to an input to the product sand detection tool. The computing device detects at least one downhole sand entry point at a logging depth based on the sand detection output of the product sand detection tool. In response to the detecting of the at least one downhole sand entry point, the computing device extracts a subset of features based on the raw timeseries waveform. The computing device determines whether the detecting of the at least one downhole sand entry point is a true positive or a false positive based on the extracted subset of the features and a trained Random Forest classifier. In response to determining that the detecting is the true positive, a remedial action is performed regarding the at least one downhole sand entry point.
In an embodiment, the method may include training a Random Forest classifier to produce the trained Random Forest classifier. The training of the Random Forest classifier includes the computing device randomly selecting the features based on the raw timeseries waveform to produce the subset of the features. The computing device determines which paired features of the subset of features have a higher average detection probability than others of the paired features based on using a training set of the features and known sand entry point outcomes. The computing device constructs the trained Random Forest classifier based on multiple decision trees, each of which is based on a respective pair of the paired features of the subset of features having the higher average detection probability.
In an embodiment, the method may include the computing device eliminating, as candidates for the subset of features, the features with a single unique value, the features with a correlation magnitude greater than 0.9 with respect to another of the features, and the features that do not contribute to a cumulative importance of at least 0.9. The randomly selecting of the features based on the raw timeseries waveform to produce the subset of the features includes randomly selecting the subset of features from the features not eliminated as the candidates for the subset of features.
In an embodiment, the method may include the computing device determining a probability of sand entry point detection based on decision trees formed from each feature of the subset of features paired with another feature of the subset of features. The computing device determines an average probability of sand entry point detection of each of the decision trees that pairs a same one of the subset of features with each different respective feature of the subset features. The computing device then may determine which of the decision trees that pairs the same one of the subset of features with each different respective feature of the subset of features has a highest average probability of the sand entry point detection. A pair of the subset of features is selected for the decision trees of the trained Random Forest classifier from the same one of the subset of features and the each different one of the subset of features for the decision trees having the highest average probability of the sand entry point detection.
In an embodiment of the method, the output of the product sand detection tool is more likely to report a false positive regarding detection of the downhole sand entry point than a true positive.
In an embodiment, the method may include the computing device creating a wavelet transform of the raw timeseries waveform. A noise portion of the wavelet transform is extracted and at least some of the features are extracted based on the noise portion of the wavelet transform.
In an embodiment of the method, the extracted features may include frequency domain features, basic features, and wavelet-based features.
Embodiments of the disclosure may also provide a computing system for detecting downhole sand entry points. The computing system includes at least one processor and a memory connected with the at least one processor. The memory includes instructions for configuring the computing system to perform operations. According to the operations, a sand detection output of a product sand detection tool and a raw timeseries waveform corresponding to an input to the product sand detection tool are received. At least one downhole sand entry point is detected at a logging depth based on the sand detection output of the product sand detection tool. In response to the detecting of the at least one downhole sand entry point, a subset of features are extracted based on the raw timeseries waveform. Based on the extracted subset of the features and a trained Random Forest classifier, a determination is made whether the detecting of the at least one downhole sand entry point is a true positive or a false positive. A remedial action regarding the at least one downhole sand entry point is performed in response to the determining that the detection of the at least one downhole sand entry point is the true positive.
Embodiments of the disclosure may further provide a non-transitory machine-readable medium having instructions recorded thereon for a processor of a computing device to perform operations. According to the operations, a sand detection output of a product sand detection tool and a raw timeseries waveform corresponding to an input to the product sand detection tool are received. At least one downhole sand entry point at a logging depth is detected based on the sand detection output of the product sand detection tool. In response to the detecting of the at least one downhole sand entry point, a subset of features are extracted based on the raw timeseries waveform. Whether the detecting of the at least one downhole sand entry point is a true positive or a false positive is determined based on the extracted subset of the features and a trained Random Forest classifier. In response to the determining that the detecting of the at least one downhole sand entry point at the logging depth is the true positive, a remedial action regarding the at least one downhole sand entry point is performed.
It will be appreciated that this summary is intended merely to introduce some aspects of the present methods, systems, and media, which are more fully described and/or claimed below. Accordingly, this summary is not intended to be limiting.
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments of the present teachings and together with the description, serve to explain the principles of the present teachings. In the figures:
Reference will now be made in detail to embodiments, examples of which are illustrated in the accompanying drawings and figures. In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the invention. However, it will be apparent to one of ordinary skill in the art that the invention may be practiced without these specific details. In other instances, well-known methods, procedures, components, circuits, and networks have not been described in detail so as not to unnecessarily obscure aspects of the embodiments.
It will also be understood that, although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another. For example, a first object or step could be termed a second object or step, and, similarly, a second object or step could be termed a first object or step, without departing from the scope of the present disclosure. The first object or step, and the second object or step, are both, objects or steps, respectively, but they are not to be considered the same object or step.
The terminology used in the description herein is for the purpose of describing particular embodiments and is not intended to be limiting. As used in this description and the appended claims, the singular forms “a,” “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will also be understood that the term “and/or” as used herein refers to and encompasses any possible combinations of one or more of the associated listed items. It will be further understood that the terms “includes,” “including,” “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. Further, as used herein, the term “if” may be construed to mean “when” or “upon” or “in response to determining” or “in response to detecting,” depending on the context.
Attention is now directed to processing procedures, methods, techniques, and workflows that are in accordance with some embodiments. Some operations in the processing procedures, methods, techniques, and workflows disclosed herein may be combined and/or the order of some operations may be changed.
In the example of
In an example embodiment, the simulation component 120 may rely on entities 122. Entities 122 may include earth entities or geological objects such as wells, surfaces, bodies, reservoirs, etc. In the system 100, the entities 122 can include virtual representations of actual physical entities that are reconstructed for purposes of simulation. The entities 122 may include entities based on data acquired via sensing, observation, etc. (e.g., the seismic data 112 and other information 114). An entity may be characterized by one or more properties (e.g., a geometrical pillar grid entity of an earth model may be characterized by a porosity property). Such properties may represent one or more measurements (e.g., acquired data), calculations, etc.
In an example embodiment, the simulation component 120 may operate in conjunction with a software framework such as an object-based framework. In such a framework, entities may include entities based on pre-defined classes to facilitate modeling and simulation. A commercially available example of an object-based framework is the MICROSOFT® .NET® framework (Redmond, Washington), which provides a set of extensible object classes. In the. NET framework, an object class encapsulates a module of reusable code and associated data structures. Object classes can be used to instantiate object instances for use by a program, script, etc. For example, borehole classes may define objects for representing boreholes based on well data.
In the example of
As an example, the simulation component 120 may include one or more features of a simulator such as the ECLIPSE™ reservoir simulator (Schlumberger Limited, Houston Texas), the INTERSECT™ reservoir simulator (Schlumberger Limited, Houston Texas), etc. As an example, a simulation component, a simulator, etc. may include features to implement one or more meshless techniques (e.g., to solve one or more equations, etc.). As an example, a reservoir or reservoirs may be simulated with respect to one or more enhanced recovery techniques (e.g., consider a thermal process such as SAGD, etc.).
In an example embodiment, the management components 110 may include features of a commercially available framework such as the PETREL® seismic to simulation software framework (Schlumberger Limited, Houston, Texas). The PETREL® framework provides components that allow for optimization of exploration and development operations. The PETREL® framework includes seismic to simulation software components that can output information for use in increasing reservoir performance, for example, by improving asset team productivity. Through use of such a framework, various professionals (e.g., geophysicists, geologists, and reservoir engineers) can develop collaborative workflows and integrate operations to streamline processes. Such a framework may be considered an application and may be considered a data-driven application (e.g., where data is input for purposes of modeling, simulating, etc.).
In an example embodiment, various aspects of the management components 110 may include add-ons or plug-ins that operate according to specifications of a framework environment. For example, a commercially available framework environment marketed as the OCEAN® framework environment (Schlumberger Limited, Houston, Texas) allows for integration of add-ons (or plug-ins) into a PETREL® framework workflow. The OCEAN® framework environment leverages .NET® tools (Microsoft Corporation, Redmond, Washington) and offers stable, user-friendly interfaces for efficient development. In an example embodiment, various components may be implemented as add-ons (or plug-ins) that conform to and operate according to specifications of a framework environment (e.g., according to application programming interface (API) specifications, etc.).
As an example, a framework may include features for implementing one or more mesh generation techniques. For example, a framework may include an input component for receipt of information from interpretation of seismic data, one or more attributes based at least in part on seismic data, log data, image data, etc. Such a framework may include a mesh generation component that processes input information, optionally in conjunction with other information, to generate a mesh.
In the example of
As an example, the domain objects 182 can include entity objects, property objects and optionally other objects. Entity objects may be used to geometrically represent wells, surfaces, bodies, reservoirs, etc., while property objects may be used to provide property values as well as data versions and display parameters. For example, an entity object may represent a well where a property object provides log information as well as version information and display information (e.g., to display the well as part of a model).
In the example of
In the example of
As mentioned, the system 100 may be used to perform one or more workflows. A workflow may be a process that includes a number of worksteps. A workstep may operate on data, for example, to create new data, to update existing data, etc. As an example, a workstep may operate on one or more inputs and create one or more results, for example, based on one or more algorithms. As an example, a system may include a workflow editor for creation, editing, executing, etc. of a workflow. In such an example, the workflow editor may provide for selection of one or more pre-defined worksteps, one or more customized worksteps, etc. As an example, a workflow may be a workflow implementable in the PETREL® software, for example, that operates on seismic data, seismic attribute(s), etc. As an example, a workflow may be a process implementable in the OCEAN® framework. As an example, a workflow may include one or more worksteps that access a module such as a plug-in (e.g., external executable code, etc.).
The process may begin with a computing device receiving a raw timeseries waveform (act 202), which may also be provided as input to the PSDT. The raw time series waveform may be provided by sensors located at a downhole logging depth The PSDT may analyze the raw timeseries waveform and may provide an output signal, which may be received by the computing device (act 204) and may indicate whether a downhole sand entry point is detected at the logging depth.
The computing device may determine whether the received output signal from the PSDT indicates that the downhole sand entry point is detected at the logging depth (act 206). If the computing device determines that the received output signal indicates that no sand was detected, then the process may indicate that no sand was detected (act 207) and the process may be completed.
Otherwise, if the computing device determines that the received output signal indicates that a downhole sand entry point was detected, the procedure may determine whether the detection of the downhole sand entry point was a true positive or a false positive by extracting a number of features based on the raw timeseries waveform (act 208) and using a binary classifier such as, for example, a trained Random Forest classifier (RFC), based on at least a subset of the extracted features, to determine whether the detection of the downhole sand entry point is the true positive or the false positive (act 210). If the binary classifier detects the sand entry point at the logging depth, then the computing device may indicate that the detection is the true positive (act 214). Otherwise, the computing device may indicate that the downhole sand entry point is the false positive (act 212). The process then may be completed.
If the true positive is determined, a remedial action may be taken. Remedial actions may include injecting artificial tackifying chemicals (e.g. agglomerants) or binders (conglomerants) into a well to stabilize formation material while maintaining sufficient permeability to enable production, or plugging of the well, as well as other remedial actions.
A set of features may be derived from the raw timeseries waveform received by the computing device, its wavelet-based noise-extracting version, and its frequency domain analysis.
Basic features of the raw timeseries waveform may include:
Some basic features are illustrated in
To amplify a difference in a pulse shape of a raw timeseries waveform, a wavelet transform may be adopted to extract noise from the raw timeseries waveform. A standard form of a tunnel-jet sand peak shows exponential decay. A mother wavelet function “db2”, shows a similar exponential decay as illustrated in
In addition to power spectral density (PSD) features, which are computed for the raw timeseries waveform and the wavelet-based extracted noise waveform, features in a frequency domain may be calculated based on a fast Fourier transform (FFT), an autocorrelation function (ACF), and a partial autocorrelation function (PACF) of the raw timeseries waveform. These are presented in this specification as follows:
As discussed above, there are many features that may be extracted based on a raw timeseries waveform. In various embodiments, some features may be eliminated as candidates for a subset of features that may be considered for forming decision trees of a Random Forest classifier. Based on a data set that includes expert labels for true and false downhole sand entry point detection, collinear and low importance features may be removed from consideration for use with decision trees of an RFC according to a following criteria:
To implement criterion (ii), a Pearson correlation score is used to cluster groups of features based on an Agglomerative Hierarchical Clustering algorithm with a magnitude correlation threshold of 0.9. Then, only one representative feature with a highest correlation to a target label is selected from each group and remaining features from the each group are removed from consideration for use with the decision trees of the RFC.
To implement criterion (iii), a simple RFC is used to train with the data set. An importance score based on a Gini impurity measure is used for removing features that do not contribute to a cumulative importance of 0.9.
The process may begin with the computing device eliminating features from being candidates for the subset of features to be considered for use in forming decision trees for a RFC, as previously discussed (act 602). Next, a subset of remaining features not eliminated as being the candidates may be randomly selected (act 604). Next, the computing device may determine, for each pair of features from the randomly selected subset of features, a probability of detecting a downhole sand entry point, given that the PDST provided a true outcome with respect to detection of the downhole sand entry point. The probability may be determined based on using training data and expert labels indicating known sand entry point detection outcomes (act 606). An average probability of detecting a downhole sand entry point may be calculated for each group of decision tree classifiers that use a same feature of the subset of features paired with another feature of the subset of features (act 608). Following act 608, the computing device may form an RFC based on a group of the decision tree classifiers having a highest average probability of detecting downhole sand entry points with respect to other groups of decision tree classifiers (act 610). In some embodiments, decision tree classifiers may be limited to a depth of 4.
Looking at
In some embodiments, the methods of the present disclosure may be executed by a computing system.
A processor may include a microprocessor, microcontroller, processor module or subsystem, programmable integrated circuit, programmable gate array, or another control or computing device.
The storage media 806 may be implemented as one or more computer-readable or machine-readable storage media. Note that while in the example embodiment of
In some embodiments, computing system 800 contains one or more sand entry point detection modules 808. In the example of computing system 800, computer system 801A includes the sand entry point detection module(s) 808. In some embodiments, a single sand entry point detection module 808 may be used to perform some aspects of one or more embodiments of the methods disclosed herein. In other embodiments, a plurality of sand entry point detection modules 808 may be used to perform some aspects of methods herein.
It should be appreciated that computing system 800 is merely one example of a computing system, and that computing system 800 may have more or fewer components than shown, may combine additional components not depicted in the example embodiment of
Further, the steps in the processing methods described herein may be implemented by running one or more functional modules in information processing apparatus such as general purpose processors or application specific chips, such as ASICs, FPGAS, PLDs, or other appropriate devices. These modules, combinations of these modules, and/or their combination with general hardware are included within the scope of the present disclosure.
Computational interpretations, models, and/or other interpretation aids may be refined in an iterative fashion; this concept is applicable to the methods discussed herein. This may include use of feedback loops executed on an algorithmic basis, such as at a computing device (e.g., computing system 800,
The foregoing description, for purpose of explanation, has been described with reference to specific embodiments. However, the illustrative discussions above are not intended to be exhaustive or limiting to the precise forms disclosed. Many modifications and variations are possible in view of the above teachings. Moreover, the order in which the elements of the methods described herein are illustrated and described may be re-arranged, and/or two or more elements may occur simultaneously. The embodiments were chosen and described in order to best explain the principles of the disclosure and its practical applications, to thereby enable others skilled in the art to best utilize the disclosed embodiments and various embodiments with various modifications as are suited to the particular use contemplated.
Number | Date | Country | Kind |
---|---|---|---|
FR2200737 | Jan 2022 | FR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2023/011388 | 1/24/2023 | WO |