The present invention is directed to systems, devices, and methods for using remote hyper-spectrum scanning to trace and/or track refined oil products back to an original source. Measurements obtained are used to determine the origin of a particular oil and may include, without limitation, particle size, cone penetration, dropping point, steel mesh oil separation, moisture, and PQ concentration.
Refined oils, including gasoline, diesel, and lubricating oil, are products with special traceable attributes. Due to various technical factors and deviations in the refining process at all refineries, such as equipment used, equipment assembly, process flow, time, temperature, and the like, the composition of each production batch of refined oil will have small, but detectable differences. For example, each composition will have a different degree of sulfur and olefin content, octane and cetane content, or have difference in additives and contaminants caused by non-standard transportation.
In some instances, it has been found that used lubricating oils are placed back on the market as new oil after being recycled and filtered. In another report, fuel tankers are alleged to have been used to transport food-grade liquids (e.g., cooking oil, syrups, etc.) without sufficient cleaning between uses, resulting in both contaminated food liquids and contaminated fuel (see, China Food Safety Scandal: Cooking Oil Carried In Same Trucks As Fuel,” by Mccarthy, Simone and Jiang, Joyce, CNN Business, Jul. 10, 2024). With rapid economic growth and an ever increasing demand for refined oil, retailers such as gas stations, lubrication oil suppliers, and other enterprises providing an important channel for oil and fuel sales, the quality of oil products being sold is critical. These business must be able to not only follow the oil refining process standard, but also represent that a first priority for the refinery sales channel is to provide safe, reliable, and quality oil products for consumers.
Unfortunately, the quality of refined oil can only be examined at a properly equipped laboratory. Further, such a laboratory is often located remotely, which requires sending samples to the laboratory. This process takes time to test and requires professional personnel to get involved to run the complex diagnostic instruments. Accordingly, it is not practical to perform lab testing on each batch of refinery oil as it arrives for retail use by businesses.
Technology has made it possible to have portable, battery-powered lubricant analysis tools with capabilities comparable to equipment used in laboratories. Such a device is disclosed in U.S. Pat. No. 9,791,386 B2 to Henning et al. (the '386 patent) and assigned to Spectro Scientific, Inc. of Chelmsford, MA (see https://www.sprectrosci.com/the-latest/press-releases/fieldlab-58-portable-fluid-analysis-system-from-spectro-scientific-boosts-performance-with-new-x-ray-fluorescence-xrf-engine/). The '386 patent is hereby incorporated by reference. The '386 patent disclosed device integrates four analytical technologies, including X-Ray Fluorescence (XRF) for elemental analysis, a filter particle qualifier (FPQ) pore blockage particle counter, an infrared (IR) spectrometer. and a kinematic viscometer (40° C.). Through its four manual operations, the device can generate test results in 5-7 minutes. However, such a device typically weighs about 15 kg (about 33 pounds) and requires extensive training, if not expertise, to operate.
What is needed is a system, device and method which can combine hyperspectral data processing and a Cloud-based element spectrum database to generate quantitative and qualitative analysis in less than a minute with laboratory-comparable results. Such an invention would give owners and managers in the refined oil business a more convenient and effective instrument for tracing the origin of refined oil at the time of delivery. The inventive instrument should be capable of operation by a less skilled person with high throughput. Further, samples (less than 3.5 ml) should be repeatedly inspected, stored, and traced by the system and method.
Use of a 400 nm to 1,000 nm spectral band hyperspectral reflectance technique to quantitatively detect an amount of metal dust and chemical components in an oil, breaks with conventional emission spectrum analysis methods (atomic spectrometer technique), and significantly simplifies detection equipment, making it easily portable at about 1 kg (about 2.5 lbs). Detection is rapid, achieving acquisition of all detection results in only 3-6 seconds. Operation of the device is simple too and does not require specialized personnel. And, both operation and maintenance are inexpensive due, in part, to the use of very few consumables, with daily startup calibration requiring no consumables.
However, prior art devices and systems which use a hyperspectral reflectance technique to detect quantities of metal dust and chemical components in oil present major challenges to those of skill in the art.
For example, limited by the range of the spectral bands of the collected reflected light, the effort to identify and quantitatively detect the amounts of the metal dusts and chemical components contained in an oil sample through the existing spectral bands does not produce enough information. We can learn about a set of spectral bands corresponding to each of the detected elements, which is helpful for the separation thereof from the collected hyper-spectrum. It is also possible to establish a model according to the oil sample to be detected and the distributions of the contents of the detected components to make calculations for the oil sample to be detected and provide a statistical and inference regression algorithm. However, as a result of the interference by non-detection (random) components in an oil sample, the impact of temperature and environment on the optical components, and the impact of the operation on the instrument's structural design, detection errors might be introduced directly into the testing process. In order to make the technology of quantitative detection of oil components by hyperspectral reflectance technology practical in the application thereof, it is crucial to improve the overall accuracy of the algorithm model, the components of the device, and the structural design of the device
As to prior art methods for analyzing oil components, at present atomic emission spectrometry and the Raman method are widely used by those of skill in the art. Atomic emission spectrometry is based on optical emission spectroscopy (OES) using a rotating disk electrode (RDE). The method employs a huge potential difference in discharge present between the RDE and an external rod electrode in an oil sample. This achieves vaporization and plasma conversion of the oil sample under action of an arc excitation source. The characteristic spectra corresponding to the various elements contained in the plasma-converted oil sample being tested are obtained through the excitation. The optical system of the spectrometer then collects, distinguishes, and quantifies the excited emission spectra.
An atomic emission spectrometer (AES) comprises three modules: 1) an excitation source, which applies external energy to the oil sample being tested and excites the elements in the oil sample so that they emit characteristic spectra; 2) an optical system to distinguish and identify the characteristic spectral lines corresponding to specific elements in the emission spectrum; and 3) data processing and a display, to distinguish and measure the intensity of the characteristic spectral lines of each measured element generated after beam splitting by the spectral system, and to convert the detection results into quantified results in specific units which are directly displayed to the operator.
The excitation process of the excitation source consists of causing an electric arc (or high-temperature spark) generated by the discharge to directly act on an oil sample to vaporize it and release energy. This process requires a dark chamber, and the characteristic spectrum produced by the vaporized (plasma converted) element is a three-dimensional random space (the distance from the light source to the receiving probe surface) relative to the spectrum receiving surface (probe). Using an AES, the results have an average error of about ten percent (10%), even when continuously testing an identical oil sample. Cleaning of the excitation source and oil sample dark chamber, and single-use consumables for the rotating disk electrode and the oil sample container increase cost per test.
The optical system in an atomic emission spectrometer has a detection capability that includes all spectral ranges used to characterize the spectral lines of detected elements. Since the characteristic spectral lines emitted by many elements lie outside the visible light spectrum, a spectral range of 400 nm to 2,300 nm is required for receiver identification and detection. Most light in the far-ultraviolet spectral range (i.e., 1,000 nm to 2,300 nm) will be absorbed (attenuated) when transmitted through the air. In order to collect and analyze these characteristic spectral lines in the far-ultraviolet spectral range, the optical system must be placed in a specially provided vacuum chamber. Alternatively, a special inert gas that has no absorbing effect on the ultraviolet spectrum can be injected into the optical system to ensure that the emitted characteristic spectral lines reach the optical grating system. Following refraction and diffraction, the light is projected onto a photoelectric conversion device (PMT). A sealed vacuum dark chamber, vacuum pump, and gas supply system (pressurizing pump) are consequently essential parts of a spectrometer. These components have direct implications for the volume, weight, and complexity of the spectrometer.
Due to the complexity of an atomic emission spectrometer (AES), data processing needs to perform a power-on calibration of the system daily. Calibration is done using standard oil samples of known elemental composition. Generally, three different standard oil samples are required for calibration when turning a unit on each day—as much as 40 minutes are required just to warm up the AES each time it is turned on. The equipment also requires different standard oil samples (i.e., different elemental concentrations and distributions) for standardization on a regular basis. As a result, spectral equipment relies on availability of standard oil samples, increasing the complexity of equipment operation and maintenance. In addition, because the AES must be operated in a vacuum dark chamber, the size of the instrument is massive.
Finally, an atomic emission spectrometer is only suitable for the detection of metal components. It does not have the ability to detect chemical components, flash points, viscosity, particle size, or the like, giving it a significantly limited usefulness.
The following Table shows just some of the deficiencies of comparable analytic devices:
1Other includes chemical composition, particle size, viscosity
Likewise, the Raman method has limitations and downsides. The Raman method uses a light source to irradiate light onto the oil. When the light refracts, a very small amount (about 1%) of the light is absorbed by the oil and changes frequency. The components in the oil are identified by detecting this portion of the light. While the Raman method is pollution-free, it can only analyze macromolecular information in oil. Due to the small amount of information (about 1%) and the high-precision requirements of the equipment, a single unit is expensive, and operation and maintenance costs are high.
The present disclosure describes methods, systems, and devices for simultaneously obtaining the detection results of metal components, particle size, viscosity, and chemical components by means of the spectral characteristics of an oil sample, oil sample sampling point information, and machine learning in a single operation. The disclosed invention achieves the effect of simplifying operation, economizing on consumables, and real-time portable detection. Further, it allows one to take the laboratory into the field and eliminates the need for operation by specialized personnel.
In the process of spectral analysis, multiple spectral bands having the main characteristics within a spectral band range are usually selected for model calculation to detect the target element components and contents. Hyperspectral oil analysis comprises a set of optical systems and algorithmic models. The accuracy and reliability of the optical system and the acquisition of algorithmic models based on the spectral bands of main characteristics-such as elements, ion groups, particle size, and viscosity-permit algorithmic rapid linear regression (convergence), and the resolving of multiple correlations and consistency to determine the quantitative capability of hyperspectral oil analysis.
Since characteristic spectral bands of no two distinct elements completely overlap in nature, the element spectral bands can be extracted to identify and quantitatively analyze elements being detected. When more than one element being detected is present in an oil sample, a series of spectral lines with various wavelengths corresponding to individual elements will appear in the spectral plot, generally within a range of several tens of spectral bands. These spectral bands will be intermixed with the spectral bands of other elements and may even overlap. These spectral bands must be separated, and the target element spectral bands must be extracted to identify and quantitatively analyze the elements.
Until the invention of the present application, these and other problems in the prior art went either unnoticed or unsolved by those skilled in the art. The present invention provides a system and methods using remote hyperspectral sensing technology to produce a relatively lightweight testing instrument. As a result, the disclosed invention achieves notable improvements in testing with the associated device without sacrificing portability, ease of use, and accuracy of results.
There is disclosed herein an improved system and method for analyzing refined oil products to verify source and confirm product is contaminate free, which avoids the disadvantages of prior devices while affording additional structural and operating advantages.
Generally speaking, the method is used for verifying a source of an oil product. As used herein. the term “oil” generally refers to both crude/petroleum oil and food oil. Likewise. the term “refined oil” shall include both refined crude oils as well as refined food grade oils. including canola, coconut, cotton seed. grapeseed, lard, palm. safflower. peanut (and other nuts), sunflower. olive, and vegetable oils. The oil refining process is broadly referenced and may simply be the result of gravity (e g., settling). filtration, addition of chemicals. separation. or any other known oil refining process.
The method comprises obtaining a plurality of samples for each of a plurality of oil products from a specific refinery/source; scanning each of the plurality of samples with a hyperspectral scanner to produce a hyperspectral image for each of the plurality of samples, associating a produced hyperspectral image with each of the plurality of oil products, creating a spectral characteristics model database using the hyperspectral images produced for the plurality of oil products. scanning at least one sample of a delivered oil product from an unverified refinery/source with a hyperspectral scanner to produce a hyperspectral image for the delivered oil product, comparing the hyperspectral image for the delivered oil product against the hyperspectral images in the spectral characteristics model database, and determining if a matching hyperspectral image from the spectral characteristics model database matches the hyperspectral image of the delivered oil to verify the unverified refinery/source of the delivered oil product.
In specific embodiments, the hyperspectral images are in the range of from 400 nm to 1,000 nm. Further, scanning each of the plurality of samples with a hyperspectral scanner to produce a hyperspectral image for each of the plurality of samples comprises at least one of varying at least one ambient condition of the scanning process, and alternating between at least two different hyperspectral scanners. The ambient conditions may include temperature, pressure, humidity, lighting, and the like.
In other specific embodiments, the method includes rejecting the delivered oil product if no matching hyperspectral image can be found in the spectral characteristics model database. Further, the scanning at least one sample of a delivered oil product from an unverified refinery/source is completed in less than one minute per sample, preferably less than 30 seconds per sample, and most preferably less than 10 seconds per sample.
It is preferred in most embodiments that the spectral characteristics model database is stored to a Cloud-based server.
In preferred embodiments, the hyperspectral scanner comprises a light emitter for directing light into a sample, a light receiver to receive reflected light from the sample, a photoelectric converter for converting incremental bands of the reflected light into sample reflectance numbers, and a wireless transceiver for communicating to the Cloud-based server, the server comprising processing software to match the hyperspectral image of the delivered oil to a hyperspectral image in the database.
These and other aspects of the invention may be understood more readily from the following description and the appended drawings.
For the purpose of facilitating an understanding of the subject matter sought to be protected, there are illustrated in the accompanying drawings, embodiments thereof, from an inspection of which, when considered in connection with the following description, the subject matter sought to be protected, its construction and operation, and many of its advantages should be readily understood and appreciated.
While this invention is susceptible of embodiments in many different forms, there is shown in the drawings and will herein be described in detail at least one preferred embodiment of the invention with the understanding that the present disclosure is to be considered as an exemplification of the principles of the invention and is not intended to limit the broad aspect of the invention to any of the specific embodiments illustrated.
With reference being made to
As shown in
A 400 nm-1000 nm halogen light source is used for analysis. The oil sample to be analyzed is loaded into a rectangle cuvette 14 with a volume of preferably about 3.5 ml, having light transmittance on both sides of about +90% (wavelength of 350 nm-2000 nm). The cuvette 14 is inserted into a dark chamber to obtain the spectrum of a specific wavelength (also known as “hyperspectral characteristic bands”) through the reflected light path. The reflected light is generally expressed with the characteristic spectral bands of ion clusters (molecules). It is characterized in that its full width at half maximum (FWHM) is wide, and the element (atomic) spectral band (which is very narrow and looks like a line) intrudes into the spectral band. Because there is no complete overlapping of the characteristic spectral bands of two different elements in the world, the spectral bands of the elements are extracted to achieve the identification and quantitative analysis of the detected elements. When more than one detected element is present in the oil sample to be detected, a series of spectral bands of various wavelengths corresponding to each element appear in the spectrum, usually in a range of dozens of spectral bands. These spectral bands are mixed or even overlap with those of the other elements. These spectral bands must be separated to extract the spectral bands of the target element in order to achieve identification and quantitative analysis of the element. In the process of the spectral analysis, multiple spectral bands with main characteristics within the range of the spectral bands are usually selected for model calculation to achieve detection of the components of the target elements and their content. Even so, the hyperspectral oil analysis system consists of a set of optical systems and algorithm models. The accuracy and reliability of the optical system and the acquisition of the algorithm models according to the main characteristic spectral bands of the elements result in a fast linear regression (convergence) for the algorithm, and the solutions of the multiple correlations and consistencies determine the ability of quantitative detection of the hyperspectral oil analysis method.
Based on the output of sampling and the optical system, reflectance and DN values (radiance values) of the oil sample being tested are obtained. Assuming the relationship between the emission spectrum of the element being detected and the intensity of each spectral band corresponding to an element concentration is known, a numeric label for the concentration level (ppm) of the detected element in the oil sample being tested can be calculated. If a sufficient number of oil sample component densities (component densities of all detected elements) and their corresponding spectra with different metal element concentrations have been stored in a database-for example, a model of a number of oil samples for 100 hours and 200 hours of machinery operation-the oil samples being tested between these can be calculated by the principle of partial least squares by adopting the spectral band n as an independent variable {x1, . . . , xp} to calculate the test index p as a related dependent variable {y1, . . . . vn}.
Based on the statistical relationship between a dependent variable and an independent variable, the parameters of the oil sample being tested (the model of the oil sample) among multiple known oil sample points in the system database are observed, and data tables of the independent variable and the dependent variable, X={x1, . . . , xp} and Y={y1, . . . . vnn}, are formed. Partial least squares regression separately extracts the first components t1 and u1 in X and Y based on the indexes of the oil samples being tested and the corresponding spectral bands based on the capacity afforded by independent variable components to analyze dependent variable components (where the test indexes correspond to known spectral bands). Partial least squares regression implements the regression of X on t1 and the regression of Y on u1, respectively. If the regression equation achieves a satisfactory result (maximization of accuracy or change trend), the algorithm terminates. If a satisfactory result is not achieved, a second round of component extraction is performed using the remaining information after X has been interpreted by t1 and the remaining information after Y has been interpreted by u1. The process is repeated until satisfactory accuracy is achieved.
For example, using a spectrum (reflectance and DN values) obtained from an oil sample being tested after running for 160 hours, if m component t1, t2, . . . , tm bands are ultimately extracted from spectrum X, a partial least squares regression will be conducted by regressing yk (the index of some element component) for the t1, t2, . . . , tm bands in an inversion calculation to obtain a certain element index of the oil sample being tested.
The data processing and quantitative calculation algorithm inputs the spectrum (the split reflectance and DN value) of the oil sample being tested based on spectral model parameters, and quantitatively calculates a detection result by partial least squares regression inversion. In practice, combined with the detection objectives of application scenarios, the actual collection of oil samples and the corresponding laboratory test results are used for one-time calibration of the model.
In the present invention, different models are combined to separate, calculate, and analyze the reflectance and DN values of the characteristics of the spectral bands of oil samples that have been detected once, to separately obtain corresponding detection results for different models. Examples of this are the metal components, particle size, viscosity, and other chemical components. This reduces the detection operation and increases testing speed.
As will be described in detail below, the spectral models are constructed from a set of oil samples that reflect actual application scenario oil changes and correspond to their spectral characteristics. The oil samples cover the entire life cycle of an oil in, for example, manufacturing equipment, such as the entire cycle of lubricating oil in rotating equipment from the replacement of the oil to a subsequent oil change. Based on the specific application scenario, 20 to 30 oil samples are generally selected for modeling.
In practical application, testing accuracy is achieved by obtaining high-quality oil samples or by diluting and mixed modeling of standard oil and actual oil samples. The present disclosure focuses on describing how to apply the algorithm models established based on different test index categories to adapt machine learning to base oil samples. By means of a single detection operation, the detection effect of cross-index types can be achieved.
The proper operation of heavy equipment is critical to a manufacturer. The health of the equipment is evaluated, managed, and maintained through active operation and maintenance to achieve normal operation. The analysis of oil in mechanical equipment is a key link and technique for evaluating the health of equipment. Oil testing complements vibration analysis, thermal imaging, and other predictive maintenance techniques to monitor, diagnose, and assess the health of equipment. However, oil testing is a complex physical and chemical process that mostly still depends on on-site collection for laboratory testing. Unfortunately, mechanical conditions can change significantly within the time required for a laboratory to return oil sample results, such as for aircraft engines. Under certain circumstances, the detected indexes relate to metal components, particle size, viscosity, and chemical components. Examples are lubricating oils and hydraulic oils. Hyperspectral oil testing technical equipment affords on-site oil analysis and real-time detection intended to eliminate long waiting times and achieves comprehensive detection to allow timely decision-making for machine operations.
As previously noted, the principle of the hyperspectral oil analysis technique is based on reflectance and DN values obtained from oil samples tested by photoelectric sensors. Model algorithms are used to infer the composition of the oil being tested. Models are established based on the components of oil samples tested using the results of known oil laboratory test components. The same (400 nm-1,000 nm) spectrum, reflectance and DN values from identical oil samples being tested are subjected to different models and corresponding algorithms to obtain corresponding detection results. The results indicate that for a single detection operation, the above mechanisms can be integrated to complete different tests by the following operation process (see
1) Establishing relationships between sampling points and test indexes. The sampling points are fixed relative to the equipment and the oil it uses. Only the sampling time changes. The component content of the oil samples collected varies depending on the time of collection. The test index refers to the algorithm model corresponding to the hyperspectral oil testing equipment. The algorithm model varies depending on the operating equipment (sampling point), oil brand, grade, and type of test index (such as metal components, particle size, viscosity, and chemical components). The test index and type are fixed relative to the sampling point. By means of hyperspectral oil testing equipment system software (application software), a corresponding relationship is established between the operating device, the sampling point, the oil being tested, and the tested indexes.
2) The oil sample being tested is bundled with the sampling point. The sampling point and the oil sample collection container (for example, a cuvette) are bundled during the collection operation. The sampling point is fixed, and the oil sample container is random. It therefore suffices to establish a relationship between the number, name, or a unique identification (UID) number of the collection container and the sampling point during collection. The sampling point is fixed relative to the equipment and the oil being tested and can naturally be bundled with the oil algorithm model. This operation is based on establishing a relationship between the sampling point and the oil sample model to be tested through the application software during initialization. It is not hard to understand that a sampling point can correspond to multiple model algorithms, and this means that the sampling point must be tested for different types of components. For example, metal components and chemical components may be simultaneously required. The application software runs on a cloud server or on an offline computer. It performs differentiated management, device management (of hyperspectral oil testing equipment), data management (report generation, trend changes, tracing) and the like for the brands, manufacturers, and types of oil samples being tested.
3) The oil being tested is transferred from the collection container to a cuvette, and an oil sample UID number is assigned. The cuvette containing the oil sample is placed in the cuvette dark chamber of a hyperspectral oil testing device. The sampling point is selected through a human-computer interactive page, the number of the oil sample to be collected (optional) is selected/input, and the test operation is executed. The optical system of the hyperspectral oil testing device generates the reflectance and DN values of the oil sample. The reflectance and DN values are bundled with the sampling point and oil sample identification number (or test time if decided not to input identification number) and uploaded to the algorithm model server. After a few seconds, the required test results for that sampling point are returned and displayed. The test results can be retrieved, traced and sorted by number (or test time). The oil sample number is bundled with the sampling point and collection time. Based on the time, the detection results of all of the collected data of the sampling point can be sorted and displayed based on a time axis. The trends of the changes of the oil at the sampling point are clear at a glance. The following describes in detail how the system pushes oil samples to different model algorithms based on sampling points.
4) The model algorithms vary depending on oil performance, brand, grade, operating equipment, and the components being detected. Depending on the application scenario, the sampling point may sometimes be quite sensitive and require a special algorithm to “correct” the existing model or to train the model by machine learning to adapt to the base oil sample (performance, viscosity, contamination, brand, grade). This method will be discussed in detail in the next section. It can be understood here that the model is a parameter matrix A,
Herein, n denotes a 400 nm to 1,000 nm spectral reflection band, with 300 discrete spectral lines, and m denotes the number of components of the oil sample being tested, such as 24 metal components. For an algorithm model Y=A⊙X, Y is a mixed reflection spectrum vector, X is a test index vector, A is a model matrix, and ⊙ is an algorithm. Models and algorithms vary depending on the different test components they calculate, while a mixed reflectance spectral vector does not vary. Test index (category) vectors, calculation results, and algorithms and model matrices differ and are distinguished. A hyperspectral oil testing equipment system will establish an algorithm model library based on oil performance, brand, grade, operating equipment, detection components, and sampling points. The application software “connects” (bundles) different model algorithms based on sampling points. When a detection operation selects a sampling point, the application software determines which model algorithms need to be called for the reflectance and DN values obtained by the collection operation based on the settings.
5) The application software spawns threads based on different model algorithms, and simultaneously pushes the reflectance and DN values of the oil sample being tested to each thread for model calculation. Since the algorithms of each model are independent, the amount of data for unified input of the reflectance and DN values of the oil samples being detected are limited, thereby resulting in independent vertical technology permitting parallel operation. Therefore, the calculation time will not increase due to an increase in the test types at a certain sampling point. For the user, the detection operation of the hyperspectral oil testing equipment is unrelated to the number (types) of detection categories. The user clicks on the execution button on the human-computer interactive page of the device, and the detection results can be displayed within 5-6 seconds.
The application software focuses on cross-model detection (multi-models). When calling the model (detection) for the first time, the model must be based on a sample of the base oil. Adaptive learning improves detection accuracy. The first call (based on a sampling point) can be assigned to device initialization. The adaptive learning logic is discussed in detail in the next section.
6) In the detection data summary, the result report generates an operation process containing distribution calculations and a synchronized result data summary. Computation synchronization monitors the running of all threads to ensure that the operation of the last algorithm model thread has been completed and results have been generated. Then the results of all thread operations are summarized, and the results are pushed to the front end application based on the data and the report format. When a certain thread is still being calculated, all results are summarized and pushed to a front end application. The results of threads whose operations have not yet been completed become indeterminate data or erroneous results. The delay caused by synchronizing the operation results of each thread should be on the order of microseconds or milliseconds, which will not affect the displaying of results or the operational experience of the end customer.
When multiple threads run independently at the same time and the start and completion times differ, synchronization (a software-specific function) must be used to ensure that all threads are completed before fetching the results (or continuing to the next stage of operation). Otherwise, the results will be incomplete (software-defined results at this point may not be useful). Here, ensuring that the last algorithm model thread operation has ended (which is random) refers to the software function of synchronously monitoring completion of the threads.
In general, specific oil samples are used for equipment modeling. The model that is established is extremely sensitive to the base oil manufacturer, brand, grade, viscosity, and degree of (particle) contamination. Cross-model testing of identically collected oil samples, such as the detection of metal components, chemical components, and oil viscosity, cannot guarantee that the base information of the oil sample being tested (such as No. 0 oil) will match the modeled oil sample, and this inevitably interferes with the model and affects accuracy. The model must be re-learned based on the sampling point (oil sample) to achieve relative detection accuracy. The method of adaptive relearning is a key link in the implementation of automated cross-model detection. The method adopted is folded (subset) interleaving to predict responses. In this “prediction,” the partial least squares modeling prediction (detection) described in the technical principles of this invention is adopted. Some of the subsets are used for observation, the number of subsets is related to the distribution gradient of the oil sample group being modeled, and the result “response” is measured by the mean square error (MSE).
Here, k denotes the number of the model prediction learning cycle, which is determined by the distribution gradient of the oil sample group; yi denotes the ith observation response: and f(xi) denotes the ith prediction learning result (detection value). The better the model learns to predict the observed value, the smaller the MSE.
The detection model requires the establishment of the distribution of multiple oil sample points through the entire life cycle of the oil. The oil sample batches provided by customers are often in a certain area (point) in the oil sample life cycle distribution. Diluted (oil sample) modeling accordingly becomes necessary. A “representative” oil sample is selected in the batch of oil samples to be diluted with the base oil (no. 0 oil) to establish a distribution gradient and constitute an oil sample group for modeling. The subset refers to the sum of the prediction and training sets in the folded subset interleaved prediction response method, which depends on the distribution gradient and must be smaller than the number in the sample group of the oil being modeled.
The reason why this depends on the distribution gradient (correlation) is that each dilution point (point of the distribution gradient) can be understood as a model parameter matrix point. A continuous area is formed between the matrix points by fitting (by the partial least squares method). Therefore, a known oil sample (with test results) and a base oil are used for dilution to form distribution gradient points (training and prediction sets). The results for each point are known. It is used in this manner to train and improve existing models. The accuracy of the distribution gradient points (known) depends on the specific gravity volume dilution method.
Embodiments of the present invention use the following principles to calculate the MSE for a given model:
Repeating the above steps k times, each time using a different training and test set (different gradients for modeling oil samples). Depending on the number of iterations the model is trained on, the predicted value will approach ever closer to the output of the model training set.
The overall test MSE is calculated as the average of k test MSEs, which is shown in
In practice, the following disclosed procedure is used to calculate the MSE for a given model:
Actual sampling point oil samples with laboratory test results are diluted by the volume specific gravity method and combined with the base oil (no. 0 oil) based on a concentration gradient to obtain a group of oil samples of known distribution. The oil sample group is generated by a hyperspectral oil analysis device to generate a set of reflectance and DN values. Among these, based on the spectrum, each group of DN values corresponds to a reflectance group,
R(λ)DN is the standard plate DN value (obtained when the equipment is turned on and calibrated every day), R(Ki)DN is the DN value of the oil sample being tested at the distribution gradient Ki position, and R(Ki) is the reflectance at the distribution gradient Ki position of the oil sample being tested. The dark current is the DN value (also known as the background noise of the equipment in the dark chamber, which is obtained when the equipment is turned on and calibrated every day) obtained by the hyperspectral oil testing equipment without any light source illumination. Ki is the test set, and the series of reflectance and DN values other than Ki are the training set.
As illustrated in
The halogen light source and the probe for collecting the reflectance spectrum are located on one side of the dark chamber, and the standard light plate is fixed on the other side of the dark chamber. Without a cuvette inserted, the light source can be projected to the standard light plate, and the reflected light can be directly received by the probe. The setting of the structure provides the operation needed for the calibration of the hyperspectral oil analysis device upon its being turned on once a day.
The cuvette 14 loaded with an oil sample to be tested (<3 ml) is inserted into the dark chamber and a “sealing” lid is closed to prevent pollution by the light from outside the chamber. Then, the light source can be directly projected horizontally (laterally) to the cuvette 14 and pass through the light transmittance surface of the cuvette, which has a transmittance rate of more than 90% (350 nm-2000 nm), then to the oil sample, with a certain level of penetration and reflection. Depending on the incident angle, the reflectance spectrum passes through the light surface of the cuvette 14 and goes directly into the probe.
The optical system transmits the reflectance spectrum of 400-1000 nm that has entered the probe to the diffraction grating. The grating divides the reflectance spectrum into 300 discrete (non-continuous) characteristic lines with the accuracy of 2 nm of spectrum band, which is similar to the monochromatic light of a single wavelength (1 nm). A photoelectric conversion of characteristic spectral lines is conducted via the electrically coupled complementary metal oxide semiconductor (CMOS), a charge coupled device (CCD), or a photomultiplier tube (PMT) to produce a group of real radiance values LDN with the wavelength (λ) as a variable, which can lay a foundation for subsequent data processing and quantitative analysis. This group of radiance values is also called DN values (digital number-intensity); see
The conversion of the remote sensing reflectance rate is subject to interference by many factors, including the intensity of the light source, the integration time, the CMOS saturation efficiency, and the temperature. These factors keep changing even in a fixed environment and vary from device to device. Through the calculation and conversion of the remote sensing reflectance rate, the interference by these factors can be corrected or eliminated to achieve real changes that reflect the components of the oil sample to be analyzed. The reflectance rate of the tested oil sample fλ is calculated as follows:
Ideally, the value of reflectance of a standard white plate should be close to the total reflectance, and the value of the reflectance of a black plate should be close to zero. The value of reflectance of an oil sample should be between the values of reflectance of a white plate and a black plate. Therefore, the reflectance rate fλ should be in a range of 0 to 1.0. The reflectance rate of the standard plate can be used to calibrate the deviation caused by interference factors.
The reflectance rate represents the intensity of the spectral lines obtained in that band, which is also known as the optical density value. The optical density value is directly used to calculate the concentration grade (PPM) of the content of each metal in the oil sample being analyzed.
The reflectance rate and the DN value of the oil sample are obtained according to the output of the acquisition and optical system. Should the emission spectrum of the detected element and the relationship between the intensity of each spectral band and the concentration of the corresponding element be known, the numerical label of the concentration grade (PPM) of the detected element in the oil sample can be calculated. If in the system database there is a sufficient amount of density (the concentrations of content of all detected elements) distributions of oil samples with varied concentrations of different metal elements and their corresponding spectra—for example, the models of several oil samples in the 100-hour and 200-hour run time interval—the oil samples between them can be calculated by using the principle of the partial least squares regression, with the spectral band n taken as the independent variable {x1, . . ., xp} to calculate, with the detection index p as a dependent variable {y1, . . . , yn}. According to the statistical relationship between the dependent variable and the independent variable, the parameters of the tested oil sample (from the model of the oil sample) in the middle of several known oil sample points in the system database are observed, from which the data table of the independent variable and the dependent variable is established: X={x1, . . . , xp} and Y={y1, . . . yn}. The method of the partial least squares regression is used in X and Y respectively to extract the first component t1 and u1 according to the indicators of the oil sample to be detected and its corresponding spectral band, based on the analytical ability of the independent variable component vs the dependent variable component (the detection indicators are corresponding to the known spectral bands). Partial least squares regression implements the regression of X on t1 and the regression of Y on u1, respectively. If the regression equation achieves a satisfactory result (reflected by the maximization of accuracy or trend of change), the first round of components are calculated. The remaining information after X is interpreted by t1 and after Y is interpreted by u1 is used for the extraction of the second round of components. The process is repeated until a satisfactory accuracy of each component is achieved. For example, for the spectrum (the reflectance and the DN value) obtained from the tested oil sample running for 160 hours. If eventually m component t1, t2, . . . , tm bands are extracted for spectrum X, partial least square regression will be carried out through the implementation of yk (the indicator of a certain element) to conduct an inversion calculation for the regression of the t1, t2, . . . , tm bands, to obtain the indicator of a certain element of the oil sample.
In the data processing and quantitative calculation algorithm, according to the spectral model parameters, the spectrum of the oil sample (the reflectance frequency and the DN value after being split) is entered, and the results of detection are quantitatively calculated with the method of partial least square regression inversion. In practice, the actual oil samples of the application scenario and the laboratory test results are combined to conduct a single standard comparison. However, the disclosed invention provides a method for ensuring that the reflectance frequency and the DN value of the oil sample are not affected by the detection device, and a method in which they can be automatically calibrated by data preprocessing.
As previously described, the accuracy of the hyperspectral oil reflection analysis technology depends on the consistency of the device, the workability of the technical principle, and the accuracy of the calculation method. The latter can be achieved by establishing a model of the standard oil in combination with actual oil samples. Features of this invention focus on describing a method of design and verification of device consistency. The results of repeated testing of the same oil sample by the device system reflect the repeatability of the device. The results of the same oil sample tested on different devices reflect the consistency of the device. The consistency (i.e., both consistency and repeatability) of the device is the basis for the device to reach a high level of accuracy. The factors that affect device consistency include light source stability, stability of grating splitting and photoelectric conversion, the consistency of structural design and the associated operations, and the impact of environment (e.g., temperature, humidity) on the optical path and grating circuit.
The consistency of device components is related to the light source and the grating separation and the photoelectric conversion circuit. The controllable parameters are exposure time (an integral value) and gain. The results are expressed with the reflectance rate and the DN value according to the band. With the length of use and environmental factors, the deviation of the parts of the device can be reflected by the changes of the reflectance rate and the DN value. The structural design and installation of the device can cause inter-device differences, the results of which are also reflected by the changes of the reflectance rate and the DN value. During operation of the device, a slight difference in the position and angle of the insertion and extraction of the cuvette will directly affect the reflectance angle of the optical path, the results of which are also reflected by the changes of the reflectance rate and the DN value. The quality of the light transmission surface of the cuvette falls under the subject of material selection for the cuvette. The cleanliness of the light transmission surface of the cuvette can be controlled by operation procedures, so it is not within the scope of this discussion.
To sum up the problems described above, consistency can be reflected by the changes of the reflectance rate and the DN value. If a relationship can be established between a) changes in the reflectance rate and in the DN value and the prostration of the properties of parts and components with use over an extended period of time, b) changes in the environment, c) variations in the structural design, and d) influence of human operations, so that the changes can effectively be calculated (measured), then device consistency can be ensured by correcting the measurement deviations (with the methods of reverse change or compensation).
With reference to
The halogen light source and the probe grating splitter are fixed by the dark chamber structural component of the device. Thus, a projection light path angle and reflectance angle are also determined. The structural component of the dark chamber of the cuvette establishes a fixed relationship with the light path (i.e., the light source and probe grating splitter) through the center of the light-transmitting lens. A fixed standard light plate is installed on the standard light plate slot of the structural component of the dark chamber of the cuvette, so that the light source is projected onto the standard light plate and the optical path reflected off the grating splitter of the probe is determined. Assuming little to no light pollution (i.e., light is not leaked) in the dark chamber of the cuvette, a special coating on the standard light plate makes it possible that the reflectance rate and intensity (the DN value) are not affected by the ambient temperature and humidity.
After the device is assembled, a measurement should be made and the reflectance rate and the DN value recorded as the “initial state” of the device. Therefore, the reflectance rate and the DN value obtained when the device is tested at any different time periods (e.g., after it is turned on for the first time every day), will be different to those recorded at the initial state. This difference indicates that deviation from consistency exists with the device. Deviations may be caused by environmental factors and/or functional attenuation of the light source and the probe grating splitter after use over time.
The design and processing of the structural parts of a device, the assembly of the halogen light source and the probe grating splitter, as well as the assembly of numerous devices, will certainly result in slight differences between devices. Such difference can be exhibited in reflectance rate and DN value by way of the optical path. Therefore, deviation consistency between devices needs to be measured.
To measure deviation consistency between devices, a standard module light plate is custom made according to the size of the cuvette—i.e., at least one side of the cuvette is equipped with a standard light plate. The standard module light plate is of the same size with the cuvette, and it is inserted into the cuvette hole in order to measure the difference between the operations of extracting and inserting a cuvette on different devices. The method of measurement is to first set a device as the “benchmark” for all other devices.
A standard optical surface of the standard module optical plate facing the optical path (the direction of the see-through lens) is inserted into the cuvette hole, and the reflectance rate and the DN value are tested after the device is turned on and warmed up. Values are then recorded as the “initial state” of the benchmark machine. The reflectance rate and the DN value are stored on a spectral model server (i.e., a dedicated server) and bound to the specific model built with the benchmark machine. For other devices, testing with a standard module optical plate will be conducted after being assembled or periodically (e.g., at times of maintenance or repair), and their reflectance rates and the DN values recorded. Each time a device carries out an oil analysis, the reflectance rate and the DN value data of the device are bound to the reflectance rate and the DN value of the standard module optical plate, and the data is uploaded to the spectral model server. Before recording and storing the measured reflectance rate and the DN value of the oil analysis data, the spectral model will first analyze the reflectance rate and the DN value of the standard module light plate of the uploading detection devices and compare these with the reflectance rate and the DN value of the modeling benchmark machine. From this comparison, measurement error, if any, can be obtained. The spectral model will then correct the reflectance rate and the DN value of the oil analysis data according to the measurement error of each band, so that the test results are consistent with those of the benchmark machine.
The oil sample to be analyzed is first injected into the cuvette 14, and the cuvette 14 is then inserted with its transparent surface facing the light path (i.e., in the direction of the see-through lens) into the dark chamber. The dark chamber is closed to prevent external light from leaking into the dark chamber, and then operation of the device is commenced.
Although the entire analysis operation takes only a few seconds (i.e., the testing itself will take about 5 to 6 seconds), inserting and extracting the cuvette containing the oil sample are independent and random events for both the device and the spectral model in the server. Therefore, the results of the continuous/repeated analysis of the same cuvette in the cuvette hole should be consistent (repeatable), as will any results of analysis. From the point of structural design, the implementation of the insertion and extraction actions require a space between the cuvette and the cuvette hole. Strictly speaking, any gap in the cuvette hole will introduce an error (angle) to the light path, resulting in a difference between two independently operated events, even though the same cuvette containing the same oil is being analyzed.
Regarding structural design of the dark chamber, the action of inserting and extracting the cuvette 14 containing the oil sample must be considered. It should not only be easy to operate, but also ensure that the structure of the dark chamber can hold the cuvette seamlessly. For example, the insertion into the transparent surface of the cuvette must not result in a fine angle in the light path.
The structure of a preferred embodiment of the dark chamber consists of a cuvette hole 9, a light-transmitting lens bracket and a see-through lens 2, a cylindrical rod for fixing the cuvette—namely, rotating shaft 7—and the cylindrical rod mechanism of the cylindrical rod for fixing the cuvette—namely, the rotating shaft seat 5, which ensures that the rotating shaft 7 moves in only one direction, as indicated by the arrow in
Installation begins by placing the rotating shaft seat 5 on the lower cover 8, then inserting the rotating shaft 7. Once the rotating shaft seat cover 6 is closed, attachment of bolt and installation of spring 1, to connect the rotating shaft 7 with the rotating shaft seat 5, follows. Finally, the see-through lens 2 and the standard light plate 4 are placed in the convex lens slot and the standard plate slot on the lower cover 8.
When the cuvette 3 is inserted into the slot, pressure exerted by the cuvette 3 pushes the rotating shaft 7 to rotate horizontally. As the cuvette 3 reaches the bottom, and the spring 1 pulls the rotating shaft 7, thereby pressing against the cuvette 3 and fixing the cuvette 3 to prevent it from shaking in the cuvette hole 9.
The cuvette hole 9 is designed according to the size of the cuvette 4, and sufficient space is necessary to ensure the smooth insertion and removal the cuvette. The distance and parallelism between the transparent surface of the cuvette 4 and the light transmitting lens 2 are guaranteed by the two rotating shafts 7 via spring 1 and rotating shaft seat, which are used to fix the cuvette 4. Because the rotating shaft is inserted into the cuvette 4, it is subjected to a force causing it to move in the opposite direction of the transparent surface of the cuvette 4. Such movement causes tension in the spring 1, which in turn causes the cylindrical rod 7 to press on the transparent surface of the cuvette 4 to stabilize it. The assertion of a force does not hinder the operation of inserting and extracting the cuvette 4. An elastic “soft” material is preferably used to make the rotating shaft 7, so that the transparent surface of the cuvette 4 will not be worn out as a result of repeatedly being inserted and extracted.
Consistency testing is for the testing and verification of the structural design and assembly quality of the dark chamber. As previously noted, one-time testing is preferably carried out and recorded after the device has been manufactured to establish an “initial state” of each device. The consistency calibration method is based on the record of the initial state of a device (i.e., reflectance rate and the DN value). In the course of use, additional reflectance rate and the DN value data are obtained through an efficiency test of the device when it is turned on for the first time every day. This data is then compared with the initial state record for calibration to be carried out and ensure consistency of the device.
Real-time operations can be thought of abstractly as the process of inserting, testing, and extracting the cuvette. The cuvette dark chamber structure needs to verify each insertion, close the cover of the dark chamber, start the test, and maintain the parallel position and distance between the cuvette and the light-transmitting lens. To describe this by another method, the reflectance rate and the DN value projected by the optical path to the same cuvette should be independent of the operation. Therefore, the priority of the consistency testing method is to measure any error caused by the front and back surfaces of the cuvette (because it is randomly inserted, both sides must be considered). The transparent surface of the cuvette is measured by the optical path and the transmittance of a given spectral segment. Strictly speaking, there are differences between the front and back transparent surfaces of the cuvette. Furthermore, the dust in the environment, non-standard operation or hand touch will lead to an error to a large extent.
So, error in the consistency of the device analysis is computed as:
The consistency testing method only considers differences between results of each test and does not judge the accuracy of each test result. Since the principle of hyperspectral reflectance is used, the randomness of the analysis depends entirely on the structural design and assembly of the dark chamber. Compared with an atomic emission spectrometer, with which the atomic emission spectrometric method is used to gasify and plasmatize an oil sample by means of an arc excitation source, the disclosed method is more stable and predictable. According to experimental results, the consistency of the hyperspectral reflectance technology is in the range of 1-2% error.
The calculated error for reflectance rate and DN value on each device should be used as a calibration (compensation) coefficient. Referring to
The data processing and quantitative calculation algorithm inputs the spectrum (reflection frequency and DN value after splitting) of the oil sample being tested based on the spectral model parameters, and quantitatively calculates a detection result by partial least squares regression inversion. In practice, one-time model calibration is performed based on the actual oil sample of the application scenario and the laboratory test results. The present invention provides an efficient and implementable modeling method for spectral models, including benchmarking work for the model and actual oil types.
A spectral model is built from a set of oil samples that reflect changes in the oil in actual application scenarios. Oil samples used to build the spectral model should cover the entire life cycle of the oil in the equipment, such as the entire cycle of lubricating oil in rotating equipment from oil addition to oil replacement. Based on specific application scenarios, 20 to 30 oil samples are generally selected for modeling. In practical applications, this is the most effective and convenient way to achieve high-quality oil sample modeling and achieve test accuracy. But this approach is impractical. The present invention establishes a spectral model on the basis of the ideal state of a standard oil, assesses system accuracy independently of the oil samples and the spectral model, and achieves equipment accuracy under ideal conditions. Based on combined modeling of standard oil (components) and a target base oil (e.g., brand, manufacturer, grade), spectral models established with actual oil samples can be gradually increased. This is a workable method of achieving a targeted accuracy in a controllable, correctable, and iterative manner. Using this scheme, even if there is a degree of deviation in the detection results, a corresponding degree of correction can be achieved through mixed model. The framework and steps of the preferred process are set forth in the flow chart of
Forming a closed loop of oil sample modeling, deviation correction, and improved accuracy can be used as a complement to the ideal approach (lacking practicality). The framework and steps of the scheme are shown in
The lubricant oil analysis method based on reflected hyperspectrum disclosed herein is used for combining a model and actual oil types and comprises several critical steps.
S1: First, it is necessary to model standard oils with identical concentration distributions having multiple indices. By “standard oil” it is meant oil samples specially made by a company, typically in a lab, and certified by Officials. The oil samples are created by accurately controlling the concentration of an element (e.g., iron) put in each oil sample. These standard oil samples are used to calibrate test equipment in the company lab before the equipment is used to test actual oil samples obtained in the field. Standard oil for the present system and device were obtained from SCP Science (https://www.scpscience.com/en).
Specifically, a webpage for such oil samples can be found using the following link: https://scpscience.com/en/products/categories?=581&name=metallo-organic-standards.in-fuel-matrices.
This step is achieved by selecting a standard oil to be analyzed and obtaining, through dilution and calibration operations, a distributed standard oil group covering a preset spectral band, with a component distribution (e.g., ppm of metal) of the distributed standard oil group corresponding to different spectra. Then, it is necessary to establish multiple hyperspectral bands for the distributed standard oil group and a spectral model for a single known standard oil component index, with the spectral model being a parameter matrix.
The spectral model must then be tested, taking a hyperspectral band detected for the distributed standard oil group as an independent variable, incorporating the parameter matrix, and using the partial least squares method to achieve convergence and “test” output a dependent variable. The dependent variable would be the standard oil component index of the distributed standard oil group, which is capable of reflecting the distribution of components.
S2: Secondly, indices (i.e., the material in oil samples being analyzed) need to be specified and standard oils with different concentration distributions (i.e., ppm) need to be modeled. This is achieved by selecting a standard oil and-based on the known concentration and composition of the standard oil-using the dilution and calibration operations mentioned above to establish a spectral model covering the distributed standard oil group. Due to the different distribution of the component indices of the standard oil, the parameter matrix of this step will have one more component index dimension than the parameter matrix of the above step. Then, it is necessary to establish a standard test template based on the dilution method and the covered distribution, with the standard test template being a set of standard oil sample groups, and the index being close to the distribution of components in the actual oil sample. It is important to establish and test a hyperspectral model and spectral models of multiple known standard oil component indices of the hyperspectral model's standard test template.
Finally, an overall assessment of the accuracy of the spectral models, algorithms, and equipment is necessary, so that different equipment can meet the accuracy standards of the test template.
S3: Next, modeling the standard oil with different concentration distributions of different indices is required. This is done by selecting a standard oil, adding the indices and concentrations of the actual oil sample distribution in the application scenario, and repeating creation of different concentrations through dilution and calibration operations, as described above, to establish a distributed standard oil group covering the application scenario. Based on the distributed standard oil group created, multiple spectral models of known standard oil component indices can be created and tested. The application scenarios of the present invention include detecting metal components in oil, as well as macromolecular chemical (such as phosphorus and boron) components and physical characteristics (flash point, viscosity, particle size, and soot) in oil.
S4: Once the spectral models are created, it is necessary to sample the test results for comparison to laboratory produced atomic emission spectrometer detection results for corresponding oil samples (i.e., same indices and concentration). The detection results are then adjusted by nonlinear data fitting, and the calculated deviation of above created spectral models are adjusted based on the data fitting. Ultimately, the device detection results are fitted to the laboratory atomic emission spectrometer test results.
S5: At this point, modeling a target detection oil brand and the manufacturer's base oil combined with a specified index standard oil is required. Accordingly, a standard oil and a base oil in actual use are selected. Then, the dilution method described above is repeated to establish a distributed standard oil group, which is then conformed to the actual oil sample distribution in the application scenario, and a spectral model based on the distributed standard oil group is established and tested. Then, it is necessary to customize the standard oil by adding particles, mixing the same with an oil sample being tested, establishing a standard oil group with an actual oil sample distribution of samples to which particles have been added, and establishing and testing a spectral model based on the distribution of the standard oil group. The impact of particles on the spectral model (i.e., interference) can be evaluated and a corresponding anti-interference scheme devised. Ultimately, a spectral model database library can be created by establishing a series of spectral models for different base oils and standard oils of specified indices for different manufacturers and brands in conjunction with application scenarios.
S6: The accuracy of the library and system can be improved by comparing oil samples with the same base oil in laboratory test results.
S7: Finally, the actual oil samples are modeled using the base oil of an oil sample and a collected oil sample having the most concentrated components during an actual oil change. Using the dilution method described above, a distributed oil sample group is established which will cause the distributed oil sample group to conform to the actual oil sample distribution in application scenarios. A spectral matrix model is established and tested, comparing the oil samples with laboratory test results, as shown in TABLE 1 below (also see chart in
In all instances, the “Absolute Deviation” (i.e., Test result-Lab Result) is less than required by ASTM D5185, satisfying lab ICP-AES equipment test results. ASTM D5185 is used as a verification standard as it is an international standard for lab equipment.
Once a model for the application scenario is established, and the oil sample being tested is input to the system, field testing can be performed. The first step is to select the oil type of the oil sample being tested—i.e., the target brand manufacturer's base oil—and determine a hyperspectral model. The oil sample to be tested is collected and manipulated by a hyperspectral oil detector to form a set of reflectivity and DN value spectra, which are inputted for detection. Spectral band splitting and processing segments are also required. The spectral bands of a specified element in the oil sample being tested are calculated based on the precise spectral segment of a characteristic spectral band of the specified element provided by the hyperspectral model library as an objective function. Generally, over the entire element spectrum (400 nm to 2,300 nm) several spectral bands (generally between 400 nm and 1,000 nm) are selected, based on experience. The number of spectral bands corresponds one-to-one with the spectral models. Upon accurately acquiring a spectral λ band (set), the value of the band (denoting an elemental component density) will necessarily form a corresponding relationship with a specified element spectral band of the hyperspectral model for the oil type. The partial least squares method is used to analyze the statistical relationship between the dependent variable and the independent variable. The stepwise process of the partial least squares method is as follows:
If data tables X and Y are subjected to m components ultimately extracted for X, substituting tk=*k1x1+*k2x2+ . . . +*knxn (k=1,2, . . . ,m) into Y=t1β1+t2β2+ . . . +tmβm to obtain the partial least squares method regression equation of p dependent variables:
The objective function is:
Usually, the dependent variable Y is the detected index (element component) that needs to be inverted (calculated or reconstructed). The partial least squares method makes it possible to calculate multiple detection indices. Here Y can be multi-dimensional detection index data. For example, it can be the amount of metal dust and chemical components in the oil being tested. A rigorous relationship is established between the detected index of the inversion result and the corresponding hyperspectrum. The detected index and the actual index concentration of the detected oil are calibrated to quantitatively estimate the element component content of the oil.
The partial least square regression algorithm described above is used to build a model and compute using independent variables (e.g., reflectance and DN value) and dependent variables (e.g., indices). The fold interleaved method is used to re-learn the inputs (reflectance and DN value) based on the existing model with a subset of known samples based on dilution distribution points (i.e., each sample created by gravity method) to improve the system accuracy.
The description of the algorithm is part of the folded interleaved verification method (for re-learning of the model purpose). The partial least square convergence of the above steps (for building the model) is to use the maximum eigenvalue of the matrix at each iteration (as described beginning at above). These steps are preferably repeated “i” times, 1≤i<K, K; the number of oil sample dilution distribution points (starting from 0), each time using a different training set and test set (different gradients of oil sample dilution distribution). Depending on the number of iterations the model is trained on, the predicted value will approach the output of the model training set.
The overall test mean square error (MSE) is calculated as the average of K test MSEs, or:
Ordinarily, the larger the number of iterations used in the K-iteration cross-refinement model is, the lower the deviation of the observed test MSE is (the difference between the predicted value and the model training set) but the higher the variance is. The “variance” is folded into the model by the system's own interference (noise). Conversely, the fewer iterations used, the higher the deviation, but the lower the variance. In practice, the selection is made based on the gradient variation of the oil sample being modeled and the characteristics of the actual sampling point. This choice has been shown to provide the best balance between deviation and variance, thereby providing a reliable estimate of the test MSE, enabling re-learning of the model and improving the relative accuracy of the test.
The specific steps of the volume specific gravity method are as follows:
Using the method of simply weighing and diluting the oil, the base oil and the test oil sample with laboratory test results are diluted based on a predetermined concentration gradient. This is characterized by simple operation, no need for laboratory equipment or environment, rapidity, accuracy (no cumulative error), no consumption of test oil samples (or base oil), and the like. It is an important link in implementation of the folded interleaved verification method, as well as in satisfying the accuracy of the training set and the test set. The operation process requires only auxiliary equipment: an electronic balance (specification: Max=200 g, e=0.01 g, d=0.001 g), two test tubes with 10 mL markings (including a stand for the test tubes to stand upright), and a pipette. The operation process assumes that the cuvette and cuvette holder required by the hyperspectral oil testing equipment to detect oil samples are present.
The primary method is as follows:
Wherein “xoil sample tested” is the weight of the oil sample being tested to be introduced based on the target dilution concentration, Wcuvette is the weight of the oil sample to be charged to the cuvette (obtained from step 2 above), and the target dilution concentration is the dilution point of the oil sample relative to the accompanying laboratory test results (for example, the relative targeted component is diluted from 200 ppm to 20 ppm). The component being tested is the target component (for example, iron Fe) in the sample being tested in the accompanying laboratory test results.
The weight of the base oil to be added to the cuvette: Wcuvette−xoil sample tested.
An electronic balance is employed. The cuvette is placed in a holder, which is then placed on the balance to obtain the net weight. Using a pipette, the test oil sample and the base oil are introduced based on the weights of the test oil sample and the base oil calculated in step 3 to reconstruct a test oil sample at the dilution point.
In some embodiments, the specific steps and material used for creating oil samples are as follows:
S1: Modeling of standard oil with the same concentration distribution of different indices. Taking the standard oil produced by SCP Science as an example, the specifications are as follows:
Through dilution and calibration operations, establishing a distributed standard oil set ranging from 0 to 900 ppm. Modeling and testing. Measuring the accuracy that can be achieved by the equipment and advancing to the next step once the absolute error, accuracy, relative error, and so on have been satisfied the specified conditions.
Through dilution and calibration operations, establishing a distributed standard oil set ranging from 0 to 200 ppm by using Fe concentration as the reference. Modeling and testing. The standard test template, including dilution method and coverage distribution, performs a general assessment of the accuracy of the spectral model, algorithm, and equipment that satisfies step S1 above. The absolute error is required to be less than 10% (accuracy 2 ppm), and the relative error is within 2%.
Combined with the actual oil sample distribution in the application scenario, a distributed standard oil group is established ranging from 0 to 200 ppm. Modeling and testing.
By combining the actual oil sample distribution in the application scenario, a distributed standard oil group is established by the dilution method, ranging from 0 to 200 ppm. Modeling and testing. Adding particles (can also be introduced through customized standard oil), mixing with the oil sample to be tested (if standard oil is used, step S3 is repeated, and Blank Oil is replaced by base oil), and performing modeling and testing. Evaluating the interference of particle size on the spectral model and the corresponding anti-interference scheme. Standard oil is modeled through different base oils and specified indices (combined with application scenarios to detect components), and a model library combined with application scenarios is established. The system software automatically retrieves and switches models in conjunction with its operating logic. The goal is to render the operation as simple as possible for users, such as operators. For example, there is no need to know the manufacturer or brand of the lubricating oil sample being tested.
In the figure, the detection value (solid line curve) is the modeling test result in step S5 above. The assay value (dashed curve) is the actual sampling laboratory test result, and the arrows indicate the calibration targets.
As shown in Table 2 below; compared to prior art the advantages of the present invention are numerous, including being lighter, more versatile, faster, and less wasteful.
1Other includes chemical composition, particle size, viscosity
The present invention can establish multiple models for metal components or macromolecular chemical components and can simultaneously detect metal components and macromolecular chemical components.
The product of the present invention does not require a vacuum dark chamber, extracts atomic spectral bands by mathematical methods, and simplifies the structural design, process, and consumables of the atomic excitation light source. By modeling the oil range to be detected and inferring a measurement algorithm, the requirements for the spectral band range are reduced, dependence on the extreme ultraviolet spectral range is avoided, and the complexity of the spectral system is greatly reduced. It is portable, low-cost, real-time, and intelligent, and is thus compact and consumables-free.
The present invention simplifies the operating sequence under specified application scenarios (having a spectral model), and obtains detection results conveniently, quickly, on the spot, and in real time.
U.S. Pat. No. ______ (application Ser. No. 17/396,986), previously incorporated by reference, describes an early conceptual version of the system, aspects of which are relevant to the preferred embodiment of the present system. That earlier conceptual embodiment is described with reference to
Using the portable instrument 11, the system can calibrate and match data by a hyperspectral model and output data corresponding to a composition of any material in a liquid sample (e.g., metal elements). The sample testing can be done onsite with results in a relatively short period of time. The output data can be formatted as a report providing diagnostic information, recommendations, and/or merely calling attention (i.e., alerts) to the sample and providing application scenarios.
The system is primarily comprised of instrument 11, which connects to the Cloud-based server 122. The instrument consists of acquisition peripherals, hyperspectral acquisition, processing and transmission, and result display. The Cloud-based server 122 consists of an information platform, calibration and processing, hyperspectral model matching, application driven expert system, measure result and diagnosis.
In addition to the instrument 11, acquisition peripherals include equipment such as a sample container 13 with an NFC chip to hold about 1.6-2.0 ml lubricant oil sample and its electronic unique ID (UID), a black and white standard reflection board for calibration, an acquisition base (i.e., create a dark environment) to support the system during acquisition, and a lens' hood 12. The system registers the sample container UID in a database and binds the container with a point of inspection (engine or rotary equipment) where oil type is known through a QR code sweep gun (not shown). The instrument 11 is able to connect the oil sample with the Cloud-based server 122 during test operation, so the right Hyperspectral Model can be used to match, and results can be transmitted to the instrument 11, and stored in the database.
The hood 12, as shown in
With reference to
Acquisition Peripheral 21. This component provides equipment such as a sample container, with NFC chip to hold lubricant oil sample and its ID, a black and white standard reflection board for calibration, an acquisition base (dark environment) to support the instrument during acquisition, a lens' hood to make sure the system produces consistent acquisition data independent from every operation.
Hyperspectral Acquisition, Processing and Transmission 22. This process describes the functions provided by the instrument. For example,
Information Platform 23. This component responds to setup a connection channel between an instrument and the Cloud-based server which facilitates an application driven platform dedicated for the end user.
Calibration and Processing 24. This component responds to measure the instrument and acquisition environment and compares to its initialization setting, use difference to generate compensation value for each band, applies them during each acquisition to offset the system errors and make sure the acquisition data consistent and stable.
Hyperspectral Model Matching Processing 25. This feature is comprised of two distinct procedures. First, the process is tasked with building a Hyperspectral Model based on a given number of oil samples with laboratory test results. A proprietary data processing method is used as well as a Hyperspectral Library to build the Hyperspectral Model (see detail illustrated in
Application Driven Expert System 26. This component uses application domain knowledge applied to the test results and provides meaningful information to less skilled onsite users to obtain mission critical maintenance diagnosis and recommendation in seconds. It is based on data accumulation and lubricant oil information to reconstruct a new (or updated) Hyperspectral Model for precision improvement and measurement expansion.
Measure Results and Diagnosis 27. This component responds to store, display, and trace the results. It also provides data management and authorization for distribution.
As previously noted, the hyperspectral sensing instrument 11 produces a uniform and smooth emission line with a characteristic wavelength (band) of 400-1000 nm. The composition of any dissolved material, metal elements, in the lubricant oil sample will have a different reflectivity of light at different wavelengths (bands) between 400 and 1000 nm. The reflectivity is detected by the instrument. Each element can be represented by a reflection value and a digital number (DN), as a function of the different wavelength bands. The reflection value and DN are as follows;
The detector 15 on the instrument 11 forms an angle with the light source 14 to maximize reflection acceptance. A hyperspectral splitter 16 after the detector 15 is used to segment the acquired spectrum with about 3 nm resolution or band widths. As a result, the splitter 16 divides the spectrum into about 200 to 300 distinct bands. Each individual band runs through a photoelectric converter to generate the reflection and DN values. By plotting the results of all the individual bands, two curves are formed based on the formulas above. Using broadband cellular network (4G or greater), the two curves are transmitted to a dedicated Cloud-based server 122. To summarize the process of Hyperspectral acquisition above, each acquisition operation emits hyperspectral light to the substance, receives reflection spectrum, splits the spectrum into distinct bands, converts the reflectance into two numbers, generates two curves based on the two numbers at each band and broadcasts the two curves to the Cloud-based server for storage.
The information platform 23 indicated in
The calibration and processing 24 of
Preferably, calibration is conducted periodically by user applying the necessary calibration procedures to generate calibration curves according to the application. However, the role of calculating compensation curves and applying correction to acquisition data is that of the calibration and processing component in the Cloud-based server 122.
Hyperspectral Model Matching is another component of the system 11 which is part of the Cloud-based server 122. This component takes acquisition data from a lubricant oil sample as input, after calibration of the two curves, then outputs quantitative analysis elements for the lubricant oil sample, such as iron (Fe) and copper (Cu) content (in mg/L). The hyperspectral model matching component consists of a Hyperspectral Library in which a collection of element spectrum is placed, such as spectral extraction, spectral discrimination, and spectrum matching processing components.
The Hyperspectral Model Matching has two tasks. The first task, based on a limited number of laboratory oil sample test results, which statistically cover entire subject lubricant application lifecycle distribution and acquisition data of these oil samples, is to build a Hyperspectral Model. The second task, based on the Hyperspectral Model, is to calculate the element from input acquisition data (two curves) in a lubricant oil sample within its distribution. This is described in further detail below:
In order for the system to quantitatively measure elemental contents in the lubricant oil sample, it needs to build Hyperspectral Model based on the same type of subject lubricant oil. Such a process is described in detail above.
The Hyperspectral Model 57 indicates the relationship between each element content corresponding to reflection and DN values of bands for a type of lubricant. Experimental results suggest that Hyperspectral Model 57 can hold multiple types of lubricants independent from the engine or rotating equipment to which it is applied.
It is easy to understand that a Hyperspectral Model 57 binds a type of lubricant or an application scenario. The Hyperspectral Model 57 can be assigned ID which can be associated with the lubricant oil sample container ID. In another words, the instrument obtains the lubricant oil sample container ID through near field communication (NFC) protocol, the system is able to pair the Hyperspectral Model to measure its acquisition data (two curves).
Acquisition inputs include dedicated data for calibration. For example, fresh lubricant oil sample reflection and DN values (i.e., clean oil before use) based on bands, and standard black and white optical plate reflection and DN values can be used as baselines (see
The same procedure can be used to measure plates at “power on” for the instrument prior to each testing. Any differences recorded over time will reflect degradation of the instrument. However, the system can use the measured difference to calculate a compensation value for the reflection and DN values of the acquisition data in real time. Accordingly, calibration is a processing unit of the system to measure and calculate the compensation needed to obtain correct and consistent reflection and DN values for each band.
Understanding measurement equipment and determining the elements in an oil sample to measure will help the Hyperspectral Model matching procedure. For example, it can help in the diagnosing of the subject engine or equipment runtime condition by knowing characteristics of the engine, equipment, or system (see
With the procedures disclosed above, comparable laboratory test results of a lubricant oil sample can be obtained using the disclosed hyperspectral sensing instrument 10 and system, in as little as a few seconds. The instrument 11 is lightweight, preferably handheld, compact enough to fit any specific application scenario, and easy enough to operate by maintenance personnel that it does not require a dedicated technician.
The instrument 11 provides at least two opportunities for better maintenance and service, including 1) providing a direct diagnosis of the “health status” of equipment as a clinic physical exam report rather than merely providing element contents in the oil sample that would require dedicated personnel to interpret, and 2) keeping the instrument independent from the specific application scenario and the Hyperspectral Model independent from the instrument, which allows the Hyperspectral Model to leverage big data self-learning and improve the precision and sample interval of the lubricant. An Application Driven Expert System (see
The Application Driven Expert (ADE) System is a self-sufficient container (i.e., as in software terminology, not a physical container), automatically deployed by the system based on an application that can run in the Cloud-based server. The ADE System offers an end user access to the system. It corresponds to at least one instrument by binding its ID. It provides an application scenario to input the way a skilled technician and/or scientist using test results of an oil sample to diagnosis or analysis the “health condition” of a machine, wind turbine, vehicle, ship, or a jet engine, etc., and to make a recommendation based on the analysis. For example, a certain level of iron (Fe) content in a lubricant oil sample from a wind turbine would mean the wind turbine paddle bearings are worn out. As a result, a maintenance procedure may be recommended. Such a threshold level can be set into the “container” to trigger an alert. Since the instrument 10 binds to the application (via ID), it can be operated by a less skilled worker on site to obtain the same diagnosis and recommendation in seconds.
Based on various applications, the system can deploy appropriate self-sufficient containers. Each software container corresponds to an application scenario, while each application corresponds to a Hyperspectral Model. The more application scenarios deployed, the greater the number of Hyperspectral Models in the system to be built (see
In a situation where there are different applications for the same type of lubricant, then multiple Hyperspectral Models create overlap in data and provide more detection area for the lubricant. The greater data allows the system to update/reconstruct the Hyperspectral Models, whereby precision becomes much better for the overlapping area, and the detection range may even increase.
For example, using a wind turbine analysis for iron (Fe) content, two Hyperspectral Models (e.g., different customers) might correspond to 2 megawatt (MW) and 4MW wind turbine applications. Both turbines use the same lubricant in the paddle bearing. Iron (Fe) content ranges between 0-1300 mg/kg in the Hyperspectral Model of the 2MW wind turbine, while the Fe content range is between 300-1800 mg/kg in the Hyperspectral Model of the 4MW turbine. With data from both models, the Hyperspectral Models of both the 2MW and 4MW wind turbines can be reconstructed/updated by the system. This process increases precision as a result of the increase in sample size. It also expands the analysis range for the wind turbines when the iron (Fe) content increases beyond the original modeling area. Iron, as well as other materials, can be quantitatively measured and exceeding thresholds can trigger an alert when anything potentially catastrophic happens in the bearings. As a result, the instrument improves its measure area and precision by self-learning.
The system includes a database which stores data, including the measure results, diagnosis, and any recommendations according to the acquisition time stamp. It is herein referred to as the “Measure Result & Diagnosis” component in the Cloud-based server 122 (see
The beneficial effects of the present invention are numerous. For example, the method is suitable for obtaining detection results for metal components, particle size, viscosity, and chemical components simultaneously in a single operation. This simplifies operation, economizes consumables, and achieves the effect of portable real-time detection, thereby eliminating the need for specialized operators involved.
As used herein, the word “preferred” means serving as an example, instance, or illustration. Any aspect or design described herein as “preferred” is not necessarily to be construed as advantageous over other aspects or designs. Rather, the use of the word “preferred” is intended to present concepts in a specific manner. The term “or” as used in this application is intended to mean an inclusive “or” rather than an exclusive “or.” That is, unless it is specified otherwise or is clear from context, “X employs A or B” is meant to naturally include either one of the permutations. That is, “X uses A or B” is satisfied in any of the following examples: X uses A; X uses B; or X uses both A and B.
Although the present disclosure has been revealed and described with respect to one implementation mode, equivalent variations and modifications will occur to those skilled in the art based on a reading and understanding of this description and the drawings. The present disclosure includes all such variations and modifications and is limited only by the scope of the appended claims. In particular, with respect to the various functions performed by the components set forth above (such as elements), the terms used to describe such components are intended to correspond to any component that performs the specified function of the component (that is, which is functionally equivalent) (unless otherwise indicated), even if not structurally equivalent to the disclosed structures that perform the functions of the exemplary implementation modes of the present disclosure shown herein.
Furthermore, although particular characteristics of the present disclosure have been disclosed with respect to only one of several implementation modes, such characteristics may be combined with one or other characteristics of other implementation modes as may be desirable and advantageous for a given or particular application combination. Moreover, to the extent that the terms “including.” “having,” “containing,” or variations thereof are used in the detailed description or the claims, such terms are intended to include in a manner similar to the term “comprising.”
Each functional unit in the embodiment of the present invention may be integrated into a single processing module, or each unit may exist physically alone, or several or more units may be integrated into one module. The above integrated modules may be implemented in the form of hardware or may be implemented in the form of functional software modules. If the integrated modules are implemented in the form of functional software modules and sold or used as independent products, they may also be stored in a computer-readable storage medium. The above storage medium may be a read-only memory, a magnetic disk, an optical disc, and the like. The above devices or systems may execute the storage methods in corresponding method embodiments.
In sum, the above embodiment is an implementation mode of the present invention, but implementation modes of the present invention are not limited by the embodiment. Any other changes, modifications, substitutions, combinations, and simplifications that do not deviate from the spirit or principle of the present invention should all be construed as equivalent substitutions that are contained within the protective scope of the present invention.
As previously noted, a unique benefit of the disclosed device and system is the ability to get accurate and useful results quickly. In tackling the problem of contamination of refined oils at delivery, this benefit is extremely useful. Business owners who retail fuel, diesel, lubricating oil, and similar refined oil products, can check product quality upon delivery to determine the product origin before accepting.
The equipment to be used has the characteristics of non-professional on-site operation, extremely low cost, and provides real-time collection and analysis. A refinery can input data from oil samples for each and every individual production batch to be stored onto a cloud-based spectral characteristics database. Then, businesses, such as oil depots and gas stations, can test delivered product to easily trace the source of the refined oil back to not only the refinery from where it was produced, but to the exact batch of refined oil produced as well, simply by using the same type of hyperspectral oil analysis equipment (system registered) disclosed. This allows the business to verify the quality and origin of the refined oil product in real time. It also helps businesses avoid accepting refined oil from the factory when there has been contamination during transportation, a cross-shipment, and unpredictable links.
The present invention uses a principal component analysis algorithm (PCA) and K Nearest Neighbor (KNN)—a data classification and regression algorithm—to establish a model database of eigenvectors and eigenvalues for refined oil samples. The traceability of refined oil products is determined by obtaining the classification of its eigenvectors and eigenvalues in a geometric space constructed based on the model database.
The hyperspectral oil analysis equipment converts any inspected refined oil into a spectral characteristic dataset of about 300 spectral bands. It includes reflectivity and DN values as shown in
In order to solve this problem, spectral characteristics dimensionality reduction techniques are adopted, including feature selection and/or feature extraction. Principal Component Analysis (PCA) is an unsupervised dimensionality reduction technique commonly used. PCA is used to find a set of orthogonal vectors (i.e., principal components) that best explain the variance of a dataset. It establishes a multi-dimensional eigenvector and eigenvalue model vectors. The number of dimensions of vectors is directly related to their accuracy. The higher the number of dimensions, the higher the accuracy to represent the original spectral characteristics dataset. The model database requires the participation of all incoming refined oil spectral characteristics. Any of the refinery product eigenvectors and eigenvalues involved in the modeling have their own locations in the model database geometry space. The modeling of a large number of refined oil samples requires computing power and memory space, which can be solved by using cloud servers.
Customer-side tracing and modeling (feature vectors and eigenvalue model database) are not on the same timeline (always behind refinery), so there is no problem with calculating all participation of refinery oils.
Due to the deviation between different hyperspectral oil analysis equipment, differences in any exam environment (e.g., temperature, humidity, etc.), and the influence of time and transportation on the refined oil itself, the spectral characteristic eigenvectors and eigenvalues of the product will be different from the same eigenvectors and eigenvalues used in modeling in the “geometric space” of the model database, resulting in a “neighborhood” effect. Therefore, it is necessary to use the K-Nearest Neighbor (KNN) algorithm to classify and regress the dataset training.
Categorizing data points is useful by separating them into different classes. This classification is done according to the similarity score of different testing equipment and acquisition time of the same type of refined oil and the score of different refined oil. Dataset training refers to the generation of spectral eigenvectors and eigenvalues of the same refined oil through input greater than or equal to K times (operating at the same or different hyperspectral oil analysis equipment, different time), which are projected to the eigenvectors and eigenvalues modeling geometric space as points for classification. For the dataset training, calculate the distance between points in the modeling geometric space for the refined oil product classification and the spatial location of the remaining refined oil eigenvectors and eigenvalues in the model database. The model category of the closest K (odd) points is selected based on the distance or the function used, and the class attributes are determined according to the weights.
The present disclosure is based on the technology, principle, equipment and system of hyperspectral oil analysis equipment disclosed above.
The following is a detailed description of how to establish a model database of eigenvectors and eigenvalues of refined oil products according to their spectral characteristics obtained from hyperspectral oil analysis equipment and using KNN algorithm to classify: There are two algorithms required to achieve the purpose of traceability, combined with the technical scheme of using hyperspectral oil analysis equipment to solve the problem of traceability of refined oil products.
Principal component analysis (PCA) is a widely used algorithm for dimensionality reduction of data, including spectral data. In the context of spectral variables, PCA can help identify the most important spectral bands or wavelengths by reducing the dimensionality of the data while preserving most of its variance.
The first step is Spectral Data Initialization 281 which is used to establish the matrix X of the refinery oil sample and its spectral variables. For example, an oil sample spectral characteristics dataset might include n oil samples, each with ρ spectral variables. Spectral data can be represented as matrix X with a dimension of ρ×n, where each row represents a different sample, and each column represents a different spectral variable. If we have 16 oil samples, each with 300 spectral bands (refer to above), the dimension size of X is 300×16. For each additional oil sample, the X matrix adds a dimension with 300 spectral variables:
Next step is to establish a covariance matrix 282 which is a mean-centered data matrix S. The first step in PCA is to center the data by subtracting the average of each variable from the spectral variables. This ensures that the mean value of each variable is zero and that all spectral variables contribute equally to the analysis in the equation:
Next, we calculate the covariance matrix Sx for the mean-centered data Xmean. The covariance matrix provides information about the relationship between different spectral variables. The covariance matrix (Sx) is represented by the equation:
Next. we find the eigenvectors and eigenvalues of the covariance matrix 283 which is also known as eigenvectors decomposition. Returning to the original problem of linearly transforming the original X matrix using the relation Y=PX, the goal is to find a orthonormal matrix P where
The next step in
Finally, to establish a spectral characteristics model database 285, a choice is made to transform matrix X, by choosing the row of P to be eigenvectors of S. Thus, substituting this into the derived expression for the covariance matrix Y gives as follows below. The original spectral data is projected onto the selected principal components Y to obtain a dimensionality reduction. The final matrix is represented as:
PCA can significantly reduce the dimensionality of oil sample spectral characteristics variables, while retaining most of the important physical signature information, making them easier to visualize, cluster, and analyze. According to the above, PCA effectively reduces the dimension size of the original spectral variable X from 16×300 to the dimension size of Y, or 16×3. It is important to interpret the principal components in the context of the original spectral variables in order to understand their physical significance. In simple terms, the principal components must correspond to a linear relationship of the original spectral variables: i.e., principal component eigenvectors that are orthogonal to each other, indicates that there is no relationship between the eigenvectors (covariance=0). At the same time, the importance of principal component eigenvectors decreases, which indicates that the first principal component eigenvector is the most important, and the eigenvectors of the first few dimensions represent most of the characteristics of the original spectral variables. The present disclosure uses PCA to establish a spectral characteristic model database to achieve tracing the source of oil samples only.
The K-Nearest Neighbor (KNN) algorithm is used to classify all known spectral signature points stored in the spectral characteristics model database (multi-dimensional space). These data points are constructed by a group of eigenvectors and eigenvalues generated by PCA in the model database. This classification is done based on the distance scores of the nearest spectral signature points in the model database and the new spectral signature points in a space defined by the eigenvectors and eigenvalues computation. The determination is based on the K number of closest points rather than any human factors.
KNN is a supervised machine learning algorithm, where “K” refers to the number of neighbors of the traced points that are considered after classifying and separating the known n oil sample groups. “K” is less than or equal to (≤) the number of model training times for a sample oil. The algorithm learns at each step and iterating to converge, eliminating the need or empirical threshold setting at any particular learning stage. The classification is based on the majority scores of a traced point adjacent to those points (K points) composed of modeling training points in the multi-dimensional space.
Here's how KNN solves the classification problem:
The oil samples were traced in the 2-D spectral eigenvector space 304 using the KNN algorithm.
The K in the KNN algorithm depends on the repeat number of the training oil sample and is generally defined as an odd number that is less than the training sample number. The KNN algorithm has the function of self-improvement (machine learning) to improve the accuracy of traceability.
The principal component analysis algorithm (PCA) and K-nearest neighbor (KNN) data classification and regression algorithm were used to establish a model database of eigenvectors and eigenvalues of refined oil, and the spectral characteristics generated by refined oil in the spectrum between 400 nm-1000 nm were traced to the refined oil. This method uses machine learning to train models and provide refined oil traceability services. As the sample volume increases, self-study improves. Simple operation, no need for professional involved.
An embodiment of the present invention is further described below with reference to
As used herein, the term “oil” generally refers to both crude/petroleum oil and food oil. Likewise, the term “refined oil” shall include both refined crude oils as well as refined food grade oils, including canola, coconut, cotton seed, grapeseed, lard, palm, safflower, peanut (and other nuts), sunflower. olive, and vegetable oils. The oil refining process is broadly referenced and may simply be the result of gravity (e.g., settling), filtration, addition of chemicals, separation, or any other known oil refining process.
Stage 1 (Proof of Concept): With two different oil samples, a simple operation embodiment can be used to classify the samples according to their spectral characteristics.
Using the disclosed hyperspectral oil analysis equipment and two different refined oil samples. One oil sample (A) was used to establish model database of eigenvector and eigenvalue, another oil sample (B) was used to distinguish different types.
The hyperspectral oil analysis equipment uses a 400 nm-1000 nm halogen light source for analysis. Oil sample A is placed into a cuvette. which is inserted into the darkroom of the hyperspectral oil analysis equipment, and the spectral of a specific wavelength 321 is obtained through the reflected light path, also known as the hyperspectral characteristic reflection ratio. Oil sample A was inserted into a different hyperspectral oil analysis equipment to obtain spectral of specific wavelengths 322. The two curves 321, 322 are very closely related. as they should be.
The repeated operation of obtaining the spectra of a specific wavelength of oil sample A actually establishes a database and trains the model. This is best illustrated with reference to
Stage 2 (Validation): A group of sixteen (16) lubricating oil samples with known different compositions is used to build a model database which can then be used to trace individual samples back to the group.
Using the hyperspectral oil analysis equipment and the 16 lubricating oil samples with known different compositions establishes the spectral characteristic model database shown in
In the test conducted, the 16 lubricating oil samples with known different composition are labeled as D01-D16. The disclosed hyperspectral oil analysis equipment is used to repeatedly test each oil sample associated with their respective labels. The tests are repeated at different environments, different times, and using different cuvette containers. to promote machine learning and training models. The results are used to establish the model database illustrated in
Then, a randomly selected oil sample from the 16 samples was inserted into the disclosed hyperspectral oil analysis equipment, and the system software accurately displayed information of the oil sample matching the label on the randomly selected oil sample. In this simple way it can be demonstrated that the disclosed system and methods are accurate in sample traceability.
Stage 3 (Practical Application): A refinery builds a spectral characteristics model database with each batch of refined oil produced before it leaves the refinery. Any business, such as an oil depot. gas station, or similar entity can use hyperspectral oil analysis equipment to trace incoming refinery oil and to monitor the quality of oil shipments by verifying its origin.
Each refinery uses hyperspectral oil analysis equipment to “record” each batch of all refined oil products before shipping to a customer. It is recommended to test and record at least five (5) oil samples for each batch of refined oil, which is not excessive in light of not requiring a lab environment nor professional personnel to perform testing. It takes approximately 5-10 minutes to complete testing for each batch—most of the time is spent adding refinery oil into the cuvette. Using five oil samples (in the same refinery batch), the basic requirements for machine learning. training models, and building a model database are met. The model database can be updated with each record added—the cloud server will recalculate the spectral signature model database. The spectral signature model database can be classified according to the type of refined oil (e.g., gasoline, diesel, lubricating oil), which helps to reduce the computational effort (PCA) required to build the spectral signature model database each time.
Once the database is completed, gas stations and oil depots can use hyperspectral oil analysis equipment to sample refined oil deliveries for real-time verification of source and non-contamination. Any oil products that are contaminated during transportation, or that are not specified (labeled) from the refinery, will be different from the spectral signature model database. Further, it is statistically impossible to trace a random or contaminated oil sample back to the cluster space generated by the modeling of a specified batch of refined oil produced at a refinery. This feature serves as a way to provide certainty as to the source and quality of refined oil for businesses in real time, on site, and without undue costs.
The matter set forth in the foregoing description and accompanying drawings is offered by way of illustration only and not as a limitation. While particular embodiments have been shown and described, it will be apparent to those skilled in the art that changes and modifications may be made without departing from the broader aspects of applicants' contribution. The actual scope of the protection sought is intended to be defined in the following claims when viewed in their proper perspective based on the prior art.
The present disclosure is a Continuation-In-Part (CIP) of, and claims a filing priority to, U.S. patent application Ser. No. 18/165,477, filed on Feb. 7, 2023, and titled “Hyperspectral Sensing System And Method For Qualitative Analysis of Fluids,” which is a Continuation-In-Part (CIP) of U.S. patent application Ser. No. 17/396,986 also titled “Hyperspectral Sensing System and Method For Qualitative Analysis of Fluids” and filed on Aug. 9, 2021, now U.S. Pat. No. 11,650,145. The '477 application and the '145 Patent are both hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 18165477 | Feb 2023 | US |
Child | 18893458 | US | |
Parent | 17396986 | Aug 2021 | US |
Child | 18893458 | US |