The objects, features and advantages of the present invention will be apparent from the following detailed descriptions of the disclosed aspects of the invention in conjunction with reference to the following drawings, where:
The present invention relates to a space-invariant independent component analysis and electronic nose for detection of selective chemicals in an unknown environment, and more specifically, to an approach to analysis of sensor responses to mixtures of unknown chemicals by an electronic nose in an open and changing environment.
The following description, taken in conjunction with the referenced drawings, is presented to enable one of ordinary skill in the art to make and use the invention and to incorporate it in the context of particular applications. Various modifications, as well as a variety of uses in different applications, will be readily apparent to those skilled in the art, and the general principles, defined herein, may be applied to a wide range of embodiments. Thus, the present invention is not intended to be limited to the embodiments presented, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein. Furthermore, it should be noted that unless explicitly stated otherwise, the figures included herein are illustrated diagrammatically and without any specific scale, as they are provided as qualitative illustrations of the concept of the present invention.
In the following detailed description, numerous specific details are set forth in order to provide a more thorough understanding of the present invention. However, it will be apparent to one skilled in the art that the present invention may be practiced without necessarily being limited to these specific details. In other instances, well-known structures and devices are shown in block diagram form, rather than in detail, in order to avoid obscuring the present invention.
The reader's attention is directed to all papers and documents that are filed concurrently with this specification and are open to public inspection with this specification, and the contents of all such papers and documents are incorporated herein by reference. All the features disclosed in this specification, (including any accompanying claims, abstract, and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.
Furthermore, any element in a claim that does not explicitly state “means for” performing a specified function, or “step for” performing a specific function, is not to be interpreted as a “means” or “step” clause as specified in 35 U.S.C. Section 112, Paragraph 6. In particular, the use of “step of” or “act of” in the claims herein is not intended to invoke the provisions of 35 U.S.C. 112, Paragraph 6.
The present invention provides a space-invariant topology to enable an Independent Component Analysis (ICA) to solve chemical detection from two unknown mixing chemical sources. The specific system architecture enables the space-invariant ICA to achieve a robust independent signal source separation in an unknown open environment. Additionally, the system architecture provides an optimal topology for hardware-friendly implementation and an effective architecture for reduced data source requirements.
The space-invariant topology along with the techniques of maximum entropy information and natural gradient descent demonstrates that it is effective to separate the two mixed unknown chemical sources with unknown mixing levels to the array of two original sources under insufficient sampled data. From separated sources, they can be identified by projecting them on 11 known chemical sources to find the best match for detection.
Simulations have shown that 100% correct detection could be achieved under two cases: a) under-completed case where the number of input mixtures is larger than number of original chemical sources; and b) regular case where the number of input mixtures is the same as the number of original chemical sources. The time-invariant topology approach may face obstacles with an over-complete case, insufficient data and cumbersome topology.
Four sets of unknown paired mixture sources are collected via an electronic nose, which, in one non-limiting example, is a JPL 16-ENose sensor array (made at the Jet Propulsion Laboratory, Pasadena, Calif.). The Enose collects the mixture sources in the unknown environment with, at most, 12 samples' data collected. Per time-invariant aspect, this appears to be an over-complete case in ICA where the number of outputs (32) is larger than the number of inputs (16).
In one non-limiting example, the approximation of the set up topology of the electronic nose array is shown in
Due to the small separation between the sensors themselves, the input of each sensor resistance is assumed to be uniquely distributed. The sensory data can be modeled as follows:
Where fi is the unknown non-linear activation, αj is the unknown mixing coefficient of chemical source j, and i is the index of sensor number and N as number of sensor and sj is original source.
From equation (1), using the first order of a Taylor expansion, it is rewritten:
And soj is an operating point of the source sj. For each sampling data point in time for the same sensor i, xi fluctuates around its operating point αi and it can be considered as a common bias for xi(t) with tε[t, t+kΔt]. From this argument, Equation (2) can be simplified to:
Where Y is unbiased mixture data, Γ is the unknown mixing matrix and S is the chemical source signal.
The learning rule based on the maximum entropy algorithm is given by:
y=g(u)=g(Wx); (5)
where g is a non linear function e.g the logistic function or hyperbolic tangent function. The update weight can be calculated as:
ΔW=W
−T+Φ(u)xT; (6)
where W−T is an inverse transport of the NXN weight matrix W, xT is a mixing input vector (observed vector), and
To simplify equation (5) using a natural gradient descent technique, the learning rule can be:
ΔW=(I+Φ(u)uT)W (8)
With u=Wx
Space-Invariant Architecture
The most common ICA approach is that the number of variables and the number of sources are the same. However, in this study there are two obstacles: 1) there are 12 or less samples (mixing chemical compounds) from each sensor and the total number of sensor is 16 and they do not have sufficient data set (at least 16 data samples required); and 2) the number of variables is 16 as number of sensors while the number of compounds in a mixture is 2 and it is considered over complete case.
For the time-invariant approach, the data that will flow orthogonal 402 with the time-invariant direction as shown in
Instead of using a time-invariant approach, the space-invariant approach 404 allows for more data points and enables the square mapping matrix (the dimension of mixing sources and sensors are the same). This approach is feasible due to the mathematical model based on equations (2)-(4). The architectures are shown in
From laboratory set up, a set of single spectra of 11 chemicals using 16 elements is collected in the ENose sensor array; it is averaged and shown in
In this embodiment, the number of sensors used is 16 and mixing chemical sources is 2; four sets of data will be examined, as shown in table of
Simulation Results
Based on the data available provided in
Experiment 1
In this case, the four subsets of the sampled data in
Data Set 1
For data set 1, there are 12 mixtures of chemicals 1 and 7 and the data are shown in
Using the space-invariant ICA approach, the recovered signal sources (chemical 1 and 7 sources) are shown in
To confirm its performance, the separated sources 1 and 7 were projected by ICA technique on the known 11 chemical sources shown in
As can be inferred from
Data Set 2
Data set 2, a mixture of chemicals 3 and 10, is plotted in
For the mixture from data set 2, the performance of space-invariant ICA has demonstrated its effective capability to separate the mixture of chemicals 3 and 10 as shown in
Table IV in
Data Set 3
Data set 3, a mixture of chemicals 2 and 13, is plotted in
For the mixture of chemicals 2 and 13, space-invariant ICA has demonstrated its capability to separate the mixture of chemical 2 and 13 as shown in
The table in
Data Set 4
Data set 4, a mixture of chemicals 2 and 5, is plotted in
For mixture from data set 4 shown in
Table VI of
Experiment 2
In this experiment, data set 1 (16×12) is paired in columns to obtain the data set 96×2. From this conversion, data values in a single row are the data from the same sensor with consecutive sampling times (t+i*Δt and t+(i+1)*Δt); this new data set allows the same number of mixing sources and of original sources.
Using this new data set, space-invariant ICA has produced the results that were validated with the 11 known chemicals. Classification was 100% correct, based on the projection on 11 classes shown in
To simplify the results, the mean and standard deviation of its projection were tabulated (separated sources of chemical 1 and chemical 7) on each single chemical source, and the results are summarized in the table in
Appendix A, incorporated herein by reference, further includes a series of flow chart diagrams providing the order in which data is processed and analyzed in the space-invariant independent component analysis.
Discussion
To separate two (2) mixing sources from a sixteen (16) element sensory data array, known as the over-complete case, poses a challenges for mathematical model and network topology. The non-overlapped paire-wise (i.e. sensor i and sensor i+1) or overlapped pair-wise (i.e. (sensor i and sensor i+1) and (sensor i+1 and sensor i+2) so on) may face cumbersome and ineffective techniques.
As shown above, the mathematical model has demonstrated space-invariant ICA to be an effective topology to overcome insufficient data samples and the over-complete case. Moreover, the chemical data itself is fuzzy and inconsistent. The simulation demonstrated that the chemical source separation problem can be solved effectively with complete time sampling data (k=12) (under complete case) and two consecutive sampling data (k=2). Optimal topology may require a model of noise in order to determine the size of the sampling input. Moreover, space-invariant ICA governed by equation (4) is only valid when the sampling time is sufficiently small. Hence, the sampling time also plays an important role to ensure that the model approach holds.
Conclusions
A successful mathematical model to enable the space-invariant ICA topology from which Infomax and natural gradient descent technique can be applied has been demonstrated, and simulation has confirmed that the modeling is effective and sufficient to perform chemical source separation to enable the smart ENose to detect mixtures of chemicals in an open environment.
The present application is a non-provisional utility application, claiming benefit of priority of U.S. Provisional Patent Application No. 60/715,353, filed Sep. 7, 2005, titled “Smart Enose for Chemical Detection in the Open Environment.”
The invention described herein was made in the performance of work under a NASA contract and is subject to the provisions of Public Law 96-517 (35 USC 202) in which the Contractor has elected to retain title.