The problem of parameter estimation for electromagnetic signals remains a central one for geolocation, tracking, pattern recognition, medical imaging, threat detection and for the detection of gestures and other time varying behavior. Most geolocation techniques used rely on phase and angle measurements or line of bearing measurements which are known to be sub-optimal.
Recently, however, techniques have arisen which do not take the information lossy step of only saving time of arrival or angle of arrival, but instead consider the entire channel. These techniques have been extended for the pure geolocation application in the presence of interference. However, there are additional missing considerations for these techniques to achieve their full potential.
First and foremost an adequate model of the channels in question as well as an understanding of the model errors and non-idealities are needed. Indeed previous techniques are potentially very sensitive to calibration errors, and furthermore it does not address near field channels which need to be modeled in order to handle near field applications, such as gesture modeling, shape modeling, medical imaging and security applications. These simple channel models are also inadequate to handle radar, especially in the near field.
There is also a need to marry the extensive literature and novel techniques in the field of Bayesian estimation, since this allows one to naturally obtain error estimates and it is well suited for applying machine learning models for pattern recognition. In particular one or more embodiments of the invention look at models of Gaussian mixtures and solve them using techniques similar to Gaussian Sum particle filters. Gaussian sum filters have been used in part for the tracking problem, but using GPS to obtain geolocation. Techniques provided lieleni will apply to both the near and far field and be more accurate due to the removal of information lossy steps.
Adding some novel techniques for obtaining solutions to electromagnetic boundary problems, along with some cutting edge techniques in Bayesian estimation, and by retaining channel state information directly, one or more embodiments are directed to obtain a novel approach to the estimation and detection problem for multiple antennas/transmitters.
One or more emboidments of the invention are directed to the use of multi-antenna radar to identify and classify objects of interest as depicted in
The baseline transceiver block diagram is shown in
The direct path transmit waveform is isolated from the receivers via RF shielding, and delay filtering if needed (described below). If a change is detected in the environment, a position estimate is formed using Bayesian estimation with a likelihood function formed using a channel model that models the electromagnetic scattering over the presumed target type. The estimation is essentially performed using hypothesis testing on the target position, coupled with prior information about previous targets.
Both the target type (shape or dielectric constants etc.) are determined by computing the Bayesian posterior probability of that type category. The Bayesian estimation is described below in Parameter Estimation and Tracking. The channel model uses data obtained through careful calibration of both the antenna arrays, the antenna noise models, and the scattering type. The channel model is described in below in System Model and the calibration procedure is described in Calibration. A more detailed step by step description of the calibration process is illustrated in
Numerous other advantages and features of the invention will become readily apparent from the following detailed description of the invention and the embodiments thereof, from the claims, and from the accompanying drawings.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee. A fuller understanding of the foregoing may be had by reference to the accompanying drawings, wherein:
While the invention is susceptible to embodiments in many different forms, there are shown in the drawings and will be described in detail herein the preferred embodiments of the present invention. It should be understood, however, that the present disclosure is to be considered an exemplification of the principles of the invention and is not intended to limit the spirit or scope of the invention and/or claims of the embodiments illustrated.
System Model
The invention proposes in one or move embodiments a general receiver model that can be utilized for localization, object classification or even medical imaging. The model focuses on a set of views of the environment that depend on a parameter vector of interest p. Consider a diverse set of K observations of an emitter (or passive scatterer) on one or more multi-antenna receivers/sensors. The observations are indexed by k for k=1:K. The “receivers” may in fact be simply a view of the signal of interest at K different frequencies, or it may be K receivers at different spatial locations, possibly separated by time, it may be a single receiver, receiving a signal from K transmitters or it may be a view of the signal after correlating over K different basis functions or code words.
The received signal by the k′th receiver (or during the k′th observation) is modeled by,
where xk(t) is the Mk×1 received waveform vector at sensor k and time t, ik(t) is an additive interference plus noise process for sensor k, kq (αkq,p,skq(t)) is a vector channel process that modulates the transmitted signal skq(t). Sensor k receives signals from possibly Q different emitters or from Q different reflectors. The channel process depends on a set of possibly random unknown parameters unique to sensor k and emitter Q, αkq and a global parameter vector pq, which is of primary interest to our receiver processing algorithms. The pq parameter might consist of geolocation coordinates, target velocity, X-ray mass absorption coefficients, or other signal invariants of interest.
If it is assumed that each view of the environment is statistically independent (though this requirement can be relaxed), one obtains the following likelihood function,
where P(x|p) is the probability of observing the total view column vector x=[x1T,x2T, . . . ,xKT]T, given p=[p1Tp2T, . . . pQT]T, fk (x) is the probability density function for the interference process ik, and gk (αkq) is the prior distribution of the noisy, free parameters of the channel process k (⋅).
Linear Channel Model
Often of great interest is the narrow band channel model, which treats the channel process as a simple complex multiply. This approximation is valid for linear time invariant systems, whose signal of interest is either narrow band, or has been channelized into several narrow band channels. In this case, in part the k index can serve as a channel index. Equation (1) is written as,
The notation can be further simplified by writing this in the time sampled digital domain as,
xk(n)=Hksk(n)+ik(n),
Hk≡[k1(αk1,p1),k2(αk2,p2) . . . ,kQ(αkQ,pQ)]
where n is the time sample index, Hk is the Mk×Q channel matrix as a function of the nuisance parameter vector αk≡[αk1T, αk2T . . . αkQT] and parameter of interest vector p≡[p1T,p2T, . . . pQT]T, for all Q emitters/reflectors. Also the emitter signal waveforms are packed into the Q×1 signal vector sk(n)≡[sk1(n),sk2(n) . . . ,skQ(n)]T.
Colored Gaussian Interference
Some processing simplifications can be made by subsuming all unmodeled interfering waveforms into the single interference vector ik (n) and further presuming that this interference is complex Gaussian, yet not white. In particular it can be presumed the interference takes the form of a zero mean multivariate, complex circularly symmetric Gaussian vector,
pi(ik|Rii)=|πRi
The zero mean Gaussian normal distribution is referred to by, N (0, Ri
N(i,R)≡|πR|−1 exp(−iHR−1i).
This allows the likelihood function to be written as,
where Tr(⋅) is the matrix trace operator and o(a)≡aaH is the outer product operator.
We now perform some manipulations to expose the sufficient statistics for the probability function as follows:
where the time averaged cross and auto correlation matrices are defined by
and where etr (A)≡exp (Tr(A)). A matrix version of completing the square provides a way to identify sufficient statistics for this problem. We write:
p(xk|Hk,sk)=|πRi
Note that Ĥk≡{circumflex over (R)}x
p(xk|Hk,sk)=ηetr(−NRi
η≡|πRi
This makes Ĥk≡{circumflex over (R)}x
We can write this in the whitened form,
p(xk|Hk,sk)=ηetr(−N(Ĥsk−{circumflex over ({tilde over (H)})}sk)H({tilde over (H)}sk−{circumflex over ({tilde over (H)})}sk)) (5)
where {tilde over (H)}sk≡Ri
The interference matrix and it's Cholesky factors can be estimated independently, either by observing the system in a quasi-stationary state, or by canceling out all known signals of interest.
This result is dependent on skq(n) being a sufficiently wide band signal to make {circumflex over (R)}s
Here we assume that the position parameters p is shared by all reflected multipath components.
Narrow Band Channels
The important case wherein skq(n) is a narrow band signal, such as a sinusoid requires special treatment. This case would arise, for example when the signal is a multi-tone signal common in stepped frequency radar, or a known signal type that has been channelized.
If one specializes to the case wherein k represents a frequency fk and q is associated with a path delay τq, we can approximate the signal of interest as,
skq(n)≈sk(n)e−2πjf
where sk (n) is a narrow band signal centered at fk.
{circumflex over (R)}x
where
Similarly we have,
{circumflex over (R)}s
This allows Equation (3) to be written as,
Similar to Equation (5), one can write the whitened version of the likelihood function for the narrow band channel as,
p(xk|Hk,sk)=η exp(−N{circumflex over (r)}s
η≡|πRi
where {tilde over (h)}k≡Ri
Aperture Noise Model
For proper estimation of channel parameters it is critical to have an accurate noise model of the aperture vector kq (αkq,pq). This is because some forms of intereference whitening can cause the true aperture vector to be canceled in the estimator derived from a naive implementation of maximum likelihood estimation.
For simplicity we refer to the parameterized aperture vector as h=kq (αkq, pq). If we subsume all the other reflectors/receivers/transmitters associated with the other q indices into the interference vector, from Equation (2) into an interference vector, a narrow band channel receiver model can be written for the aperture vector as,
x(n)=hs(n)+i(n).
From this we can consider several different aperture noise models.
Additive Gaussian Noise
This model simply adds Gaussian noise to the true aperture ht,
h=ht+e,
where e is colored Gaussian complex noise. In general e may be dependent on the unknown parameters αkq, pq. It can be useful to consider a piecewise approximation of the e noise parameters for use in Bayesian estimation. In particular we could presume that the covariance matrices are dependent on an index that is unique to a given region or assumption about the nuisance parameters.
The whitened least squares estimators suggested by Equation (5) and Equation (7) are conducive to analyzing aperture noise models under the assumption that they are Gaussian.
Multiplicative Aperture Noise
The channel may be subject to phase noise due to oscillator imperfections. This is often modeled as multiplicative noise. In this case we would model the channel as,
h=ht⊙(1+e),
where ⊙ represents elementwise multiplication and e is colored Gaussian complex noise.
One approximation subsumes this noise model into the simpler noise model of the Additive Gaussian Noise, with e replaced with ht⊙e, another treats this model by an aperture square gain term added to the diagonal of the interference covariance matrix, further multiplied by relative phase noise power.
A more constrained approach to aperture noise allows a noise multiplier for each spread frequency. So for frequency k we model the receiver channel as an Mk×1 vector,
{tilde over (h)}kαk,
where {tilde over (h)}k is a possibly whitened component of the channel seen at frequency k across all receivers. The receiver model can be written as,
{tilde over (x)}k={tilde over (h)}kαk+ek
αk=1+εk.
Here, {tilde over (x)}k is the whitened received data, in the form of an estimated channel response, playing the role of {tilde over (ĥ)}k in Equation (7).
Assuming a complex Gaussian noise model we have,
Here σ2 is the variance of the additive white Gaussian noise (post whitening) and βk2 is the variance that controls the deviation of the per-frequency channel gain from unity. By varying βk2 we can express our confidence in the fidelity of the channel model. A large βk2 suggests that there is a lot of frequency dependent model error in the channel, perhaps due to phase noise arising from a stepped frequency tuner.
We can now integrate away the nuisance parameter representing the gain αk to obtain,
P({tilde over (x)}k|{tilde over (h)}k)=det(πRx
where
Rx
The posterior expectation for αk and it's maximum likelihood estimate is given by,
It can be shown that to evaluate the argument in the exponential of Equation (9), it suffices to substitute {circumflex over (α)}k into αk in Equation (8). Also the determinant is easily computed as,
det(πRx
In the case that the multiplicative noise is applied over both antenna number and frequency number, we have the receiver channel vector modeled as,
{tilde over (x)}mk={tilde over (h)}mkαmk+emk,
where m is the antenna index and k is the frequency index. This model leads to the conditional distribution of,
P({tilde over (x)}|{tilde over (h)}k)=det(πRx
where
Rx
where (|{tilde over (h)}k|2) is an M×M diagonal matrix, whose m′th diagonal element is |{tilde over (h)}mk|2 and (βk2) is a diagonal matrix whose m′th diagonal element is βmk2, the variance of the gain multiplier αmk.
Multi-Target and Multi-Scatter Models
We now generalize the model of Mutiplicative Aperture Noise to deal with the narrow band antenna model in the presence of either multiple scatterers or multiple emitters.
In this case we use the channel model proposed in Equation (6),
xk=Hkgk+ek (10)
Hk=[hk1,hk2, . . . hkQ]
gk≡[αk1,αk2, . . . αkQ]T
gk˜N(1,Rg
where 1 is the all ones vector, and where we drop the {tilde over (⋅)} representation for whitened data, assuming that the data and channels have been whitened if needed. The q index is the scatterer index or emitter index.
From this model we obtain xk˜N(Hk1, Re
p(xk|Hk)=det(πRx
Rx
The mixture model in Equation (10) is very suggestive of multi-user detection (MUD). Indeed either successive interference cancellation (SIC) or parallel interference cancellation (PIC) are viable techniques for separating each superimposed channel vector. The fact that these vectors are parameterized by only a few position and gain parameters means that it is possible to separate them using their positions and delays as a filter. Typically the processing will keep a set of parameters associated with a given emitter/scatterer. These parameters can be used to generate associated channel vectors and thus to cancel them from the current environment using SIC or PIC. The positions parameters are then re-estimated for each active scatterer and that enables the process to repeat so that the cancellation can improve iteratively.
Note also that one can treat one of the “scatterers” as the static background field. It can be canceled from the environment using the same MUD techniques if one wants the processing to only focus on time varying phenomenon.
Multi-Aperture Target and Phase Normalization
The Multi-target signal model in Equation (10) can be used as a basis for a more complex and accurate target model. The receiver can be modeled as,
where the target channel αk is seen as a mixture of multiple related component channels, (e.g. a human body with torso and limbs). The uk vector is presumed to be the stationary background clutter. For simplicity one can presume that the coefficients are Gaussian distributed, as is the background clutter with,
gk˜N(
uk˜N(ūk,Ruu).
The probability for the signal on and signal off are,
p(xk|ON)=N(xk−Hk
p(xk|OFF)=N(xk−ūk,Ruu),
from this one can compute the posterior probability of the target present (ON) and target not present (OFF),
where p1 is the prior probability of the target being present and p0=1−p1.
An additional modification to our model is to allow the removal of bulk phase terms for each channel component. Thus we have,
where cq is a unit modulus phase equalization term. This is often necessary for smaller wavelengths, wherein the bulk phase can change rapidly over small changes in the target position.
One can use successive interference cancellation (SIC) or parallel interference cancellation techniques to learn cq, simply by rewriting,
Finally, one can use similar techniques to add and subtract channel vectors from the collection that defines a target. The channels that have the lowest gain parameters can be dropped and a new channel can be added near to the canceled signal,
We either pick hp as part of the calibrated array manifold or simply as a raw data estimate.
Position/Parameter Uncertianty
Each channel vector has a dependency on it's position and other extraneous parameters, hkq (pq, u). It is of interest to presume that there is some uncertainty with regards to these parameters. For example u may contain the positions of the receiver antennas, whilst Pq may contain the uncertain scatterer/emitter positions. We may assume or estimate priors for these parameters, pq˜f(pq) and u˜g(u). If these prior distributions are Gaussian, then we have, pq˜N (p0q;Rp) and u˜N (u0,Ru). Defining the real multivariate normal distribution by (α,R)≡det(2πR)−1/2exp(−½αTR−1α), we can write the likelihood function as,
Using Bayes theorem we can write out the posterior distribution as,
While we do not have a closed form solution for the posterior distribution we can use a Gaussian mixture model and importance sampling to produce a close approximation to it. These techniques are discussed in more detail in Gaussian-Mixture-Filter, herein.
Range Filtering
For many of the applications it is possible to know the precise timing of the transmit waveform. This is true for example for radar applications or for global position systems, or possibly cellular broadcast channels. This permits the receiver to filter signals based on their range by exploiting the time delay of arrival. This is useful for removing unwanted multipath and other types of co channel interference.
Consider the narrow band model where the signal is received at multiple frequencies indexed by k. At the receiver's m′th antenna we might see a signal of the form,
xkm=hkme−2πjf
where τm is the signal delay seen at antenna m, and where fk is the signal frequency for index k. We perform an inverse discrete Fourier transform, −1 and obtain,
If its assumed that the frequency hopping is linear fk=f0+kΔ, and that hkm≈hm is dominated by a frequency independent mode, we then have,
One can plot what the simple delay channel looks like in the time domain. Here assuming K=128, Δ=20 MHz and τm=5 ns, we plot the impulsive response in
As the number of frequencies K→∞, the time response would approach an impulse function. For an emitter/reflector that has an approximate known range, ie during tracking, it would be possible to use it's round trip or total delay to filter out unwanted multipath or interfering signals.
The required processing is illustrated in
The time delay filtering concept in
Note for other system paradigms such as multi-transmitter geolocation, such as global positioning systems (GPS), one can do time delay filtering simply by limiting the lags of the spreading code correlators to a predetermined range.
Parameter Estimation and Tracking
One primary tool, used in one or more embodiments, for estimating free parameters is Bayesian estimation. It has the feature of providing both asymptotically optimal estimates of our parameters and also provides post estimation error bounds. Bayesian estimation techniques have been given a large boost by Markov Chain Monte Carlo methods, importance sampling and particle filters, since these techniques do not require the evaluation of intractable integrals. Furthermore the particle techniques do not require the computation of derivatives, or sequential optimization and are particularly suited for multiprocessor computation, either on a multi-core processor or an embedded graphics processing unit (GPU).
Gaussian Mixture Particle Filters
A particularly simple and efficient particle filter technique involves using a Gaussian or Gaussian mixture density as a prior to generate a lot of random points. Those points are then weighted by the conditional distribution, to then obtain a Gaussian mixture posterior estimate. The technique can use a simple Gaussian, or to approximate more complex priors we can use Gaussian mixtures.
While most of the prior art deal with dynamic systems and therefore update their particles/points using the system model, we are initially concerned with simply estimating the posterior distribution from our set of random guesses (aka particles) generated by our priors. Our notation below, drops the tilde for simplicity, but the received data and channel models could, without loss of generality, be whitened to make them interference resistant.
Bayes theorem for computing the posterior distribution can be written as,
where x is the received data estimate of the channel, and p is the parameter vector containing the location and occurs in the estimator as the argument of h(p), the channel vector model. If the number of received antennas is a constant M for each channel number k, then x and h are of dimension MK. One possible example of a likelihood function can be derived from Equation (9),
As can be seen it might be very difficult to optimize the conditional likelihood function using ordinary techniques due to it's complex dependency on p.
Define the real multivariate Gaussian probability density function as,
(p;R)=det(2πR)−1/2 exp(−½pTR−1p).
One can propose to model the prior and posterior probabilities as a Gaussian mixture. The prior is assumed to adhere to,
where Σjηj=1. One can now propose the following algorithm to update our Gaussian mixture so that it approximates the posterior probability given in Equation (12) with p(p)=p0(p).
Gaussian Mixture Filter Update
1. Sample N random points from the Gaussian mixture prior p0(p), pn.
2. Compute the conditional likelihood function over all the random points,
λn=p(x|pn).
3. Normalize the likelihoods to sum to one,
4. Resample a subset of Q<N points from the original N, where the probability of choosing pn equals λn. Call this set of renumbered points pv(q) for subindex q and the associated original index, n=v(q).
5. Use the EM algorithm, and the points pv(q) to learn a new set of n′j, m′j and R′j to fit the set of sampled pv(q) points, q=1 . . . Q.
We now delve into some of the details of the algorithm.
Sampling the Mixture and Resampling
For a given mixture probability of the form,
we can generate N pseudo-random samples from any such distribution for which we can sample directly from fj(p). For each n we randomly select what category j, pn will belong to, based on the category probabilities ηj. Selecting the category can be done in approximately o(log(J)) comparisons by using a binary search into the cumulative sum array of ηj, namely çj≡Σq=1jηj. When a uniform random variate u is chosen, we need merely find j such that, çj-1≤u<çj, using binary search, and initializing ç0=0.
Once the category is chosen for a given n, we generate a random sample using the pdf fj(p). For example when fj(p)=(p−mj;Rj), we can sample from any given multivariate Gaussian distribution by setting
pm=mjCjTen,
where en is the n′th, generated unit variance Gaussian random vector with independent components, each component selected according to any number of standard proceedures for generating Gaussian samples e.g. the ziggurat method. Here the Cj matrix is the upper Cholesky factor of the positive definite covariance matrix Rj. These computations are highly amenable to parallel processing as well
After this we choose to sub-sample our original large sample of N points in proportion to the likihood
Again we can compute a cumulative sum of the likelihoods vn≡Σq=1nλn, followed by a binary search to find a n where a given, newly generated random uniform variable uq satisfies, vn−1≤u<vn. This is our n=v(q).
Let us compute the probability distribution for pv(q). We have,
We have Σq≠np(x|pq)+p(x|p)≈N ∫pp(x|p)p0(p)dp, by the law of large numbers, i.e. the sample mean approaches the expected mean as the number of independent samples N, approaches infinity, and p is also chosen independently. This shows that the conditional distribution of pv(q) asymptotically approaches our desired posterior distribution. In what follows we label the resampled points as pv(q)→pq.
Expectation Maximization Algorithm
The Expectation Maximization (EM) algorithm can be used to learn the free parameters of a Gaussian mixture model, from collected data. In Samling the Mixture and Resampling it was showed how to approximately sample from the posterior distribution given an arbitrary likelyhood function. Thus this is the final step needed to update the parameters associated with the posterior distribution p(p|x).
The Gaussian mixture model for the posterior distribution will take the form,
The initial values for πj, mj, and Rj are chosen to be the same as the prior. The EM algorithm consists first performing the expectation of the logarithm of the conditional distribution p1(p|m,R), which is a lower bound to the original function followed by a maximization of this lower bound over the unknown parameters mj and Rj, to find the next iterate. This can be summarized as provided below.
Expectation Step
Compute the posterior likelihood of a sampled point pq being in category j,
The parameter Nj can be thought of as the expected number of points associated to category j. Note ΣjNj=Q.
Maximization Step
The updated mixture parameters are given by,
The Expectation and Maximization steps are performed repeatedly until the log-likelihood function increase is below a predefined threshhold. The log likelihood function is defined as,
The form of EM algorithm suggests a modification that potentially allows one to skip the resampling component of Sampling the Mixture and Resampling. Using the original sampling of pn drawn from the prior distribution, modify (13) to read,
This causes a given pn to be directly weighted by it's likelihood function.
Geolocation and Tracking
We can use our Bayesian particle filter techniques to facilitate both geolocation and tracking. For a single target objective we can use Equation (9) as an objective function to perform an initial maximization over position to find p. The position parameter enters the equation through the channel model h(p).
Dynamic Models
One simple way to approach dynamic models, is to simply re-estimate changing parameters and allow them to deviate from their current values at each look at the environment. This can be accommodated by our Bayesian approach, simply by bounding the variance of the parameter of interest from below. So for example if ρ(p|x) is the posterior distribution of position parameter p given the observed data x, after many observations we might have,
We then can exponentially devalue stale information by updating in the log-domain,
log(pN(p|x))=log(ρ(p|xN))+λ log((pN−1(p|x))+c,
where 0<λ<1 and c is a normalization constant so that pN(p|x) integrates to 1. The exponential average puts a lower bound on the variance, which would tend towards zero if all the prior information was weighted equally. This has the effect of allowing a search over the parameter space in the vicinity of the current optimal p, with enough variance so that p can change according to system dynamics. This model works well for slow moving reflectors/emitters.
Another approach for system dynamics is to assume that p adheres to a first order discrete constant velocity model, ie
pN=pN−1+v.
The model then seeks to learn v and to update pN based on the observed statistics. Again we can limit the variance for estimating v so that we can allow the acceleration to change slowly over time.
Every emitter q in the environment, will have an associated set of parameters that will need to be tracked and maintained, aka pq. Moreover for some systems, e.g. detecting hand gestures, there may be multiple emitters/reflectors associated with the same target, ie a hand arm and human body. Emitters and reflectors can typically be associated by position over time relative to one another. We can use machine learning techniques to associate such objects and to classify their trajectories using channel vectors as the training data. A Bayesian belief network is a good model for this.
Detecting Environment Changes
During normal operation of any proposed sensor device, the environment may be quasi-stationary. Many applications require detection of moving or living objects and thus are improved by removing stationary objects. To this end we can consider a hypothesis channel model of the form,
xoff=u+e0
xon=u+h+e1,
where the channel vectors are stacked over all spread frequencies/look indices and have length MK elements, where u is the stationary receive vector due to either stationary emitters or stationary scatterers, e1 is the noise vector for the new signal present case, e0 is the signal off noise, and h is the channel vector for a new or moving emitter.
Because we continuously monitor the array we can learn an approximate prior distribution for u and h wherein u˜N (ū,Ruu) and h˜N (
p(x,u|ON)=N(x−u−
p(x,u|OFF)=N(x−u,Ree)N(u−ū,Ruu).
After integrating out u one obtains,
p(x|ON)=N(x−
p(x|OFF)=N(x−ū,Ruu+Ree)
From this we can see that the posterior probability of x adhering to the ON model is,
where p1 is the prior probability that we see a new signal arriving and p0=1−p1. In general the distribution for h will be a Gaussian mixture, conditioned on the current set of captured targets and the possibility of a new acquisition. Thus we may choose to further condition the distribution on a category k, that defines the existing tracked targets plus an additional zero mean distribution for an unknown new target.
We also need to compute the posterior distribution for u so that we can update it's statistics. We have,
p(u|x,ON)=N(ū1,R11),
ū1=ū+Ruu(Ruu+Rhh+Ree)−1(x−ū−
R11=Ruu−Ruu(Ruu+Rhh+Ree)−1Ruu;
p(u|x,OFF)=N(ū0,R00),
ū0=ū+Ruu(Ruu+Ree)−1(x−ū)
R00=Ruu−Ruu(Ruu+Ree)−1Ruu.
The preferred modality is to update only when the OFF state is detected. Note also the similarity of these updates to the Kalman filter. The latter has a close relationship to Bayesian inference, when the distributions in question are normally distributed.
Shape and Gesture Identification
Suppose that a target has a collection of emitters/scatterers associated with it hq, q∈ where e is a target index and is a set of scatterers associated with a given target. The shape identification problem is to identify the set of channel vectors hq that correspond to a shape category ce. The identification problem also often has a time dependency, so that one typically makes an identification over a time period t∈[r−T,τ]. An example of this would be say a human hand making a right circular motion. To identify this, we require data over a period of time long enough to verify that it is a circular motion in the right direction.
Mapping a set of channel vectors to a shape is a good candidate for machine learning and using a Bayesian belief network. The training process in learning and identifying targets is part of the calibration process for the array. Each target will have very different spatial signatures, especially after the target is observed over time. The category associated with a target's shape, the target's orientation as well as it's position are all candidate parameters for learning within the p vector of the overall parameter estimation problem.
While the data set is large, the identification problem can often be broken down into a set of smaller more reasonable steps. For example we can consider the problem of detecting a human hand wave. If we model body parts as cylinders, we need to find a cylinder “object” making changes in the elevation coordinate, or more precisely a cylinder that is rotating about an axis in the x-y plane. The gesture detection problem is roughly outlined in
The initial state occurs when the target of interest appears stationary or has not yet entered the field of view. Once motion is detected via the techniques in Detecting Environment Changes we then use Bayesian inference to detect the object location, it's orientation and object shape using the Bayesian inference techniques of Gaussian Mixture Pareticle Filters and Geolocation and Tracking.
For the gesture detection we pay attention to it's orientation and it's elevation coordinate. After two changes in elevation coordinate we can presume that the object is waving. Not shown in
Many other gestures and motion detection schemes can be broken up into discrete steps, whose transitions can be appropriately detected by Bayesian inference and machine learning in general. It is important for us to work directly with the channel vectors, since this gives us the optimal theoretical estimation and detection performance.
Using HOSVD in Machine Learning
Creating a neural network, Bayesian network or other machine learning structure, allows one to learn an arbitrary nonlinear function through a controlled data structure. An example of such a structure is provided in
In this directed graph each node represents a computation. For a Bayesian network the graph represents conditional dependency and each edge is weighted by the conditional probability, and the nodes represent a sum of products whose output is the posterior probability, ie for node k,
The initial probabilities and conditionals are possibly dependent on the received input data and par(k) is the set of nodes which are parents of node k in the directed graph.
For a neural network the weights on each edge multiply the values at the nodes are summed and then passed through a nonlinear function to obtain the node output,
The machine learning literature contains many techniques for learning the weights and the structure of these networks in an attempt to match an arbitrary nonlinear function.
We introduce here a new approach that exploits the structure compression properties of higher order singular value decomposition. In
The number of branches required to approximate most functions is significantly smaller than the requirement to tabulate the multivariable function. For example, when we look at HOSVD decompositions for Green function kernels in Basis Functions and HOSVD and Equation (31) we find only a tiny fraction of the singular values are significant as FIG. (11) shows.
Thus there may be significant computational savings in using this flattened structure, once the individual eigenfunctions are approximated by splines. This structure provides an innovative way of implementing many of the machine learning networks and for approximating the many functions used in partial differential equation boundary value problems, such as the Helmholtz wave equation for electromagnetic propagation.
Channel Modeling
A key component of this invention is the ability to successfully model the electro-magnetic channel seen by our receivers. Particularly challenging is the problem of electro-magnetic scattering. We propose a series of approximations to the scattering problem that exploit known geometries and some novel expansions of the electromagnetic reproducing kernel.
Our goal is to create a series of basis functions that can be used to calibrate our receiver array in the presence of various scattering objects. These functions can also be used to interpolate our scatterers over differing range and angle of arrival values.
Electro-Magnetic Scattering
For our calibration model, we desire to pick a set of basis functions that match the electromagnetic environment of the receiver. This is especially important for nearby scatterers for radar applications. There are some applications that require us to model a sizeable reflector within a few feet of the transmitter. For example for security purposes we might want to differentiate between an animal or human reflector, or we may want to detect hand or arm gestures or detect the presence of a forklift entering a warehouse etc.
For this reason we need to extend the emitter model beyond that of a point source. If we model the scatterer as a parameterized surface that admits a local orthogonal coordinate system, we can exploit a type of Green's theorem to model an electromagnetic field as a function of the field parameters on the distant closed surface. Indeed it is well known that the wave equation has a reproducing kernel, similar to Cauchy's theorem in complex analysis.
In the geometric Clifford algebra Cl0,3, the microscopic Maxwell's equations can be written as,
where c is the velocity of light, ∂t is the partial derivative operator with respect to time, D≡e1∂x+e2∂y+e3∂z is the Dirac operator, F≡E+icB is the electromagnetic field vector (or Faraday vector), ρ is the charge density and j is the current density vector. A vector is expressed by v≡Σk=13vkek, and the coordinate vectors ek have an associative product that has the properties eqem=−emeq for q≠m and eq2=1. The pseudoscaler i≡e1e2e3 commutes with the ek and has the structural property i2=−1.
A product of vectors in the geometric algebra includes both the inner product and the cross product via the formula,
ab=a·b+ia×b.
Because of this it is possible to express Maxwell's equations in the concise expression 14. In a region of no sources, and assuming the time dependency is harmonic, of the form exp(2πjft), we see that the electromagnetic field vector satisfies the Helmholtz equation,
DF=−jkF,
where the wavenumber
Note that the j used here is a square root of −1 for tracking phase changes for a complex phasor at frequency f, which is different from the pseudoscalar i. An important property of the Dirac operator within a Clifford algebra is the fact that, D2=∂x2+∂y2+∂z2 is the scalar Laplace operator of the standard wave equation.
The Cauchy integral formula for our Geometric algebra can be written in general by,
where dx*≡dydze1−dxdze2+dxdye3=Σkdx*kek, and ⋅* is the Hodge star operator, R is a volume in three space and ∂R is the boundary of the volume. The vector form dx* will be normal to any parameterized surface. In (15) if we substitute h→b and g→Da, we get,
Similarly if we substitute h→bD and g→a we get,
Subtracting these two equations yields a second order Green's formula,
Note that it is possible to extend both the Cauchy equation and Green's formula to the time harmonic case, addressing the Helmholtz equation. This is simply a matter of adding and subtracting the eigen value, or simply letting them cancel out for the Green's function. We can thus write,
Bicomplex Algebra
The addition of another commutative square root of negative one, j, for the purpose of tracking harmonic phase, causes the underlying geometric algebra to split into 2 isomorphic components. One component corresponds to outgoing waves with a wavenumber k, and the other to incoming waves with wavenumber −k. It also forces plane waves to decompose into right and left circular polarization.
The two algebras coincide with coefficients of either +=½(1+ij), and −=½(1−ij). The algebra isomorphism is better understood by noting,
++=+
−−=−
+−=0
++−=1
i−−i+=j
−eiwt++e−iwt=ejwt.
Thus, we can write the Clifford electromagnetic field as,
where the bar here is conjugation with respect to j. The j is replaced by i and it's conjugate in the split algebra. This isomorphism allows one to drop the additional j and implement the algebra in our calculations by tracking Clifford numbers for both
One can recover the original representation by replacing,
+→½(1+ij)
−→½(1−ij),
resulting in,
where,
Fr=½(F(k,r)+
Fi=½(F(k,r)−
Boundary Conditions
For the scattering problem it is important to understand electromagnetic boundary conditions. We presume linear media so that Equation (14) holds. In Equation (16) set h=1 and let g=F, the electro-magnetic field vector, so that (D+jk)
Let the volume of integration be an infinitely thin box that spans the boundary of two regions as shown in
where ρs is the surface charge density and js is the surface current density and n is the outward facing normal from surface A in region 1 to the left, and is negative to the normal in region 2 to the right of the boundary. dS is the scalar area differential. Since F is piecewise continuous and the volume of R→0, we also have, ∫RjkF→0. The sides of the box shrink to 0 rapidly and don't contribute to the area integral. The A region is presumed small enough so that F is constant there.
In one fell swoop, this proves the electromagnetic boundary conditions,
This shows in particular that,
n×(E1−E2)=0
n·(B1−B2)=0
The tangential component of the electric field is continuous, as is the normal component of the magnetic field. We can project the Faraday vector on to it's tangential E component and normal B component via the surface projection operator,
Q−F≡½(F+n{tilde over (F)}n),
where ã is the principle involution defined by {tilde over (e)}k=−ek for k=1 . . . 3, and ãb=ã{tilde over (b)}. Thus the continuity relations can be written as,
Q−(F1−F2)=0.
If you define the complementary projection operator onto the normal component of E by,
Q+F=½(F−n{tilde over (F)}n),
then we have on the surface,
For a perfect conductor the surface current paravector
can be found from the incident Faraday vector F0 by,
K=2nF0,
where is the real part of the Clifford number a with respect to the pseudoscalar i=e1e2e3. Here the fields are evaluated at the surface of the conductor. We can also write down the reflected field from a perfect conductor as,
Fr(x)=−n{tilde over (F)}0(n{tilde over (x)}n)n,
Also of interest is the reversion involution defined by (ab)†=b†a† and ek†=ek. Note that ĩ=−i, and i†=−i. Also a=½(a+a†). For non-deal conductors we will generalize this rule somewhat via an unknown reflection coefficient α, so that,
Fr(x)=−αn{tilde over (F)}0(n{tilde over (x)}n)n (18)
The reflection coefficient α is allowed to be complex in j but not in i. These formulas require that the coordinate system is chosen so that the reflecting conducting plane passes through the origin. Otherwise we need to replace n{tilde over (x)}n with n(x−x0){tilde over ( )}n+x0, for some x0 on the reflecting surface.
This model yields the following surface current paravector (vector plus a scalar),
K=2nF0α† (19)
where F0 is evaluated at the reflector surface. For monochromatic waves it suffices to evaluate the Faraday vector over a closed surface to determine it's radiated pattern throughout space. This makes Equation (19) particularly useful to determine the electromagnetic field, reflected off of arbitrary surfaces.
It can be shown that the reflected electromagnetic field vector in Equation (18) satisfies Diracs equation,
since the reflection formula is in fact just an improper Lorentz transformation. We can use Equation (19) as an approximation for non-ideal conductors, wherein the complex reflection coefficient α is a property of the underlying material. Since α can be learned during calibration and estimated from the received data, changes in α can be attributed to changes in the underlying dielectric material of different objects. This has applications to medical imaging, where the dielectric may change as a result of sugar levels, or tumors and so forth.
The Reproducing Kernel
Consider the Helmholtz form of the Cauchy equation in (16). Suppose we can find a function h(x) in the geometric algebra that has the property that applying the right Helmholtz operator yields the Dirac delta function,
h(x−x0)(D−jk)=δ(x−x0).
Furthermore consider a Faraday vector g(x) satisfying the Helmholtz version of Maxwell's equation,
Substitution into (16) yields,
Assuming that x0 lies in the spatial region R (possibly unbounded assuming decaying radiation conditions), we have,
For the case of no source charges or currents in the field of interest, we have an analogy to Cauchy's equation in complex analysis,
The reproducing kernel for geometric algebra can be written as,
This can be written as,
For the Green's theorem in (17), it is helpful to first define the Electromagnetic paravector potential as,
a=cA−ϕ,
where A is the electro-magnetic vector potential and where ϕ is the scalar potential. We also assume the Lorenz gauge which forces
where ⋅S is the scalar part of a Clifford number. This means,
Maxwell's equations now give us,
For the time harmonic case we get the Helmholtz version of this as,
In Equation (17) we now use a as the electromagnetic paravector potential and b as a Green's function satisfying (D2+k2)b (x−x0)=δ(x−x0). This yields,
It is customary to choose a Green's function that is constant or even zero on ∂R, however our typical use case evaluates the paravector potential at an arbitrary point x0 in space, requiring our surface to vary with the choice of receiver location x0. For this reason it suffices to simply choose the standard Green's function for the sphere,
from which we can obtain,
In a region with no sources we can therefore write the reproducing kernel for the paravector potential as,
Note also that if the region R is unbounded and that we assume a radiation decay condition that makes the surface integrals vanish, we obtain the usual time harmonic solution for the retarded potential,
Also note that the scalar potential can, in particular can be written as,
Reproducing Kernel Channel Model and Surface Inference
The reproducing kernel formulas suggest a possible numerical model for the reflected waveform. We pick a hypothesized surface R centered on the reflector position p, and with boundary ∂R. Given a model for the electromagnetic field from the transmitter, we can compute the reflected field from Equation (18), based on the local normal to the surface ∂R, from this we can compute the field at the remote receiver antenna using one of the reproducing kernel formulas in Equation (20) and Equation (24).
We summarize this algorithm as provided below.
Reproducing Kernel Algorithm
1. Hypothesize a position p for the center of the emitter/reflector region R.
2. Compute a set of Faraday vectors for the reflected field αqFr(xq)=−αqnq{tilde over (F)}0(p+rq)nq, where F0(x) is the transmitted Faraday vector seen at position x, nq is the unit normal vector seen at position xq=p+rq∈∂R.
3. Compute the Faraday vector at antenna position y using an approximation to the reproducing kernel formula,
where Aq is the area associated with the sampled surface point xq.
4. Compute per antenna voltage hm≈bm⋅E(ym), where bm is the appropriate vector effective length for receiver antenna m, and E(ym) is the electric field component of g(ym). Computations can be reduced by bringing the E field projection and the inner product into the sum/integral, ie
where uRV is the real vector part of u. Of course more sophisticated quadrature formulas can be used to obtain the integral approximation.
5. Use the computed channel vector hm as part of the Gaussian Mixture Filter Update algorithm from Gaussian Mixture Filter Update for estimating the reflector/emitter position p.
Another algorithm suggests itself from the form of the Reproducing Kernel Algorithm. It is theoretically possible to compute the contribution of all the nodes to the m′th antenna via,
ϑmq≡Aqbm·h(xq−ym)nqFr(xq)RV,
in advance after calibration. In this mode we would compute ϑmq over a dense grid xq, indexed by q. Thus with the ϑmq pre computed we can write an approximation for the channel vector as,
In this model we might consider the reflection coefficients αq and even the boundary set ∂R as free parameters, not to mention the implicit dependency on the surface normal unit vectors nq. By making these parameters invariant or part of some constrained manifold we might actually be able to infer the boundary from the collection of the received data over time. Allowing the boundary to change permits us to possibly detect human gestures, breathing or more complex objects in the field. The free parameters can be added to the global invariant parameter vector p and optimized using Gaussian Mixture Filtering.
This suggests the algorithm as provided below.
Surface Inference Algorithm
1. Hypothesize a boundary region ∂R, containing a center as part of the parameter vector p.
2. Identify all nodes q∈∂R and compute or update the unit normal vectors nq if necessary, based on the shape of ∂R.
3. Compute hm=Σq:x
4. Use the Gaussian Mixture Filter algorithm to update the mixing gains αq in a locally constrained manner (ie small variance), and use nearby q to determine if the bondary ∂R should be updated if it achieves a better fit.
5. Test convergence if no improvement can be made in the likelihood, if not return to step 1.
Factoring Common Phase Center
For higher frequencies (larger k), spherical harmonics produce phase terms of the form e−jkr, where r is the total distance traveled for the transmitted waveform (about twice the distance to the reflector). The
wave solution is always asymptotically true in the far field. Because r is usually much larger than the inter-element spacing of the array, in order to avoid floating point overflows or underflows it helps to factor out the phase response due to the center of the array.
Suppose the reflector is at position y and the center of the array is at x0, which we also assume is the transmitter position for simplicity. Assume also that xm=x0+dm is the position of receive antenna m. The total distance traveled to antenna m is,
rm=y0+∥x0−y+dm∥.
The total distance to x0 is,
If we factor out the phase center we have the relative phase term,
erel≡e−jk(r
But now we can write,
The last expression avoids the loss of precision inherent in subtracting two large numbers rm−r0 in the phase response. These terms are inherent in all models of electro-magnetic propagation, and it is often beneficial to work with the relative phase. Indeed one can smooth the array response by tossing away the bulk phase term e−jkr
Analytic Scattering Models
We now turn our attention to techniques which approximate the electro-magnetic field with analytic solutions using known functions or series of functions. We seek to obtain sets of basis functions that can be used to fit the scattering model at hand, usually as part of the calibration process.
We first look at some useful coordinate systems, whose level curves represent objects of interest. We also derive their Dirac operators.
Rectangular Coordinate System
This coordinate system is good for representing rectangular surfaces and boxes.
Cylindrical Coordinate System
This coordinate system is good for representing cylindrical surfaces.
Spherical Coordinate System
This coordinate system is good for representing spheres.
Prolate Spheroidal Coordinate System
Prolate spheroidal coordinate systems are good for modeling an ellipse that has been rotated about the z-axis.
Confocal Ellipsoidal Coordinates
This confocal ellipsoidal coordinate system has level surfaces corresponding to ellipsoids when ξ1 is constant,
hyperboloids of one sheet when ξ2 is constant,
and hyperboloids of two sheets when ξ3 is constant
where we additionally require,
ξ1<c2<ξ2<b2<ξ3<α2.
Coordinates can be derived as,
The Dirichlet operators for the confocal ellipsoidal coordinates must be derived implicitly since (ξ1,ξ2,ξ3) can only be found from the roots of the third order polynomial equation in z,
Cauchy-Kovalevskaya Extension Theorem
Similar to reproducing kernel concept, in the Reproducing Kernel portion of this document. it is possible to generate an analytic solution to the wave equation, given it's response on some closed surface. Suppose that the Dirac operator in a given coordinate system can be written as,
for a coordinate parameterization (ψ1,ψ2, ψ3). Suppose we know that given a fixed ψ3=0, the coordinate system forms a closed surface (e.g. for spherical coordinates if ψ3=r−1, then the closed surface is a sphere).
Define,
Furthermore suppose we have a differential operator μ(ψ3) that depends on ψ3 and let ϕ(ψ1,ψ2) be the value of F restricted to the surface ψ3−0. We will make the assumption that,
(D+jk)μ(ψ3)ϕ(ψ1,ψ2)=0 (26)
so that F(ψ1,ψ2,ψ3)=μ(ψ3)ϕ(ψ1,ψ2), satisfies the Helmholtz equation. If we require the operator μ(ψ3) to be valid for any surface function ϕ(ψ1,ψ2), then Equation (26) requires the following functional equation,
This yields the operator solution to μ(ψ3),
μ(ψ3)=exp(−Dψ3)−1D12ψ3).
Thus the solution to the Helmholtz equation becomes,
F(ψ1,ψ2,ψ3)=exp(−(Dψ3)−1D12ψ3)ϕ(ψ1,ψ2).
As long as ϕ(ψ1,ψ2) is C∞, we can expand the operator function exp(−(Dψ3)−1D12ψ3) in power series of functions and derivatives with respect to ψ1 and ψ2 to obtain a solution that converges locally. This generalizes the Cauchy-Kovalevskaya extension theorem.
Another form of this has slightly better convergence properties. Starting from Equation (27) we write,
noting that the initial conditions require μ(0)=1. If Dc≡(Dψ3)−1D12, then we can write this as the integro-differential operator,
(1+∫Dc)μ=1,
where ∫ g≡∫0s g(ψ3)dψ3. One can now argue that,
which is equivalent to repeated applications of (28). We can now use this theorem to find the field reflected by the incident wavefront by assuming that
ϕ(ψ1,ψ2)=−αn{tilde over (F)}0(n{tilde over (x)}n)n,
for x∈∂R defined by ψ3=0, and where F0(x) is the incident (transmitted) wave.
It is possible to extend the Kovalevskaya extension to actually recover the surface current paravector K for a perfect conductor and hence the reflected wave. Suppose we are given the reflected wave Fr, we can recover the outgoing surface component of this wave for ψ3≥0 via,
We actually take the Hardy projection on to the boundary, which requires that the limit be non-tangential, meaning that the limit sequence exists in a cone smaller than a half plane about it's limit.
Our extended Kovalevskaya extension recovers the original function from the boundary, namely
μP+Fr=Fr.
We now involve the surface projection operators, Q++Q−=1.
We used here the electromagnetic continuity condition at the conductor boundary which requires Q−P+(Fr+F0)=0, where F0 is the incident wave.
This can be combined with Equation (28) to obtain the recursive formulas,
Convergence can be guaranteed for “small” μQ+P+, which is possible as ψ3→0, for sufficiently smooth F0.
The main obstacle for use appears to be the proliferation of symbolic terms for larger values of q.
Basis Functions and HOSVD
For a set of basis functions to approximate the scattered field, it is convenient to look the separable solutions of the Helmholtz equation, (D2+k2)F=0. There are 11 orthogonal coordinate systems that are separable by the Helmholtz operator. Several of those coordinate systems were described in Analytic Scattering Models. We examine the spherical coordinate system and the associated spherical harmonics that are solutions of the Helmholtz equation, as one possible example for generating appropriate basis functions.
The Laplacian in spherical coordinate systems can be written as,
If we set ψ=R(kr)Θ(θ)Φ(ϕ), into the Helmholtz equation we obtain three separate ordinary differential equations:
From this we can obtain the general solutions,
R(r)=B1hn1(kr)+B2hn2(kr),
where hn1(kr) is the spherical Hankel function of the first kind, representing an inward traveling wave and hn2(kr) is the spherical Hankel function of the second kind, representing an outward traveling wave. These functions satisfy some recursion relations and can be expressed in terms of Bessel functions of fractional order. Additionally we have,
The polar angle dependency is solved by,
Θ(θ)=Pnm(cos(θ)),
where Pnm(x) is the Legendre polynomial of the first kind.
Φ(ϕ)=exp(mjϕ).
The angular functions are typically collated into a single set of functions known as spherical harmonics Ynm(θ,ϕ),
The spherical harmonics are orthogonal satisfying,
where δm′m is the dirac-delta function. These functions are also complete, allowing the representation of any function on the sphere, f(θ,ϕ) as a series of spherical harmonics. The spherical harmonics are often written as a function of the unit normal vector
as Ynm({circumflex over (x)}), which is a polynomial function of the components of {circumflex over (x)}, called harmonic polynomials.
It is possible to decompose the spherical green's functions h01(kr) and h02(kr) into spherical waves when
We can then write,
where jl(z) is the spherical Bessel function of the first kind and the real part of h1(z) for real z.
Note that,
r=√{square root over (r12+r02−2r1r0 cos(θ))},
where r0=∥x0∥, r1=∥x∥, and x0Tx=r1r0 cos(θ).
Suppose that r(r0,r1,cos(θ))=√{square root over (r12+r02−2r1r0 cos(θ))}, and that we seek to find a decomposition similar to Equation (29) of the form,
Furthermore we seek such a decomposition that gives us the best fit to r(⋅), with the minimum number of terms. This problem is very similar to singular value decomposition for matrices, but here the problem is multi-linear containing multiple “eigen-vectors”, αl(r0), βl(r1), and γl(cos(θ)). This problem can be solved using a device called multilinear singular value decompositions, or higher order svd (HOSVD). This decomposition reduces a multilinear form (tensor) into a sum of mixed eigenvectors of the form,
where the Uji(n) are unitary matrices (entry (j,i)), and where si
The indices for these tensors all represent the values of continuous parameters, that have been densely sampled.
An example of the use of the HOSVD on the spherical Green's function on the left hand side of Equation (29) is shown in
Furthermore those functions of r1, r0 and cos(θ) are smooth as can be seen by the locus plots in
Harmonic functions generated by using HOSVD operations can be used as basis functions in general. Indeed we can apply the Helmholtz equation as a constraint, while matching Dirichlet style boundary conditions for any given surface. Suppose we decomposed the reproducing Kernel Equation (23) in a form similar to Equation (30),
where x=xe1+ye2+ze3, is the receiver location and dl are Clifford numbers. The field values are evaluated at the receiver location x0, which is fixed for each receive antenna. We can either use the reproducing kernel to produce the field at the receiver, or we can use these functions to fit the boundary conditions on a closed surface, using any convenient parameterization of that surface. Typically we would place the coordinate center at the center of the scattering object.
For the reproducing kernel case we have from Equation (20) and Equation (32) for rectangular coordinates
The scalar voltage could then be written as,
where b is the vector effective length for the receive antenna we are analyzing and Fr(x) is found from Equation (18). We can also use any coordinate parameterization for which the scatterer represents a level surface, not just a rectangular decomposition.
If 3-space is parameterized by ψ1,ψ2,ψ3, and the closed surface of the scatterer is described by ψ1=ρ, a constant we can use the HOSVD to decompose the reflected wave
The HOSVD operation is performed over a discretized function of several variables, but we can use splines to interpolate the function values αl(ψ1), βl(ψ2) and γl(ψ3). We also modify the algorithm to satisfy the Helmholtz equation,
(D2+k2)v(ψ1,ψ2,ψ3)=0
where
The operator D2 will be a scalar operator that can be discretized and treated as a multi-linear tensor contraction on v(ψ1,ψ2,ψ3). This can be flattened to simply look like a matrix. To obtain the approximation we attempt to solve the Lagrangian problem,
Here the integration ∫ψ1ψ2ψ3 is approximated by a sum over a region of points likely to represent a receiver/scatterer location. Similar to the power method for eigenvalue problems, we can use alternating directions optimization to obtain functions and coefficients for this approximation problem.
Now implicitly v(ψ) is dependent on the center of the scattering body xr, which is one of the parameters we need to estimate. After solving the Lagrange constrained minimization problem in Equation (33) we obtain an approximating series solution Σlϑlαl(ψ1)βl(ψ2)γl(ψ3) also dependent on xr. Thus the voltage seen at receiver position x0 is given by V0(xr)=Σlϑlαl(ψ10)βl(ψ20)γl(ψ30), where (ψ10,ψ20,ψ30) is the parameterized coordinates of x0 relative to xr. If we compute this over a dense set of possible reflector positions we obtain another non-linear function that can be decomposed as a function of xr=(xr,yr,zr). This function can be decomposed again using HOSVD techniques to obtain,
Of course the voltage could theoretically be obtained using any electromagnetic modeling tool, that can model the scatterer and obtain the voltage at the receive antennas. The key point here as that these computations can be done offline as part of the calibration process. Only the final “tabulated” voltage as a function of scatterer position needs to be retained for our algorithm purposes.
Image Techniques
Similar to the electrostatic case, it is possible to place image sources to satisfy electromagnetic boundary conditions. For a given dipole moment vector b we can write the electromagnetic vector for the simplest case of the Hertzian dipole as,
F=E+icB,
where,
where er≡Dr=x/r and P (er)b=er(er⋅b)=½(b+erber) and P⊥(er)b=b−er(er⋅b)=½(b−erber).
Suppose we have a Hertzian dipole 1 a distance h above an electromagnetic screen. We intend to place an image dipole 2 at a distance h below the screen, so that we can generate a reflected field, −n{tilde over (F)}(n{tilde over (x)}n)n, whose superposition with the incident field satisfies the boundary conditions.
This situation is illustrated in
If the coordinate origin is on the electromagnetic screen, and x1 is the center position of Dipole number 1 and n is the unit normal to the conducting surface, then the coordinate center for dipole 2, x2 is it's reflection through the conducting plane P, ie x2=−nx1n. Note that the reflection operator is a homomorphism, ie if Rn(x)≡n{tilde over (x)}n, then Rn(ab)=Rn(a)Rn(b), and Rn(x)=x, when x∈P. We can write the reflected field as,
Suppose we decompose the E and B field into it's tangential and normal components,
E=En+Et,
B=Bn+Bt,
then we have,
Rn(E)=−En+Et
Rn(B)=−Bn+Bt.
From this and the fact that the radii are the same for both the reflected field and the original field, we see that at boundary surface the total field is,
This shows that the correct boundary conditions are satisfied inasmuch as the total tangential component of the electric field and the total normal component of the magnetic field vanish.
Because of the homomorphism properties of the reflection operator it is seen that the reflected field is the same as that due to an image dipole with dipole moment b2=nbn at the reflected position x2=−nx1n. The reflected electromagnetic field becomes,
Imaging techniques are a powerful way to handle rectangular boundaries. It's a good way to model electromagnetic propagation near the ground or over water by treating the ground as an electromagnetic conducting screen. Multiple plane boundaries can be modeled by reflected each additional image across the new plane introduced into the environment. In the next section we will show how the concept can be extended to spherical boundaries using conformal mappings in 4-space. Furthermore it is straightforward to extend the technique to arbitrary spherical harmonics using the reflection operator Rn(F).
Conformal Mappings
A conformal map in a Clifford algebra takes the form of a Möbius transform,
ϕ(x)=(ax+b)(cx+d)−1.
Where a, b, c and d are members of the Clifford group Γ3, which are products of non-zero vectors in Cl0,3. These mappings preserve the surface integrals in (16), namely
where
This implies,
Assuming h(y) is the reproducing kernel in (21), and that g(y)) is left monogenic (ie satisfies the Helmoltz equation (D+jk)g=0, we obtain,
where y0=ϕ(x0). This demonstrates that one can use conformal maps to change the surfaces over which a solution can be evaluated.
In particular the Möbius transform is known to map spheres to spheres, where a “sphere” of infinite radius is a half plane. Thus we can formulate a solution to the boundary value problem in the half plane, using say the imaging solution of the prior section and then use a conformal map to map the problem to a sphere or vice-versa.
Another way we can use conformal maps is to use Vekua operator to map a solution of the Laplace equation (k=0) to a solution of the full Helmholtz equation. Conformal maps preserve the analytic property of annihilating the Dirac operator. Thus if Dg(x)=0, then DJ (ϕ,x) g (ϕ(x))=0. However we can use the Vekua operator to map this to a solution that also satisfies the Helmholtz wave equation D2u(x)+k2u(x)=0, via,
where J1(⋅) is the first order Bessel function of the first kind. The Vekua transform is actually a general way to associate solutions of the ordinary Laplace equation with solutions of the Helmholtz equation.
Calibration
The calibration procedures for an array that is required to perform geolocation consists of fitting a set of interpolating basis functions to a set of observed voltages seen on the array, for targets/reflectors at known positions in space. A lab or anechoic chamber is typically used for initial calibration. Calibration occurs not just for the free parameters of channel model, but also can occur over different target types for a radar application, or even dynamic targets that involve gestures, beating hearts, or varying dielectric properties of materials.
Let vmkq(pq) be the voltage seen by the m′th sensor, k′th frequency/channel and the q′th calibration collect, and where pq is the “position” and shape parameter vector, for the q′th calibration event. The calibration process attempts to fit a linear combination of basis functions to the observed voltages. The fit metric is usually least squares,
The basis functions blm(pq) are a function of the position/shape parameter vector pq, antenna index m and calibration index q. While it's generally not a good ideal to calibrate in the presence of interference, we might also presume that the received data and the basis functions have been interference whitened over the antenna index. We of course expect the basis functions to be different for different shapes in radar applications whose signals are reflected off of scatterers of different geometries.
The necessary conditions for the least squares calibration are given by,
So for each antenna-frequency pair we have the following L×L linear equation solution,
amk=Rb
where the l′th element of amk is αmkl the l, n′th element of the matrix Rb
It is interesting to note that certain coordinate systems allow the interpolation of the wave equation over all space once the voltages are defined over a closed surface. Thus is suffices to sample the environment in the near-field over an appropriate closed surface. In general however we are often required to grossly sub-sample the array response. In this case we can often choose the few dominant terms in our basis function, either found using HOSVD as in Basis Functions and HOSVD, or using something similar to Generalized Spherical harmonics from the same Section. The higher order harmonics have decreasing contributions to the overall wave function, especially in the far field.
Obtaining The Basis Functions
From Channel Modeling as described herein, a variety of techniques for generating analytic formulas for the wavefunction are shown, even for complex scatterers. Here we give a brief overview of the procedure.
1. Generate a model for the transmitted electromagnetic field.
2. Predict the field being scattered (for radar applications) off of the desired target type.
3. Use the prediction to generate basis functions for the received voltage at each antenna receiver and frequency,
4. Collect data in the near field for multiple scatterer/receiver positions,
5. Use least squares to estimate the complex combining weights for the basis functions.
The model for the transmitted electromagnetic field is at least partially determined by numerical modeling of the antenna array. We have seen a formula for the Hertzian dipole given in Equation 35. This formula actually corresponds to the first order Spherical Harmonics. In general we might write the electromagnetic vector potential as,
Au(x)=hn(u)2(kr)Yn(u)m(u)({circumflex over (x)})b,
where the u index has been flattened to include all m and n components of the spherical Harmonics, and b is a dipole moment vector of the transmitting antenna. We can also use the hn(u)1 (kr) for incoming waves. Similar considerations can be made for other coordinate systems.
From this we obtain the electric field from Equation (22) as,
We then use one of the several scattering modeling techniques described in Analytic Scattering Models to obtain an induced scattered field Ĕu(xm;pq) at receiver position xm and scatterer parameter vector pq. The scattered field itself may require a series solution for each transmitted Eu(x) and we may see a multi-parameter basis function of the form, Ĕpu(xm;pq), whose indices (p,u) can either be index flattened to Ĕl(xm; pq), or further sub-selected on the basis of the largest singular values using HOSVD techniques described in Basis Functions and HOSVD.
Finally the actual voltage and the scalar basis function is obtained via the vector effective length for antenna m,
blm(pq)=bm·Ĕl(xm;pq).
Once the complex gains for the model are determined from (37), we have a series solution model for the received voltage that can be used in the parameter estimation and tracking models of Geolocation and Tracking.
Calibration Model Modifications
There are a number of modifications to the basis function model we have proposed so far that may come into play. The first occurs when we only have a sparse number of calibration points to work with. This can happen when calibration is very expensive or can not be automated.
We can relax the requirement of learning a new set of gains for every frequency index k, by creating a structured version of our calibration gains, namely:
αmkl=βmlγk.
This asserts that the basis functions via it's dependency on the wavenumber k suffices to interpolate over frequency, outside some frequency dependent gains on the transmit to receive antenna path. The structured gains can be found by holding the βml terms constant and minimizing over γk, followed by holding the γk constant and optimizing over βml. This is repeated until convergence.
Another modification occurs if we use the pre-computed grid points suggested in (25). This would use more of a discrete point model for the reflector surface points. Those values, rather than the basis function coefficients become the output of the calibration process, though modeling the transmit antenna response may well use interpolating basis functions as described earlier.
Finally we may find that multiple nearby receive antennas mutually couple. If we model these antennas as re-radiating scatterer, than each antenna has a fixed contribution to another nearby antenna that is a complex multiply of it's receive voltage. This can be modeled as an additional mixing matrix added to the final received voltage, ie,
The mixing matrix can also be discovered as part of the calibration process. Again we use iterative alternating directions least squares optimization for each linear component as a technique for determining these coefficients from a sufficiently dense calibration table vmkq(pq).
In one or more of the embodiments of the present invention there is provided a method of parameter estimation in a multi-channel signal environment system where a plurality of receive antennas receive signals from one or more transmitters that transmit a signal or wave that is reflected from one or more targets, or receive antennas that receive directly from the transmitters, whose received signals are processed over multiple frequencies or channels by a digital receiver. One or more processors may be provided with processing comprises the steps of: (a) calibrating before the operation of the digital receiver to determine the free parameters of a mathematical model of a channel either as the channel model parameters or in the form of tabulated data; (b) calibrating during the operation of the digital receiver to determine the channel model; (c) comparing received antennas array voltages to an analytic or table driven channel model from a calibrated template without only relying on information lossy intermediate steps such as delay time of arrival or angle of arrival measurements; (d) creating a statistical likelihood function modeling receiver noise to determine model channel parameters or prior channel uncertainty; (e) saving target reflector/emitter parameters to be reused for dynamic tracking; and (f) using Bayesian particle filtering or Maximum Likelihood Methods to update mixture models for the unknown parameters.
The method may use Bayesian detection or other Statistical Signal Processing Techniques for the estimation of channel parameters such as location parameters, shape parameters and reflector electromagnetic parameters. Various aspects of the invention could be defined wherein:
the transmitted signal is a frequency stepped radar;
a statistical likelihood function is used to determine target type or target position;
a channel template for each target type is used to further classify the target type or estimate the target position;
static direct path clutter is removed via a cancellation algorithm;
the parameter estimation blindly calculates an unknown gain and phase constant βq over each channel observation q, to absorb bulk phase and gain changes between observations;
the statistical likelihood function takes the form of a Cauchy quotient and is therefore a member of the numerical range of a parameterized matrix as exhibited in Equation (14).
the target classification is determined from the likelihood function using the posterior probability of detection via Bayes theorem;
the target classification is determined from the likelihood function after blindly estimating any position dependent parameters during the classification process itself;
the target classification is determined from the likelihood function after blindly estimating phase terms such as time delays of arrival;
the target classification is determined from the likelihood function after blindly estimating phase delays using fast Fourier transform (FFT) processing;
the target classification or position parameters are determined from the likelihood function that assumed the scale parameters βq=β, are independent of the collection index and can therefore be solved using the generalized eigenvalue problem in Equation (8);
the DSP processing hardware is configured to enable parallel likelihood function calculations wherein the same instruction across all processing resources;
the DSP hardware has built in support for parallel reduction for primitive associative operations like addition, maximization, minimization or multiplication to enable the parallel computation of the likelihood function;
multiple transmit and receive RF chains are packed into an radio frequency (RF) integrated circuit (IC) with clocks controlled by a phased locked loop;
the signals from each antenna are down converted and digitized by a bank of analog to digital converters, where they are further decimated to obtain a sample of the channel for each antenna and frequency in use;
the wave function can be further interpolated using basis functions that are reflected through objects of predetermined shape or presumed electrical properties;
interpolating uses a likelihood function to compute a fixed set of channel vectors α, modulating the basis functions, that can be used to interpolate the channel at an arbitrary target position;
the method further includes blindly calculating an unknown gain and phase constant βq over each channel observation q, to absorb bulk phase and gain changes between observations;
the method further includes computing separate channel vectors αc for each target type c;
the method further includes jointly computing the phase constants βq and the channel vectors αc by alternating directions optimization; ie iteratively estimating each parameter while holding the other fixed;
the method further includes assuming the inner product of the basis functions is independent of position resulting in an eigenvalue problem (10) that is used to learn the channel vectors during calibration;
the method further includes assuming the scale parameters βq=β, are independent of the collection index, and can therefore be solved using the generalized eigenvalue problem in Equation (8);
the method further calibrates a blind classifier by blindly estimating any position dependent parameters during the classification process itself, without necessarily using known target or array transceiver positions;
the method further calibratesg a blind classifier by after blindly estimating phase terms such as time delays during the classification process itself without necessarily using known target or array transceiver positions;
the method further calibrates a blind classifier by blindly estimating phase delays using FFT processing, during the classification process itself, without necessarily using known target or array transceiver positions;
the DSP processing hardware is configured to enable parallel likelihood function calculations, yet can share the same instruction across all processing resources;
the DSP hardware has built in support for parallel reduction for primitive associative operations like addition, maximization, minimization or multiplication to enable the parallel computation of the likelihood function;
multiple transmit and receive RF chains are packed into an RF IC, with clocks controlled by a phased locked loop;
the signals from each antenna are down-converted and digitized by a bank of analog to digital converters wherein the signals are further decimated to obtain a sample of the channel for each antenna and frequency in use; OR
the likelihood function is itself used to further refine the presumed positions of the transmitter and receiver to improve the calibration channel model.
the foregoing and as mentioned above, it is observed that numerous variations and modifications may be effected without departing from the spirit and scope of the novel concept of the invention. It is to be understood that no limitation with respect to the embodiments illustrated herein is intended or should be inferred. It is intended to cover, by the appended claims, all such modifications within the scope of the appended claims.
The present invention claims priority to U.S. Provisional Application Ser. No. 62/764,814 filed on Aug. 16, 2018, which is incorporated in its entirety by reference.
Number | Name | Date | Kind |
---|---|---|---|
6571104 | Nanda et al. | May 2003 | B1 |
7119739 | Struckman | Oct 2006 | B1 |
9304184 | Draganov | Apr 2016 | B1 |
20040072577 | Myllymaki | Apr 2004 | A1 |
20080198072 | Elwell et al. | Aug 2008 | A1 |
20090042526 | Maulik et al. | Feb 2009 | A1 |
20100008406 | Sawai et al. | Jan 2010 | A1 |
20110287801 | Levin et al. | Nov 2011 | A1 |
Number | Date | Country |
---|---|---|
H 11510981 | Sep 1999 | JP |
WO 2004075577 | Sep 2004 | WO |
WO 2013013169 | Jan 2013 | WO |
WO 2016174679 | Nov 2016 | WO |
Entry |
---|
Al-Salihi, H. et al., “Bayesian Compressed Sensing-based Channel Estimation for Massive MIMO Systems,” EURASIP Journal on Wireless Communications and Networking, Dec. 31, 2017, pp. 1-5. |
PCT International Search Report and Written Opinion, PCT Application No. PCT/US2019/046737, dated Dec. 5, 2019, 8 pages. |
PCT International Search Report and Written Opinion, PCT Application No. PCT/US2019/046741, dated Dec. 5, 2019, 9 pages. |
Non-Final Office Action, U.S. Appl. No. 16/542,342; (dated Jun. 20, 2022), 17 pgs. |
Number | Date | Country | |
---|---|---|---|
20200057163 A1 | Feb 2020 | US |
Number | Date | Country | |
---|---|---|---|
62764814 | Aug 2018 | US |