Deciding whether a received signal is a signal of interest

Information

  • Patent Grant
  • 8572010
  • Patent Number
    8,572,010
  • Date Filed
    Tuesday, August 30, 2011
    13 years ago
  • Date Issued
    Tuesday, October 29, 2013
    11 years ago
Abstract
A method of deciding whether an observed received signal is a particular signal of interest (SOI) includes the steps of: providing a statistical model of the particular signal of interest and a statistical model of the environment when the particular signal of interest is not present, and processing the observed received signal in accordance with a decision rule that uses the provided statistical models to decide whether the observed received signal is the particular signal of interest. The decision rule is:
Description
BACKGROUND OF THE INVENTION

The present invention generally pertains to signal communications and is particularly directed to deciding whether an observed received signal, such as a radio communication signal, is a signal of interest (SOI).


Various different attributes of an observed received signal may be processed to decide whether the observed received signal is a signal of interest.


The resources that may be used for making such a decision pertain to data processing and computing, receiver gain, the frequency to which the receiver is tuned and the timing of when the receiver is being tuned to a particular frequency. In some circumstances the time constraints for making such a decision are such that there is a need to optimize the use of such resources while maintaining the probability of accurately making the decision.


When one or more of these resources is limited, it may not be practical to continuously observe the full spectrum of interest; and it may not be practical to continuously process all of the various different attributes of the observed received signal that are typically processed in order to decide whether the observed received signal is a signal of interest. When a network of sensors (such as antenna elements) for observing attributes of the received signal is used, it may not be possible to communicate all of what is observed from each sensor to a computing center, because of the bandwidth limitations of the coupling with the sensors.


SUMMARY OF THE INVENTION

The present invention provides a method of deciding whether an observed received signal is a particular signal of interest (SOI), comprising the steps of:


(a) providing a statistical model of the particular signal of interest and a statistical model of the environment when the particular signal of interest is not present; and


(b) with a computer, processing the observed received signal in accordance with a decision rule that uses the provided statistical models to decide whether the observed received signal is the particular signal of interest;


wherein the decision rule is:









decision
=

{






SOI




if






LLR


(
y
)



>

+
λ







~
SOI





if






LLR


(
y
)



<

-
λ






unsure


otherwise








where






LLR


(
y
)



=

log



P


(




a
1



(
y
)


·


a
2



(
y
)



,






|
SOI


)



P


(





a
1



(
y
)


·


a
2



(
y
)










|

~
SOI


)





,






[

Eq
.




1

]








where LLR is log likelihood ratio, P denotes a probability, a1(y) and a2(y) are functions of first and second attributes of the observed signal y in a sequential order that said attribute functions are presented for computation of LLR(y), and +λ and −λ are real numbers,


wherein the statistical model of the particular signal of interest includes functions of attributes that individually have a probability distribution function f1 when the observed received signal is the particular signal of interest such that

KL(f1,f0)>0,  [Eq. 2]

where f0 is a probability distribution function (PDF) of the respective individual attribute when the observed received signal is not a signal of interest.


In one aspect of the method of the present invention, at least one of the attribute functions is expressed as:

a(yC,yR)fc=fc
a(yC,yRf=Δf
a(yC,yR)tc=tc
a(yC,yRt=Δt,

where yC is a sample of the observed signal received by a commutated element of a commutated antenna array, yR is a sample of the observed signal received by a commutated element of the antenna array, f is frequency, and t is time.


In another aspect of the method of the present invention, at least one of the attribute functions is expressed as one of the following:












a


(


y
C

,

y
R


)



S
N


=








y
C

,

y
R






2







y
C



2






y
R



2


-







y
C

,

y
R






2




,




(
i
)








where yc is a sample of the observed signal received by a commutated element of a commutated antenna array and yR is a sample of the observed signal received by a commutated element of the antenna array;

a(yC,yR)AOA=arg maxΦColor*U(U*diag(Energy)U)−1U*Color

where

Un=ukn(Φ)
Colorn=<yC,yR>Rn
Energyn=<yR,yR>Rn  (ii)

yC is a sample of the observed signal received by a commutated element of a commutated antenna array and yR is a sample of the observed signal received by a commutated element of the antenna array;












a


(

y
,
k

)


Sinusoids

=






Y


(
k
)



w



2






y


(
k
)


-


Y


(
k
)



w




2








where






w
=



(


Y


(
k
)


*

Y


(
k
)



)


-
1




Y


(
k
)


*

y


(
k
)








(
iii
)








and k is the k in k-means;

a(y,k)PSK k-ary 1=a(yk,k)Sinusoids,  (iv)

and k is the k in k-means;












a


(

y
,
k

)




PSK





k

-

ary





2



=




i




Mean


(

C
i

)


2





i



Variance


(

C
i

)










where







{

C
i

}

=

k
-

means


(

Δ





y

)








(
v
)








and k is the k in k-means;












a


(

y
,
k

)



PSK





k


-


ary





3


=


max


T
~



[


1

2





Δ





f


,

2

Δ





f



]








i




Mean


(

C
i

)


2





i



Variance


(

C
i

)











where








{

C
i

}

=

k


-



means


(



i



s
i



)




,





(
vi
)








and k is the k in k-means;












a


(

y
,
k

)



QAM





k


-


ary





1


=




i




Mean


(

C
i

)


2





i



Variance


(

C
i

)










where








{

C
i

}

=

k


-



means


(

y
2

)




,





(
vii
)








and k is the k in k-means;












a


(

y
,
k

)



QAM





k


-


ary





2


=


max


T
~



[


1

2





Δ





f


,

2

Δ





f



]








i




Mean


(

C
i

)


2





i



Variance


(

C
i

)











where








{

C
i

}

=

k


-



means


(



i



s
i



)




,





(
viii
)







k is the k in k-means, f is the frequency of the signal, and U denotes a union operator;












a


(
y
)


AM

=






<

V


(



ω
~

c

,
y
,
ω

)



,


V


(



ω
~

c

,
y
,

-
ω


)


>




2










V


(



ω
~

c

,
y
,
ω

)




2






V


(



ω
~

c

,
y
,

-
ω


)




2


-










<

V


(



ω
~

c

,
y
,
ω

)



,


V


(



ω
~

c

,
y
,

-
ω


)


>




2














ω
~

c

=

arg







max

ω
c








<

V


(


ω
c

,
y
,
ω

)



,


V


(


ω
c

,
y
,

-
ω


)


>




2








(
ix
)








and ω is frequequncy.


The following is an explanation of the meaning of the term “function of an attribute of the received signal”.


Let x(M) denote a transmitted signal where M is the transmitted message. Let y=C(x(M)) denote an associated received signal where C denotes the effect of the channel. Let x(M)∈H⊂custom characterN for some set H where H denotes some “class” of signals. For example, H may be the set of all 4-ary PSK or AM modulated signals, in which case H is a proper subset of custom characterN. Let y∈custom characterN. x(M) and y are random variables. Let a:custom characterNcustom character denote an attribute function and a(y) an attribute of y. a(y) is used to decide whether or not x(M)∈H. Let f1 denote the PDF of a(y) when x(M)∈H, and f0 denote the PDF of a(y) when x(M)∉H. For an attribute to be useful it is necessary that f1 and f0 are dissimilar. A very important measure of similarity is the Kullback-Leibler divergence defined as,










KL


(


f
1

,

f
0


)


=



f





1


(
u
)


log









f
1



(
u
)





f
0



(
u
)






u







[

Eq
.




3

]








Notionally, a(y) is “useful” if

KL(f1,f0)>0.  [Eq. 4]

(KL(f1,f0) is always non-negative.)


The present invention additionally provides non-transitory computer readable storage media that include computer executable program instructions for causing one or more computers to perform and/or enable one or more of the steps of the above-described methods. Each storage medium may include a plurality of non-transitory media.


Additional features of the present invention are described with reference to the detailed description.





BRIEF DESCRIPTION OF THE DRAWING


FIG. 1 is a block diagram of an exemplary system in which the method of the present invention is performed.



FIG. 2 is a diagram generally showing an exemplary embodiment of the method of the present invention.



FIG. 3 is a diagram showing additional processing steps in an exemplary embodiment of the method shown in FIG. 2.



FIG. 4 is a diagram showing another additional processing step in an exemplary embodiment of the method shown in FIG. 2.



FIG. 5 is a diagram showing still other additional processing steps in an exemplary embodiment of the method shown in FIG. 2.



FIG. 6 is a diagram showing yet other additional processing step in an exemplary embodiment of the method shown in FIG. 2.



FIG. 7 is a diagram showing a further additional processing step in an exemplary embodiment of the method shown in FIG. 2.



FIG. 8 shows the rectangular decomposition of a received signal.



FIGS. 9A and 9B illustrate probability distribution functions f1 and f0 for an attribute, ordinary energy.



FIG. 10 is an exemplary histogram of k-means clusters for an attribute, PSK.



FIG. 11 is a diagram of a system for implementing operation of the method shown in FIG. 2.





DETAILED DESCRIPTION

Referring to FIG. 1, an exemplary system in which the method of the present invention is performed includes an array of commutated antenna elements 10, a reference antenna element 11, a receiver 12, a computer 14 and a computer display monitor 16. In some embodiments the receiver 12 and the computer 14 are combined in a single assembly. The computer 14 contains at least one digital signal processor and non-transitory computer readable storage medium/media that include computer executable program instructions for causing the computer to perform and/or enable the various processing steps that are described herein. These instructions are stored in the computer readable storage medium/media of the computer when the computer is manufactured and/or upon being downloaded via the Internet or from a portable non-transitory computer readable storage medium/media containing such instructions.


Samples 20 of an observed signal received by the array of commutated antenna elements 10 are obtained by the receiver 12 and samples 22 of the observed received signal are obtained by the receiver 12 from the reference antenna element 11. Preferably, the samples 20, 22 of the observed received signal are observed in zero-mean additive white Gaussian noise (AWGN).


Individual samples 20 are obtained from different elements of the array of commutated antenna elements 10. The samples 20, 22 are obtained at different times, at different frequencies and/or by using different CDMA signal access codes. The usefulness of attributes of the received signal increases with the number of samples because the probability distributions functions (PDFs} become more localized with one caveat. From an information-theoretic point-of-view the Sampling Theorem must be respected. More samples that are not statistically independent are not constructive.


The samples 20, 22 of the observed received signal are processed by the computer 14 to decide whether the observed received signal is a particular signal of interest.


Also referring to FIG. 2, a statistical model 24 of the particular signal of interest and a statistical model 25 of the environment when the particular signal of interest is not present are provided, as shown in FIG. 2. The statistical model of the particular signal of interest includes functions of attributes that individually have a probability distribution function f1 when the observed received signal is the particular signal of interest such that

KL(f1,f0)>0,  [Eq. 2]

where f0 is a probability distribution function of the respective individual attribute when the observed received signal is not a signal of interest.


The computer 14 processes the observed received signal samples 20, 22 in accordance with a decision rule that uses the provided statistical models 24, 25 to decide whether the observed received signal is the particular signal of interest, as shown at 26. The decision rule is:









decision
=

{






SOI




if






LLR


(
y
)



>

+
λ







~
SOI





if






LLR


(
y
)



<

-
λ






unsure


otherwise








where






LLR


(
y
)



=

log



P


(




a
1



(
y
)


·


a
2



(
y
)



,






|
SOI


)



P


(





a
1



(
y
)


·


a
2



(
y
)










|

~
SOI


)





,






[

Eq
.




1

]








where LLR is log likelihood ratio, P denotes a probability, a1(y) and a2(y) are functions of first and second attributes of the observed signal y in a sequential order that said attribute functions are presented for computation of LLR(y), and +λ and −λ are real numbers.


P(a1(y),a2(y), . . . |SOI) is a model for SOI; and P(a1(y),a2(y), . . . |˜SOI) is a model for ˜SOI. For example, when y is a SOI, one would expect LLR(y) to be very large and positive, and specifically, LLR(y)>>λ. Opposite expectations apply when y is ˜SOI.


The decision rule defined by Equation 1 is ternary. The possible decisions are SOI, ˜SOI, and “unsure”.


Referring to FIG. 3, when it is determined, as shown at 27, that processing by the computer 14 of a particular observed received signal in accordance with a decision rule that includes the statistical model for the particular SOI has resulted in a false decision either that the observed received signal is a signal of interest, as shown at 28, or that the observed signal is not a signal of interest, as shown at 30, the operator causes the computer 14 to construct a revised statistical model for the particular observed signal, as shown at 32, and/or to revise at least one of the parameters +λ and −λ of the decision rule that is used in the decision processing step for the particular observed signal, as shown at 34. A determination that such processing has resulted in a false decision can be made by a system operator or by some agent external to the system.


Preferably, the decision processing step 26 is incremental in that LLR(y) is computed incrementally for cumulatively ordered sampled attributes a1(y),a2(y), . . . until it is decided whether or not the observed received signal is a signal of interest, as shown in FIG. 4.


The computer 14 first computes









LLR


(
y
)


1

=

log







P


(



a
1



(
y
)


|
SOI

)



P


(



a
1



(
y
)


|

~
SOI


)





,





as shown at 36. When LLR(y)1>+λ or >−λ, it is decided that the received signal is either SOI or ˜SOI and the decision processing step is complete, as shown at 38. However, when |LLR(y)1|<λ and >−λ, an “unsure” decision has been made, and the decision processing step continues by next computing









LLR


(
y
)


2

=

log







P


(



a
1



(
y
)


,



a
2



(
y
)


|
SOI


)



P


(



a
1



(
y
)


,



a
2



(
y
)


|

~
SOI



)





,





as shown at 40.


When |LLR(y)n| is either >λ or <−λ, a decision as to whether or not the received signal is a SOI has been made and the decision processing step is complete.


It is possible that after some finite number of computations of LLR(y)n, the decision may remain “unsure”. Nonetheless, the decision processing step will be completed with a decision of SOI or ˜SOI provided that the attributes functions are defined reasonably and are presented for computation of LLR(y) in an order that minimizes the average time for making a decision as to whether or not the observed received signal is a signal of interest. When the received signal y is a SOI, LLR(y)n usually increases “almost” monotonically.


Preferably, the attributes functions a1(y),a2(y), . . . are presented for computation of LLR(y) in an order for incremental computation that minimizes the average time for making a decision as to whether or not the observed received signal is a signal of interest. This criterion derives from a classical construction in statistics, the Bayesian Hypothesis Test, wherein it is desired to make a decision among a candidate set of hypotheses. Here the hypotheses are SOI and ˜SOI. (The ternary and incremental decision is an extension.) For the Bayesian Hypothesis Test a defined cost function is specified. There is a cost for deciding SOI when the correct hypothesis is ˜SOI, and there is a cost for deciding ˜SOI when the correct hypothesis is SOI. The Bayesian Hypothesis Test construction minimizes the expected cost. It is also necessary to specify the a priori probabilities of each hypothesis. The Bayesian Hypothesis Test is described at http://en.wikipedia.org/wiki/Bayesian_inference.


For example, one may select the attribute functions a(y)Energy and a(yC,yR)fc and detect a target signal by observing energy over a threshold within the frequency band of a SOI.


Initial Identification of a Target Signal as a SOI


Some approaches for identifying a target signal as a signal of interest SOI for use establishing criteria that are used for constructing the statistical models include the following:


First, after making several passes of the environment while assuming that no SOI is present, a statistical model P(a1(y),a2(y), . . . |˜SOI) of the environment is constructed and stored in a SOI database. When an unusual signal y is detected the unusual signal is processed in relation to the statistical model of the environment to determine whether the unusual signal y is an outlier.


Note that if one were to use only a(y)Energy as an attribute function, ordinary energy detection may be used with a constant false-alarm rate. Here λ has the slightly different interpretation of testing whether the PDF of the attribute is sufficiently small to classify the unusual signal y is an outlier. Note also that it is not sufficient to use a cumulative distribution function CDF of the attribute, as such a technique would fail for a PDF of an attribute that is concentrated at multiple locations.


Secondly, a system operator may specify characteristics of a SOI. For example, this may be something like, a(y)Energy>τ and aAOA(yC,yR)∈(−15°,+15°). A reasonable approach is to regard this as a statement about a PDF. For example, assume that a(y)Energy is uniformly distributed in some bounded range greater than τ and aAOA(yC,yR) is uniformly distributed in the interval (−15°,+15°). The detection process allows these PDF's to be refined once the operator affirms a labeled target as a SOI.


Once the operator affirms a target signal as a SOI, a statistical model P(ai(y)|SOI) of the target signal is entered into the SOI database as a particular signal of interest. A SOI database entry for a particular signal of interest may be revised upon subsequent identifications of the particular target signal. This calculation is more expensive and preferably is performed in the background. After that, the earlier discussion applies.


Estimation of +λ


An initial value of +λ is estimated by initially labeling target signals in a graphical user interface (GUI) as a SOI in accordance with the estimated value of +λ. The initial value of +λ may be estimated prior to deployment of the system in the field. A target signal that is labeled as a SOI is entered into a SOI database. When it is subsequently determined by a system operator that a particular target signal which has been labeled as a SOI for the estimated +λ is ˜SOI, pursuant to feedback from the operator, the particular target signal is removed from the SOI database, and the value of +λ is increased; whereby in the future, it may be more difficult to decide if a received signal is a SOI.


The operator also may provide feedback for the other error type by indicating an instance when a particular target signal is an SOI after the particular target signal had been implicitly labeled as ˜SOI by not being labeled as SOI in accordance with the estimated value of +λ.


The estimation of +λ is made in relation to a lower value region of +λ in which it is desired that the decision be “unsure”. If one wants to make it more difficult for the decision to be SOI, one should change the value of +λ to increase the region in which the decision is “unsure”. One may not want to make a corresponding change to the region in which the decision is ˜SOI; whereupon +λ and −λ would not be positive and negative values of the same real number.


For the Bayesian Hypothesis Test the optimal +λ depends on the cost function and the a priori probabilities of the hypotheses.


Determining that Attributes are Useful Attributes


Whenever a system operator affirms that a target signal is an SOI, there is an opportunity to determine which attributes of the target signal are useful attributes by estimating the probability distribution function (PDF) of the each of the individual attributes. Such estimates are made by using kernel density estimation, which is described at http://en.wikipedia.org/wiki/Kernel_density_estimation. For kernel density estimation, the PDF is approximated as a linear combination of translates of a kernel function. The kernel function integrates to 1. For a linear combination of N kernel functions dividing by N ensures that the integral of the linear combination integrates to 1, as required for a PDF. The bandwidth parameter of the kernel function stretches the kernel function along its domain. A smaller bandwidth concentrates the kernel function in a smaller region of its domain. Hence smaller bandwidth produces a linear combination that varies more rapidly. Conversely, larger bandwidth produces a linear combination that varies more smoothly.


The preferred kernel function is the PDF of a Normal distribution. This is probably the most common function used for kernel density estimation. There is no particular significance to the Normal distribution, other than as a PDF, this kernel integrates to 1, as required.


The bandwidth parameter is determined by a cross-validation technique. For a modest number of samples S, the samples are partitioned as S=S1∪S2. For a given bandwidth, one estimates the distribution by using S1 and then assesses the efficacy of the asserted bandwidth by computing the likelihood of S2.


The statistical model P(a1(y),a2(y), . . . |SOI) is a joint distribution. To construct a reasonable estimate of a distribution, there must be enough samples. For a single attribute distributed along a line, this is simple enough. But for even a joint distribution with two attributes, it may be difficult to get enough samples in a plane. This is the so-called curse of dimensionality. It only gets worse with many attributes since the decision rule uses a product of PDF's of the individual attributes. This may be regarded as a joint PDF where it is assumed that the attributes a1(y),a2(y), . . . are statistically independent. This has been found to be a generally satisfactory assumption, albeit not exactly correct.


Criteria for Screening Attributes


When screening attributes that may be considered for representation in the statistical model, an attribute m is preferred over an attribute n for construction of the statistical model of a particular signal of interest when C(n){P(m)−1}>C(m){P(n)−1}. C(i) is the time cost of computing attribute i and P(i) is the probability of satisfying the associated criteria.


Presentation of Attribute Functions in an Optimal Order for Computation of LLR(y)


As discussed above, the decision processing step is incremental in that LLR(y) is computed incrementally for cumulatively ordered sampled attributes functions a1(y), a2(y), . . . until it is decided whether or not the observed received signal is a signal of interest; and it is preferable that the attribute functions are presented for computation of LLR(y) in an order that minimizes the average time for making a decision as to whether or not the observed received signal is a signal of interest.


In deciding whether to present a particular attribute function a1(y) for computation before another attribute a2(y), one should determine whether attribute a1(y) is likely to increase LLR(y)1 more rapidly than a2(y) and/or the relative time costs of observing and computing a1(y) and a2(y). The expected increase in LLR(y)1 associated with presenting a1(y) is the quantity KL(P(a1|SOI), P(a1|˜SOI)). The time cost of observing and computing ai(y) is determined in accordance with:









i
=

arg







max
i




KL


(


P


(


a
i

|
SOI

)


,

P


(


a
i

|

~
SOI


)



)



C
i








[

Eq
.




5

]







An attribute function a(y) that is based upon observations of a particular attribute over different time intervals may be treated in the statistical model as two different attribute functions. For example, an AOA estimate computed over 10 ms takes less observational computation time than the same AOA estimate computed over 20 ms, but may be less useful because it may have a higher variance.


Another consideration is an assumption of statistical independence. When independence of attributes is assumed the PDF of an attribute is an estimate of the marginal PDF with respect to other attributes. Essentially, it is an average over the other attributes. In particular, it is marginal with respect to S/N. The instantaneous S/N may differ considerably from the average and it is appropriate to revise the estimate of the rate of change of LLR(y) for a particular attribute selection once that attribute has been selected. This is very similar to the exploitation/exploration paradigm in reinforcement learning.


Reinforcement Learning


In some exemplary embodiments, reinforcement learning is used to allocate computer resources for performing the decision processing step 26.


Reinforcement learning is an area of machine learning in computer science, concerned with how an agent ought to take actions in an environment so as to maximize some notion of cumulative reward. Reinforcement learning involves trial-and-error and feedback to arrive at the cumulative reward. Examples of cumulative rewards that are applicable to the present method of determining whether a particular observed signal is a signal of interest are: (1) minimizing the time cost of computation, such as by minimizing the average time for making a decision as to whether or not the observed received signal is a signal of interest; (2) minimizing the time cost of observing the attributes; and (3) minimizing the time cost of communicating the observed attributes.


Referring to FIG. 5, during the construction of a statistical model for a particular signal of interest, the attribute functions a1(y),a2(y), . . . are so ordered, as shown at 42, that during the decision processing step 26 the attribute functions a1(y),a2(y), . . . are presented for computation of LLR(y) in an order that minimizes the average time for making a decision as to whether or not the observed received signal is a signal of interest.


The average time to make a decision that completes the decision processing step 26 is estimated by using the formula








KL


(


f
1

,

f
0


)



Time





Cost


,





as shown at 44. The estimate of the average time to make a decision is revised in accordance with cumulative estimates, as shown at 46.


In some exemplary embodiments, reinforcement learning is used to allocate spectral tuning resources and/or temporal tuning resources and/or receiver gain resources for the statistical model construction step.


The time cost of communicating observed attributes is relevant to a situation in which both the time of arrival and the angle of arrival of the observed received signal are communicated from a sensor network. Computing the time of arrival requires samples on the order of the bandwidth (i.e. the Nyquist Theorem). This may involve a lot of data; and the time cost of sending this data may be very large.


A useful reinforcement learning method is the so-called N-armed bandit paradigm, which is described at http://en.wikipedia.org/wiki/Multi-armed_bandit.


Distinguishing Different Observed Received Signals of Interest From One Another


Referring to FIG. 6, in an exemplary embodiment, different observed received signals of interest are distinguished from one another by using the method of the present invention. Statistical models that are different for a plurality of different particular signals of interest received signals are separately provided, as shown at 50; and the decision processing step 26 (FIG. 2) is performed, as shown at 52, by separately using the provided statistical models of the different particular signals of interest. The different observed received signals are then distinguished from one another in accordance with the differences in the decisions made when separately using the provided statistical models for the different particular signals of interest, as shown at 54.


Referring to FIG. 7, in an exemplary embodiment, a statistical model is constructed for the union of a particular set of observed signals and other observed signals by using attributes associated with selected criteria for distinguishing the particular set of observed signals from the other observed signals, as shown at 56.


Attribute Functions


The following is a discussion of how some attribute functions may be used in the statistical models in exemplary embodiments of the present invention for different types of attributes. An attribute is a function of an instance of an observed signal.


Energy or S/N


For any rectangle one may compute,











a


(


y
C

,

y
R


)



S
N


=








y
C

·

y
R






2







y
C



2






y
R



2


-







y
C

·

y
R






2







[

Eq
.




6

]








where <. . . > denotes inner-product and yc and yR denote samples from the commutated and reference elements respectively. From earlier material and here stated without proof, this is an estimate of






S
N





for 2 antenna elements in AWGN and a single coordinate descent step. The denominator is always non-negative. As the energy of the noise approaches zero, yC→syR for some complex scalar s and a(yC,YR)→∞.


Ordinary Energy

a(y)Energy=∥y∥2  [Eq. 7]


Notice that instead of a(yC,yR) this attribute function has been written as a(y). This highlights the fact that this attribute function calculation does not require both the commutated and reference element samples, and either y=yR or y=yC may be used. Or, one could use both for a slight improvement. Under the assumption of AWGN for the channel, the distribution of a(y) is easy to understand. a(y) has a chi-square distribution for x(M)∉H. If we may also assume x(M)∈H has constant energy, then a(y) is non-central chi-square for x(M)∈H. Typical f1 and f0 are shown in FIGS. 9A and 9B, where one can see that a(y) is useful as KL(f1,f0)>0.


Spectral and Temporal Support.


The support of a function refers to the subset of its domain where the function is non-zero. For the rectangular decomposition of a received signal, as shown in FIG. 8, the spectral support is characterized by fc and Δf, and the temporal support is characterized by tc and Δt.


In some embodiments, energy and fc could be the only attributes used to identify an SOI. This is clearly unnecessarily restrictive.


For the present invention one may use any of the following attribute functions to identify a SOI:

a(yC,yR)fc=fc
a(yC,yRf=Δf
a(yC,yR)tc=tc
a(yC,yRt=Δt,  [Eq. 8]

As before, one may also use a(y) rather than a(yC,yR).


Here x(M)∈H for some SOI, and otherwise x(M)∉H for other signals that are not of interest. Generally, one would expect a(yC,yR) to have a broader distribution for x(M)∉H (, as characterized by f0). For example, in some concepts of operation, the notion is that entirety of the probability mass of f1 is located near fc.


Angle of Arrival (AOA)


In the rectangular decomposition of a received signal, as shown in FIG. 8, for any rectangle R the color of the rectangle Color(R) is defined as the complex scalar,

Color(R)=<yC,yR>R.  [Eq. 9]


The subscript R on the inner-product indicates that the computation of the inner-product is restricted by the spectral and temporal support of the rectangle R. What is rendered in FIG. 8 is Arg{Color(R)}. It has been previously demonstrated that the color sequence in the rectangular decomposition is a sufficient statistic for computing an MLE (most likely estimation) of the AOA. Accordingly, one may define an attribute function as a point estimate of the AOA in accordance with,

a(yC,yR)AOA=arg maxΦColor*U(U*diag(Energy)U)−1U*Color

where

Un=ukn(Φ)
Colorn=<yC,yR>Rn
Energyn=<yR,yR>Rn  [Eq. 10]
ukn(Φ) is taken from the calibration tables for the antenna, the response of antenna element kn o a signal from angle Φ. Note that this is a multiple angle calculation and to compute a single angle Φ is restricted to a single tuple. As in the case of the above discussed spectral and temporal support attributes, x(M)∈H for some SOI, and otherwise x(M)∉H for other signals that are not of interest. Generally, one would expect a(yc,yR) to have a broader distribution for x(M)∉H (as characterized by f0). A related attribute is the a posteriori PDF of the AOA. Here the range of a(yC,yR) is the set of all non-negative functions that integrate to 1 defined on the interval [−π,+π].


For the next group of attributes functions, specific structural assumptions are made about x(M). These attribute functions include: (a) multiple sinusoids to include CW and FSK, (b) PSK, (c) FM and QAM, (d) AM and DSB, and (e) SSB. For these attribute functions, the channel is more problematic for several reasons. First, frequency error is unconstrained. Secondly, digital modulation samples obtained away from the center of a symbol may not exhibit the characteristics of the constellation.


CW and FSK.


Spectrum estimation is a classic signal processing problem and there are many different approaches, each with advantages and disadvantages.


Suppose x(M)n=s eiωn where s is a complex scalar that models amplitude and phase and ω is the frequency of the sinusoid. Of course, s and ω are unknown. Let y=x(M)+v where v is AWGN.


One approach is to compute the MLE for ω and s and then compute the associated







S
N

.





Here,

ωMLE=arg maxω|ℑ(y,ω)|2  [Eq. 11]
sMLE=ℑ(y,ωMLE).  [Eq. 12]

ℑ denotes the Fourier transform.


With












x
~

n

=


s
MLE












ω
MLE


n




,


S
N

=






x
~



2





y
-

x
~




2


.






[

Eq
.




13

]







While this method enjoys certain statistical optimality and this is a simple calculation for a single sinusoid, the product space of the parameters quickly becomes problematic for multiple sinusoids.


One alternative method is AR spectral modeling. Here, y(n) is approximated as a linear combination of immediately preceding samples. One earlier sample is required for each sinusoid. For example, for one sinusoid y(n)=ω1y(n−1) where ω1=e107 . In general, y=Yw for an appropriately defined Y. Generally, w is estimated by ordinary LS as, w=(Y*Y)−1Y*y and the computational complexity of this algorithm is determined by the computation of w. For a small number of sinusoids Cramer's rule may be used. For larger problems notionally Gaussian elimination may be used, but recognizing that Y*Y is Toeplitz, the Levinson-Durbin algorithm is preferred for O(N2) complexity.







S
N

=





Yw


2





y
-
Yw



2


.






In AR spectral modeling the estimated frequencies of the sinusoids are determined by the locations of the zeros of the FIR filter determined by w.


Using an AR spectral model,












a


(

y
,
k

)


Sinusoids

=






Y


(
k
)



w



2






y


(
k
)


-


Y


(
k
)



w




2








where






w
=



(


Y


(
k
)


*

Y


(
k
)



)


-
1




Y


(
k
)


*

y


(
k
)








[

Eq
.




14

]








Here k denotes the order of the AR model and the number of sinusoids. k is the number of tuples of w. For example, for CW k=1 and for binary FSK k=2.


Notice that because of modulation, binary FSK is not a sum of two pure sinusoids. An AR model is essentially an FIR filter, and symbol transitions induce a transient of length k−1. This degrades






S
N





and a more elegant solution would be to detect these transients (by anomalous energy) and delete these samples from the calculation of w. This produces much better behavior for a(y,k), but also destroys the Toeplitz structure of Y*Y and thereby significantly increases computational difficulty.


PSK


Consider first the idealized case where the receiver samples only at the center of a transmit symbol. For k-ary PSK,








x


(
M
)


n

=

s














w





n














2

π

k



m
n









where s is an unknown complex gain and ω is an unknown frequency error. mncustom character is the unknown transmit data sequence. Let y=x(M)+v where v is AWGN.


A first approach is to observe that {x(M)n}k=s′eikωn and absent noise (yn)k={x(M)n}k. Let yk denote this vector. yk is a sinusoid at frequency kω and it is reasonable to define,

a(y,k)PSK k-ary 1=a(yk,k)Sinusoids  [Eq. 15]


A second approach is to observe that









x


(
M
)


n





x


(
M
)



n
-
1


_


=


s












w














2

π

k



m










and absent noise








y
n




y

n
-
1


_


=



x


(
M
)


n






x


(
M
)



n
-
1


_

.







Let








(

Δ





y

)

n

=


y
n





y

n
-
1


_

.







Δy is the same k-ary constellation rotated by ω.


The efficacy of a 4-ary PSK model may be determined by applying k-means (with k=4) and computing







S
N

.





Let {Ci}=k-means(Δy) where {Ci} is a set of clusters which represent a partition of the points of Δy. For any single cluster C we may define







S
N

=




Mean


(
C
)


2


Variance


(
C
)



.






Similarly, for the set of all clusters {Ci} we may define







S
N

=





i




Mean


(

C
i

)


2





i



Variance


(

C
i

)




.






Accordingly,












a


(

y
,
k

)




PSK





k

-

ary





2



=




i




Mean


(

C
i

)


2





i



Variance


(

C
i

)










where







{

C
i

}

=

k
-

means


(

Δ





y

)








[

Eq
.




16

]







For the purpose of implementation, it is desirable to seed each k-means trial with k points that are uniformly spaced on a circle with random rotation. The result is independent of the radius of the circle used for this seeding.


Now consider what happens when samples are not necessarily taken at the center of transmit symbols. Suppose that the center of the first transmit symbol is at time t0 and the center of the nth symbol is at time t0+(n−1)T. There are two issues here. The receiver does not know t0 or T. Generally, the difficulty here is to determine an appropriate set of sampling instants, by a reasonable computation. The following method has been found to satisfy this requirement.


Begin by fixing an estimate of T. Let {tilde over (T)} be an estimate of T. Consider







T
~



[


1

2


Δ
~






f


·

2


Δ
~






f



]






where {tilde over (Δ)}f=a(yC,yR)Δf. Let







S
i

=




t


[

i



T
~

·

(

i
+
1

)




T
~


]






y


(

t
+

T
~


)






y


(
t
)


_

.








The density with which samples are taken in a {tilde over (T)} interval may vary, but generally four samples are satisfactory. For T={tilde over (T)} there is at least one sample of the form







s





ⅈω










2

π

k



m




.






Other samples will be in a neighborhood of this value and generally there will be no bias in the error because of source compression (i.e. adjacent symbols are independent).


Let {Ci}=k-means(∪iSi) and μi=Mean(Ci). For the cluster {Ci} determine a partition wherein ∪iSi and μi are the cluster centers. FIG. 10 is an exemplary histogram of clusters.


Let s′i=arg minu∈Sii−u|2. That is, s′i is that element of Si that is closest to μi. Let μ′i=k-means(∪is′i). This process may be iterated. It appears that typically five iterations are sufficient to determine the correct subset of the samples. That is, to determine the centers of the transmit symbols. Thus,












a


(

y
,
k

)




PSK





k

-

ary





3



=


max


T
~



[


1

2


Δ
~






f


·

2


Δ
~






f



]








i




Mean


(

C
i

)


2





i



Variance


(

C
i

)











where







{

C
i

}

=

k
-

means


(



i



s
i



)








[

Eq
.




17

]







It is still appropriate to retain a(y,k)PSK k-ary 2 as defined previously, for instances where the knowledge of symbol timing is present.


FM and QAM.


The treatment of FM and QAM is similar to the above discussed treatment of PSK. Consider first the idealized case where the receiver samples only at the center of a transmit symbol. For QAM, x(M)n=s eiωnu(mn) where s is an unknown complex gain and ω is an unknown frequency error. u(mn) is an unknown transmit data symbol, and element of the QAM constellation. Let y=x(M)+v where v is AWGN.


A first approach is to observe that |x(M)n|2∈A where #A=k and absent noise |yn|2=|x(M)n|2. (# denotes cardinality.) Let y2 denote this vector. Thus (y2)n∈A where #A=k.


k is the number of distinct amplitude rings. For QAM k>1 and typically small (i.e. 2 or 3). For FM k=1 and FM is distinguished from k-ary PSK because FM does not exhibit the k-ary PSK constellation.


It follows that one may use,












a


(

y
,
k

)




QAM





k

-

ary





1



=




i




Mean


(

C
i

)


2





i



Variance


(

C
i

)










where







{

C
i

}

=

k
-

means


(

y
2

)








[

Eq
.




18

]







As above, now consider what happens when samples are not necessarily taken at the center of transmit symbols. Begin by fixing an estimate of T and consider







T
~




[


1

2


Δ
~






f


·

2


Δ
~






f



]

.






Let Si=∪t∈[i{tilde over (T)},(i+1){tilde over (T)}]|y(t)|2. For T={tilde over (T)} there exists s∈Si such that s∈A.


Let {Ci}=k-means(∪iSi) and μi=Mean(Ci). The cluster {Ci} determine a partition of ∪iSi and μi are the cluster centers.


As before, let s′i=arg minu∈Sii−u|2 and μ′i=k-means(∪is′i), and iterate.


Thus,












a


(

y
,
k

)




QAM





k

-

ary





2



=


max


T
~



[


1

2


Δ
~






f


·

2


Δ
~






f



]








i




Mean


(

C
i

)


2





i



Variance


(

C
i

)











where







{

C
i

}

=

k
-

means


(



i



s
i



)








[

Eq
.




19

]








Note that this algorithm is identical to the earlier algorithm for PSK. The difference is in the definition of Si.


For PSK







S
i

=




t


[

i



T
~

·

(

i
+
1

)




T
~


]






y


(

t
+

T
~


)





y


(
t
)


_








and for QAM Si=∪t∈[i{tilde over (T)},(i+1){tilde over (T)}]|y(t)|2. There are also some minor differences in an appropriate implementation of k-means. It is still appropriate to retain a(y,k)QAM k-ary 1 as defined previously, for instances where the knowledge of symbol timing is present.


AM or DSB.


For xn=s ecnun where ℑun=0. That is, un is purely real. Note the use of ωc rather than the earlier ω.


Recall, from the inverse Fourier Transform un=∫ωs(ω)e−ωn+s+(ω)e+ωn. ℑun=0 implies s(ω)e−ωn and s30(ω)e+ωn are conjugates, and this implies that s(ω) and s+(ω) are conjugates. Thus, |ℑ(u,ω)| is an even function of ω and Arg ℑ(u,ω) is an odd function of ω. The important observation is ℑ(u,ω)=ℑ(u,−ω). This is the characteristic that will be used to identify AM (or DSB).


Suppose one knows ωc. Letting V(wc,y,ω)=ℑ(e−ωcnyn,ω) one may test for whether or not V(ωc,y,ω)≈V(ωc,y,−ω). This is the same relationship that exists between the commutated and reference samples in the presence of a target signal (in AWGN), in the construction of the rectangular decomposition. Hence, the following estimate is appropriate:












a


(
y
)


AM

=






<

V


(



ω
~

c

,
y
,
ω

)



,


V


(



ω
~

c

,
y
,

-
ω


)


>




2










V


(



ω
~

c

,
y
,
ω

)




2






V


(



ω
~

c

,
y
,

-
ω


)




2


-










<

V


(



ω
~

c

,
y
,
ω

)



,


V


(



ω
~

c

,
y
,

-
ω


)


>




2














ω
~

c

=

arg







max

ω
c








<

V


(


ω
c

,
y
,
ω

)



,


V


(


ω
c

,
y
,

-
ω


)


>




2








[

Eq
.




20

]








The first term in the denominator has no dependence on ωc (by Parseval's theorem). Hence, a(y)AM is maximized over ωc when the numerator is maximal. This justifies the estimate {tilde over (ω)}c.


Practically, the computation of {tilde over (ω)}c is much simpler than it appears. One need only compute three FFT's. Compute one FFT transform, flip this over the ω-axis (a reversal), and correlate this with the unmodified transform to determine {tilde over (ω)}c. This correlation may again be computed with two FFT's.


Implementation of Operation


Referring to FIG. 11, operation of the method of the present invention is implemented by a scan policy controller 60 in combination with a machine interface (MI) 62, a computer 64, signal observation resources 66 and a SOI database 68.


The MI 62 is embodied in the computer 64 (14 in FIG. 1) and the computer monitor (16 in FIG. 1) or may be embodied separately therefrom.


Also referring to FIGS. 1 and 2, the computer 64 (14 in FIG. 1) is programmed for constructing the statistical models of particular signals of interest 24 and the statistical model of the environment 25, and for processing the observed received signal during step 26 in accordance with the decision rule to decide whether the observed received signal is a SOI.


The signal observation resources 56 include antennas and receivers 10, 11, 12 that can be controlled to access specified attributes of the observed received signal for use in constructing the statistical models 24, 25.


The scan policy controller 60 is a collection of apparatus that control the signal observation resources 66 and the operation of the computer 64. The scan policy controller 60 is embodied in the computer 64 (14 in FIG. 1) and the computer monitor (16 in FIG. 1) or may be embodied separately therefrom.


The SOI database 66, which may be included in the computer 64, stores a statistical model for the environment 25 and statistical model for particular SOI's. Statistical models of target signals that are identified as signals of interest are entered into the SOI database upon affirmation by the operator that a particular target signal that has been identified as a SOI is indeed a SOI. The statistical models for particular SOI's are specified by the system operator.


When the operator desires a decision by the computer 64 as to whether an observed received signal is a particular SOI, the operator causes the statistical model for the particular SOI to be accessed by the scan policy controller 60 from the SOI database 66 and used for construction of the statistical model for the observed received signal; and the scan policy controller 60 decides whether the observed received signal is the particular SOI.


The MI 62 facilitates operation of the scan policy controller by a system operator by enabling the operator to specify and/or revise the attributes of a target signal that are used to construct the statistical model and/or to revise parameters of the decision rule in response to decisions made by the computer 64 (14 in FIG. 1) as to whether the observed received signal is a SOI.


A significant benefit of the methods described herein is reducing operator workload. This is realized by automating operator tasks that have historically been manual, wherever possible.


The benefits specifically stated herein do not necessarily apply to every conceivable embodiment of the present invention. Further, such stated benefits of the present invention are only examples and should not be construed as the only benefits of the present invention.


While the above description contains many specificities, these specificities are not to be construed as limitations on the scope of the present invention, but rather as examples of the preferred embodiments described herein. Other variations are possible and the scope of the present invention should be determined not by the embodiments described herein but rather by the claims and their legal equivalents.


Regarding the method claims, except for those steps that can only occur in the sequence in which they are recited, and except for those steps for which the occurrence of a given sequence is specifically recited or must be inferred, the steps of the method claims do not have to occur in the sequence in which they are recited.


LIST OF ACRONYMS



  • AM Amplitude modulation

  • AOA Angle-of-arrival

  • AR Autoregressive

  • AWGN Additive white Gaussian noise

  • CDF Cumulative distribution function

  • CW Continuous wave

  • DSB Double sideband

  • FIR Finite impulse response

  • FM Frequency modulation

  • FSK Frequency shift keying

  • GUI Graphical user interface

  • KL Kullback-Leibler divergence

  • LLR Log likelihood ratio

  • LS Least-squares

  • MLE Maximum-likelihood estimate (or estimator)

  • MI Machine interface

  • PDF Probability density function

  • QAM Quadrature amplitude modulation

  • SOI Signal-of-interest







S
N





Signal-to-noise ratio

Claims
  • 1. A method of deciding whether an observed received signal is a particular signal of interest (SOI), comprising the steps of: (a) providing a statistical model of the particular signal of interest and a statistical model of the environment when the particular signal of interest is not present; and(b) with a computer, processing the observed received signal in accordance with a decision rule that uses the provided statistical models to decide whether the observed received signal is the particular signal of interest;wherein the decision rule is:
  • 2. A method according to claim 1, wherein the attribute functions a1(y),a2(y), . . . are presented for computation of LLR(y) in an order that is in accordance with the time cost of observing the attributes.
  • 3. A method according to claim 1, wherein the attribute functions a1(y),a2(y), . . .are presented for computation of LLR(y) in an order that is in accordance with the time cost of computation.
  • 4. A method according to claim 1, wherein the attribute functions a1(y),a2(y), . . . are presented for computation of LLR(y) in an order that is in accordance with the time cost of communicating the observed attribute.
  • 5. A non-transitory computer readable storage medium for use with a computer in a system for deciding whether an observed received signal is a particular signal of interest (SOI), wherein the computer readable storage medium contains computer executable program instructions for causing the computer to perform the step of: (a) processing the observed received signal in accordance with a decision rule that uses a statistical model of the particular signal of interest and a statistical model of the environment when the particular signal of interest is not present to decide whether the observed received signal is the particular signal of interest;wherein the decision rule is:
  • 6. A method of deciding whether an observed received signal is a particular signal of interest (SOI), comprising the steps of: (a) providing a statistical model of the particular signal of interest and a statistical model of the environment when the particular signal of interest is not present; and(b) with a computer, processing the observed received signal in accordance with a decision rule that uses the provided statistical models to decide whether the observed received signal is the particular signal of interest;wherein the decision rule is:
  • 7. A method according to claim 6, wherein the attribute functions a1(y), a2(y), . . . are presented for computation of LLR(y) in an order that is in accordance with the time cost of observing the attributes.
  • 8. A method according to claim 6, wherein the attribute functions a1(y), a2(y), . . . are presented for computation of LLR(y) in an order that is in accordance with the time cost of computation.
  • 9. A method according to claim 6, wherein the attribute functions a1(y), a2(y), . . . are presented for computation of LLR(y) in an order that is in accordance with the time cost of communicating the observed attribute.
  • 10. A non-transitory computer readable storage medium for use with a computer in a system for deciding whether an observed received signal is a particular signal of interest (SOI), wherein the computer readable storage medium contains computer executable program instructions for causing the computer to perform the step of: (a) processing the observed received signal in accordance with a decision rule that uses a statistical model of the particular signal of interest and a statistical model of the environment when the particular signal of interest is not present to decide whether the observed received signal is the particular signal of interest;wherein the decision rule is:
US Referenced Citations (13)
Number Name Date Kind
6161209 Moher Dec 2000 A
6853328 Guice et al. Feb 2005 B1
6983264 Shimizu Jan 2006 B2
6985749 Bannasch et al. Jan 2006 B2
7149320 Haykin et al. Dec 2006 B2
7430254 Anderson Sep 2008 B1
7437166 Osseiran et al. Oct 2008 B2
7567635 Scheim et al. Jul 2009 B2
7590530 Zhao et al. Sep 2009 B2
7707131 Chickering et al. Apr 2010 B2
20060192850 Verhaegh et al. Aug 2006 A1
20080301077 Fung et al. Dec 2008 A1
20090088089 Chandra et al. Apr 2009 A1
Non-Patent Literature Citations (2)
Entry
Roy E. Bethel, “Joint Detection and Estimation in a Multiple Signal Array Processing Environment”, PhD Thesis, George Mason University, 2002.
Satish Giridhar Iyengar, “Decision-Making with Heterogeneous Sensors—A Copula Based Approach”, PhD Thesis, Syracuse University, Jan. 1, 2011.