The invention is concerned with the calibration of phased array antennas of the type used in applications such as Direction Finding (DF), signal separation and enhanced reception or simple beam steering.
These techniques are well known but one problem commonly encountered is that knowledge is required of the response of the array to signals arriving from different directions.
The set of complex responses across an array of n elements may be termed a point response vector (PRV) and the complete set of these vectors over all directions is known as the array manifold (of n dimensions). Normally a finite sampled form of the manifold is stored for use in the DF processing.
The (sampled) manifold can be obtained, in principle, either by calibration or by calculation or perhaps by a combination of these. Calibration, particularly over two angle dimensions (for example azimuth and elevation) is difficult and expensive, and calculation, particularly for arrays of simple elements, is much more convenient. In this case, if the positions of the elements are known accurately (to a small fraction of a wavelength, preferably less than 1%) the relative phases of a signal arriving from a given direction can be calculated easily, at the frequency to be used. The relative amplitudes should also be known as functions of direction, particularly for simple elements, such as monopoles or loops. If the elements are all similar and orientated in the same direction then the situation corresponds to one of equal, parallel pattern elements, and the relative gains across the set of elements are all unity for all directions.
The problem with calculating the array response is that this will not necessarily match the actual response for various reasons. One reason is that the signal may arrive after some degree of multipath propagation, which will distort the response. Another is that the array positions may not be specified accurately, and another that the element responses may not be as close to ideal as required. Nevertheless, in many practical systems these errors are all low enough to permit satisfactory performance to be achieved. However, one further source of error that it is important to eliminate, or reduce to a low level, is the matching of the channels between the elements and the points at which the received signals are digitized, and from which point no further significant errors can be introduced (
One solution to channel calibration is to feed an identical test signal into all the channels immediately after the elements. The relative levels and phases of these after digitization give directly the compensation (as the negative phase and reciprocal amplitude factor) which could be conveniently applied digitally to all signals before processing, when using the system (
One problem which arises during the measurement of phase angles is that of ‘unwrapping’ the measured value. The indicated value will lie within a range having a magnitude of 360° (or 2π radians) with no indication of whether the true value equals this indicated value or includes a whole number multiple of 360°/2π radians. The term ‘unwrapping’ is used in the art to describe the process of resolving such indicated values to determine the true values. According to a first aspect of the invention, a method of processing a signal comprises the steps set out in claim 1 appended hereto. According to a second aspect of the invention, apparatus for processing a signal comprises the features set out in claim 5 attached hereto.
For any array, the phase response across the array is a function of the element positions. For example, for a linear array the phase response across the array is a linear function of the element positions along the axis of the array, and this is the case whatever the direction of the observed signal (though the line has different slopes for different signal directions, of course). Thus if a signal of opportunity is available the received array phases are determined and the best linear fit to these values, as related to element position, is determined. It is assumed that this linear response is close to the ideal response for this signal and that the deviations of the received values from this line are the phase errors which require compensation. In the case of equal, parallel element patterns, the amplitude responses should be equal so variations, as factors, from a mean (in this case the geometric mean) give the required corrections.
The invention will now be described, by way of non limiting example, with reference to the attached figures in which:
The following detailed description is concerned with the case of a one-dimensional antenna array having evenly spaced elements. However, this should not be seen as limiting as the invention is equally applicable to array antennas of other shapes or configuration (e.g. two dimensional planar, spherical etc), whether or not the array elements are evenly spaced (so long as the element positions are known).
Referring to
Here it is assumed that the relative phases have been found and that the required multiples of 2π have been added to make the phases approximately linear with element position along the array axis. This process is known as unwrapping the phase values.
A number of approaches to the problem of phase unwrapping are possible and further details on how the problem may be approached are included later.
Since the phase φk for each element k is directly proportional to the position xk, a plot of the (correctly adjusted) phase shifts against element positions should provide a straight line. This is the case, whatever the value of θ, the signal direction; the value of θ (and of λ) will determine the slope of the line. In practice, there will be channel phase errors which add to these path difference phases, so that the (corrected) phase values will be scattered about the line, rather than lying exactly on it (
The basis of one aspect of the invention is that, given the phase measurements and the element positions, the straight line through this set of points which gives the best fit, in some sense, is found and it is assumed that this is close to the response due to the signal. In fact it is only necessary that the slope of this line should agree with the slope due to the signal (which is 2π sin θ/λ) as any phase offset which is common to all the channels is of no physical significance. In fact if the actual signal direction is not known, then the correct slope will not be known, and the ‘best fit’ line may not have this slope exactly. However, if there is no correlation between the phase errors and the element positions, as would generally be expected to be the case, and if there is a sufficient number of elements to smooth statistical fluctuations adequately, then the match should be good. For a definition of ‘best fit’ the sum of the squares of the errors (of the given points from the line) should be minimized—i.e. a least mean square error solution is sought.
Let the element positions and the phases be given by
x=[x1 x2 . . . xn]T and p=[p1 p2 . . . pn]T
respectively, where xk and pk are the position of element k and the phase measured in channel k. Let
p=ax+b (1)
be the best fit line, where a and b have yet to be determined. The errors of the measured points from this line is given by
e=p−(ax+b1) (2)
where x contains the n element positions so ax+b1 are the n phases at these points, given by the best fit line. The sum of the squared errors is given by
where 1 is the n-vector of ones, [1 1 . . . 1]T. For any given a the task is to find b which minimizes the total squared error, s. Thus:
(using uTv=vTu for any vectors u and v of equal length). This derivative is zero when
1T(p−(ax+b1))=1Tp−(a1Tx+b1T1)=n
Here
is the mean of the components of p, and similarly for
With this value for b the line becomes p=
e=p−
with the definition that Δp=p−
The total squared error is now given by
E=(Δp−aΔx)T(Δp−aΔx)=ΔpTΔp−2aΔxTΔp+a2ΔxTΔx.
and this is zero when
This is the estimate of the slope of the best fit line, and putting this into the expression for e (equation (6)) gives the estimate of the channel phase matching error
This method of the invention can be extended to apply for planar arrays and for volume, or 3D, arrays. In the planar case the phase at an element k, relative to that at the origin, is given by
φk=(2π/λ)(uxk+vyk) (8)
where the coordinates for the position of element k are (xk, yk, 0) and (u, v, w) are the direction cosines for the signal position (u replaces sin θ in the linear case) using the same coordinate system. (The path difference is the projection of the position vector [xk yk 0] onto the unit signal direction vector [u v w], and this is given by their inner product. Again the path difference is converted into radians of phase shift at the signal frequency by multiplying by 2π/λ.) As in the linear array case the phase is a linear function of the element position, in this case in two dimensions. Ideally the phase values from a single signal will all lie in a plane so in this case the plane that is the best fit through the set of measured points is sought.
Let the plane be given by
p=ax+by +c (9)
then the errors (the difference between the measured phases p and the line) are given by
e=p−(ax+by +c1) (10)
and applying the result found for a linear array above, that the sum of the errors should be zero (or 1Te=0), gives
0=1Tp−(a1Tx+b1Ty+c1T1)=n
so c=
and e=p−
where, as before,
and similarly for x and y.
The total squared error is given by
E=e
T
e=(Δp−(aΔx+bΔy))T(Δp−(aΔx+bΔy))
and in this case E must be minimized with respect to both a and b. Thus
These are two simultaneous equations which can be put in the form
or, introducing the notation Dxp=ΔxTΔp, etc.,
with the solution
(using Dyx=Dxy).
For the volume arrays the phase of element k, again given by the inner product, is
(φk=(2π/λ)(uxk+vyk+wzk) (16)
where the element position is (xk, yk, zk). The 3D hyperplane that the phases should lie on is given by
p=ax+by+cz+d (17)
and the errors are given by
e=p−(ax+by+cz+d1). (18)
Making the sum of the errors zero leads to
e=p−
and then requiring that E should be minimized with respect to a, b and c, leads to
which gives the required values of the three coefficients.
In the case of equal parallel pattern elements the gains (as real amplitude, or modulus, factors) should all be equal. If the measured gains are a1, a2, . . . , an, then the geometric mean of these â, rather than the arithmetic means (as in the phase case) is taken, and then the error factors are ak/â and the correction factors to be applied to the data before processing are the reciprocals of these. (Alternatively one could just apply factors 1/ak, so effectively setting the channel gains (including the gains of the array elements) to unity. As the set of n channel outputs can be scaled arbitrarily, this is equally valid, but may require changes to any thresholds, as level sensitive quantities.)
If the element patterns are not parallel (all with the same pattern shape and oriented in the same direction) then this calibration will only be valid for the direction of the signal used, which in general is not known. (Even if it is known, the calibration information could only be used for correcting the manifold vector for this single direction.) Thus this method is not applicable to mixed element arrays (e.g. containing monopoles and loops) or to arrays of similar elements (e.g. all loops) differently oriented. If the element patterns are parallel but not equal (i.e. if the array elements have different gains) then this calibration will effectively equalize all the gains, which will then agree with the stored manifold values (if this assumption has been made in computing the manifold vectors). However this will modify the channel noise levels, in the case of systems which are internal noise limited (rather than external noise limited as may be the case at HF), so that the noise is spatially ‘non-white’, which is undesirable in the processing. Thus this method is really limited to arrays with equal, parallel pattern elements, but this is in fact a very common form of array, and this calibration should be simple and effective for this case. The method does not otherwise depend on the array geometry so is applicable to linear, planar or volume arrays.
Considering the case of a regular linear array first, in the absence of errors the path differences between adjacent elements will all be the same, so also will be the resulting phase differences. However, the measured phases are all within an interval of 2π radians (e.g. −π to +π) so if the cumulative phase at an element is outside this range then a multiple of 2π radians will be subtracted or added, in effect, to give the observed value. In order to obtain the linear relationship between phase and element position the correct phase shifts need to be found, adding or subtracting the correct multiples of 2π to the observed values. Taking the differences between all the adjacent elements yields some that correspond to the correct phase slope, say Δφ, and some with a figure 2π higher or lower (e.g. Δφ−2π). These steps in the set of differences indicate where the increments of 2π should be added in (and to all succeeding elements). However, with channel phase errors present the difference between (Δφ+errors) and (Δφ−2π+errors) is not a simple value of 2π and it is necessary to set some thresholds to decide whether a given value is in fact near to Δφ (which itself is not known, as the signal direction is not known) or near to Δφ−2π. This problem is solved by taking a second set of differences−the differences between adjacent values of the first set. When there are two adjacent values of (Δφ+errors) their difference is (zero+errors) and when adjacent values are (Δφ+errors) and (Δφ−2π+errors) the difference is (2π+errors). Thus all the second differences are near zero, ±2π, ±4π and so on. To find the values that there would be without errors the set is simply rounded to the nearest value of 2π to get the correct, error free, second differences. (It is assumed that the errors are small enough that four such errors, some differing in sign, which accumulate in the second differences, do not reach ±π radians. An estimate of the standard deviation of the phase errors is given below, showing that up to 20° to 30° can be handled). In fact it is convenient to measure phase in cycles for this process, so that the second differences are rounded to the nearest integer.
Having found the integer values for the second differences in phase (measured in cycles) the process is now reversed: starting with the first difference set to zero, the next difference is obtained by incrementing by the first of the second differences, and so on. Having obtained the (error-free) set of first differences, now containing integer values (in cycles), this process is repeated to find the set of cycles to be added and then these are applied to the measured set of phases to obtain the full (unwrapped) set of phases.
The two differencing processes may be considered to be analogous to differentiation, the first reducing the linear slope to a constant value, Δφ (except for the integer cycle jumps), and the second reducing this constant to zero (where there are no jumps). Reversing the process is analogous to integration, which raises the problem of the arbitrary constant. In fact an error by one cycle (or more) may be present at the first difference stage, and integrating this contribution gives an additional slope of one phase cycle (or more) per element. However, the error estimation process described above is independent of the actual slope so the fact that the slope may be different from the true one makes no difference.
A more formal analysis of the phase correction determination is given below, including the solution for the case where the array is not regular. Here the second differences, used to eliminate u, have to take into account the irregular values of dk (and their first differences, Δdk) so the expressions become more complicated.
Let the full phase in channel k be given by
Φk=dku+φ0+εk (k=1 to n) (A1)
where dk is the distance of element k along the array axis from some reference point, u is the direction cosine for the source direction along the array axis (in fact u=sin θ, where θ is the angle of the signal measured from the normal to the array axis), φ0 is a fixed phase value and εk is the channel phase error. It is often convenient in practice to take an end element of the array as the reference point, and then regard this as the reference channel, measuring all channel phases and amplitudes relative to those of this channel. The term dku is the path difference for the signal, between the reference point and element k, measured in cycles, and all phases here are in cycles, which is more convenient than radians or degrees for this problem, both in theory and in the practical computation. This phase may be many cycles (or multiples of 2π radians) but the measured phases will be within a range of 2π radians, or one cycle, and these are taken to be between −½ and +½ cycles and to be given by
(φk=Φk+mk=dku+mk+φ0+εk (k=1 to n) (A2)
where mk is the number of cycles added to the full phase value (or removed, if mk is negative). The problem in phase unwrapping is to find the values of mk.
In order to remove φ0 and also the effect of the arbitrary choice of reference point the first differences are formed, given by
Δφk=uΔd+Δmk+Δεk(k=1 to n−1) (A3)
where
Δxk=xk+1−xk (A4)
for x representing φ, d, m or ε, and Δdk=Δd as all the Δdk are equal for a uniform, or regular, array. Next, the second differences are taken to obtain
Δ2φk=Δ2mk+Δ2εk (k=1 to n−2) (A5)
as the term uΔd is constant (with k) so its differences disappear. As all the values of mk are integral, so also are all their first and second differences. If the errors are not too great then the second differences in the errors (Δ2εk=εk+2−2εk+1+εk) will be less than ½ in magnitude, so if the values of Δ2φk are rounded to the nearest integer the correct values for Δ2mk are obtained. Let
Δ2Mk=round(Δ2φk)=int(Δ2φk+½) (A6)
where int(x) gives the highest integer in x, then with moderate error levels
Δ2Mk=Δ2mk (A7)
will normally be obtained.
To find the values of Mk, a summing operation (the inverse of the differencing process) is carried out twice. From (A4),
ΔMk+1=ΔMk+Δ2Mk (k=1 to n−2) (A8)
but value for ΔM1 has not been defined. This is analogous to the ‘arbitrary constant’ of integration, which is set to zero here. The second reverse operation gives:
M
k+1
=M
k
+ΔM
k (k=1 to n−1) (A9)
again putting M1=0. Because these values of M1 and ΔM1 may not be the same as m1 and Δm1 (which are not known) the resultant values of mk may not be the same as the values obtained for Mk, but it is now shown that the differences (if any) are of no significance for this calibration purpose, and that the set of Mk values is equivalent to the actual set of mk. In a processing program generated, (A4) was used twice to obtain the first and second differences of φ, before rounding, according to (A6), and then using (A8) and (A9) to obtain the set of Mk. Finally Φk is obtained from φk using Mk, ignoring any differences between Mk and mk.
Equivalence of Set {Mk} and {mk}
Let Δma and mb be the arbitrary choices (or constants of ‘integration’) taken for ΔM1 and M1 respectively. Putting
ΔM1=Δma=(Δma−Δm1)+Δm1, (A10)
the next first difference for ΔM is
ΔM2=ΔM1+Δ2M1=ΔM1+Δ2m1=ΔM1+(Δm2−Δm1)=(Δma−Δm1)+Δm2 (A11)
where (A8), (A7), (A4) and (A10) have been used. Continuing,
ΔMk=(Δma−Δm1)+Δmk (k=1 to n−1) (A12)
in general. Now let
M
1
=m
b=(mb−m1)+m1, (A13)
then
M
2
=M
1
+ΔM
1=(mb−m1)+m1+(Δma−Δm1)+Δm1=(mb−m1)+(Δma−Δm1)+m2, (A14)
using (A13), (A10) and (A4) (Δm1=m2−m1). Note that every time ΔMk is added, the quantity (Δma−Δm1) is included, so that finally
M
k=(mb−m1)+(k−1)(Δma−Δm1)+mk. (k=1 to n) (A15)
The term (mb−m1) is a constant phase shift (over all k) and the term (k−1)(Δma−Δm1) corresponds to a constant phase slope, so when the corrections Mk are added to φk to obtain Φk the irregular jumps mk are correctly compensated for while adding an overall phase (when mb≠m1) and a change in slope (when Δma≠Δm1). However, the phase error estimation of the invention is independent both of absolute phase and of the phase slope, so these differences do not affect the resultant estimates in any way.
The full phase is given by (A1) and the measured phase by (A2), but, in the case of the non-uniform linear array (A3) is replaced, for the first differences in phase, by
Δφk=uΔdk+Δmk+Δεk. (k=1 to n−1) (A16)
In this equation the quantities Δφk, Δdk are known, the error differences Δεk are not known but will be removed by rounding, at the appropriate point, and Δmk is to be found, for each k. However u is unknown and while it is removed by taking second differences in the uniform case, this will not be the case here because, in general uΔdk+1 and uΔdk will differ so their difference does not disappear.
Rearranging the equation gives
and taking differences again, gives
which is again rearranged as
It is known that Δmk+1 is integral, so if the errors are not too great, as before, the relation
holds.
From this equation (the first ‘summation’) all the Δmk, given Δm1 could be found. As this is not known ΔM1 is set to 0, and the set {ΔMk} is found, equivalent, for the purpose of finding the best fit, to {mk}, as shown in the section “Equivalence of set {Mk} and {mk}” above. Thus with ΔM1=0 the equation
is solved to obtain the set {ΔMk: k=1 to n−1}. Then the set {Mk: k=1 to n} is obtained as before, putting ΔM1=0, and using (A9).
Note that (A20) is the equation, for the non-uniform case, equivalent to (A8) for the uniform case. Putting Δdk+1=Δdk, for the linear case, then (A20) becomes
using the fact that ΔMk is integral, and then equations (A4) and (A6).
Table 1 shows data derived from actual measurements using a one dimensional linear array with 10 equispaced elements.
For convenience & simplicity of explanation, channel 1 is taken as the measurement reference, so that all measured phase shifts are relative to channel 1.
Column 2 shows average values of measured phase relative to channel 1, calculated from a large number of acquired data (not shown).
Column 3 shows the results of the first differencing process, i.e. the difference in phase between adjacent array elements. The entries in column 3 are given by subtracting the corresponding entry in column 2 from the next entry in column 2.
Column 4 shows the results of the second differencing process: the entries in column 4 are given by subtracting the corresponding entry in column 3 from the next entry in column 3.
Column 5 shows Diffk (k=1 to 8), the set of second difference values of Column 4, rounded to the nearest multiple of 360° and expressed in cycles through subsequent division by −360°. (The negative sign is required to ensure the phase unwrap values will have the correct sense).
The results in column 5 now need to be summed twice in order to obtain the phase unwrap values. The results of the first summation are given by:
dΦ
k+1
=dΦ
k+Diffk
dΦ
1=0 (k=1 to 8)
The results of the first summation are shown in column 6.
The second summation is given by
Φk+1=Φk+dΦk
Φ0=0 (K=1 to 9)
The results of the second summation are shown in column 7.
Since, in this example, the rounded second differences were optionally divided by −360° to give the values shown in column 5, the results of the second summation shown in column 7 are now multiplied by 360° to give the amount of phase unwrapping to be associated with each channel. Thus, the entries in column 8 show the values to be added to the measured phases for each of the channels, in order to establish the actual phase shift of each channel, relative to channel 1.
A program has been written to simulate a phase error mismatch problem using a regular linear array, at half wavelength spacing. The three input arguments are n, the number of elements, θ, the angle of the signal source, relative to the normal to the axis of the array, and the standard deviation of the channel phase errors. On running the program a set of n channel phase errors are taken from a zero mean normal distribution with the given standard deviation. These are added to the phases at the elements due to the signal, from direction θ, which give the linear phase response. As mentioned previously, it is convenient to express these phases in cycles, rather than radians or degrees. These phases are then reduced, by subtracting a number of whole cycles from each, to the range −½ to +½ (equivalent to −π to +π radians), to give the values that would be measured. This is the basic data that the channel error estimation algorithm would be provided with.
The processing begins by ‘unwrapping’ the phases—restoring the cycles that have been removed from the approximately linear response. This is implemented by the process described previously, and relies on the errors being not too excessive. (The errors to the kth second difference are εk−2εk+1+εk+2, where εk is the error in channel k. The variance at the second difference level is thus 6σ2 (from σ2+4σ2+σ2) if σ2 is the variance of the errors, so the standard deviation is increased √6 times. Thus for σ=30°, the s.d. of the second difference errors is about 73.5°, so ±180° corresponds to the 2.45 s.d. points, and the probability of exceeding these limits, and causing an error, is between 1% and 2%. If σ=20° errors occur at the 3.67 s.d. points, giving a probability of error of about 2×10−4. This is the probability for each of the n−2 differences, not for the array as a whole.)
Having obtained the full path difference phase shifts, the processing for evaluating the estimate of the slope a of the best fit line from equation (7) is applied and then the estimate of the channel errors is found from equation (6).
Table 2 shows five sets of errors for this example. The first line is the set of channel errors taken from the normal distribution with a standard deviation of 10°. The second line gives the cycles of error resulting from the unwrapping process—in this case there is no error in all ten channels. The third line gives the estimated errors across the ten channels, and the fourth is the difference between lines three and one—i.e. the errors in estimating the channel errors. Finally the fifth line removes the mean value from line three (on the basis that a common phase can be subtracted across the array) and an interesting result is observed. The residual errors increment regularly across the array—in other words they correspond to a linear response and so are due to a small error between the true response (corresponding to the signal direction of 30°) and the best fit line. This is not a failure of the method, but a result of the particular finite set of error data used, as indicated in
Without information of the actual direction of the signal it is impossible to know what is the correct slope and the best that can be done is to make some best fit, in this case based on the least squared error solution. The slope of the best fit line matches that of the signal response if the phase error vector and the element position vector are orthogonal—i.e. if the phases and the positions are uncorrelated. This will not normally be exactly true for finite samples (10 in this simulation case) but would become more nearly true as the number of elements increases.
However, examination of the phase slope error that has been introduced reveals that the DF error this introduces is small. In the example above the phase difference between elements after calibration by this method is 0.5°. With elements at a half wavelength apart the phase difference for a signal at δθ from broadside is 180° sin δθ, or 180°δθ, for a small angle. Thus in this case δθ=0.5/180=1/360 radians or about 0.16°. (The DF measurement error increases as secθ with movement to an angle θ from broadside, as the phase difference between elements between θ and θ+δθ is approximately 180° cos θδθ so in this case, δθ=0.16°secθ and if θ=60°, for example, δθ=0.32°.
Finally some more examples are presented in Table 3.
In example (a) it can be seen that there is an error of one cycle per element in estimating the unwrapping phases. As this is a linear error across the array it does not affect the error estimates. In example (b) there is an error of one cycle on all the elements. As this is a constant phase error, again it does not affect the estimation of the slope of the line or the error estimates. In this case the residual errors are very small (giving a slope of 0.1° per element) but this is just a consequence of the particular set of errors chosen (and not related to the change of signal direction to 80°). Another run, with the same input arguments, gave errors of 1.8° per element. With high channel errors (from a distribution with a standard deviation of 30° in example (c)) the possibility of errors at the second difference stage occurs, and this is shown here. Here the sixth difference the error is 37.6°−2×(−75.1°)+17.5° which exceeds 180°, resulting in an extra cycle being inserted at this point (and the following points, because of the integration). This has caused the ‘corrected’ phase to be non-linear and led to errors. This result, however, was only obtained after several runs with these arguments, without this error appearing.
On increasing the s.d. of the channel errors from 10° (in case (a)) to 20° (case (d)) it can be seen that the residual errors increase, from 1.7° per element to 2.2° per element. Of course, these values will vary statistically, and a proper estimate could only be obtained by taking a large number of cases. However, the residual errors can be expected to be generally proportional to the input error magnitudes, given by the standard deviation of the distribution.
Finally, it can be expected that increasing the number of elements, and hence the number of points that the best fit process averages over, will reduce the residual errors. Comparison of (e) and (d) shows that the errors have fallen from (−)2.2° per element to 0.6, though again this comparison is for only one run in each case, and a large number should be carried out for firm data.
Number | Date | Country | Kind |
---|---|---|---|
0520332.8 | Oct 2005 | GB | national |
0524624.4 | Dec 2005 | GB | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/GB06/50315 | 10/5/2006 | WO | 00 | 9/18/2008 |