VARIABLE STEP-SIZE LEAST MEAN SQUARE METHOD FOR ESTIMATION IN ADAPTIVE NETWORKS

Information

  • Patent Application
  • 20120106357
  • Publication Number
    20120106357
  • Date Filed
    October 27, 2010
    14 years ago
  • Date Published
    May 03, 2012
    12 years ago
Abstract
The variable step-size least mean square method for estimation in adaptive networks uses a variable step-size to provide estimation for each node in the adaptive network, where the step-size at each node is determined by the error calculated for each node, as opposed to conventional least mean square algorithms used in adaptive filters and the like, where the choice of step-size reflects a tradeoff between misadjustment and the speed of adaptation.
Description
BACKGROUND OF THE INVENTION

1. Field of the Invention


The present invention relates generally to sensor networks and, particularly, to wireless sensor networks in which a plurality of wireless sensors are spread over a geographical location. More particularly, the invention relates to estimation in such a network utilizing a variable step-size least mean square method for estimation.


2. Description of the Related Art


In reference to wireless sensor networks, the term “diffusion” is used to identify the type of cooperation between sensor nodes in the wireless sensor network. That data that is to be shared by any sensor is diffused into the network in order to be captured by its respective neighbors that are involved in cooperation.


Wireless sensor networks include a plurality of wireless sensors spread over a geographic area. The sensors take readings of some specific data and, if they have the capability, perform some signal processing tasks before the data is collected from the sensors for more detailed thorough processing.


A “fusion-center based” wireless network has sensors transmitting all the data to a fixed center, where all the processing takes place. An “ad hoc” network is devoid of such a center and the processing is performed at the sensors themselves, with some cooperation between nearby neighbors of the respective sensor nodes.


Recently, several algorithms have been developed to exploit this nature of the sensor nodes and cooperation schemes have been formalized to improve estimation in sensor networks.


Least mean squares (LMS) algorithms are a class of adaptive filters used to mimic a desired filter by finding the filter coefficients that relate to producing the least mean squares of the error signal (i.e., the difference between the desired and the actual signal). The LMS algorithm is a stochastic gradient descent method, in that the filter is only adapted based on the error at the current time.



FIG. 2 diagrammatically illustrates an adaptive network having N nodes. In the following, boldface letters are used to represent vectors and matrices and non-bolded letters represent scalar quantities. Matrices are represented by capital letters and lower-case letters are used to represent vectors. The notation (.)T stands for transposition for vectors and matrices, and expectation operations are denoted as E[.]. FIG. 2 illustrates an exemplary adaptive network having N nodes, where the network has a predefined topology. For each node k, the number of neighbors is given by Nk, including the node k itself, as shown in FIG. 2. At each iteration i, the output of the system at each node is given by:






d
k(i)=uk,iw0k(i),  (1)


where uk,i is a known regressor row vector of length M, w0 is an unknown column vector of length M, and νk(i) represents noise. The output and regressor data are used to produce an estimate of the unknown vector, given by ψk,i.


The adaptation can be performed using two different techniques. The first technique is the Incremental Least Mean Squares (ILMS) method, in which each node updates its own estimate at every iteration and then passes on its estimate to the next node. The estimate of the last node is taken as the final estimate of that iteration. The second technique is the Diffusion LMS (DLMS), where each node combines its own estimate with the estimates of its neighbors using some combination technique, and then the combined estimate is used for updating the node estimate. This method is referred to as Combine-then-Adapt (CTA) diffusion. It is also possible to first update the estimate using the estimate from the previous iteration and then combine the updates from all neighbors to form the final estimate for the iteration. This method is known as Adapt-then-Combine (ATC) diffusion. Simulation results show that ATC diffusion outperforms CTA diffusion.


Using LMS, the ATC diffusion algorithm is given by:










{





f

k
,
i


=


y

k
,

i
-
1



+


μ
k




u

k
,
i

T



(



d
k



(
i
)


-


u

k
,
i




y

k
,

i
-
1





)











y

k
,
i


=




lɛN
k









c
lk



f

l
,
i








}

,




(
2
)







where {clk}lεNk is a combination weight for node k, which is fixed, and {fl,i}lεNk is the local estimate for each node neighboring node k, and μk is the node step-size.


The conventional Diffusion Lease Mean Square (LMS) technique uses a fixed step-size, which is chosen as a trade-off between steady-state misadjustment and speed of convergence. A fast convergence, as well as low steady-state misadjustment, cannot be achieved with this technique.


Thus, a variable step-size least mean square method for estimation in adaptive networks solving the aforementioned problems is desired.


SUMMARY OF THE INVENTION

The variable step-size least mean square method for estimation in adaptive networks uses a variable step-size to provide estimation for each node in the adaptive network, where the step-size at each node is determined by the error calculated for each node. This is in contrast to conventional least mean square algorithms used in adaptive filters and the like, in which the choice of step-size reflects a tradeoff between misadjustment and the speed of adaptation.


In a first embodiment, a variable step-size incremental least mean square method for estimation in adaptive networks is given by the following steps: (a) establishing a network having N nodes, where N is an integer greater than one, and establishing a Hamiltonian cycle among the nodes such that each node is connected to two neighboring nodes, one from which it receives data and one to which it transmits data; (b) establishing an integer i and initially setting i=1; (c) calculating an output of the adaptive network at each node k as dk(i)=uk,iw0k(i), where uk,i represents a known regressor row vector of length M, w0 represents an unknown column vector of length M and νk(i) represents noise in the adaptive network, where M is an integer; (d) calculating an error value ek(i) at each node k as ek(i)=dk(i)−uk,iyk-1,i, where yk,i represents an estimate of an output vector for each node k at iteration {dot over (r)}, (e) calculating a node step size μk for each node k as μk(i)=αμk(i−1)+γe2(i), where α and γ are unitless, selectable parameters; (f) calculating an estimate of an output vector yk,i for each node k as yk,i=yk-1,ik(i)uTk,iT(dk(i)−uk,iyk-1,i); (g) if ek(i) is greater than a selected error threshold, then setting i=i+1 and returning to step (c); otherwise, (h) defining a set of output vectors yk for each node k, where yk=yk,i; and (i) storing the set of output vectors in computer readable memory.


In an alternative embodiment, a variable step-size diffusion least mean square method for estimation in adaptive networks is given by the following steps: (a) establishing an adaptive network having N nodes, where N is an integer greater than one, and for each node k, a number of neighbors of node k is given by Nk, including the node k, where k is an integer between one and N; (b) establishing an integer i and initially setting i=1; (c) calculating an output of the adaptive network at each node k as dk(i)=uk,iw0k(i), where uk,i represents a known regressor row vector of length M, w0 represents an unknown column vector of length M and νk(i) represents noise in the adaptive network, where M is an integer; (d) calculating an error value ek(i) at each node k as ek(i)=dk(i)−uk,iyk,i-1, where yk,i represents an estimate of an output vector for each node k at iteration {dot over (r)}, (e) calculating a node step size μk for each node k as μk(i)=αμk(i−1+γe2(i), where α and γ are unitless, selectable parameters; (f) calculating a local estimate for each node neighboring node k, fk,i, for each node k as fk,i=yk,i-1kuk,iT(dk(i)−uk,iyk,i-1); (g) calculating an estimate of an output vector yk,i for each node k as








y

k
,
i


=




l
=
1


N
k









c
lk



f

l
,
i





,




where l is an integer, clk represents a combination weight for node k, which is fixed; (h) if ek(i) is greater than a selected error threshold, then setting i=i+1 and returning to step (c); otherwise, (i) defining a set of output vectors yk for each node k, where yk=yk,i; and (j) storing the set of output vectors in computer readable memory.


These and other features of the present invention will become readily apparent upon further review of the following specification.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a block diagram of a system for implementing a variable step-size least mean square method for estimation in adaptive networks according to the present invention.



FIG. 2 is a diagram schematically illustrating an adaptive network with N nodes.



FIG. 3 is a diagram schematically illustrating network topology for an exemplary simulation of the variable step-size least mean square method for estimation in adaptive networks according to the present invention.



FIG. 4A is a graph of signal-to-noise ratio as chosen for simulation at each node in the exemplary simulation of FIG. 3.



FIG. 4B is a graph of simulated values of noise power at each node in the exemplary simulation of FIG. 3.



FIGS. 5A and 5B are comparison plots illustrating mean square deviation as a function of time for two exemplary scenarios using the variable step-size least mean square method for estimation in adaptive networks according to the present invention.



FIGS. 6A and 6B are comparison plots illustrating steady-state mean square deviations at each node for the exemplary scenarios of FIGS. 5A and 5B, respectively.





Similar reference characters denote corresponding features consistently throughout the attached drawings.


DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The variable step-size least mean square method for estimation in adaptive networks uses a variable step-size to provide estimation for each node in the adaptive network, where the step-size at each node is determined by the error calculated for each node. This is in contrast to conventional least mean square algorithms used in adaptive filters and the like, in which the choice of step-size reflects a tradeoff between misadjustment and the speed of adaptation. An example of a conventional LMS algorithm is shown in chapter ten of A. H. Sayed, Adaptive Filters, New York: Wiley, 2008, pgs. 163-166. An example of a variable step-size LMS algorithm is shown in R. H. Kwong and E. W. Johnston, “A Variable Step Size LMS Algorithm”, IEEE Transactions on Signal Processing, Vol. 40, No. 7, July 1992, each of which is herein incorporated by reference in its entirety.


In a first, incremental variable step-size least mean square method, the estimate of the unknown vector yk,i is given by:






y
k,i
=y
k-1,ik(i)uk,iT(dk(i)−uk,iyk-1,i).  (3)


Every node updates its estimate based on the updated estimate coming from the previous node. The error is given by:






e
k(i)=dk(i)−uk,iyk-1,i.  (4)


The node step size μk is given by:





μk(i)=αμk(i−1)+γe2(i),  (5)


where α and γ are controlling parameters. Updating equation (3) with the step-size given by equation (5) results in the variable step-size incremental least mean square method defined by the following recursion:






y
k,i
=y
k-1,ik(i)uk,iT(dk(i)−uk,iyk-1,i).  (6)


Thus, the variable step-size incremental least mean square method is given by the following steps: (a) establishing an adaptive network having N nodes, where N is an integer greater than one, and for each node k, a number of neighbors of node k is given by Nk, including the node k, where k is an integer between one and N, and establishing a Hamiltonian cycle among the nodes such that each node is connected to two neighboring nodes, each node receiving data from one of the neighboring nodes and transmitting data to the other neighboring node; (b) establishing an integer i and initially setting i=1; (c) calculating an output of the adaptive network at each node k as dk(i)=uk,iw0k(i), where uk,i represents a known regressor row vector of length M, w0 represents an unknown column vector of length M and νk(i) represents noise in the adaptive network, where M is an integer; (d) calculating an error value ek(i) at each node k as ek(i)=dk(i)−uk,iyk-1,i; (e) calculating a node step size for each node k as μk(i)=αμk(i−1)+γe2 (i); (f) calculating an estimate of an output vector yk,i for each node k as yk,i=yk-1,ik(i)uk,iT(dk(i)−uk,iyk-1,i); (g) if ek(i) is greater than a selected error threshold, then setting i=i+1 and returning to step (c); otherwise, (h) defining a set of output vectors yk for each node k, where yk=yk,i; and (i) storing the set of output vectors in computer readable memory.


In an alternative embodiment, a second, diffusion variable step-size least mean square method of estimation is used. The diffusion variable step-size least mean square method of estimation is based upon equation set (2), and the error is given by:






e
k(i)=dk(i)−uk,iyk,i-1.  (7)


Applying this error to equation (5) yields the following update equations:










{





f

k
,
i


=


y

k
,

i
-
1



+



μ
k



(
i
)





u

k
,
i

T



(



d
k



(
i
)


-


u

k
,
i




y

k
,

i
-
1





)











y

k
,
i


=




lɛN
k









c
lk



f

l
,
i








}

.




(
8
)







Thus, the variable step-size diffusion least mean square method is given by the following steps: (a) establishing an adaptive network having N nodes, where N is an integer greater than one, and for each node k, a number of neighbors of node k is given by Nk, including the node k, where k is an integer between one and N; (b) establishing an integer i and initially setting i=1; (c) calculating an output of the adaptive network at each node k as dk(i)=uk,iw0k(i), where uk,i represents a known regressor row vector of length M, w0 represents an unknown column vector of length M and νk(i) represents noise in the adaptive network, where M is an integer; (d) calculating an error value ek(i) at each node k as ek(i)=dk(i)−uk,iyk,i-1; (e) calculating a node step size for each node k as μk(i)=αμk(i−1)+γe2(i); (f) calculating a local estimate for each node neighboring node k, fk,i, for each node k as fk,i=yk,i-1kuk,iT(dk(i)−uk,iyk,i-1); (g) calculating an estimate of an output vector yk,i for each node k as








y

k
,
i


=




l
=
1


N
k









c
lk



f

l
,
i





,




where l is an integer, clk represents a combination weight for node k, which is fixed; (h) if ek(i) is greater than a selected error threshold, then setting i=i+1 and returning to step (c); otherwise, (i) defining a set of output vectors yk for each node k, where yk=yk,i; and (j) storing the set of output vectors in computer readable memory.


In order to perform a performance analysis of the variable step-size incremental least mean square method, the variance relation is given by:






E└∥ y
kS2┘=E└∥ yk-1S2┘+μk2σν,k2Tr(LkS)  (9)







S′= S−μ

k(LkS+ SLk)+μk2(LkTr(SLk)+2LkSLk)  (10)


where S is a Gaussian normalized weighting matrix, Lk is a diagonal matrix containing the eigenvalues for the regressor vector at node k, and σν,k2 is the noise variance. Equations (9) and (10) show how the variance of the Gaussian normalized error vector yk iterates from node to node. When equation (5) is incorporated into the results, an independence assumption is invoked, resulting in the following:






E└μ
k(i)uk,iTek(i)┘=E[μk(i)]E└uk,iTek(i)┘,  (11)


thus yielding the following new recursions, respectively, from equations (9) and (10):






E[∥ y
k
iS2]=E[∥ yk-1iS2]+E[μk,i-12ν,k2Tr(LkS)  (12)







S′= S−E[μ

k,i-1](LkS+ SLk)+E└μk,i-12┘(LkTr(SLk)+2LkSLk).  (13)


At steady state, the step-size expectation values in equations (12) and (13), μ=Eμk,∞ and μ2=Eμk,∞2, respectively, are given by:











μ
_

k

=


γ


(


M
k

+

σ

v
,
k

2


)



1
-
α






(
14
)








μ
_

k
2

=



2

αγ



μ
_



(


σ

v
,
k

2

+

M
k


)



+

3




γ
2



(


σ

v
,
k

2

+

M
k


)


2




1
-

α
2







(
15
)







where M is the steady-state misadjustment for the step-size, given by:










M
k

=




1
-

[

1
-

2




(

3
-
α

)



γσ

v
,
k

2



1
-

α
2





Tr


(
L
)




]



1
+

[

1
-

2




(

3
-
α

)



γσ

v
,
k

2



1
-

α
2





Tr


(
L
)




]




.





(
16
)







The above can be directly incorporated into steady-state equations for calculation of mean square deviation (MSD) and excess mean square error (EMSE) in order to obtain the values of MSD and EMSE. Similarly, the variance relation for the variable step-size diffusion least mean square method is given by:






E[∥ y
iS2]=E[∥ yi-1FS2]+bTs  (17)







F=
(GTe G*)[IN2M2−(INMeLD)−(LDeINM)+(DeD)A],  (18)





where







s=bvec{ S}, and
  (19)






b=bvec{R
ν
D
2
L},  (20)


where Rν is the noise auto-correlation matrix, bvec{.} is the block vectorization operator, D is the block diagonal step-size matrix for the whole network, G is the block combiner matrix, A is the block vectorized form of the fourth order weighted moment of the regressor vectors, and e is the block Kronecker product. Applying equation (11), equations (18) and (20) respectively yield:







F=
(GTe G*)[IN2M2−(INMeLE[D])−(LE[D]eINM)+E[(DeD)A]]  (21)






b=bvec{R
ν
E└D
2
┘L}.  (22)


Since the step-size matrix is block-diagonal, the above operations are relatively straightforward. Steady-state matrices may be formed for the step-sizes using equations (14) and (15). Using these matrices directly in the steady-state equations provides the MSD and EMSE values for the variable step-size diffusion least mean square method.


In the below, simulation examples are described to illustrate the performance of the variable step-size LMS methods over adaptive networks. Results are compared with conventional fixed-step LMS methods. A network topology with N=15 nodes is considered in FIG. 3. For the variable step-size incremental LMS method, α is set to 0.997 and γ is set to 2×10−4. For the variable step-size diffusion LMS method, α is set to 0.998 and γ is set to 2×10−5. These exemplary values ensure that the convergence rate is the same for all methods. Two distinct scenarios are presented in the below results. In the first scenario, the signal-to-noise (SNR) and the noise power profile are shown in FIGS. 4A and 4B, respectively. In the second scenario, the noise power for Node 5 is increased from 6×10−4 to 1×10−2, which reduces the SNR to approximately 18 dB. As a result, there is a visible deterioration in performance.



FIG. 5A illustrates the results obtained for the first scenario. As can be seen in FIG. 5A, there is a great improvement in performance obtained through the usage of both incremental and diffusion variable step-size LMS methods. There is an approximately 25 dB difference in favor of the variable step-size diffusion LMS method, compared to the conventional fixed step-size method. In the second scenario, the performance of the variable step-size diffusion LMS method deteriorates only by approximately 3 dB, whereas that of the variable step-size incremental LMS method deteriorates by nearly 9 dB, as shown in FIG. 5B.



FIGS. 6A and 6B illustrate the steady-state MSD values at each node for both the incremental and diffusion methods, with the variable step-size diffusion LMS method being shown to be superior to both the incremental method and the fixed step-size conventional LMS method.



FIG. 1 illustrates a generalized system 10 for implementing the variable step-size least mean square method for estimation in adaptive networks, although it should be understood that the generalized system 10 may represent a stand-alone computer, computer terminal, portable computing device, networked computer or computer terminal, or networked portable device. Data may be entered into the system 10 by the user via any suitable type of user interface 18, and may be stored in computer readable memory 14, which may be any suitable type of computer readable and programmable memory. Calculations are performed by the processor 12, which may be any suitable type of computer processor, and may be displayed to the user on the display 16, which may be any suitable type of computer display. The system 10 preferably includes a network interface 20, such as a modem or the like, allowing the computer to be networked with either a local area network or a wide area network.


The processor 12 may be associated with, or incorporated into, any suitable type of computing device, for example, a personal computer or a programmable logic controller. The display 16, the processor 12, the memory 14, the user interface 18, network interface 20 and any associated computer readable media are in communication with one another by any suitable type of data bus, as is well known in the art. Additionally, other standard components, such as a printer or the like, may interface with system 10 via any suitable type of interface.


Examples of computer readable media include a magnetic recording apparatus, an optical disk, a magneto-optical disk, and/or a semiconductor memory (for example, RAM, ROM, etc.). Examples of magnetic recording apparatus that may be used in addition to memory 14, or in place of memory 14, include a hard disk device (HDD), a flexible disk (FD), and a magnetic tape (MT). Examples of the optical disk include a DVD (Digital Versatile Disc), a DVD-RAM, a CD-ROM (Compact Disc-Read Only Memory), and a CD-R (Recordable)/RW.


It is to be understood that the present invention is not limited to the embodiments described above, but encompasses any and all embodiments within the scope of the following claims.

Claims
  • 1. A variable step-size least mean square method for estimation in adaptive networks, comprising the steps of: (a) establishing an adaptive network having N nodes, where N is an integer greater than one, and establishing a Hamiltonian cycle among the nodes such that each node is connected to two neighboring nodes, each node receiving data from a first of the neighboring nodes and transmitting data to a second of the neighboring nodes;(b) establishing an integer i and initially setting i=1;(c) calculating an output of the adaptive network at each node k as dk(i)=uk,iw0+νk(i), where uk,i represents a known regressor row vector of length M, w0 represents an unknown column vector of length M and νk(i) represents noise in the adaptive network, wherein M is an integer;(d) calculating an error value ek(i) at each node k as ek(i)=dk(i)−uk,iyk-1,i, wherein yk,i represents an estimate of an output vector for each node k at iteration {dot over (r)},(e) calculating a node step size μk for each node k as μk(i)=αμk(i−1)+γe2(i), wherein α and γ are unitless, selectable parameters;(f) calculating the estimate of the output vector yk,i for each node k as yk,i=yk-1,i+μk(i)uk,iT(dk(i)−uk,iyk-1,i);(g) if ek(i) is greater than a selected error threshold, then setting i=i+1 and returning to step (c); otherwise,(h) defining a set of output vectors yk for each node k, where yk=yk,i; and(i) storing the set of output vectors in computer readable memory.
  • 2. A variable step-size least mean square method for estimation in adaptive networks comprising the steps of: (a) establishing an adaptive network having N nodes, wherein N is an integer greater than one, and for each node k, a number of neighbors of node k is given by Nk, including the node k, wherein k is an integer between one and N;(b) establishing an integer i and initially setting i=1;(c) calculating an output of the adaptive network at each node k as dk(i)=uk,iw0+νk(i), wherein uk,i represents a known regressor row vector of length M, w0 represents an unknown column vector of length M and νk(i) represents noise in the adaptive network, wherein M is an integer;(d) calculating an error value ek(i) at each node k as ek(i)=dk(i)−uk,iyk,i-1, wherein yk,i represents an estimate of an output vector for each node k at iteration {dot over (r)};(e) calculating a node step size μk for each node k as μk(i)=αμk(i−1)+γe2(i), wherein α and γ are unitless, selectable parameters;(f) calculating a local estimate for each node neighboring node k, fk,i, for each node k as fk,i=yk,i-1+μk(i)uk,iT(dk(i)−uk,iyk,i-1);(g) calculating an estimate of an output vector yk,i for each node k as
  • 3. A system for performing variable step-size least mean square estimation for an adaptive network, the adaptive network having N nodes, and for each node k, a number of neighbors of node k is given by Nk, including the node k, wherein k is an integer between one and N, the system comprising: a processor;computer readable memory coupled to the processor;a user interface coupled to the processor;a display; andsoftware stored in the memory and executable by the processor, the software having: means for establishing an integer i and initially setting i=1;means for calculating an output of the adaptive network at each node k as dk(i)=uk,iw0+νk(i), wherein uk,i represents a known regressor row vector of length M, w0 represents an unknown column vector of length M and νk(i) represents noise in the adaptive network, wherein M is an integer;means for calculating an error value ek(i) at each node k as ek(i)=dk(i)−uk,iyk,i-1, wherein yk,i represents an estimate of an output vector for each node k at iteration {dot over (r)},means for calculating a node step size μk for each node k as μk(i)=αμk(i−1)+γe2(i), wherein α and γ are unitless, selectable parameters;means for calculating a local estimate for each node neighboring node k, fk,i, for each node k as fk,i=yk,i-1+μk(i)uk,iT(dk(i)−uk,iyk,i-1);means for calculating an estimate of an output vector yk,i for each node k as