This application claims the benefit under 35 U.S.C. §119(a) of Korean Patent Applications No. 10-2010-0133789, filed on Dec. 23, 2010, and No. 10-2011-0090383, filed on Sep. 6, 2011, the entire disclosures of which are incorporated herein by reference for all purposes.
1. Field
The following description relates to system modeling, and more particularly, to an apparatus and method for adaptively modeling a system by estimating system model parameters using statistical characteristics of signals.
2. Description of the Related Art
A signal processing system needs an algorithm for optimizing itself under a predetermined criterion upon signal processing when no information about a certain input signal is given, in order to vary system characteristics as necessary. Such an algorithm is called an adaptive algorithm.
A representative adaptive algorithm is a steepest descent algorithm. However, the steepest descent algorithm has a problem that it is not easy to adjust a convergence speed.
The following description relates to an apparatus and method for adaptive gravity search, capable of easily adjusting a convergence speed and a convergence pattern.
In one general aspect, there is provided a gravity search adaptive apparatus for modeling a system, including: a system modeling unit configured to receive the same signal as a signal input to the system, to convert the signal using a system modeling parameter, and to output the converted signal; and an adaptive controller configured to use a gravity search adaptive algorithm to detect the system modeling parameter so that an error signal which is a difference between an output signal of the system and an output signal of the system modeling unit converges on a minimum value.
In another general aspect, there is provided a gravity search adaptive method including: calculating a system modeling parameter of a system modeling unit for minimizing an error signal which is a difference between an output signal of a system and an output signal of the system modeling unit when the same signal has been input to the system and the system modeling unit; and applying the system modeling parameter to the system modeling unit.
Other features and aspects will be apparent from the following detailed description, the drawings, and the claims.
Throughout the drawings and the detailed description, unless otherwise described, the same drawing reference numerals will be understood to refer to the same elements, features, and structures. The relative size and depiction of these elements may be exaggerated for clarity, illustration, and convenience.
The following description is provided to assist the reader in gaining a comprehensive understanding of the methods, apparatuses, and/or systems described herein. Accordingly, various changes, modifications, and equivalents of the methods, apparatuses, and/or systems described herein will be suggested to those of ordinary skill in the art. Also, descriptions of well-known functions and constructions may be omitted for increased clarity and conciseness.
Referring to
The system 110 processes and outputs input signals. The processing characteristics of the system 110 are not known or changes in time depend on environmental factors.
In
The system modeling unit 120 has been modeled as mathematical representation of the system 110. The system modeling unit 120 is used to model a unknown system or a system whose characteristics vary depending on environmental factors. {right arrow over (w)}[k], which is an (N×1) vector, represents a parameter vector of the system modeling unit 120. The system modeling unit 120 receives the input signal x[k] of the system 110 and applies {right arrow over (w)}[k] to the input signal x[k], thus outputting an output signal ŷ[k].
The adaptive controller 130 controls the system 110 and the system modeling unit 120 so that the same signal is applied to the system 110 and the system modeling unit 120, and compares an output signal from the system 110 to an output signal from the system modeling unit 120 to thereby estimate a parameter of the system modeling unit 120. In detail, the adaptive controller 130 includes an error detector 131 and a modeling parameter calculator 132.
The error detector 131 detects a difference between the output signal y[k] of the system 110 and the output signal ŷ[k] of the system modeling unit 120, thus outputting the detected difference as an error signal e[k]. Then, the modeling parameter calculator 132 calculates a modeling parameter vector for reducing power of the error signal e[k] output from the error detector 131, and applies the modeling parameter vector to the system modeling unit 120. Then, the modeling parameter calculator 132 continues to calculate modeling parameters until the error signal e[k] between the output signal y[k] of the system 110 and the output signal ŷ[k] of the system modeling unit 120 obtained when the previously calculated modeling parameter is applied to the system modeling unit 120 converges on a minimum value, and supplies the calculated modeling parameters to the system modeling unit 120 to thereby update the system modeling unit 120.
That is, the adaptive controller 130 estimates a system modeling parameter for approximating the output value of the system modeling unit 120 to the output value y[k] of the system 110, thereby estimating the characteristics of the system 110.
The output signal ŷ[k] of the system modeling unit 120 at time k is expressed by equation 1 below.
In equation 1, H represents Hermitian Transpose, and (N×1) vectors {right arrow over (w)}[k] and {right arrow over (x)}[k] are defined as follows.
An adaptive algorithm is aimed at obtaining a modeling parameter vector for minimizing the power of an error signal which is a difference between the output signal of a real system and the output signal of a system modeling unit.
When noise n[k] exists in a measured signal, the error signal e[k] can be expressed by equation 2 below.
e[k]=y[k]−ŷ[k]+n[k] (2)
Also, the power ε of the error signal e[k] is given as equation 3 below.
In equation 3, E{•} represents an ensemble average.
Also, R, which is a (N×N) matrix, can be expressed by equation 4 below.
R=E{
[k]
[k]
H} (4)
In equation 3, {right arrow over (P)}, which is a (N×1) vector, can be defined as equation 5 below.
=E{[k]y*[k]} (5)
In equation 3, ε is also called a cost function, and since the cost function c is a quadratic function of {right arrow over (w)}[k], an optimum coefficient vector {right arrow over (w)}opt at which the gradient of ε becomes zero so that c reaches a minimum value can be obtained. If {right arrow over (∇)}[k] is a gradient vector of ε, {right arrow over (∇)}[k] can be expressed by equation 6 below.
If the inverse matrix R−1 of R exists, the optimum coefficient vector {right arrow over (w)}opt and the optimum power εopt of the error signal ε when {right arrow over (w)}[k]={right arrow over (w)}opt are given as equations 7 and 8 below.
opt
=R
−1
(7)
εopt=σy2−R−1+σn2 (8)
However, since directly calculating the inverse matrix R−1 in equation 7 requires a large amount of calculation, applying a method of directly obtaining the inverse matrix R−1 in implementing a real system is improper in many cases. In those cases, an adaptive algorithm which repeatedly updates {right arrow over (w)}[k] so that {right arrow over (w)}[k] gradually converges on {right arrow over (w)}opt is utilized, instead of directly calculating the inverse matrix R−1. One of adaptive algorithms that has been widely used for the purpose is a steepest descent algorithm, and the steepest descent algorithm is also called a gradient descent algorithm.
The steepest descent algorithm calculates a gradient vector {right arrow over (∇)}[k] of ε with respect to the parameter vector {right arrow over (w)}[k] using the fact that ε is a quadratic function of the parameter vector {right arrow over (w)}[k], and updates the parameter vector {right arrow over (w)}[k] by the magnitude of the calculated gradient vector {right arrow over (∇)}[k] in the opposite direction of the gradient to thereby obtain a next parameter vector Δ{right arrow over (w)}[k+1].
By repeating the process, the parameter vector {right arrow over (w)}[k] finally converges on the optimum coefficient vector {right arrow over (w)}opt for minimizing ε as the k value increases.
The steepest descent algorithm can be expressed by equation 9 below.
A convergence speed of the steepest descent algorithm can be adjusted using a μ value. That is, the μ value, which is a proportional coefficient used to update the optimum coefficient vector {right arrow over (w)}opt, decides the convergence speed. If the μ value is large, the convergence speed is fast, and if the μ value is small, the convergence speed is slow.
Meanwhile, if it is defined that Δ{right arrow over (w)}[k]={right arrow over (w)}[k]−{right arrow over (w)}opt, equations 10 and 11 can be induced from equations 7 and 8.
Δ[k+1]=(I−μR)Δ[k]=(I−μR)k+1Δ[0] (10)
ε=εopt+Δ[k]HRΔ[k] (11)
If R=VAV−1 is given as eigen-value decomposition of R, Δ{right arrow over (w)}[k+1] can be rewritten as equation 12 below.
Δ[k+1]=V(I−μΛ)k−1V−1Δ[0] (12)
As seen from equation 12, in order to make the steepest descent algorithm converge, the value has to satisfy equation 13 below.
0<μ<1/λmax (13)
In order to implement the steepest descent algorithm written as equation 9, the matrix R and the vector {right arrow over (P)} defined in equations 4 and 5 respectively have to be given. One method is to calculate time averages of samples and use the time averages as estimated values of R and {right arrow over (P)}, instead of directly obtaining R and {right arrow over (P)} using an ensemble average. A well-known Least Mean Squares (LMS) algorithm uses a single sample of each of {right arrow over (x)}[k] and y[k] in order to estimate R and {right arrow over (P)}, as written by equations 14 and 15 below.
{circumflex over (R)}=
[k]
[k]
H (14)
=[k]y*[k] (15)
As such, by estimating the instantaneous values of R and {right arrow over (P)}, the steepest descent algorithm is simplified to a LSM algorithm, which is expressed as equation 16 below.
[k+1]=[k]+μ[k]e*[k] (16)
The convergence speed of the LSM algorithm may be adjusted using the μ value. As seen in equation 13, as the eigen-value is large, a small μ value may be used, and if the eigen-value is small, a large μ value may be used.
Accordingly, in order to overcome the problem of the steepest descent algorithm, gravity is used. Since a new adaptive algorithm proposed in this specification has been developed based on physical characteristics that occur as a natural phenomenon, it has a stable characteristic by the law of energy conservation. Also, since the proposed adaptive algorithm has degrees of freedom two in adjusting the convergence speed of the adaptive algorithm, the adaptive algorithm can adjust the convergence speed and the convergence pattern more flexibly according to application fields.
Since gravity pulls down the ball, the ball moves downward to the bottom of the container along the inner surface of the container. At this time, the potential energy of the ball is converted to kinetic energy to move the ball, the kinetic energy is lost due to friction with the liquid, and resultantly the ball is stopped at the lowest position (that is, the bottom of the container) of the container.
At this time, the behavior of the ball, that is, how fast the ball moves toward the bottom of the container or how quickly the ball is stopped at the bottom depends on the mass m of the ball, the magnitude g of gravitational acceleration g, and the friction force f of the liquid.
As such, a natural phenomenon can be used as a model to obtain an adaptive algorithm for solving a problem of finding a global minimum of a cost function which is in the form of a quadratic function. The adaptive algorithm is called a “gravity search adaptive algorithm”, and the gravity search adaptive algorithm is stable and has a low risk of divergence compared to other adaptive algorithms that use no physical phenomenon associated with the law of energy conservation.
For easy understanding, a procedure of inducing a 1-dimensional gravity search adaptive algorithm whose coefficients are real numbers, a procedure of inducing a N-dimensional gravity search adaptive algorithm whose coefficients are real numbers, and a procedure of inducing a N-dimensional gravity search adaptive algorithm whose coefficients are complex numbers will be described in this order below.
<1-Dimensional Gravity Search Adaptive Algorithm>
In the case of a 1-dimensional gravity search adaptive algorithm, the cost function of equation 3 can be simplified to equation 17 below.
ε=σy2−2pw[k]+σx2w[k]2 (17)
where w[k] is a value resulting from k-th iteration of the coefficient w, p=E{x[k]y[k]}, and σx=E{x2[k]}.
First-order and second-order derivatives of the cost function with respect to w[k] are written as equations 18 and 19 below.
In
m{umlaut over (w)}+f{dot over (w)}=F (20)
where F represent a force applied to the ball in the direction of the w-axis, m represents the mass of the ball, f represents a friction force depending on viscosity of liquid, and {dot over (w)} and {umlaut over (w)} are first-order and second-order derivatives with respect to time, respectively. The F value depends on the magnitude g of gravitation acceleration, the shape of the container, and the position and movement of the ball.
If the F value is assumed to be a constant and v={dot over (w)}, equation 20 can be rewritten to equation 21 below.
A solution of the differential equation of equation 21 can be expressed as equation 22 below.
By calculating the second term of the right side of equation 22, equation 22 can be arranged to equation 23 below.
In equation 23, w(t) can be obtained as equation 24 below. v(t) is a derivative of w(t). So, if we know w(t), v(t) can be calculated by calculating the integral of w(t).
If First-order Taylor Series Approximation is used,
when |x|<<1, and accordingly, if
equations 23 and 24 can be rewritten to equation 25 and 26 below.
In equation 24, if it is assumed that v(τ) is a constant in the integral interval, equation 24 can be rewritten to equation 27 below.
w(t)≈w(t0)+v(t0)(t−to)≈w(t0)+v(t)(t−t0) (27)
In this case, equation 27 becomes identical to equation 26. This means that the First-order Taylor Series Approximation has the same effect as the assumption that v(τ) is a constant. Also, since equation 27 has been obtained under the same assumption that v(τ) is a constant, an approximate value having the same accuracy can be obtained. In this specification, equation 27 is used.
Equations 25 and 27 can be arranged and expressed as matrix equation 28 below.
Here, w(t) can be substituted by w[k+1], v(t) can be substituted by v[k+1], w(t0) can be substituted by w[k], v(t0) can be substituted by v[k], and F can be substituted by F[k], and t−t0=Ts.
Ts can be considered as a sampling interval and can be, for convenience of expansion, normalized to Ts=1.
When the substitutions are applied, equation 28 can be rewritten to equation 29 below.
Here, since
β has to satisfy equation 30 below.
1−ε<β<1,ε<<1 (30)
In equation 29, F[k] consists of two components as written in equation 31 below.
F[k]=F
g
[k]+F
a
[k] (31)
where Fg[k] is a force applied to the ball by gravity, and Fa[k] is a force generated when the ball changes its direction while moving.
In
Here, considering the fact that the direction of the force applied to the ball is in the opposite direction of the gradient vector, Fg[k] can be expressed using the characteristics of trigonometric functions of equation 32, as equation 33 below.
By applying tan θk=∇[k] to equation 33, equation 33 is rewritten to equation 34.
Then, a process of obtaining Fa will be described. When a differentiable function y=f(x) is given, curvature κ can be expressed as equation 35 below.
Acceleration generated when a certain object has a curvilinear motion at a velocity of v is in a direction perpendicular to the concave part of the curve, and if curvature of the curve is κ, the magnitude of the acceleration a can be expressed as equation 36 below.
a=v
2κ (36)
When a ball goes down or up along the inner inclined surface of a container as illustrated in
As illustrated in
As seen from
equation 40 is induced, as follows.
If y′=∇[k] and
are applied to equation 40, equation 41 is induced from |Fa[k]|=maw
Finally, a 1-dimensional gravity search adaptive algorithm induced from equations 29, 34, and 41 is expressed by equation 42 below.
where
where m is the mass of the ball, f is the friction coefficient of liquid, and g is the gravitational acceleration. g[k] is defined by equation 43 below.
By applying equation 18 to equation 43, equation 44 is deduced as follows.
<N-dimensional Gravity Search Adaptive Algorithm>
The 1-dimensional gravity search adaptive algorithm can be easily expanded to an N-dimensional gravity search adaptive algorithm. It is assumed that {right arrow over (x)}[k] and {right arrow over (w)}[k] are N-dimensional signal and coefficient vectors, respectively.
Since {right arrow over (F)}g[k] indicates the opposite direction of the N-dimensional gradient vector {right arrow over (∇)}[k], equation 45 is deduced from equation 33.
At this time, since
tan θk can be expressed as equation 46 below.
Referring to
Here, since
tan θk can be expressed as equation 48 below.
Accordingly, in the case of the N-dimensional gravity search adaptive algorithm, {right arrow over (F)}g[k] can be represented by equation 49 below.
In order to calculate a[k], curvature of the container in the direction of the movement of the ball has to be obtained.
The direction of the N-dimensional velocity vector {right arrow over (v)}[k] is generally not parallel to the direction of {right arrow over (∇)}[k].
Since
equation 50 can be obtained.
However, if
for an arbitrary ε when ε<<1, tan Φk can be expressed as equation 51 below.
If s is a trajectory of the ball that has moved along the inner surface of the container, curvature is defined as a change rate in movement direction when the ball moves by a unit distance. An example of such curvature is shown in
Here, equation 52 is arranged to equation 53 below.
Accordingly, equation 54 can be obtained from equation 53.
Here,
can be represented as a product of R and a vector, as shown in equation 55.
Here, since
Where, vn[k] is the nth element of the vector {right arrow over (v)}[k]. Accordingly, equation 57 is deduced as follows.
Accordingly, equation 58 is deduced from equation 57, as follows.
As a result, the curvature is given as equation 59 below.
Accordingly, the acceleration a can be obtained by equation 60 below.
Since the acceleration a is always in a direction perpendicular to the surface and is not related to the direction of {right arrow over (v)}[k], {right arrow over (a)}[k] can be expressed by equation 61 below. That is, th e direction of a is perpendicular to the surface, and the direction of {right arrow over (v)}[k] is tangential to the surface, but can be in any direction.
At this time, since equation 62 is satisfied, equation 63 is deduced.
Finally, the N-dimensional gravity search adaptive algorithm is arranged to equation 64 below.
where
where m is the mass of the ball, f is the friction coefficient of the liquid, and g is the gravitational acceleration. Also, g[k] is defined as equation 65 below.
<N-dimensional Gravity Search Adaptive Algorithm Having Complex Coefficients>
If {right arrow over (x)}[k] is an N-dimensional complex signal vector and {right arrow over (w)}[k] is an N-dimensional complex coefficient vector, the cost function given by equation 3 above can be rewritten to equation 66 below.
Real vectors r; i; r[k], i[k] and real matrices Rr and Ri are defined by equations 67, 68, and 69, respectively, below.
=r+ji (67)
[k]=r[k]+ji[k] (68)
R=R
r
+jR
i (69)
Here, the real matrices Rr and Ri satisfy equation 70 below.
R
r
T
=R
r
R
i
T
=−R
i (70)
Equations 67, 68, and 69 are applied to equation 66, and equation 66 is rearranged to obtain equation 71 below.
ε=θy2−2[k]T+[k]TR[k]+σn2 (71)
Here, {right arrow over (p)}, {right arrow over (w)}[k], and R are defined by equation 72 below.
Equations 71 and 72 represent a problem with 2N-dimensional real coefficients. Accordingly, the gravity search adaptive algorithm using real numbers as coefficients, as obtained in equations 64 and 65, is used to calculate a 2N-dimensional gravity search adaptive algorithm as expressed as equations 71 and 72 below.
It will be seen from equation 72 that equations 73 and 74 are different expressions of an adaptive algorithm for obtaining {right arrow over (wr)}[k] and {right arrow over (wi)}[k].
Accordingly, after obtaining {right arrow over (wr)}[k] and {right arrow over (wi)}[k] from the adaptive algorithm of equations 73 and 74, equations 67, 68, and 69 are used to obtain an N-dimensional complex gravity search adaptive algorithm, as expressed as equations 75 and 76 below.
<Simplification of N-Dimensional Gravity Search Adaptive Algorithm>
{right arrow over (g)}[k] in equation 76 is composed of two terms; the first term is generated when gravity is applied to the ball, and the second term is generated by acceleration when the ball changes its movement direction while moving along the curved surface of the container.
Since a change in acceleration due to a change in movement direction of a ball has a very small magnitude compared to the gravitational acceleration, for simplification of implementation, the acceleration due to the change in movement direction is ignored since it has little influence on performance of the adaptive algorithm. If the second term of {right arrow over (g)}[k] in equation 76 is ignored, a simplified gravity search adaptive algorithm is deduced, which is expressed by equation 77 below.
In equations 77 and 78, β is a parameter associated with inertia. If β is large, that is, if β is close to 1, the velocity of the ball does not change easily since the ball has a strong tendency to maintaining its own velocity, while if β is small, that is, if β is close to 0, the ball has a weak tendency to maintain its own velocity. If β is zero, the velocity of the ball depends only on a force currently applied to the ball regardless of the previous velocity of the ball. That is, when β is zero, the adaptive algorithm of equations 75 and 76 is rewritten to equation 79 below.
Also, when β is zero, the adaptive algorithm of equations 77 and 78 is rewritten to equation 81 below.
<Gravity Search LMS (GS-LMS) Algorithm>
Like the steepest descent algorithm, in order to implement a GS-LMS algorithm, R and {right arrow over (p)} have to be given. Like when inducing a LMS algorithm, the case of estimating R and {right arrow over (p)} from instantaneous values of the signal as given in equations 14 and 15 is called a GS-LMS algorithm.
A gradient vector can be deduced from equations 14 and 15, which is expressed as equation 81 below.
Equation 81 is applied to equation 76, thus deducing equation 82.
If it is defined that z[k]=[k]H[k] and {right arrow over (g)}[k] is represented in terms of z[k], equation 82 can be rewritten to equation 83 below.
Here, {•} represents the real part of the complex number. Accordingly, from
equations 83 and 73, the GS-LMS algorithm is given as equations 84 and 85 below.
Meanwhile, if the acceleration component due to a change in movement direction of the ball is ignored from {right arrow over (g)}[k], equations 75 and 79 are rewritten to a simplified GS-LMS algorithm, which is expressed as equations 86 and 87 below.
If β is zero, equation 79 can be rewritten to a simplified GS-LMS algorithm, which is expressed as equation 88 below.
If β is zero, equation 80 can be rewritten to a simplified GS-LMS algorithm, which is expressed as equation 89 below.
In equation 87, {right arrow over (g)}[k] has been normalized with respect to both {right arrow over (x)}[k] and {right arrow over (e)}[k]. Accordingly, the convergence speed of the simplified GS-LMS algorithm of equations 86 and 87 may become slow during the early stage of adaptation when the power of e[k] is large. In this case, one method for improving the convergence speed is to vary g over time.
For example, it is possible to make g be in proportion to the power of an error signal e[k], as follows.
g[k]={right arrow over (g)}|e[k]|
2 (90)
By applying equation 90 to equation 87, equations 91 and 92 are obtained as follows.
Here, a range of the {tilde over (g)} value is given by equation 93 below.
If β is zero, equations 91 and 92 can be rewritten to equation 94 below.
The adaptive algorithm written as equation 94 has a structure that is very similar to the normalized LMS algorithm of equation 95.
For easy comparison, equation 94 is rearranged to equation 96 below.
If equations 97 and 98 below are satisfied, equation 95 becomes identical to equation 96.
This means that if there is no memory in the GS-LMS algorithm of equation 94, the GS-LMS algorithm becomes a normalized LMS algorithm. According to a different interpretation, this means that a GS-LMS algorithm is an enhanced version of the normalized LMS by introducing memory or inertia.
The above interpretation about the GS-LMS algorithm has very important significance. The reason is because the characteristics of the GS-LMS algorithm can be understood based on the well-known characteristics of the normalized LMS algorithm.
It is assumed that equations 99 and 100 below are satisfied.
1<<4e[k][k]H[k]e*[k] (99)
4δ|e[k]|2<<4e[k][k]h[k]e*[k] (100)
In this case, the GS-LMS algorithm may be further simplified. That is, under the assumption, equation 92 can be rewritten to equation 101 below.
By arranging equation 101, the algorithms of equations 91 and 92 are simplified to equation 102 and 103 below.
Also, the algorithms of equations 101 and 103 become identical to the normalized LMS algorithm when β is zero.
The gravity search adaptive apparatus illustrated in
Referring to
Then, the adaptive controller 1030 applies the system modeling parameter to the system modeling unit 1020 (1120). Then, the adaptive controller 1030 applies the same signal to both the system 1010 and the system modeling unit 1020 (1130). Then, the adaptive controller 1030 calculates an error signal which is a difference between an output signal of the system 1010 and an output signal of the system modeling unit 1020 (1140). Successively, the adaptive controller 1030 compares the error signal to the previously calculated error signal to thereby determine whether the error signal converges on a minimum value (1150).
If it is determined in operation 1150 that the error signal does not converge on a minimum value, the adaptive controller 1030 returns to operation 1110 to update the system modeling parameter using the gravity search adaptive algorithm. That is, the procedure from operation 1110 to operation 1150 is repeated until the error signal converges on the minimum value.
Also, when calculating a system parameter using the gravity search adaptive algorithm, under an assumption that a ball moves along the inner surface of a parabolic, concave container filled up with liquid, the adaptive controller 1030 sets the mass m of the ball, a friction force g of liquid, a sampling interval Ts, and a force Fk applied to the ball, and applies the set values to equation 37 to thereby calculate or update a parameter of the system modeling unit 1020.
Meanwhile, if it is determined in operation 1150 that the error signal converges on the minimum, the adaptive controller 1030 estimates the characteristics of the system 1010 using a final parameter calculated in operation 1160.
Therefore, as described above, the gravity search adaptive apparatus and method can easily adjust a convergence speed and a convergence pattern.
A number of examples have been described above. Nevertheless, it will be understood that various modifications may be made. For example, suitable results may be achieved if the described techniques are performed in a different order and/or if components in a described system, architecture, device, or circuit are combined in a different manner and/or replaced or supplemented by other components or their equivalents. Accordingly, other implementations are within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2010-0133789 | Dec 2010 | KR | national |
10-2011-0090383 | Sep 2011 | KR | national |