This application claims the priority under 35 U.S.C. § 119 of European Patent application no. 16199563.4, filed on Nov. 18, 2016, the contents of which are incorporated by reference herein.
The present disclosure relates generally to an adaptive filter. In particular, the present disclosure relates to an adaptive filter using resource sharing of computational blocks for performing filter coefficient convergence algorithm. More particularly, the present disclosure relates to a bootstrapping technique to increase the convergence rate of adaptive filter using resource sharing.
An adaptive filter is a computational device that attempts to model the relationship between two signals in real time in an iterative manner.
Such adaptive filters are used in many applications e.g. for canceling undesired signal components. Echo cancelers/equalizer (for inter-symbol interference cancellation) are a typical application of the adaptive filter for canceling an echo resulting from the trans-hybrid coupling of a hybrid circuit with an echo replica derived from the input signal of the adaptive filter. Adaptive filters are often realized either as a set of program instructions running on an arithmetical processing device such as a microprocessor or DSP chip, or as a set of logic operations implemented in a field-programmable gate array (FPGA) or in a semicustom or custom VLSI integrated circuit.
The adaptive filter has a tapped-delay line and a tap-weight coefficient controller for producing a sum of tap signals weighted respectively by tap-weight coefficients. According to a known adaptive convergence algorithm such as the LMS (least mean square) algorithm, the tap-weight (filter) coefficients are updated by correlations between the tap signals and a residual error of a correction signal, which is represented by the sum of the weighted tap signals.
Fast convergence of the tap-weight coefficients are of primary concern for designing an adaptive filter. In particular fast convergence at adaptive filters using resource sharing is a major desire in view of power efficient implementation for a cost sensitive market.
The present invention provides an adaptive filter, a method of injecting offsets into the iterative convergence algorithm for adjusting filter coefficients and a non-transitory, tangible computer readable storage medium bearing computer executable instructions for performing the aforementioned method at an adaptive filter as described in the accompanying claims. Specific embodiments of the invention are set forth in the dependent claims. These and other aspects of the invention will be apparent from and elucidated with reference to the embodiments described hereinafter.
The accompanying drawings, which are incorporated herein and form a part of the specification, illustrate the present invention and, together with the description, further serve to explain the principles of the invention and to enable a person skilled in the pertinent art to make and use the invention.
Embodiments of the present disclosure will be described below in detail with reference to drawings. Note that the same reference numerals are used to represent identical or equivalent elements in figures, and the description thereof will not be repeated. The embodiments set forth below represent the necessary information to enable those skilled in the art to practice the invention. Upon reading the following description in light of the accompanying drawing figures, those skilled in the art will understand the concepts of the invention and will recognize applications of these concepts not particularly addressed herein. It should be understood that these concepts and applications fall within the scope of the disclosure and the accompanying claims.
Referring now to
Referring now to
Assuming a linear relationship between input signal s(n) and output signal y(n), the adaptive filter can take the form of a finite-impulse-response (FIR) filter as exemplified in herein with reference to
where S(n)=[s(n), s(n−1), . . . , s(n−L+1)]T is the input signal vector.
As shown in
The adaptive convergence algorithm of an adaptive filter for adjusting the filter coefficients ci(n) is performed to minimize a cost function selected with respect to a respective use case of the adaptive filter. The adjusting of the filter coefficients ci(n) is performed in an iterative procedure:
C(n+1)=C(n)+μ(n)·G(e(n),S(n),Φ(n))
where G(e(n),S(n),Φ(n)) is a nonlinear vector function, μ(n) is the so-called step size, e(n) is the error signal and S(n) is the input signal vector. Φ(n) is a vector of states that may be used to describe pertinent information of the characteristics of the input signal, error signal and/or filter coefficients.
The adaptive filter comprises a coefficient-adjusting module 125, which performs the aforementioned adaptive convergence algorithm. At least the error signal e(n) and the input signal vector S(n) is input to the coefficient adjusting module 125, which may further comprise at least L memory storage locations to store the filter coefficients ci(n) and to supply the stored filter coefficients ci(n) for generating the output signal y(n). Further parameters required by the adaptive convergence algorithm implemented in the coefficient-adjusting module 125 such as the step size μ(n) may be predefined and/or configurable.
Least mean squares (LMS) functions are used in a class of adaptive filters to mimic a desired filter by finding the filter coefficients that relate to producing the least mean squares of the error signal e(n) (difference between the desired and the actual signal). It is a stochastic gradient descent methodology in that the filter coefficients are only adapted based on the error signal at a current time.
In particular, LMS algorithms are based on the steepest descent methodology to find filter coefficients, which may be summarized as following:
C(n+1)=C(n)+μ·e(n)·S(n)
ci(n+1)=ci(n)+μ·e(n)·s(n−i)
where C(n)=[c0(n), c1(n), . . . , cL-1(n)]T, S(n)=[s(n), s(n−1), . . . , s(n−L+1)]T, μ is the step size and L is the order of the filter.
The filter coefficients are determined in an iterative procedure starting with initial values of the filter coefficients Cinit(n)=[c0init(n), c1init(n), . . . , cL-1init(n)]T. The initial values are predefined. In a non-limiting example, the initial values of the filter coefficients ciinit(n) may be set to zero, i.e. Cinit(n)=[0,0, . . . ,0]T=zeros(L), but non-zero initial values likewise possible. As the LMS algorithm does not use the exact values of the expectations, the filter coefficients would never reach the optimal convergence values in the absolute sense, but a convergence is possible in mean. Even though the filter coefficients may change by small amounts, they change about the convergence values. The value of step-size μ should be chosen properly. In the following, a filter coefficient changing by small amounts about its optimal convergence value, will be referred to as a filter coefficient, which has reached steady state.
A LMS computational block 120.0 to 120.L−1 may be arranged for each filter coefficient ci(n) of the adaptive filter shown in
Referring now to
For the sake of illustration, the exemplary adaptive filter schematically shown in
Referring now to
With reference to
This means that when using computational resources sharing each filter coefficient is updated each k-th iteration in time, herein k=2. In general, the computational resources sharing may be implemented with higher values of k, which will be denoted as sharing factor k in the following, wherein k is integer and k>1. The number of LMS computational blocks corresponds to the filter order L=6 divided by the sharing factor k=2: L/k=3. The exemplified adaptive filter comprises the three LMS computational blocks 120.1, 120.3 and 120.5.
Those skilled in the art understand that the above-described resource sharing scheme is only an illustrative scheme to improve the understanding of the concept of the present application but not intended to limit the present application.
The adjusting of filter coefficients in adaptive filter requires computational blocks configured to carry out the adaptive convergence algorithm. Each computational block is enabled to perform the adjusting procedure of one filter coefficient ci(n) at one cycle. Therefore, the number of computational blocks in traditional adaptive filters corresponds to the order L of the adaptive filter or the number of tapped delay signal s(n−i) provided by the tapped-delay-line. In adaptive filters using computational resources sharing, the number of computational blocks is less than the order L of the adaptive filter. Accordingly, only a subset of the filter coefficients is adjusted in one cycle. In an example of the present application, the number of filter coefficients is an integer multiple of the number of filter coefficients in each subset. The integer multiple corresponds to the sharing factor k.
As schematically illustrated in and described with reference to
The reduced rate of convergence is further illustrated with reference to
In order to improve the convergence rate, an offset determined based on the monitored development of the filter coefficient ci(n). The determined offset is injected to the filter coefficient ci(n) at predefined periods of time. The injection of the determined offset is stopped in case the filter coefficient ci(n) varies about the convergence value, which is indicative of the filter coefficient ci(n) having reached the steady state.
The offset Offi is determined based on a value difference of the filter coefficient ci(n). The value difference Δi is determined over a period of time N·Ts, where Ts is the sampling time and fs is the sampling frequency of the adaptive filter (Ts=1/fs) and N is an predetermined integer value, N≥1, wherein
The value difference Δi is determined with n=0, wherein n is a sampling index relating to the sampling time Ts. The sampling index n=0 is indicative of the start of the filter coefficient adjusting procedure.
The value difference Δi(n) is further determined each period of time M·N·Ts after the potential injection of the offset Offi. The value difference Δij(j) is hence determined at n=j·M·N, where j=1, 2, 3 . . . (t=n·Ts). Accordingly,
At each period of time M·N·Ts, the offset Offi is added to the filter coefficient ci(n) provided that the current slope of the development of the filter coefficient ci(n) is below a predefined threshold. A current slope of the development of the filter coefficient ci(n) below the predefined threshold is considered to be indicative of a filter coefficient ci(n) slightly varying about the optimal convergence value.
The offset Offi(j) is determined based on the value difference Δij(j) and the sharing factor k:
When using the above vector representation, the offset Offi(j) can be described as following:
wherein C′(n)=[c0′(n), c1′(n), . . . , cL-1′(n)]T,
Hence, the offset can be written as following:
and the injection can be written as following:
C(n)=C(n)+
As schematically illustrated in
The adaptive convergence algorithm is performed for at each cycle, whereas offsets are injected on a periodic basis having a period greater than the iteration cycle of the adaptive convergence algorithm.
It should be noted that current slopes of the development of the filter coefficient ci(n) at the above-mentioned points in time (j+1)·M·N·Ts, j=0, 1, 2 . . . are positive such that the offset Offi(j) is added. Otherwise, if the slope is negative, the offset Offi(j) would be subtracted. As described below, the slope may be approximated by a difference quotient determined with respect to a predefined period of time, which may be shorted than the injection period.
The periodic injection of the offset is well recognizable in
The convergence rate of the adaptive filter with computational resource sharing and offset injection substantially corresponds to the convergence rate of the adaptive filter without computational resource sharing. The convergence rate of the adaptive filter with computational resource sharing (and without offset injection) is significantly lower.
Referring now to
The improved convergence rate of a filter coefficient ci(n) is obtained by an offset value Offi(j), which is added or subtracted at a predetermined period of time. The adding or subtracting of the offset value Offi(j) depends on a current slope of the development of the filter coefficient ci(n), in particular the adding or subtracting depends on whether the slope is positive (raising filter coefficient) or negative (falling filter coefficient). The offset value Offi(j) is based on a periodically determined value difference Δi(j), wherein j is an index to the periods.
The method performs with respect to the sampling time Ts and the sampling index n, respectively, wherein time t=n·Ts and n=0, 1, 2 . . . .
In an initial operation S100, the adaptive convergence algorithm is initialized and values are assigned for a first time period T1 and a second time period T2. Typically, the sample index n is set to n=n0 and the initial value of the filter coefficient ci(n) is set to ci(n)=ciinit(n). In an example, the sample index n is set to n0=0. In an example, the initial value of the filter coefficient ci(n) is set to ci(n)=0. The sharing factor k is predefined by the implementation of the adaptive filter with computational resources sharing. A threshold value TH is assigned, which allows to determine whether or not the filter coefficient ci(n) has reached the steady state.
The first time period T1 is defined with respect to two parameter N and M, where N and M are integers and N>1, M>1. For instance, the first time period T1=N·M·Ts. The second time period T2 is defined with respect to the parameter N. For instance, the second time period T2=N·Ts. In an example, the parameter N is greater than the sharing factor k (N>k). The second time period T2 occurs M times in the first time period T1.
In an operation S100, the sample index is increased by one (n=n+1).
In an operation S120, the development of the filter coefficient ci(n) is monitored. The development is monitored on the basis of the change of the value of the filter coefficient ci(n) developing over time. For instance, a slope is determined from the value of the filter coefficient ci(n), in particular with respect to the second time period T2.
In an operation S130, it is determined whether or not an offset is injected into the iteration of the filter coefficient ci(n). In particular, such offset is injected each first time period T1, only. More particularly, the offset is only injected in case the filter coefficient ci(n) has not reached the steady state, e.g. in case an absolute value of the monitored slope exceeds the predefined threshold TH, which is considered as indicative of the filter coefficient ci(n) still significantly differing from the optimal convergence value. The offset to be injected is based on the monitored slope and further on the sharing factor.
In an operation S140, the iteration to calculate the filter coefficient ci(n) is performed. In accordance with the present example, the filter coefficient ci(n) is determined using the LMS algorithm:
ci(n+1)=ci(n)+μ·e(n)·s(n−i)
The step size μ may be predefined in the initial operation. For the sake of completeness it should be noted that the step size μ may be a variable parameter dependent on the sampling index n: μ=μ(n).
Referring now to
In an operation S200, the filter coefficient ci(n) is monitored based on the development of the value of the filter coefficient ci(n) within the first time period T1. For instance, a slope or a value difference is determined at least at the beginning of each first time period T1 and the ending of each first time period T1. The slope or value difference is determined from the change of the values of the filter coefficient ci(n), e.g. over the second time period T2.
In an operation S210, it is determined whether or not the second time period T2 has lapsed. For instance, if the current sampling index n is a multiple of N and n is not zero (n>0) then the second time period T2 has lapsed as indicated by following condition
n mod N=0
In case the second time period T2 has lapsed, a slope or value difference is determined in an operation S220. The slope is determined based on the change of the filter coefficient value ci(n) over time/sampling index. In an example, the slope ci′ is determined based on the values of the filter coefficient ci(n) and filter coefficient ci(n−N) at sampling index n and n−N:
Alternatively, the value difference Δi may be determined, which should be considered as an equivalent value to the aforementioned slope:
Δi=ci(n)−ci(n−N)=N·ci′(n)
In an operation S230, it is determined whether or not the determined slope ci′ or change Δi relates to the beginning of the first time period T1, for instance the first occurrence of the second time period T2 in the first time period T1:
(n−N)mod(N·M)=0
If the determined slope or value difference relates to the beginning of the first time period T1 then the slope ci′ or change Δi may be stored in an operation S240 for later use. The stored slope ci* or change Δi* is used for determining the offset.
In an operation S250, the monitoring of the development of the filter coefficient ci(n) is completed.
Referring now to
In an operation S300, injecting an offset into the iteration of the filter coefficient ci(n) is performed provided that the filter coefficient ci(n) has not reached steady state.
In an operation S310, it is determined whether or not the first time period T1 has lapsed. For instance, if the current sampling index n is a multiple of N·M and n is not zero (n>0) then the first time period T1 has lapsed as indicated by following condition
n mod(N·M)=0
If the first time period T1 has lapsed, the offset Offi is determined in an operation S320. The offset Offi is based on the stored slope ci* or change Δi* to consider the development of the filter coefficient ci(n) over the first time period T1. The offset Offi is further based on the sharing factor k, which enables to consider the reduced convergence rate because of the computational resources sharing of the adaptive filter. For instance,
Offi=(k−1)·ci*·M·N; or
Offi=(k−1)·Δi*·M
As aforementioned, the offset Offi is injected into the iteration of the filter coefficient ci(n) if the filter coefficient ci(n) has not reached steady state.
In an operation S330, the current slope ci′ or the current value difference Δi is compared against the predefined threshold TH. The current slope ci′ is for instance determined from a difference quotient based on the filter coefficient ci(n) at different points in time, e.g. points in time n and (n−N). The current value difference Δi is for instance determined from a value difference based on the filter coefficient ci(n) at different points in time, e.g. points in time n and (n−N). In an example, the current slope ci′ is the slope determined by the previous operation relating to the monitoring of the filter coefficient ci(n). In an example, the current value difference Δi is the value difference determined by the previous operation relating to the monitoring of the filter coefficient ci(n).
|ci′|<THc; or
|Δi|<THΔ
wherein THΔ≈THc·N in the present example.
If an absolute value of the current slope ci′ or the current value difference Δi is less (or equal to) than the predefined threshold (THc and THΔ, respectively), it is assumed that the filter coefficient ci(n) has reached the steady state and only slightly varies about the optimal convergence value. In this case, the offset Offi is not injected.
Otherwise, if the absolute value of the current slope ci′ or the current value difference Δi is greater than the predefined threshold, the offset Offi is injected into the iteration calculation of the filter coefficient ci(n) in an operation S340; for instance:
ci(n)=ci(n)+Offi
In an operation S350, the injecting of an offset is completed.
Referring now to
In an operation S300′, injecting an offset into the iteration of the filter coefficient ci(n) is performed provided that the filter coefficient ci(n) has not reached steady state.
In an operation S310, it is determined whether or not the first time period T1 has lapsed. If the first time period T1 has lapsed, the offset Offi is determined in an operation S320.
In an operation S330, the current slope ci′ or the current change Δi is compared against the predefined threshold TH (THc and THΔ, respectively).
If an absolute value of the current slope ci′ or the current value difference Δi is less (or equal to) than the predefined threshold, it is assumed that the filter coefficient ci(n) has reached the steady state and only slightly varies about the optimal convergence value. In this case, the offset Offi is not injected.
Otherwise, if the absolute value of the current slope ci′ or the current value difference Δi is greater than the predefined threshold, the offset Offi is injected into the iteration calculation of the filter coefficient ci(n).
The operations S310 to S330 correspond to the respective operations described above with reference to
In an operation S340, it is determined whether the development of the filter coefficient ci(n) over time shows an ascending or descending behavior. Whether the filter coefficient ci(n) ascends or descends over time can be determined from the current slope ci′ or the current value difference Δi. If the current slope ci′ or the current change Δi is greater than 0, the filter coefficient ci(n) ascends over time, otherwise if the current slope ci′(n) or the current change Δi is less than 0, the filter coefficient ci(n) descends over time:
ci′,Δi>0: ascending or ci′,Δi<0: descending.
If the filter coefficient ci(n) ascends over time, the offset Offi is added in an operation S370:
ci(n)=ci(n)+Offi
If the filter coefficient ci(n) descends over time, the offset Offi is subtracted in an operation S380:
ci(n)=ci(n)−Offi
In an operation S390, the injecting of an offset is completed.
Referring now to
According to the filter order L, the tapped-delay-line has L−1 delay elements 110.1 to 110.L−1 and provides L tapped delay signal s(n−i), i=0, . . . , L−1,
The exemplified adaptive filter of
The LMS computational block 120.1 is for instance used to adjust the filter coefficients c0(n) to c2(n) and the LMS computational block 120.L/k is for instance used to adjust the filter coefficients cL-3(n) to cL-1(n). Those skilled in the art will understand that the computational resources sharing exemplary adaptive filter of
The adaptive filter further comprises L multipliers 130 for multiplying each tapped delay signal s(n−i) with the respective filter coefficient ci(n), where i=0 to L−1, and L−1 adders 140 for adding the weighted output signal contributions Yi(n) to obtain the output signal y(n). The adaptive filter further comprises at least L memory locations to store the L filter coefficients ci(n).
The adaptive filter further comprises a monitoring block 200, which has access to the filter coefficients ci(n) and which is arranged to monitor the development of the filter coefficients ci(n). In particular, the monitoring block 200 is configured to carry out the method of monitoring in particular as described above with reference to the flow diagrams shown in
The adaptive filter further comprises a offset calculation block 210, which receives information from the monitoring block 200 about the development of the values of the filter coefficients ci(n) and is arranged to compute offsets values Offi for the filter coefficients ci(n) on a periodic time scale and inject the computed offsets Offi into the adjusting procedure of the filter coefficients ci(n). In particular, the offset computation block 210 is configured to carry out the method of monitoring in particular as described above with reference to the flow diagrams shown in
It should be noted that the offset injection should not be understood to be limited to the LMS (least mean square) algorithm for adjusting the filter coefficient, with regard to which the methodology to improve the convergence rate has been illustratively explained above. The LMS algorithm is but one of an entire family of algorithms, which are based on approximations to steepest descent procedures. The family of algorithms further comprises for instance the sign-error algorithm, the sign-delta algorithm, sign-sign algorithm, zero-forcing algorithm and power-to-two quantized algorithm. The steepest descent procedures are based on the mean-squared error (MSE) cost function, which has been shown as useful for adaptive FIR filters. However, further algorithm are known, which are based on non-MSE criteria. The illustrated offset injection is in principle applicable with iteratively determined filter coefficients in the above-mentioned general form:
C(n)=C(n)+
C(n+1)=C(n)+μ(n)·G(e(n),S(n),Φ(n))
Referring now to
The exemplary adaptive filter comprises a number of computational blocks 260. In particular, the number of computational blocks 260 is determined at implementation or design stage. Each of the computational blocks 260 is enabled to perform the adjusting procedure of a filter coefficient ci(n) at one cycle. The adjusting procedure is carried out in accordance with an adaptive convergence algorithm. The computational blocks 260 are accordingly configured. The computational blocks 260 are not fixedly assigned to one or more tapped delay signals s(n−i). A symbol routing logic 300 is provided in the adaptive filter, which is configurable to selectively route any tapped delay signals s(n−i) to any computational block 260. Hence, each of the computational blocks 260 is freely assignable to one tapped delay signal s(n−1) at one cycle.
For managing the computational blocks 260, each of the computational blocks 260 is allocated to one of a number of w clusters 250.j, wherein j=1, . . . , w and w is a positive non-zero integer. The number w of clusters is configurable. Each of the plurality of clusters 250.1 to 250.w comprises an individual set of Cj computational blocks 260, wherein j=1, . . . w. The number of computational blocks 260, j comprised in each cluster 250.1 to 250.w may differ. For instance, the cluster 250.1 comprises a set of C1 computational blocks CB 260.1.1 to 260.1.C1, the cluster 250.2 comprises a set of C2 computational blocks CB 260.2.1 to 260.2.C2 and the cluster 250.w comprises a set of Cw computational blocks CB 260.w.1 to 260.w.Cw.
The symbol routing logic 300 routes each one of a number of w sets of tapped delay signals {s(n−i)}.1 to {s(n−i)}.w to a respective one of the clusters 250.1 to 250.w. Each set of tapped delay signals {s(n−i)} comprises Mj tapped delay signals s(n−i), wherein j=1, . . . , w. The number of tapped delay signals s(n−i) comprised by each set may differ. For instance, a first set {s(n−i)} of tapped delay signals s(n−i) is routed to the cluster 250.1 and comprises M1 tapped delay signals s(n−i), a second set {s(n−i)} of tapped delay signals s(n−i) is routed to the cluster 250.2 and comprises M2 tapped delay signals s(n−i), a w-th set {s(n−i)} of tapped delay signals s(n−i) is routed to the cluster 250.w and comprises Mw tapped delay signals s(n−i).
The number of sets of tapped delay signals {s(n−i)}.1 to {s(n−i)}.w correspond to the number of clusters 250.1 to 250.w.
The filter coefficients ci(n) are stored in a coefficient memory storage 270, to which the computational blocks 260 have access to read a respective filter coefficient ci(n) from a respective memory location thereof and write an updated filter coefficient ci(n) to the respective memory location thereof.
The allocation of each computational block 260 to a respective one of the clusters 250.1 to 250.w and the operation of the computational blocks 260 is under control of a cluster controller block 320. The cluster controller block 320 is configured to turn on/off the computational blocks 260 individually and/or cluster-wise. The cluster controller block 320 is further arranged to configure the computational blocks 260 to enable access to the required filter coefficient ci(n) corresponding to the tapped delay signal s(n−i) supplied thereto by the symbol routing logic 300.
The routing of the tapped delay signals s(n−1) is under control of a routing controller block 310, which configures the symbol routing logic (300) accordingly. The routing controller block 310 is configured to allocate each tapped delay signal s(n−i) to one of the sets of tapped delay signals {s(n−i)}.1 to {s(n−i)}.w. The routing controller block 310 configures the symbol routing logic 300 to route each set of tapped delay signals {s(n−i)}.1 to {s(n−i)}.w to each respective one of the clusters 250.1 to 250.w. Each set of tapped delay signals {s(n−i)}.1 to {s(n−i)}.w is assigned to one of the clusters 250.1 to 250.w. Each cluster 250.1 to 250.w receives the tapped delay signals s(n−i) of one set of tapped delay signals {s(n−i)}.1 to {s(n−i)}.w.
The routing controller block 310 and the cluster controller block 320 receive information from a monitoring block 200, which has access to the coefficients memory 270 and which is arranged to monitor the development of the filter coefficients ci(n). The monitoring block 200 is enabled to supply information relating to the development of the filter coefficients ci(n) to the routing controller block 310 and the cluster controller block 320, which are arranged to dynamically operate the exemplified adaptive filter based on the received information.
The operation of the adaptive filter with controllable computational resource sharing according to an embodiment of the present application will be further explained with reference to
As exemplified in the filter coefficient plot of
The controllable computational resource sharing enables to consider the above considerations in the operation of the adaptive filter while meeting performance requirements at a reduced power consumption.
The symbol routing logic 300 allows to partition the total number of L tapped delay signals s(n−i) generated on each sampling cycle by the tapped-delay-line into w signal sets of tapped delay signals {s(n−i)}.1 to {s(n−i)}.w. Each signal set may comprise a different number of tapped delay signals s(n−i). The total number of L tapped delay signals s(n−i) are for instance partitioned into the five sets 400.1 to 400.5, each comprising a different number of successive tapped delay signals {s(n−i)}, where i=i1, . . . , i2, where i1 and i2 are integers, i1<i2, 0<i1, i2<L−1, and L is the order of the adaptive filter.
The total number of L tapped delay signals s(n−i) may be partitioned into sets based on the monitored, assumed or expected contribution levels to the output signal y(n). The total number of L tapped delay signals s(n−i) may be partitioned into sets based on the monitored, assumed or expected value amounts of the associated filter coefficients ci(n). Initially a partitioning of the tapped delay signals s(n−i) to signal sets may be predefined, for instance, the tapped delay signals s(n−i) may be evenly assigned to signal sets having substantially the same number of tapped delay signals s(n−i), e.g. when the initial values of the filter coefficients are set to zero. When initially starting the filter coefficient adjusting with initial non-zero values, the allocation of the tapped delay signals s(n−i) to different signal sets may be based in the initial non-zero values, which may be considered to be indicative of levels of contribution or significance levels of the respective tapped delay signal s(n−i). During operation of the adaptive filter, the total number of L tapped delay signals s(n−i) may be repartitioned for instance in response to the monitored value amounts of the filter coefficients ci(n).
As illustratively shown in
Each of the signal sets is associated to one of the clusters, herein five clusters according to the five signal sets. For instance, the cluster 1 is associated with the third signal set 400.3. Computational blocks are allocated to each one of the five clusters. The numbers of computational blocks allocated to the clusters may differ. However, as understood from the above discussion, the crucial factor, which defines the computational performance of a cluster, is given by the individual sharing factor ki, wherein i=1 to w and w is the number of clusters. The sharing factor ki defines the ratio between the number of tapped delay signals and filter coefficients ci(n), respectively, assigned to a cluster i and the number of computational blocks allocated to the cluster i. The sharing factors ki of the different clusters may differ from each other.
The allocation of tapped delay signal s(n−i) may be performed based on one or more threshold levels applied to the amount values of the filter coefficients ci(n) or the initial values of the filter coefficients ci(n). The allocation of the tapped delay signals to different sets values may be based on normalized values of the filter coefficients ci(n). Normalized values of the filter coefficients ci(n) may improve the comparableness. The allocation of the tapped delay signals s(n−i) to the five signal sets exemplified in
In an example of the present application, clusters, to which signal sets with less dominant filter coefficients ci(n) are assigned, may be operated with a higher sharing factor than clusters, to which signal sets with more dominant filter coefficients ci(n) are assigned.
The cluster controller block 320 is arranged to allocate the computational blocks to the clusters. Initially, the computational blocks may be allocated to clusters according to an initial allocation scheme; for instance, the computational blocks may be evenly allocated to clusters comprising substantially the same number of computational blocks. During operation of the adaptive filter, the allocation of the computational blocks may be adapted for instance in response to the monitored contribution levels and/or the state of convergence of the filter coefficients ci(n).
As further illustratively shown in
The tapped delay signs are allocated to one of the N+1 signal sets (corresponding to the number N+1 of value subranges) based on the allocation of the values of the respective filter coefficient ci(n) to the one of the N+1 value subranges. Accordingly, each signal sets may comprise one, two or more subsets of successive tapped delay signals s(n−i). Herein, the signal sets 400.1 and 400.2 each comprise two continuous subsets of tapped delay signal s(n−i) and the signal set 400.4 comprises one subsets of successive tapped delay signal s(n−i). Each of the three signal sets 400.1 to 400.3 is assigned to one of three clusters.
The number of computational blocks allocated to each of the three cluster may be further selected based on the normalized values of the filter coefficients ci(n) in the respective signal set. In case the normalized values of the filter coefficients ci(n) of a signal set are low in comparison to the other ones, a low number of computational blocks is allocated to the respective cluster, which means that the filter coefficients ci(n) of the signal set with low values are adjusted using a high sharing factor k. In case the normalized values of the filter coefficients ci(n) of a signal set are high in comparison to the other ones, a high number of computational blocks is allocated to the respective cluster, which means that the filter coefficients ci(n) of the signal set with high values are adjusted using a low sharing factor k. In case the normalized values of the filter coefficients ci(n) of a signal set are medium in comparison to the other ones, a medium number of computational blocks is allocated to the respective cluster, which means that the filter coefficients ci(n) of the signal set with high values are adjusted using a medium sharing factor k.
Referring back to
Further referring to
For instance, in case the filter coefficients ci(n), which are assigned to one cluster for adjusting procedure has reached the steady state, the computational blocks of the cluster can be disabled at least temporarily to reduce the power consumption. In particular, the computational blocks of the cluster may be disabled for a predefined off-time interval Toff, after which the disabled computational blocks are put into operation again.
For the above-description, it is well understood that the suggested design of the adaptable filter with configurable computational resources sharing enables to flexibly and dynamically assign computational power for a configurable subset of tapped delay signals s(n−i) and filter coefficients ci(n), respectively. Thereby, the available computational power of the computational blocks employed for performing the adjusting procedure according to an adaptive convergence algorithm is efficiently usable while the overall number of implemented computational blocks can be reduced to an economic number.
Although not shown in
Those of skill in the art would understand that information and signals may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits, symbols, and chips that may be referenced throughout the above description may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields or particles, or any combination thereof.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To illustrate clearly this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
The various illustrative logical blocks, modules, and circuits described in connection with the disclosure herein may be implemented or performed with a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
The steps of a method or algorithm described in connection with the disclosure herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such that the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal.
In one or more exemplary designs, the functions described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code means in the form of instructions or data structures and that can be accessed by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and Blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
The previous description of the disclosure is provided to enable any person skilled in the art to make or use the disclosure. Various modifications to the disclosure will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other variations without departing from the spirit or scope of the disclosure. Thus, the disclosure is not intended to be limited to the examples and designs described herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Number | Date | Country | Kind |
---|---|---|---|
16199563 | Nov 2016 | EP | regional |
Number | Name | Date | Kind |
---|---|---|---|
4985900 | Rhind et al. | Jan 1991 | A |
5309481 | Viviano et al. | May 1994 | A |
5777910 | Lu | Jul 1998 | A |
5809058 | Sato | Sep 1998 | A |
5946351 | Ariyavisitakul et al. | Aug 1999 | A |
6108681 | Wittig et al. | Aug 2000 | A |
7120656 | Lam et al. | Oct 2006 | B1 |
7499513 | Tetzlaff | Mar 2009 | B1 |
9001883 | Tsai | Apr 2015 | B2 |
20020150155 | Florentin et al. | Oct 2002 | A1 |
20050031045 | Mayor et al. | Feb 2005 | A1 |
20080013657 | Aouine | Jan 2008 | A1 |
20090198754 | Chang et al. | Aug 2009 | A1 |
20120071107 | Falck et al. | Mar 2012 | A1 |
20130170538 | Ki et al. | Jul 2013 | A1 |
20140140416 | Yamazaki | May 2014 | A1 |
20140241181 | Barrass | Aug 2014 | A1 |
20140348276 | Pandey | Nov 2014 | A1 |
20180145725 | Pandey | May 2018 | A1 |
Entry |
---|
Apolinario et al. “QRD-RLS Adaptive Filtering, Chapter 2: Introduction to Adaptive Filters”, Springer Publishing Company, pp. 23-49 (Jan. 2009). |
Non-Final Office Action dated Dec. 6, 2018 for U.S. Appl. No. 15/700,528 5 pgs. |
Notice of Allowance for related U.S. Appl. No. 15/700,528 10 pgs. (dated Apr. 17, 2019). |
Number | Date | Country | |
---|---|---|---|
20180141363 A1 | May 2018 | US |