The present invention relates to a system and method for filtering, and, in particular, to a system and method for adaptive filtering.
Adaptive filtering is used in a variety of situations, including power amplifier predistortion systems. Predistortion is a technique which improves the linearity of power amplifiers, for example in telecommunications systems. A power amplifier with nonlinearities causes interference on other radio channels. Predistortion circuits inversely model the power amplifier's gain and phase characteristics to produce a system that is more linear. Inverse distortion is introduced into the input of the power amplifier, cancelling nonlinearities in the amplifier. The characteristics of the adaptive filter may vary by the type of power amplifier or power amplifier sub-system architecture.
An adaptive equalizer provides feedback, for example to equalize the channel gain across frequency bandwidths to compensate for different gains at different frequencies. Also, adaptive filters may be used for interference calculations in other types of adaptive systems. An adaptive filter self-adjusts its transfer function based on an optimization algorithm from an error signal. In an example, an adaptive process uses a cost function, which is a criterion for optimum performance of the filter, as an input to an optimization algorithm. The algorithm determines how to modify the filter transfer function to minimize the cost of the next iteration.
An embodiment method for training an adaptive filter includes receiving, by a processor from a device, an input signal and a training reference signal and determining a correlation matrix in accordance with the input signal, the training reference signal, and a filter type. The method also includes determining a plurality of coefficients in accordance with the correlation matrix and adjusting the adaptive filter in accordance with the plurality of coefficients.
An embodiment method for training an adaptive filter includes receiving, by a processor from a device, an input signal and a training reference signal and determining a tri-angle matrix in accordance with the input signal and the training reference signal. The method also includes storing the tri-angle matrix and determining a plurality of coefficients in accordance with the tri-angle matrix. Additionally, the method includes adjusting the adaptive filter in accordance with the plurality of coefficients.
An embodiment computer includes a processor and a computer readable storage medium storing programming for execution by the processor. The programming includes instructions to receive an input signal and a training reference signal and determine a correlation matrix in accordance with the input signal, the training reference signal, and a filter type. The programming also includes instructions to determine a plurality of coefficients in accordance with the correlation matrix and adjust an adaptive filter in accordance with the plurality of coefficients.
The foregoing has outlined rather broadly the features of an embodiment of the present invention in order that the detailed description of the invention that follows may be better understood. Additional features and advantages of embodiments of the invention will be described hereinafter, which form the subject of the claims of the invention. It should be appreciated by those skilled in the art that the conception and specific embodiments disclosed may be readily utilized as a basis for modifying or designing other structures or processes for carrying out the same purposes of the present invention. It should also be realized by those skilled in the art that such equivalent constructions do not depart from the spirit and scope of the invention as set forth in the appended claims.
For a more complete understanding of the present invention, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawing, in which:
Corresponding numerals and symbols in the different figures generally refer to corresponding parts unless otherwise indicated. The figures are drawn to clearly illustrate the relevant aspects of the embodiments and are not necessarily drawn to scale.
It should be understood at the outset that although an illustrative implementation of one or more embodiments are provided below, the disclosed systems and/or methods may be implemented using any number of techniques, whether currently known or in existence. The disclosure should in no way be limited to the illustrative implementations, drawings, and techniques illustrated below, including the exemplary designs and implementations illustrated and described herein, but may be modified within the scope of the appended claims along with their full scope of equivalents.
An embodiment uses a flexible adaptation architecture to achieve convergence in different manners and adapts quickly for different types of training targets. A common hardware architecture may be used for various training algorithms, such as least squares (LS)-GIVEN, least squares QR decomposition (LS-QRD), QR decomposition recursive least squares (QRD-RLS) or (QRD), and least mean squares (LMS). Configurable parameters include the training algorithm, the filter type, the matrix size, the number of samples for the correlation matrix calculation, the number of rows within the correlation matrix (correlation U vectors or U vector) for basic tri-angle rotation matrix calculation, multiple level adaptations with forgetting factors λ1, λ2, and λ3, coefficient update parameter μ, and regularization factor α.
Correlation matrix engine 102 generates a U vector and a correlation matrix Rxd. In one example, the U vector is composed of the input signal x, which is the output of the pre-actuator model, when direct training is used, or the plant output when indirect training is used. The general formula for the training input U vector is a function of the input signal, and is given by:
U=f(x0,x1, . . . ,xi),
where i represents the total number of training model parameters. In general, the kth correlation matrix is defined by the recursive relation:
R
xd
=R
xd
b+λ1Rxd
Rxd
R
xd
b
=[U
H
×U U
H
×d]
M×(M+1).
The correlation matrix of the U vector is renamed to:
R
u
=[U
H
×U]
M×M
+αI
M.
Regularization fact a, which is programmable, handles the matrix ill condition. The U vector matrix is given by:
M is the number of coefficients, the superscript i is the ith sample within the kth correlation matrix, and N is the number of samples to be used for the correlation matrix calculation. The number of samples, N to be used for the Rxd matrix calculation may be programmable. The cross correlation vector of U and d is given by:
R
d
=[U
H
×d]
N×1,
where d is the training reference signal. In a direct learning algorithm, d is the error signal between the actuator input signal and the feedback signal. For indirect learning, d is the actuator output signal given by:
For QRD and LMS algorithms, the correlation matrix Rud
Vector engine 104 calculates diagonal R elements for tri-angle matrix engine 108 and rotation matrix Gn for rotation engine 106. For the nth rotation, the rotation matrix Gn is defined by:
where cn is given by:
and sn is given by:
The diagonal elements of the Rud matrix are then calculated by:
Rotation engine 106 calculates the rest of the elements in tri-angle matrix Rud. The additional elements include rnk(i,j) and znk(i). These additional elements may be obtained by:
Tri-angle matrix engine 108 implements tri-angle matrix Rud. The tri-angle matrix engine stores intermediate and final tri-angle matrix results. The general recursive jth tri-angle matrix is given by:
R
ud
=R
ud
b+λ3×Rud
where Rud
R
ud
b=λ2×Rud
The initial basic tri-angle matrix is configured based on the filter type. For LS-GIVEN, the initial tri-angle matrix equals the correlation matrix Rxd
For QRD or LS-QRD, rotation is performed on a single row or a block row within the correlation matrix. After i rotations, the tri-angle matrix becomes:
For the LMS configuration, the tri-angle matrix is not used. Tri-angle matrix engine block 108 may be disabled, and the memory used for storing tri-angle matrix elements may be assigned to extend the size of the correlation matrix.
Coefficient engine 110 performs back substitution and updates the coefficients. The coefficient calculation and update are based on the matrix update configuration. This may be performed on a single U vector, a basic block, or several basic blocks based on the algorithm chosen and the training system requirements. Table 1 below illustrates various matrix update configurations.
For LS-GIVEN, a single basic block correlation matrix or multiple basic block correlation matrices may be used. With either a single basic block correlation matrix or multiple basic block matrices, the tri-angle matrix may be a basic rotation matrix or multiple basic rotation matrices. In a basic rotation matrix, the matrix rotation and coefficient calculation are based on the basic correlation matrix and the basic tri-angle matrix. In basic rotation, λ1=0, λ2=1, and λ3=0. For multiple basic rotation matrices, calculations are based on the basic correlation matrix and multiple basic tri-angle matrices, where λ1=0, λ2=1, and λ3=(0, 1]. When multiple basic blocks are used for the correlation matrix, a basic rotation matrix or multiple basic rotation matrix may be used for the tri-angle matrix. For a basic rotation matrix as the tri-angle matrix, matrix rotation and coefficient calculation are based on multiple basic correlation matrices, and the basic tri-angle matrix, where λ1=(0, 1], λ2=1, and λ3=0. For multiple basic rotation matrix, the matrix rotation and coefficient calculation are based on multiple basic correlation matrices, and multiple basic tri-angle matrices, and λ1=(0, 1], λ2=1, and λ3=(0, 1].
In an example with a LS-QRD filter, the correlation matrix may be a partial basic block matrix, a single basic block matrix, or multiple basic block matrices. The tri-angle matrix may be a basic rotation matrix or multiple basic rotation matrices. There may be any combination of correlation matrix and tri-angle matrix. When a partial basic block correlation matrix is used with a basic rotation tri-angle matrix, matrix rotation and coefficient calculation are based on several rows within a basic correlation matrix or multiple correlation matrices and the basic tri-angle matrix, where λ1=0, λ2=(0, 1], and λ3=0. On the other hand, when a partial basic block correlation matrix is used with multiple basic rotation tri-angle matrices, matrix rotation and coefficient calculation are based on several rows within a basic correlation matrix or multiple correlation matrices and multiple basic tri-angle matrices, where λ1=0, λ2=(0, 1], and λ3=(0, 1]. Also, when a single basic block correlation matrix is used with a basic rotation tri-angle matrix, the matrix rotation and coefficient calculation are based on all rows of the basic correlation matrix and the basic tri-angle matrix, where λ1=0, λ2=(0, 1], and λ3=0. When a single basic block correlation matrix is used with multiple basic rotation matrices, the matrix rotation and coefficient calculation are based on all rows of a basic correlation matrix and multiple basic rotation matrices, where λ1=0, λ2=(0, 1], and λ3=(0, 1]. Additionally, when multiple basic block correlation matrices and a basic rotation tri-angle matrix are used, the matrix rotation and coefficient calculations are based on all rows of multiple basic correlation matrices and a basic tri-angle matrix, where λ1=(0, 1], λ2=(0, 1], and λ3=0. When multiple basic block correlation matrices are used with multiple basic rotation tri-angle matrices, matrix rotation and coefficient calculations are based on all rows of multiple basic correlation matrices and multiple basic tri-angle matrices, with λ1=(0, 1], λ2=(0, 1], and λ3=(0, 1].
In another example, a QRD filter is used, and the correlation matrix is a single U vector matrix. The tri-angle matrix may be a basic rotation matrix or multiple basic rotation matrices. When a basic rotation matrix is use as the tri-angle matrix, the matrix rotation and coefficient calculations are based on single rows of multiple basic correlation matrices and a basic tri-angle matrix, where λ1=0, λ2=(0, 1], and λ3=0. On the other hand, when the multiple basic rotation matrices are used, the matrix rotation and coefficient calculations are based on a single row of multiple basic correlation matrix and multiple basic tri-angle matrices, where λ1=0, λ2=(0, 1], and λ3=(0, 1].
With an LMS filter, the correlation matrix may be a single U vector matrix or a single basic block matrix. There is no tri-angle matrix for LMS. When the correlation matrix is a single U vector, the coefficient calculation and update is based on a single U vector with a single basic correlation matrix for a sample by sample update, where λ1=0, and λ2 and λ3 are not applicable. When a single basic block correlation matrix is used, some or all of the U vectors within a single basic correlation matrix are used for the calculation and update, with block based updating, where λ1=0, and λ2 and λ3 are not applicable.
In block 122, the correlation matrix is generated. λ1 is forgetting factor used for the correlation matrix calculation as defined in Equation (1). When λ1 is zero, the current correlation matrix Rxdi is calculated without basing itself on previous correlation matrix information. At a given time i, the correlation matrix, Rxdi, may be calculated independently or based on partial correlation matrix information from the previous iteration weighted by the forgetting factor.
Then, in block 124, λ2 is the forgetting factor for individual triangle matrix, and λ3 is the forgetting factor between triangle matrices. At a given time, the triangle matrix, Rxdi may be calculated independently or based on partial information of the previous triangle matrix Rxdi-1 weighted by the corresponding forgetting factor value.
Finally, in block 126, the coefficients are updated. In block 140, the coefficient 1 is calculated from the triangle matrix. Note that μ is the coefficient forgetting factor. At a given time i (or sample), the coefficients may be calculated independently or based on weighted coefficients of the previous iteration. Depending on the configuration, the coefficient may not be applied to the actuator in the current iteration i.e. multiple iterations can happen before a coefficient update is applied.
U=f(x0,x1, . . . ,xi),
where i is an integer indicating the number of training model parameters.
Next, in step 154, the correlation matrix is generated. In general, the kth correlation matrix is defined by the recursive relation:
R
xd
=R
xd
b+λ1Rxd
Rxd
Then, in step 156, rotation matrix Gn is calculated. For the nth rotation, the rotation matrix Gn is defined by:
where cn is given by:
and sn is given by:
In step 158, the diagonal elements of the tri-angle matrix are calculated. For instance, the diagonal elements of the tri-angle matrix may be calculated according to the formula:
The remaining elements of the tri-angle matrix are calculated in step 160. The additional elements include rnk(i,j) and znk(i). These additional elements may be calculated, for example by:
Next, in step 164, the tri-angle matrix is implemented. The tri-angle matrix engine stores the intermediate and final tri-angle matrix results. The initial basic tri-angle matrix is configured based on the filter type. For LS-GIVEN, the initial tri-angle matrix equals the correlation matrix Rxd
Finally, in step 166, back substitution and coefficient updates are performed. The coefficient calculation and update are based on the matrix update configuration. This may be performed on a single U vector, a basic block, or several basic blocks based on the algorithm chosen and the training system requirements.
Embodiments may have a variety of physical implementations. For example, a field programmable gate array (FPGA), application specific integrated circuit (ASIC), digital signal processor, or general purpose computer may be used. These are all considered as examples of the general category of processor.
The bus may be one or more of any type of several bus architectures including a memory bus or memory controller, a peripheral bus, video bus, or the like. CPU 274 may comprise any type of electronic data processor. Memory 276 may comprise any type of system memory such as static random access memory (SRAM), dynamic random access memory (DRAM), synchronous DRAM (SDRAM), read-only memory (ROM), a combination thereof, or the like. In an embodiment, the memory may include ROM for use at boot-up, and DRAM for program and data storage for use while executing programs.
Mass storage device 278 may comprise any type of storage device configured to store data, programs, and other information and to make the data, programs, and other information accessible via the bus. Mass storage device 278 may comprise, for example, one or more of a solid state drive, hard disk drive, a magnetic disk drive, an optical disk drive, or the like.
Video adaptor 280 and I/O interface 288 provide interfaces to couple external input and output devices to the processing unit. As illustrated, examples of input and output devices include the display coupled to the video adapter and the mouse/keyboard/printer coupled to the I/O interface. Other devices may be coupled to the processing unit, and additional or fewer interface cards may be utilized. For example, a serial interface card (not pictured) may be used to provide a serial interface for a printer.
The processing unit also includes one or more network interface 284, which may comprise wired links, such as an Ethernet cable or the like, and/or wireless links to access nodes or different networks. Network interface 284 allows the processing unit to communicate with remote units via the networks. For example, the network interface may provide wireless communication via one or more transmitters/transmit antennas and one or more receivers/receive antennas. In an embodiment, the processing unit is coupled to a local-area network or a wide-area network for data processing and communications with remote devices, such as other processing units, the Internet, remote storage facilities, or the like.
While several embodiments have been provided in the present disclosure, it should be understood that the disclosed systems and methods might be embodied in many other specific forms without departing from the spirit or scope of the present disclosure. The present examples are to be considered as illustrative and not restrictive, and the intention is not to be limited to the details given herein. For example, the various elements or components may be combined or integrated in another system or certain features may be omitted, or not implemented.
In addition, techniques, systems, subsystems, and methods described and illustrated in the various embodiments as discrete or separate may be combined or integrated with other systems, modules, techniques, or methods without departing from the scope of the present disclosure. Other items shown or discussed as coupled or directly coupled or communicating with each other may be indirectly coupled or communicating through some interface, device, or intermediate component whether electrically, mechanically, or otherwise. Other examples of changes, substitutions, and alterations are ascertainable by one skilled in the art and could be made without departing from the spirit and scope disclosed herein.