Data stream processing device with reconfigurable data stream processing resources and data stream processing method

BACKGROUND
Field of the Invention

The present invention generally relates to a data stream processing device. More specifically, the present invention relates to a data stream processing device configured to process data streams. Also, the present invention generally relates to a data stream processing method.

Background Information

Modern digital communication devices perform real time data stream processing. Such real time data stream processing can require substantial hardware resources, such as multipliers, adders, and other arithmetic units. It can be useful for a single device to be flexibly applied to perform different processing of data streams. Often such flexibility can be achieved by simply adding additional resources. However, generally such approach increases the cost of the device.

On the other hand, modern digital communication devices can include parallel, reconfigurable receivers or transmitters and have the ability to process multiple data streams simultaneously. Such reconfigurable devices are known in the field of data stream processing (see U.S. Pat. No. 9,954,698, for example). With such devices, multiple data streams can be filtered using parallel processing engines.

SUMMARY

Even though a device is designed to filter many parallel data streams, certain operating conditions such as bandwidth allocation or network configuration can limit this parallel processing to just one or two data streams in actual operation, which underutilizes the full hardware potential of the device.

One object is to provide a data stream processing device with which resources for processing data stream can be efficiently utilized.

In view of the state of the known technology, a data stream processing device is provided that includes a plurality of data providing units, a plurality of processing units, and control circuitry. The data providing units are configured to output data values received via a plurality of data inputs, respectively. The processing units are configured to generate data outputs based on the data values, respectively. The control circuitry includes a mode selection input and is configured to simultaneously provide data values of different data streams to the data inputs of the data providing units, respectively, in response to the mode selection input receiving a signal indicating a first mode, and simultaneously provide a plurality of successive groups of data values of one of the data streams to the data inputs of the data providing units, respectively, in response to the mode selection input not receiving the signal indicating the first mode.

Also, other features, aspects and advantages of the disclosed data stream processing device will become apparent to those skilled in the field of the data stream processing device from the following detailed description, which, taken in conjunction with the annexed drawings, discloses several illustrative embodiments of a data stream processing device with various features.

BRIEF DESCRIPTION OF THE DRAWINGS

Referring now to the attached drawings which form a part of this original disclosure:

FIG. 1 illustrates a block diagram of a data stream processing device with an array of M filters each having an N-tap FIR filter according to a first embodiment;

FIG. 2 illustrates an example of the N-tap FIR filter illustrated in FIG. 1;

FIG. 3 illustrates a flow chart of a data stream processing method of the data stream processing device illustrated in FIG. 1;

FIG. 4A illustrates a block diagram of a data stream processing device with an array of M filters according to a modification example of the first embodiment;

FIG. 4B illustrates filter coefficient settings of the array of the M filters illustrated in FIG. 4A while the M filters are combined with each other;

FIG. 5 illustrates a block diagram of a data stream processing device with an array of M linear channel equalizers each having an LMS linear channel equalizer according to a second embodiment; and

FIG. 6 illustrates an example of the LMS linear channel equalizer illustrated in FIG. 5.

DETAILED DESCRIPTION OF EMBODIMENTS

Selected embodiments will now be explained with reference to the drawings. It will be apparent to those skilled in the art from this disclosure that the following descriptions of the embodiments are provided for illustration only and not for the purpose of limiting the invention as defined by the appended claims and their equivalents.

The present application is related to a device and a method to efficiently utilize and combine signal processing resources in reconfigurable, parallel arrays of FIR Filters and linear channel equalizers. Specifically, the present application illustrates a device and a method to flexibly reconfigure and combine the signal processing resources of an array of independent, parallel FIR filters into a smaller array of independent FIR filters with longer tap length. Also, the present application illustrates a device and a method to further optimize the design of the filters when the filter coefficients exhibit symmetries of an odd or even function. Here, the impulse response of an FIR filter is also its coefficient set. Furthermore, the present application illustrates that the ideas embodied in this application readily extends to the design of adaptive linear channel equalizers, and enables an array of independent equalizers to reconfigure and combine both their filtering components and their coefficient adaptation components to form a smaller array of independent equalizers with longer tap lengths and superior frequency-selective properties.

First Embodiment

FIG. 1 illustrates an example of a data stream processing device 10 according to an exemplary embodiment. As shown in FIG. 1, the data stream processing device 10 includes a plurality of (M: M is two or more) digital filters F (F₀, F₁, . . . , F_M-1). Specifically, in the illustrated embodiment, the filters F (F₀, F₁, . . . , F_M-1) each have an N-tap digital filter (N is one or more), such as a finite impulse response (FIR) filter, for example. The data stream processing device 10 also includes control circuitry 11 that has one or more (M−1 in the illustrated embodiment) multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1). The multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) are coupled between adjacent pairs of the filters F (F₀, F₁, . . . , F_M-1), respectively. In the illustrated embodiment, the number M of digital filters F can be set to any number that is two or more, as needed and/desired. Also, the number N of the taps of the digital filters F can be set to any number that is one or more, as needed and/desired.

In the illustrated embodiment, by operation of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1), the operation mode of the data stream processing device 10 is switched between a “first mode” in which the filters F (F₀, F₁, . . . , F_M-1) independently perform filtering operations on a plurality of (M) independent input data streams U (U₀, U₁, . . . , U_M-1) and a “second mode” in which the filters F (F₀, F₁, . . . , F_M-1) perform a filtering operation on the input data stream U₀as a single filter.

Furthermore, in the illustrated embodiment, the data stream processing device 10 also includes a plurality of (M) coefficient registers R (R₀, R₁, . . . , R_M-1) for the filters F (F₀, F₁, . . . , F_M-1), respectively, and a summation circuit (e.g., summation unit) P. In the illustrated embodiment, the coefficient registers R (R₀, R₁, . . . , R_M-1) each store one or more (N) coefficient values C(0), C(1), . . . , C(N−1) (see FIG. 2). More specifically, as illustrated in FIG. 1, the coefficient register R₀stores coefficient values C₀(0), C₀(1), . . . , C₀(N−1), the coefficient register R₁stores coefficient values C₁(0), C₁(1), . . . , C₁(N−1), . . . , and the coefficient register R_M-1stores coefficient values C_M-1(0), C_M-1(1), . . . , C_M-1(N−1). In the illustrated embodiment, the coefficient registers R (R₀, R₁, . . . , R_M-1) are formed as separate registers (or register devices). However, the coefficient registers R (R₀, R₁, . . . , R_M-1) each can be formed as part of single register (or a single register device).

In the illustrated embodiment, as illustrated in FIG. 2, the filters F (F₀, F₁, . . . , F_M-1) each have a digital delay line 12 (e.g., a data providing unit, a data providing circuit or a tapped delay line) and a processing unit or circuit 14. For simplicity of the explanation, FIG. 2 illustrates a simplified configuration of the filters F (F₀, F₁, . . . , F_M-1) without showing the multiplexers Mux_A. In particular, FIG. 2 illustrates a simplified configuration of the filters F (F₀, F₁, . . . , F_M-1) while the operation mode of the data stream processing device 10 is set to the “first mode.” In this case, as illustrated in FIG. 2, the filters F (F₀, F₁, . . . , F_M-1) each receive an input data stream U (U₀, U₁, . . . , U_M-1) having input data or sample u(n) (u₀(n), u₁(n), . . . , u_M-1(n)) at sample period (time) n via a data input 18 of the digital delay line 12, and generate an output data stream Y (Y₀, Y₁, . . . , Y_M-1) having output data or sample y(n) (y₀(n), y₁(n), . . . , y_M-1(n)) at sample period n from a data output 20 of the processing unit 14. In the illustrated embodiment, the filters F can be formed by a single processor or multiple processors. Specifically, the digital delay line 12 and the processing unit 14 can be formed by a single processor or multiple processors.

As illustrated in FIG. 2, the digital delay line 12 includes one or more (N: N is one or more) successive delays D₀, D₁, . . . , D_N-1that are connected in series with respect to each other. The successive delays D₀, D₁, . . . , D_N-1are coupled to transfer input data received via the data input 18 through the successive delays D₀, D₁, . . . , D_N-1in response to a shift signal or clock. Specifically, the delays D₀, D₁, . . . , D_N-1can be implemented using, for example, registers, clocked registers, flip flops, or shift registers. Each delay, or a data output of each delay, can be referred to as a “tap” or “stage,” for example. Thus, the digital delay line 12 illustrated in FIG. 2 can be referred to as an N-stage or N-tap digital delay line or a tapped delay line with N taps.

Furthermore, the digital delay line 12 also includes one or more (N) delayed data outputs T₀, T₁, . . . , T_N-1that output delayed output data (e.g., data values received via the data input 18) of the delays D₀, D₁, . . . , D_N-1. Specifically, in the illustrated embodiment, the first delay D₀has an input that is coupled to the data input 18 and an output that is coupled to an input of the next delay D₁and also coupled to the delayed data output T₀. Also, the last or N-th delay D_N-1has an input that is coupled to an output of the prior or (N−1)-th delay D_N-2and an output that is coupled to the delayed data output T_N-1. Furthermore, except for the output of the last delay D_N-1of the last filter F_M-1(see FIG. 1), the delayed data output T_N-1of the last delay D_N-1is also coupled to an input of a respective one of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1). Furthermore, the middle delays D₁, . . . , D_N-2each have an input that is coupled to an output of a respective one of the prior delays D₀, . . . , D_N-3, respectively, and an output that is coupled to an input of the next delays D₂, . . . , D_N-1and also coupled to a respective one of the delayed data outputs T₁, . . . , T_N-2.

The processing unit 14 perform a function responsive to the delayed output data received from the digital delay line 12 to generate the output data stream Y (Y₀, Y₁, . . . , Y_M-1) from the data output 20. The function can also be referred to as a “process”, a “processing”, or a “procedure.” Performing the function can include performing a set of one or more operations, in sequence and/or combination, on the delayed output data received from the digital delay line 12. Such operations can include, but are not limited to, bitwise logical operations (such as, but not limited to, NOT, AND, OR, XOR, left shift, and right shift) and arithmetic operations (such as, but not limited to, addition, subtraction, multiplication, division, logarithmic, exponential, trigonometric, statistic, and comparison operations). In some implementations, the processing unit 14 can include a programmable processor that executes one or more program instructions to perform the function or a portion of the function. With this configuration, the processing unit 14 implements a finite impulse response (FIR) filter.

More specifically, the processing unit 14 includes one or more (N) delayed data inputs S₀, S₁, . . . , S_N-1connected to the delayed data outputs T₀, T₁, . . . , T_N-1, respectively. Furthermore, the processing unit 14 further includes one or more (N) multiplier circuits Q₀, Q₁, . . . , Q_N-1and a summation circuit V. The multiplier circuits Q₀, Q₁, . . . , Q_N-1calculate the products between the data values of the delayed output data received via the delayed data inputs S₀, S₁, . . . , S_N-1and the coefficient values C(0), C(1), . . . , C(N−1) stored in the corresponding one of the coefficient registers R (R₀, R₁, . . . , R_M-1), respectively. Then, the summation circuit V calculates the sum (sum(N)) of the products to generate the output data stream Y (Y₀, Y₁, . . . , Y_M-1) from the data output 20.

The coefficient registers R (R₀, R₁, . . . , R_M-1) are a register or memory element, and each store a set of the coefficient values C(0), C(1), . . . , C(N−1) for a respective one of the filters F (F₀, F₁, . . . , F_M-1). The coefficient registers R (R₀, R₁, . . . , R_M-1) can be formed as a single register or as separate registers. Also, the coefficient registers R (R₀, R₁, . . . , R_M-1) can be formed as part of the processing units 14 of the filters F (F₀, F₁, . . . , F_M-1), respectively. The coefficient values C(0), C(1), . . . , C(N−1) for the filters F (F₀, F₁, . . . , F_M-1) are preset to adjust the filter characteristics of the filters F (F₀, F₁, . . . , F_M-1). Of course, the coefficient values C(0), C(1), . . . , C(N−1) for the filters F (F₀, F₁, . . . , F_M-1) can be adaptively changed to obtain the desired filter characteristics.

As illustrated in FIG. 1, the filters F (F₀, F₁, . . . , F_M-1) are configured to receive the M independent input data streams U₀, U₁, . . . , U_M-1having input data u₀(n), u₁(n), . . . , u_M-1(n) at sample period n, respectively, and output the M output data streams Y₀, Y₁, . . . , Y_M-1having output data y₀(n), y₁(n), . . . , Y_M-1(n) at sample period n, respectively. Specifically, in the illustrated embodiment, the input data stream U₀is directly inputted to the data input 18 of the first filter F₀. The input data streams U₁, . . . , U_M-1are inputted to the data inputs 18 of the filters F₁, . . . , F_M-1via the multiplexers Mux_A₁, . . . , Mux_A_M-1, respectively.

In the illustrated embodiment, the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) are formed as a “switch”, “multiple input, single output switch,” “switching element”, “mux”, “signal selector”, or “data selector.” In the illustrated embodiment, each of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) has first and second inputs. The first inputs of the multiplexers Mux_A₁, . . . , Mux_A_M-1receive the input data streams U₁, . . . , U_M-1, respectively. The second inputs of the multiplexers Mux_A₁, . . . , Mux_A_M-1are connected to the delayed data outputs T_N-1of the last delays D_N-1of the filters F₀, F₁, F_M-2, respectively, to receive the delayed output data via the delayed data outputs T_N-1, respectively.

Furthermore, the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) each have a mode selection input MS for receiving a mode selection signal having a value of either “0” or “1.” The multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) each selectively couple either the first input or the second input to the data input 18 of a respective one of the filters F₁, . . . , F_M-1according to the value of the mode selection signal. In the illustrated embodiment, according to the value of the mode selection signal, the filters F (F₀, F₁, . . . , F_M-1) can function as M independent N-tap FIR filters or as a single MN-tap FIR filter. This mode selection signal can be inputted through an interface of the data stream processing device 10, for example, to change operation mode of the data stream processing device 10 between the first and second modes. Of course, this mode selection signal can be inputted according to an operation status of the data stream processing device 10.

FIG. 3 illustrates a flow chart of the data stream processing algorithm (e.g., data stream processing method) of the data stream processing device 10. The data stream processing device 10 receives the input data stream U (step S10). In response, the digital delay lines 12 output delayed output data (e.g., data values) received via the data inputs 18, respectively. The control circuitry 11 determines whether the mode selection signal indicates the first mode (step S12). If the value of the mode selection signal is “0,” which indicates the first mode (Yes in step S12), then the data stream processing device 10 is operated in the first mode (step S14). On the other hand, if the value of the mode selection signal is “1,” which does not indicate the first mode (No in step S12), then the data stream processing device 10 is operated in the second mode (step S16).

In particular, if the value of the mode selection signal is “0,” which indicates the first mode (Yes in step S12), then the multiplexers Mux_A₁, . . . , Mux_A_M-1couple the first inputs to the data inputs 18 of the filters F1, . . . , F_M-1, respectively, such that the input data streams U₁, . . . , U_M-1are inputted to the filters F₁, . . . , F_M-1via the data inputs 18 of the filters F₁, . . . , F_M-1, respectively. This allows the filters F₀, F₁, . . . , F_M-1to independently perform filtering operations on the independent input data streams U₀, U₁, . . . , U_M-1. Thus, in this case, the control circuitry 11 (the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1)) simultaneously provides data values of the independent input data streams U₀, U₁, . . . , U_M-1to the data inputs 18 of the filters F₀, F₁, . . . , F_M-1, respectively, such that the filters F₀, F₁, . . . , F_M-1independently performs filtering operations on the independent input data streams U₀, U₁, . . . , U_M-1, respectively, and independently output the output data streams Y₀, Y₁, . . . , Y_M-1from the data outputs 20 of the filters F₀, F₁, . . . , F_M-1, respectively. Therefore, in this case, the filters F (F₀, F₁, . . . , F_M-1) form an array of M FIR filters arranged in parallel to each other, as can be found in a device supporting parallel processing of M independent data streams. In other words, the corresponding pairs of the digital delay lines 12 and the processing units 14 form a plurality of FIR filters for the input data streams U₀, U₁, . . . , U_M-1, respectively, in response to the mode selection signal indicating the first mode.

On the other hand, if the value of the mode selection signal is “1,” which indicates the second mode (or does not indicate the first mode) (No in step S12), then the multiplexers Mux_A₁, . . . , Mux_A_M-1couple the second inputs to the data inputs 18 of the filters F1, . . . , F_M-1, respectively, such that the delayed output data from the last delays D_N-1of the filters F₀, F₁, . . . , F_M-2are inputted to the first delays Do of the filters F₁, . . . , F_M-1via the data inputs 18 of the filters F₁, . . . , F_M-1, respectively. Thus, the digital delay lines 12 of the M filters F (F₀, F₁, . . . , F_M-1) are serially connected to each other and are daisy-chained into a tapped delay line with MN taps in length, which multiplies the length of the FIR filter and improves its frequency selective performance, for example. Specifically, in this case, the control circuitry 11 (the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1)) simultaneously provides a plurality of (M) successive groups of N data values of the input data stream U₀to the data inputs 18 of the filters F (F₀, F₁, . . . , F_M-1). In particular, a first group of data values of the input data u₀(n), u₀(n−1), . . . , u₀(n−N+1) of the input data stream U₀is provided to the filter F₀, a second group of data values of the input data u₀(n−N), u₀(n−N−1), . . . , u₀(n−2N+1) of the input data stream U₀is provided to the filter F₁, . . . , an M-th group of data values of the input data u₀(n−(M−1)N), u₀(n−(M−1)N−1), . . . , u₀(n−MN+1) of the input data stream U₀is provided to the filter F_M-1, The filters F₀, F₁, . . . , F_M-1perform filtering operations on the successive groups of data values of the input data stream U₀, and output the output data streams Y₀, Y₁, . . . , Y_M-1from the data outputs 20 of the filters F₀, F₁, . . . , F_M-1, respectively. Furthermore, in this case, the summation circuit P calculates the sum (sum(M)) of the data values of the output data streams Y₀, Y₁, . . . , Y_M-1to generate an output data stream Y_shaving output data y_s(n) at sample period n, which is an output of the combined FIR filter with MN taps at sample period n. Therefore, in this case, the filters F (F₀, F₁, . . . , F_M-1) form a single FIR filter. In other words, the digital delay lines 12 and the processing units 14 form a single FIR filter for the input data stream U₀in response to the mode selection signal not indicating the first mode. In particular, the control circuitry 11 serially couples the digital delay lines 12 with respect to each other in response to the mode selection signal not indicating the first mode.

With the data stream processing device 10, the filter configuration of the parallel FIR filters F (F₀, F₁, . . . , F_M-1) can be reconfigurable. Thus, the signal processing (filtering) resources of the data stream processing device 10 can be efficiently utilized. In particular, efficient implementation to combine the M independent FIR filters F (F₀, F₁, . . . , F_M-1) with N taps into a single FIR filter with MN taps can be provided.

Thus, even if some of the signal processing (filtering) resources of the data stream processing device 10 (e.g., some of the filters F (F₀, F₁, . . . , F_M-1)) cannot be utilized for processing many parallel data streams due to certain operating conditions, such as bandwidth allocation or network configuration, the unused signal processing (filtering) resources can be efficiently reallocated to process a data stream by effectively multiplying the number of taps of the signal processing (filtering) resources in use, which enhances filtering performance with minimal increases in implementation complexity, area, and power.

In the illustrated embodiment, the filters F (F₀, F₁, . . . , F_M-1) are configured to be identical to each other. Specifically, the filters F (F₀, F₁, . . . , F_M-1) have the same number of taps (i.e., N taps). However, the filters F (F₀, F₁, . . . , F_M-1) can have different numbers of taps (i.e., different number of delays) with respect to each other as needed and/or desired.

In the illustrated embodiment, the filters F (F₀, F₁, . . . , F_M-1) form a single FIR filter in response to all of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) receiving the same mode selection signal with the same value “1”. However, the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) can be configured to independently receive different mode selection signals with different values “0” and “1” to form a plurality of groups of the filters F (F₀, F₁, . . . , F_M-1) each forming a single FIR filter. For example, if all of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) receive the same mode selection signal with the same value “1” except for a k-th multiplexer Mux_A_k, and the k-th multiplexer Mux_A_kreceives the mode selection signal with the value “0,” then a first group of the filters F₀, . . . , F_k-1can form a first single FIR filter for the input data stream U₀and a second group of the filters F_k, . . . , F_M-1can form a second single FIR filter for the input data stream U_k. In this case, the summation circuit P calculates the sum of the data values of the output data streams Y₀, . . . , Y_k-1to generate a first output data stream Y_s1as an output of the first single FIR filter with kN taps, and calculates the sum of the data values of the output data streams Y_k, . . . , Y_M-1to generate a second output data stream Y_s2as an output of the second single FIR filter with (M−k)N taps. In this case, the filters F (F₀, F₁, . . . , F_M-1) are grouped into two groups, but can also be grouped into more than two groups.

Referring now to FIGS. 4A and 4B, a data stream processing device 100 according to a modification example of the first embodiment will be described. In view of the similarity between the data stream processing devices 10 and 100, the parts of the data stream processing device 100 that are identical or similar to the parts of the data stream processing device 10 will be given the same reference numerals as the parts of the data stream processing device 10. Moreover, the descriptions of the parts of the data stream processing device 100 that are identical or similar to the parts of the data stream processing device 10 may be omitted for the sake of brevity.

As illustrated in FIG. 2, with the data stream processing device 10, the coefficient registers R (R₀, R₁, . . . , R_M-1) each independently store one or more (N) coefficient values C(0), C(1), . . . , C(N−1), and the coefficient values C(0), C(1), . . . , C(N−1) each are routed to a respective one of the multiplier circuits Q₀, Q₁, . . . , Q_N-1.

On the other hand, as illustrated in FIG. 4A, the data stream processing device 100 is configured such that coefficient values in each of the coefficient registers R (R₀, R₁, . . . , R_M-1) are symmetrically routed to the multiplier circuits of a respective one of the filters F (F₀, F₁, . . . , F_M-1) while the value of the mode selection signal is “0,” which indicates the first mode. Specifically, a coefficient value is routed to two multiplier circuits in the same filter. This reduces the number or storage space of the coefficient registers R (R₀, R₁, . . . , R_M-1) for storing the coefficient values. In this case (first mode), the filters F (F₀, F₁, . . . , F_M-1) independently function as M independent symmetric FIR filters with symmetric filter coefficients having symmetry of an even or odd function.

Also, as illustrated in FIG. 4A, the data stream processing device 100 is configured such that coefficient values in the coefficient registers R (R₀, R₁, . . . , R_M-1) are symmetrically routed to the multiplier circuits of a combined filter of the filters F (F₀, F₁, . . . , F_M-1) while the value of the mode selection signal is “1,” which indicates the second mode. Specifically, a coefficient value is routed to two multiplier circuits in two different filters. This also reduces the number or storage space of the coefficient registers R (R₀, R₁, . . . , R_M-1) for storing the coefficient values. In this case (second mode), the combination of the filters F (F₀, F₁, . . . , F_M-1) functions as a single symmetric FIR filter with symmetric filter coefficients having symmetry of an even or odd function. Thus, in the illustrated embodiment, the filters F (F₀, F₁, . . . , F_M-1) can function as M independent symmetric FIR filters or a single symmetric FIR filter. An example of such symmetric FIR filter or filters is a root raised cosine (RRC) filter. The RRC filter has the symmetry f(x)=f(−x) in its impulse response, which makes its impulse response an even function. Also, such symmetric FIR filter or filters can have another type of symmetry, such as f(x)=−f(−x). This type of symmetry can be implemented using a single set of filter coefficients, provided that the filter coefficients are negated on one side.

More specifically, as illustrated in FIG. 4A, the data stream processing device 100 is basically identical to the data stream processing device 10, except that the data stream processing device 100 further includes at least one (k) multiplexer Mux_B for each of the filters F (F₀, F₁, In particular, the data stream processing device 100 includes the k multiplexers Mux_B (Mux_B₀, . . . , Mux_B_k-1) for each of the N-tap filters F (F₀, F₁, . . . , F_M-1), where N=2k or 2k+1. Thus, for each of the filters F (F₀, F₁, . . . , F_M-1), the multiplexers Mux_B₀, . . . , Mux_B_k-1are coupled to k multiplier circuits Q_N-k, . . . , Q_N-1, respectively.

As also illustrated in FIG. 4A, each of the multiplexers Mux_B (Mux_B₀, . . . , Mux_B_k-1) has first and second inputs. The first inputs of the multiplexers Mux_B₀, . . . , Mux_B_k-1for each of the filters F (F₀, F₁, . . . , F_M-1) receive the coefficient values C(k−1), . . . , C(0), respectively, from a respective one of the coefficient registers R (R₀, R₁, . . . , R_M-1). Thus, when the filters F (F₀, F₁, . . . , F_M-1) function as parallel filters in response to the value of the mode selection signal being “0,” which indicates the first mode, each of the filters F (F₀, F₁, . . . , F_M-1) has symmetric filter coefficients. This is equivalent to the case in which each of the filters F (F₀, F₁, . . . , F_M-1) as illustrated in FIG. 2 has the following filter coefficient values: C(0)=C(N−1), C(1)=C(N−2), . . . , C(k−1)=C(k) (where N=2k) (or C(k−1)=C(k+1) (where N=2k+1)). In other words, in this case (first mode), for the N-tap filter, only k coefficient values C(0), C(1), . . . , C(k−1) (where N=2k) (or k+1 coefficient values C(0), C(1), . . . , C(k) (where N=2k+1)) need to be stored in each of the coefficient registers R (R₀, R₁, . . . , R_M-1), and the k coefficient values C(0), C(1), . . . , C(k−1) are symmetrically applied to the N data values outputted by the delayed data outputs T₀, T₁, . . . , T_N-1of the tapped delay lines 12 relative to the centers of the tapped delay lines 12, respectively (i.e., relative to a position between the delayed data outputs T_k-1and T_k(where N=2k) or a position of the delayed data output T_k(where N=2k+1) for each of the filters F (F₀, F₁, . . . , F_M-1) along each of the tapped delay lines 12).

On the other hand, the second inputs of the multiplexers Mux_B₀, . . . Mux_B_k-1for each of the filters F (F₀, F₁, . . . , F_M-1) receive the coefficient values C(k−1), . . . , C(0) from a coefficient register R (R₀, R₁, . . . , R_M-1) for a symmetrically corresponding one of the filters F (F_M-1, F_M-2. . . , F0). Thus, when the filters F (F₀, F₁, . . . , F_M-1) are combined into a single filter in response to the value of the mode selection signal being “1,” which indicates the second mode, the single filter formed by the filters F (F₀, F₁, . . . , F_M-1) has symmetric filter coefficients. In other words, in this case (second mode), for the MN-tap combined filter, only M sets of k coefficient values C(0), C(1), . . . , C(k−1) (where N=2k) (or M sets of k+1 coefficient values C(0), C(1), C(k) (where N=2k+1)) need to be stored in total in the coefficient registers R (R₀, R₁, . . . , R_M-1), and the M sets of k coefficient values C(0), C(1), . . . , C(k−1) are symmetrically applied to the MN data values outputted by the delayed data outputs T₀, T₁, . . . , T_N-1of the tapped delay lines 12 relative to the center of the series of the tapped delay lines 12 (i.e., relative to a position between the delayed data outputs T_N-1of the filter and the delayed data outputs T₀of the filter F_j(where M=2j), a position between the delayed data outputs T_k-1and T_kof the filter F_j(where M=2j+1 and N=2k) or a position of the delayed data output T_kof the filter F_j(where M=2j+1 and N=2k+1) along a series of the tapped delay lines 12). Thus, symmetric arrangement (symmetric arrangement having symmetry of an even function) of the filter coefficients are preserved when the parallel filters F (F₀, F₁, . . . , F_M-1) are combined into a single filter.

More specifically, referring to FIG. 4B, examples of the symmetric filter coefficient settings when the parallel filters F (F₀, F₁, . . . , F_M-1) are combined into a single filter will be described in detail. In the illustrated embodiment, for simplicity of the explanation, it is assumed that each of the M parallel filters F (F₀, F₁, . . . , F_M-1) has an even number of taps (i.e., N is even). However, the configurations discussed herein can be logically extended to a case in which each of the M parallel filters F (F₀, F₁, . . . , F_M-1) has an odd number of taps (i.e., N is odd).

In particular, FIG. 4B illustrates an example of the symmetric filter coefficient setting when M is odd, and an example of the symmetric filter coefficient setting when M is even. In this case, the multiplexors Mux_B (Mux_B₀, . . . , Mux_B_k-1) (FIG. 4A) apply symmetric filter coefficient having symmetry of an even function to each of the M parallel filters F (F₀, F₁, . . . , F_M-1) when the value of the mode selection signal is “0,” which indicates the first mode. When the value of the mode selection signal is switched to “1,” different symmetric coefficient values having symmetry of an even function are applied to the MN-tap combined filter.

Basically, for a MN-tap combined filter, MN combined filter coefficients K(0), K(1), . . . , K(MN−1) (“combined filter coefficient number” in FIG. 4B) are applied to the MN taps. However, in the illustrated embodiment, only MN/2 unique coefficient values (i.e., M sets of N/2 coefficient values) C₀(0), . . . , C₀(N/2−1), C₁(0), . . . , C₁(N/2−1), . . . , C_M-1(0), . . . , C_M-1(N/2−1) (“sub-filter coefficient number” shown in unshaded boxes in FIG. 4B) are needed for the MN combined filter coefficients K(0), K(1), . . . , K(MN−1) of the combined filter when the symmetric filter coefficient setting having symmetry of an even function is applied to the combined filter. In particular, only the MN/2 unique coefficient values C₀(0), . . . , C₀(N/2−1), C₁(0), . . . , C₁(N/2−1), . . . , C_M-1(0), . . . , C_M-1(N/2−1) are held or stored in the hardware registers (e.g., the coefficient registers R (R₀, R₁, . . . , R_M-1)), and these MN/2 unique coefficient values C₀(0), . . . , C₀(N/2−1), C₁(0), . . . , C₁(N/2−1), . . . , C_M-1(0), . . . , C_M-1(N/2−1) are symmetrically applied to form the MN combined filter coefficients K(0), K(1), . . . , K(MN−1) having symmetry of an even function as indicated by the arrows relative to the midpoint (center). In particular, the coefficient values in shaded boxes in FIG. 4B are obtained by wire connections to the hardware registers storing the corresponding coefficient values C₀(0), . . . , C₀(N/2−1), C₁(0), . . . , C₁(N/2−1), . . . , C_M-1(0), . . . , C_M-1(N/2−1).

Furthermore, in the illustrated embodiment, the symmetric filter coefficient settings having symmetry of an even function are applied to the M independent parallel filters F (F₀, F₁, . . . , F_M-1) with N taps, respectively, which also needs only need MN/2 unique coefficient values in total. For example, a set of N/2 unique coefficient values C₀(0), . . . , C₀(N/2−1) is symmetrically applied as the N filter coefficients K(0), . . . , K(N−1) having symmetry of an even function for the N-tap filter F₀. Similarly, other sets of N/2 unique coefficient values C₁(0), . . . , C₁(N/2−1), . . . , C_M-1(0), . . . , C_M-1(N/2−1) are also symmetrically applied as other sets of N filter coefficients K(N), . . . , K(2N−1), . . . , K(MN−N), . . . , K(MN−1) having symmetry of an even function for the N-tap filters respectively. Thus, this filter coefficient settings does not require additional storage space in the coefficient registers R (R₀, R₁, . . . , R_M-1) whether the filters F (F₀, F₁, . . . , F_M-1) are working independently or combined.

With the data stream processing device 100, efficient implementation to combine the M independent FIR filters with N taps F (F₀, F₁, . . . , F_M-1) each having a set of symmetric filter coefficients into a single FIR filter with MN taps having a set of symmetric filter coefficients can be provided.

The signal processing (filtering) resources of two or more FIR filters can be basically combined into a single, higher tap-length FIR filter by daisy-chaining them, where the output of one filter feeds the input of the next filter. However, in case of root raised cosine (RRC) filters, the convolution of two RRC filter impulse responses results in a raised cosine (RC) impulse response. Thus, when the system requires match-filtering, as is often the case in communication systems, an RC filter cannot be used in place of an RRC filter. It is possible to split an RRC filter into two RRC filters of shorter length using numerical method. However, this adds unnecessary complexity to the system design. On the other hand, with the data stream processing device 100, this can be avoided by preserving symmetric arrangement (symmetric arrangement having symmetry of an even function or mirror symmetry) of the filter coefficients even when the parallel filters F (F₀, F₁, . . . , F_M-1) are combined into a single filter.

Second Embodiment

Referring now to FIGS. 5 and 6, a data stream processing device 200 according to a second embodiment will be described. In view of the similarity between the data stream processing devices 10 and 200, the parts of the data stream processing device 200 that are functionally identical or similar to the parts of the data stream processing device 10 will be given the same reference numerals as the parts of the data stream processing device 10 but with “200” added thereto. Moreover, the descriptions of the parts of the data stream processing device 200 that are identical or similar to the parts of the data stream processing device 10 may be omitted for the sake of brevity.

As shown in FIG. 5, the data stream processing device 200 includes a plurality of (M: M is two or more) linear channel equalizers E (E₀, E₁, . . . , E_M-1). Specifically, in the illustrated embodiment, the equalizers E (E₀, E₁, . . . , E_M-1) each have an N-tap digital filter (N is one or more), such as a finite impulse response (FIR) filter, for example. The data stream processing device 200 also includes control circuitry 211 that has one or more (M−1) multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1). The multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) are coupled between adjacent pairs of the equalizers E (E₀, E₁, . . . , E_M-1), respectively. Furthermore, the control circuitry 211 also has a plurality of (M) multiplexers Mux_C (Mux_C₀, . . . , Mux_C_M-1) for the equalizers E (E₀, E₁, . . . , E_M-1), respectively. In the illustrated embodiment, by operating the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) and the multiplexers Mux_C (Mux_C₀, . . . , Mux_C_M-1), the operation mode of the data stream processing device 200 is switched between a “first mode” in which the equalizers E₀, E₁, . . . , E_M-1independently perform equalizing operations on independent input data streams U₀, U₁, . . . , U_M-1and a “second mode” in which the equalizers E₀, E₁, . . . , E_M-1perform an equalizing operation on the input data stream U₀as a single equalizer. In particular, the multiplexer Mux_A switches the input data streams U (U₀, U₁, . . . , U_M-1) inputted to a data input 218 of the equalizers E (E₀, E₁, . . . , E_M-1), respectively.

Specifically, as illustrated in FIG. 6, the equalizers E (E₀, E₁, . . . , E_M-1) are formed as a linear channel equalizer with least mean squares (LMS) coefficient adaptation. Specifically, as illustrated in FIG. 6, the equalizers E (E₀, E₁, . . . , E_M-1) each have a digital delay line or tapped delay line 212 (e.g., a data providing unit or circuit), a data storage or data storage facility 213 (e.g., a data storage or memory) and a processing unit or circuit 214. For simplicity of the explanation, FIG. 6 illustrates a simplified configuration of the equalizers E (E₀, E₁, . . . , E_M-1) without showing the multiplexers Mux_A and Mux_C. In particular, FIG. 6 illustrates a simplified configuration of the equalizers E (E₀, E₁, . . . , E_M-1) while the operation mode of the data stream processing device 200 is set to the “first mode.” In this case, as illustrated in FIG. 6, the equalizers E (E₀, E₁, . . . , E_M-1) each receive an input data stream U (U₀, U₁, . . . , U_M-1) having input data or sample u(n) (u₀(n), u₁(n), . . . , u_M-1(n)) at sample period (time) n via the data input 218, and generate an output data stream Y (Y₀, Y₁, . . . , Y_M-1) having output data or sample y(n) (y₀(n), y₁(n), . . . , y_M-1(n)) at sample period n from a data output 220. In the illustrated embodiment, the equalizers E can be formed by a single processor or multiple processors. Specifically, the digital delay line 212 and the processing unit 214 can be formed by a single processor or multiple processors.

More specifically, in the illustrated embodiment, the digital delay line 212 is basically identical to the digital delay line 12 as illustrated in FIG. 2. In particular, the digital delay line 212 includes one or more (N: N is one or more) successive delays that are connected in series with respect to each other. The successive delays are coupled to transfer input data received via the data input 218 through the successive delays in response to a shift signal or clock. With this configuration, the digital delay line 212 holds a set of the input data stream inputted via the data input 218. In the illustrated embodiment, the digital delay line 212 holds a set of N input data u(n−N+1), . . . , u(n) in the past to form a N-tap digital filter. In particular, in the illustrated embodiment, the digital delay line 212 is a fixed delay whose length is the length of the filter. As illustrated in FIG. 6, the digital delay line 212 outputs the delayed output data (e.g., data values received via the data input 218) to the processing unit 214.

Furthermore, as illustrated in FIG. 5, the delayed output data of the digital delay lines 212 are each inputted to an input of a respective one of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1). With this configuration, the digital delay lines 212 of the equalizers E (E₀, E₁, . . . , E_M-1) are daisy-chained while the operation mode of the data stream processing device 200 is set to the “second mode,” which will be described in detail later. In particular, the digital delay lines 212 of the equalizers E (E₀, E₁, . . . , E_M-1) are daisy-chained such that as soon as the delayed output data exit one of the digital delay lines 212, it goes into the next one of the digital delay lines 212.

As also illustrated in FIG. 6, the data storage 213 stores a history of the input data stream inputted via the data input 218. For example, the data storage 213 stores a history of N input data u(n−N+1), . . . , u(n) in the past for calculating or updating N filter coefficients of the N-tap digital filter. In the illustrated embodiment, the data storage 213 includes a register or memory element, such as a shift register. Specifically, the data storage 213 has enough space for storing the complete set of the input data u(n−N+1), . . . , u(n), and additional space needed for calculating or updating the filter coefficient, such as space needed while the error is computed in an error calculation circuit 226 of the adaptation sub-section 224 (described later). Of course, the data storage 213 can be formed by one or more (N) successive delays having the same configuration as the digital delay line 12 (FIG. 2).

The processing unit 214 includes an FIR filter sub-section 222, a dot product calculator 223, and an adaptation sub-section 224. The FIR filter sub-section 222 calculate the complex conjugate of the filter coefficients [c₀(n), . . . , c_N-1(n)] that have been calculated or updated by the adaptation sub-section 224. The dot product calculator 223 calculates the dot product between the complex conjugate of the filter coefficients [c₀(n), . . . , c_N-1(n)] (i.e., conj([c₀(n), . . . , c_N-1(n)])) outputted from the FIR filter sub-section 222 and a vector of the delayed output data [u(n−N+1), u(n)] outputted from the digital delay line 212, and then outputs the dot product as the output data y(n) (y₀(n), y₁(n), . . . , y_M-1(n)) of the output data stream Y (Y₀, Y₁, . . . , Y_M-1). The operations to calculate the output data y(n) (y₀(n), y₁(n), . . . , y_M-1(n)) by the FIR filter sub-section 222 and the dot product calculator 223 are mathematically identical to the operations to calculate the output data y(n) (y₀(n), y₁(n), . . . , y_M-1(n)) by the processing unit 14 (FIG. 2). Thus, the FIR filter sub-section 222 and the dot product calculator 223 can be replaced by the processing unit 14 such that the processing unit 14 calculates an inner product between the vector of the input data [u(n−N+1), u(n)] outputted from the digital delay line 212 and the filter coefficients [c₀(n), . . . , c_N-1(n)] that have been updated by the adaptation sub-section 224 at sample period n to output the output data stream Y (Y₀, Y₁, . . . , Y_M-1). Specifically, in this case, the output data y(n) can be calculated as follows: y(n)=[u(n−N+1), u(n)][c₀(n), . . . , c_N-1(n)]^H, where [ ]^Hdenotes Hermitian transpose.

The adaptation sub-section 224 includes the error calculation circuit 226 and a coefficient update circuit 228. The error calculation circuit 226 computes the difference between known training sample d(n) and the output data y(n) to find an error e(n) at sample period n (e(n)=d(n)−y(n)). Furthermore, the coefficient update circuit 228 calculates an updated set of filter coefficients [c₀(n+1), . . . , c_N-1(n+1)] for sample period n+1 based on a coefficient update formula using the LMS algorithm. Specifically, the updated set of filter coefficients [c₀(n+1), . . . , c_N-1(n+1)] for sample period n+1 is computed as follows: [c₀(n+1), . . . , c_N-1(n+1)]=[c₀(n), . . . , c_N-1(n)]+μ[u(n−N+1), . . . , u(n)]conj(e(n)), based on a vector of the stored input data [u(n−N+1), . . . , u(n)] stored in the data storage 213, the current output data y(n), the complex conjugate of the error e(n), the current set of the filter coefficient at sampler period n [c₀(n), . . . c_N-1(n)], and an appropriate adaptation step size μ. The calculations of the error calculation circuit 226 and the coefficient update circuit 228 can be performed using the same clock as the clock applied to the digital delay line 212 or using multiple clocks different from the clock applied to the digital delay line 212. In the illustrated embodiment, the data storage 213 is configured as forming a flexible pipeline delay, and can hold the input data [u(n−N+1), . . . , u(n)] while the calculations of the error calculation circuit 226 and the coefficient update circuit 228 using the different clocks take time to finish.

As illustrated in FIG. 5, the data stream processing device 200 further includes a summation circuit (e.g., summation unit) P and an error calculation circuit ER. The summation circuit P calculates the sum (sum(M)) of the data values of the output data streams Y₀, Y₁, . . . , Y_M-1to generate an output data stream Y_shaving output data y_s(n) at sample period n. The error calculation circuit ER computes the difference between known training sample d_s(n) and the output data y_s(n) to find an error e_s(n) at sample period n (e_s(n)=d_s(n)−y_s(n)).

In the illustrated embodiment, the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) and the multiplexers Mux_C (Mux_C₀, . . . , Mux_C_M-1) are formed as a “switch”, “multiple input, single output switch,” “switching element”, “mux”, “signal selector”, or “data selector.” In the illustrated embodiment, each of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) and the multiplexers Mux_C (Mux_C₀, . . . , Mux_C_M-1) has first and second inputs. The first inputs of the multiplexers Mux_A₁, . . . , Mux_A_M-1receive the input data streams U₁, . . . , U_M-1, respectively. The second inputs of the multiplexers Mux_A₁, . . . , Mux_A_M-1are connected to the digital delay lines 212 of the equalizers E₀, E₁, . . . , E_M-2, respectively, to receive the past input data inputted via the input data stream U₀. On the other hand, the first inputs of the multiplexers Mux_C₀, . . . , Mux_C_M-1receive the errors e₀(n), . . . , e_M-1(n) from the error calculation circuits 226 of the equalizers E₀, E₁, . . . , E_M-2, respectively. The second inputs of the multiplexers Mux_C₀, . . . , Mux_C_M-1receive the error e_s(n) from the error calculation circuit ER.

In particular, the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) and Mux_C (Mux_C₀, . . . , Mux_C_M-1) each have a mode selection input MS for receiving a mode selection signal. The multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) each selectively couple either the first input or the second input to the data input 218 of a respective one of the equalizers E (E₁, . . . , E_M-1) in response to the mode selection signal. Furthermore, the multiplexers Mux_C (Mux_C₀, . . . , Mux_C_M-1) each selectively couple either the first input or the second input to an error input 230 of the coefficient update circuit 228 of a respective one of the equalizers E (E₁, . . . , E_M-1) in response to the mode selection signal.

The data stream processing algorithm (e.g., data stream processing method) of the data stream processing device 200 will be described by reference to FIG. 3. The data stream processing device 200 receives the input data stream U (step S10). In response, the digital delay lines 212 of the equalizers E output input data (e.g., data values) received via the data inputs 218, respectively. The control circuitry 211 determines whether the mode selection signal indicates the first mode (step S12). If the value of the mode selection signal is “0,” which indicates the first mode (Yes in step S12), then the data stream processing device 200 is operated in the first mode (step S14). On the other hand, if the value of the mode selection signal is “1,” which does not indicate the first mode (No in step S12), then the data stream processing device 200 is operated in the second mode (step S16).

In particular, if the value of the mode selection signal is “0,” which indicates the first mode, then the multiplexers Mux_A₁, . . . , Mux_A_M-1couple the first inputs to the data inputs 218 of the equalizers E₁, . . . , E_M-1, respectively, such that the input data streams U₁, . . . , U_M-1are inputted to the equalizers E₁, . . . , E_M-1via the data inputs 218 of the equalizers E₁, . . . , E_M-1, respectively. Furthermore, in this case, the multiplexers Mux_C₀, . . . , Mux_C_M-1couple the first inputs to the coefficient update circuits 228 of the equalizers E₁, . . . , E_M-1, respectively, such that the errors e₀(n), . . . , e_M-1(n) from the error calculation circuits 226 are inputted to the coefficient update circuits 228, respectively.

This allows the equalizers E₀, E₁, . . . , E_M-1to independently perform equalizing operations on the independent input data streams U₀, U₁, . . . , U_M-1, respectively. In this case, as illustrated in FIG. 5, the digital delay lines 212 of the equalizers E (E₀, E₁, . . . , E_M-1) output the input data u(n−N+1), u(n) of the input data streams U (U₀, U₁, . . . , U_M-1), respectively. More specifically, the digital delay line 212 of the equalizer E₀outputs a set of N input data u₀(n−N+1), . . . , u₀(n) of the input data stream U₀, the digital delay line 212 of the equalizer E₁outputs a set of N input data u₁(n−N+1), . . . , u₁(n) of the input data stream U₁, . . . , and the digital delay line 212 of the equalizer E_M-1outputs a set of N input data u_M-1(n−N+1), . . . , u_M-1(n) of the input data stream U_M-1. Furthermore, the data storages 213 of the equalizers E₀, E₁, . . . , E_M-1store the input data of the input data streams U₀, U₁, . . . , U_M-1, respectively. More specifically, the data storage 213 of the equalizer E₀stores a history of N input data u₀(n−N+1), . . . , u₀(n) of the input data stream U₀, the data storage 213 of the equalizer E₁stores a history of N input data u₁(n−N+1), . . . , u₁(n) of the input data stream U₁, . . . , and the data storage 213 of the equalizer E_M-1stores a history of N input data u_M-1(n−N+1), . . . , u_M-1(n) of the input data stream U_M-1. Furthermore, the processing units 214, the error calculation circuits 226 and the coefficient update circuits 228 of the equalizers E₀, E₁, . . . , E_M-1perform processing (calculations) illustrated in FIG. 5, respectively, to output the output data streams Y₀, Y₁, . . . , Y_M-1, respectively. Thus, in this case, the control circuitry 211 (the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1)) simultaneously provides data values of the independent input data streams U₀, U₁, . . . , U_M-1to the data inputs 218 of the equalizers E₀, E₁, . . . , E_M-1, respectively, such that the equalizers E₀, E₁, . . . , E_M-1independently perform equalizing operations on the independent input data streams U₀, U₁, . . . , U_M-1, respectively, and independently output the output data streams Y₀, Y₁, . . . , Y_M-1from the data outputs 220 of the equalizers E₀, E₁, . . . , E_M-1, respectively. Therefore, in this case, the equalizers E (E₀, E₁, . . . , E_M-1) form an array of M equalizers arranged in parallel to each other.

In other words, the corresponding pairs of the digital delay lines 212 and the processing units 214 form a plurality of equalizers for the input data streams U₀, U₁, . . . , U_M-1, respectively, in response to the mode selection signal indicating the first mode. Furthermore, in the illustrated embodiment, the processing units 214 update the coefficient values for generating the output data streams Y₀, Y₁, . . . , Y_M-1based on the errors (e.g., error values) received via the error inputs 230, respectively. The control circuitry 211 provides the errors e(n) (e.g., error values) between the output data y(n) of the processing units 214 and the training samples or reference signals d(n) to the error inputs 230 of the processing units 214, respectively, in response to the mode selection signal indicating the first mode. Furthermore, in the illustrated embodiment, the processing units 214 generate the output data streams Y₀, Y₁, . . . , Y_M-1by applying the coefficient values to the data values of the independent input data streams U₀, U₁, . . . , U_M-1, respectively, in response to the mode selection signal indicating the first mode.

On the other hand, if the value of the mode selection signal is “1,” which indicates the second mode (or does not indicate the first mode), then the multiplexers Mux_A₁, . . . , Mux_A_M-1couple the second inputs to the data inputs 218 of the equalizers E₁, . . . , E_M-1, respectively, such that the digital delay lines 212 of the equalizers E₀, E₁, . . . , E_M-1are cascaded to each other. Thus, the digital delay lines 212 of the equalizers E₀, E₁, . . . , E_M-1are serially connected to each other to output a set of MN input data u₀(n−MN+1), . . . , u₀(n) inputted via the input data stream U₀. In particular, the digital delay line 212 of the equalizer E₀outputs a set of N input data u₀(n−N+1), . . . , u₀(n) of the input data stream U₀, the digital delay line 212 of the equalizer E₁output a set of N input data u₀(n−2N+1), . . . , u₀(n−N) of the input data stream U₀, . . . , and the digital delay line 212 of the equalizer E_M-1outputs a set of N input data u₀(n−MN+1), . . . , u₀(n−(M−1)N) of the input data stream U₀. Similarly, the data storage 213 of the equalizer E₀stores a history of N input data u₀(n−N+1), . . . , u₀(n) of the input data stream U₀, the data storage 213 of the equalizer E₁stores a history of N input data u₀(n−2N+1), . . . , u₀(n−N) of the input data stream U₀, . . . , and the data storage 213 of the equalizer E_M-1stores a history of N input data u₀(n−MN+1), . . . , u₀(n−(M−1)N) of the input data stream U₀. Furthermore, in this case, the multiplexers Mux_C₀, . . . , Mux_C_M-1couple the second inputs to the coefficient update circuits 228 of the equalizers E₀, E₁, . . . , E_M-1, respectively, such that the coefficient update circuits 228 receive the error e_s(n) from the error calculation circuit ER.

In this case, in each of the equalizers E (E₀, E₁, . . . , E_M-1), the coefficient update circuit 228 independently calculates an updated set of filter coefficients [c₀(n+1), . . . , c_N-1(n+1)] based on the coefficient update formula using the LMS algorithm. Specifically, the updated set of filter coefficients [c₀(n+1), . . . , c_N-1(n+1)] is computed based on a vector of the stored input data stored in the data storage 213, the current output data y(n), the complex conjugate of the error e_s(n), the current set of the filter coefficient at sampler period n [c₀(n), . . . c_N-1(n)], and an appropriate adaptation step size μ. Also, in each of the equalizers E (E₀, E₁, . . . , E_M-1), the processing unit 214 independently calculates the current output data y(n) based on the same formula illustrated in FIG. 5, but using the updated set of filter coefficients updated by the coefficient update circuit 228 and the input data outputted from the digital delay line 212. Thus, in this case, the control circuitry 11 (the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1)) simultaneously provides a plurality of (M) successive groups of data values of the input data stream U₀to the data inputs 218 of the equalizers E₀, E₁, . . . , E_M-1. With this configuration, the equalizers E₀, E₁, . . . , E_M-1perform equalizing operations on the successive groups of data values of the input data stream U₀, which is equivalent to performing equalizing operation on the input data stream U₀with a combined equalizer of the equalizers E₀, E₁, . . . , E_M-1. Furthermore, the equalizers E₀, E₁, . . . , E_M-1output the output data streams Y₀, Y₁, . . . , Y_M-1from the data outputs 220 of the equalizers E₀, E₁, . . . , E_M-1, respectively. Moreover, in this case, the summation circuit P calculates the sum (sum(M)) of the data values of the output data streams Y₀, Y₁, . . . , Y_M-1to generate an output data stream Y_shaving output data y_s(n) at sample period n, which is an output of the combined equalizer with MN taps at sample period n. Therefore, in this case, the equalizers E (E₀, E₁, . . . , E_M-1) form a single equalizer.

In other words, the digital delay lines 212 and the processing units 214 form a single equalizer for the input data stream U₀in response to the mode selection signal not indicating the first mode. Furthermore, the control circuitry 211 provides the error e_s(n) (e.g., error value) between the sum of the data values (e.g., output data) of the output data streams Y₀, Y₁, . . . , Y_M-1of the processing units 214 and the training sample or reference signal d_s(n) to the error inputs 230 of the processing units 214, respectively, in response to the mode selection signal not indicating the first mode. Furthermore, in the illustrated embodiment, the processing units 214 generate the output data streams Y₀, Y₁, . . . , Y_M-1by applying the coefficient values to the successive groups of the data values of the input data stream U₀, respectively, in response to the mode selection signal not indicating the first mode.

In the illustrated embodiment, the M equalizers E (E₀, E₁, . . . , E_M-1) with N taps are combined into a single equalizer of MN taps in a manner similar to the first embodiment, in which an array of M FIR filters of N taps are combined into a single FIR filter of MN taps. To combine the M equalizers E (E₀, E₁, . . . , E_M-1), the output data from the equalizers E (E₀, E₁, . . . , E_M-1) are summed in a manner similar to the first embodiment, in which the M FIR filters are combined. In the illustrated embodiment, to combine the adaptation sub-sections 224 of the M individual equalizers E (E₀, E₁, . . . , E_M-1), a common error e_s(n) is produced from the difference between the training sample d_s(n) and the output data y_s(n) of the combined equalizer. This common error e_s(n) is then used to update all coefficient sets of all M equalizers E (E₀, E₁, . . . , E_M-1), which maintain their own independent storages for the input data or samples u(n) and the set of the filter coefficients [c₀(n), . . . , c_N-1(n)]. In the illustrated embodiment, the switch between the first mode in which the M equalizers E (E₀, E₁, . . . , E_M-1) operate independently and the second mode in which the M equalizers E (E₀, E₁, . . . , E_M-1) are combined into a single equalizer is effected by setting the value of the mode selection signal to “1” inputted to the multiplexors Mux_A and Mux_C.

With the data stream processing device 200, the equalizer configuration of the linear channel equalizers E (E₀, E₁, . . . , E_M-1) can be reconfigurable. Thus, the signal processing (equalizing) resources of the data stream processing device 200 can be efficiently utilized. In particular, an efficient implementation to combine the LMS linear channel equalizers E (E₀, E₁, . . . , E_M-1) having N-tap filter length into a single LMS linear channel equalizer with MN-tap length filter can be provided.

Thus, even if some of the signal processing (equalizing) resources of the data stream processing device 200 (e.g., some of the equalizers E (E₀, E₁, . . . , E_M-1)) cannot be utilized for processing many parallel data streams due to certain operating conditions, such as bandwidth allocation or network configuration, the unused signal processing (equalizing) resources can be efficiently reallocated to process a data stream by effectively multiplying the number of taps of the signal processing (equalizing) resources in use, which enhances equalizing performance with minimal increases in implementation complexity, area, and power. In particular, the liner channel equalizers generally not only have a filtering section, but also a coefficient update unit. Reallocating of the signal processing (equalizing) resources of unused parallel equalizers to a single equalizer multiplies both the tap length of the filtering section and the number of the coefficient update units. Thus, the performance can be improved without slowing down the rate of the coefficient update even when the number of the filter coefficients has multiplied.

Generally, when a plurality of liner channel equalizers are provided in a device, each equalizer computes an error between the output and the training sample or reference to update filter coefficients. If two equalizers are daisy-chained together, then each adaptive filter section needs to compute its own error, which is computationally inefficient. On the other hand, with the data stream processing device 200, two or more filtering sections can be combined into a single filtering section with a single coefficient set.

In the illustrated embodiment, the data stream processing device 200 only additionally needs a summation (sum(M)) by the summation circuit P, a difference computation for the error e_s(n) by the error calculation circuit ER, and a switching by the multiplexors Mux C to combine the M equalizers E (E₀, E₁, . . . , E_M-1) of N taps into a more powerful equalizer of MN taps. Thus, with the data stream processing device 200, the data storages and the digital delay lines for input data and all of the coefficient update facilities can be efficiently reused.

In the illustrated embodiment, the equalizers E (E₀, E₁, . . . , E_M-1) form a single equalizer in response to all of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) receiving the same mode selection signal with the same value “1”. However, the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) can be configured to independently receive different mode selection signals with different values “0” and “1” to form a plurality of groups of the equalizers E (E₀, E₁, . . . , E_M-1) each forming a single equalizer. For example, if all of the multiplexers Mux_A (Mux_A₁, . . . , Mux_A_M-1) receive the same mode selection signal with the same value “1” except for a k-th multiplexer Mux_A_k, and the k-th multiplexer Mux_A_kreceives the mode selection signal with the value “0,” then a first group of the equalizers E₀, . . . , E_k-1can form a first single equalizer for the input data stream U₀and a second group of the equalizers E_k, . . . , E_M-1can form a second single equalizer for the input data stream U_k. In this case, the summation circuit P calculates the sum of the data values of the output data streams Y₀, . . . , Y_k-1to generate a first output data stream Y_s1as an output of the first single equalizer with kN taps, and calculates the sum of the data values of the output data streams Y_k, . . . , Y_M-1to generate a second output data stream Y_s2as an output of the second single equalizer with (M−k)N taps. In this case, the equalizers E (E₀, E₁, . . . , E_M-1) are grouped into two groups, but can also be grouped into more than two groups.

The foregoing data stream processing devices 10, 100 and 200 can be included in a demodulator of a communication signal receiver. For example, the demodulator can be configured to receive communication signal, and perform analog-to-digital conversion of the communication signal to produce the input data streams U (U₀, U₁, . . . , U_M-1). Also, the foregoing data stream processing devices 10, 100 and 200 can be included in a modular of a communication signal transmitter. Likewise, the foregoing data stream processing devices 10, 100 and 200 can be included in a digital filter, such as, but not limited to, in a device for audio or radar signal processing.

The foregoing data stream processing devices 10, 100 and 200 can be implemented in an application specific integrated circuit (ASIC). The foregoing data stream processing devices 10, 100 and 200 can be implemented and/or included in a field programmable gate array (FPGA) device or other such programmable hardware logic device. A bitstream can include data therein which, when received by an FPGA device or other such programmable hardware logic device, causes the receiving device to implement and/or include one or more of the foregoing data stream processing devices 10, 100 and 200. A machine-readable medium can include such a bitstream recorded therein.

A machine-readable medium can include a netlist or other such hardware specification recorded thereon that when processed by suitable software and/or hardware to instantiate an ASIC device or configure an FPGA device or other such programmable hardware logic device, the device implements and/or includes one or more of the foregoing data stream processing devices 10, 100 and 200. A machine-readable medium can include a device specification in a hardware description language such as, but not limited to, VHDL or Verilog, recorded thereon that when processed by suitable software and/or hardware to instantiate an ASIC device or configure an FPGA device or other such programmable hardware logic device, the device implements and/or includes one or more of the foregoing data stream processing devices 10, 100 and 200. A computer-implemented method can automatically generate the foregoing netlist or other such hardware specification or device specification in a hardware description language. For example, the computer-implemented method can perform an algorithm to automatically determine appropriate couplings among elements of one or more of the foregoing data stream processing devices. Instructions can be recorded on a machine-readable medium which, when executed by one or more computer processors, cause the one or more computer processors to perform the computer-implemented method.

In understanding the scope of the present invention, the term “comprising” and its derivatives, as used herein, are intended to be open ended terms that specify the presence of the stated features, elements, components, groups, integers, and/or steps, but do not exclude the presence of other unstated features, elements, components, groups, integers and/or steps. The foregoing also applies to words having similar meanings such as the terms, “including”, “having” and their derivatives. Also, the terms “part,” “section,” “portion,” “member” or “element” when used in the singular can have the dual meaning of a single part or a plurality of parts. Also, the term “detect” as used herein to describe an operation or function carried out by a component, a section, a device or the like includes a component, a section, a device or the like that does not require physical detection, but rather includes determining, measuring, modeling, predicting or computing or the like to carry out the operation or function. The term “configured” as used herein to describe a component, section or part of a device includes hardware and/or software that is constructed and/or programmed to carry out the desired function. The terms of degree such as “substantially”, “about” and “approximately” as used herein mean a reasonable amount of deviation of the modified term such that the end result is not significantly changed.

While only selected embodiments have been chosen to illustrate the present invention, it will be apparent to those skilled in the art from this disclosure that various changes and modifications can be made herein without departing from the scope of the invention as defined in the appended claims. For example, the size, shape, location or orientation of the various components can be changed as needed and/or desired. Components that are shown directly connected or contacting each other can have intermediate structures disposed between them. The functions of one element can be performed by two, and vice versa. The structures and functions of one embodiment can be adopted in another embodiment. It is not necessary for all advantages to be present in a particular embodiment at the same time. Every feature which is unique from the prior art, alone or in combination with other features, also should be considered a separate description of further inventions by the applicant, including the structural and/or functional concepts embodied by such feature(s). Thus, the foregoing descriptions of the embodiments according to the present invention are provided for illustration only, and not for the purpose of limiting the invention as defined by the appended claims and their equivalents.

Number	Name	Date	Kind
4733403	Simone	Mar 1988	A
6473474	Wiegand	Oct 2002	B1
7756197	Ferguson	Jul 2010	B1
7793013	Esposito	Sep 2010	B1
8116820	Lee	Feb 2012	B2
8200181	Khlat	Jun 2012	B1
9954698	Huang et al.	Apr 2018	B1
10613205	Mortensen	Apr 2020	B2
20020161806	Shaikh	Oct 2002	A1
20030021340	McElroy	Jan 2003	A1
20030072363	McDonald	Apr 2003	A1
20030083583	Kovtun	May 2003	A1
20050105507	Clements	May 2005	A1
20060176947	Lim	Aug 2006	A1
20070053380	Graham	Mar 2007	A1
20080013638	Walton	Jan 2008	A1
20080165907	Sobchak	Jul 2008	A1
20080285640	McCallister	Nov 2008	A1
20140045541	Moshfeghi	Feb 2014	A1
20140126323	Li	May 2014	A1
20160065311	Winzer	Mar 2016	A1
20180013540	Giaconi	Jan 2018	A1
20180262371	Li	Sep 2018	A1
20190140706	Chang	May 2019	A1
20190348970	Skinner	Nov 2019	A1
20200136865	Huang	Apr 2020	A1

Data stream processing device with reconfigurable data stream processing resources and data stream processing method

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

CPC

International Classifications

Term Extension

Abstract

Description

Claims

US Referenced Citations (26)

Related Publications (1)