Information
-
Patent Application
-
20030126169
-
Publication Number
20030126169
-
Date Filed
December 28, 200123 years ago
-
Date Published
July 03, 200321 years ago
-
CPC
-
US Classifications
-
International Classifications
Abstract
Embodiments of a progressive 2D pyramid filter bank are disclosed.
Description
BACKGROUND
[0001] This disclosure is related to pyramid filter implementations.
[0002] A filter bank may comprise M different filters, where M is a finite number and larger than 1. Because the filter bank may generate M different output signal samples substantially simultaneously from the same input signal sample or samples, it allows the capability to select a desired signal sample output from M signal sample outputs in real-time. The application of a filter bank is, therefore, popular in reprographics systems, such as photocopying machines, for example. Unfortunately, since the computation of such a filter is complicated and the number of filters in a filter bank may also be large, the number of computations employed may be significant.
BRIEF DESCRIPTION OF THE DRAWINGS
[0003] Subject matter is particularly pointed out and distinctly claimed in the concluding portion of the specification. The claimed subject matter, however, both as to organization and method of operation, together with objects, features, and advantages thereof, may best be understood by reference of the following detailed description when read with the accompanying drawings in which:
[0004]
FIG. 1 is a diagram illustrating the coefficients for one-dimensional (ID) pyramid filters;
[0005]
FIG. 2 is a table of filter coefficients for a 5-filter pyramid filter bank;
[0006]
FIG. 3 is a table of input signal sample-filter coefficient products for a 5-filter pyramid filter bank;
[0007]
FIG. 4 is a table of column filter data for an embodiment of a 5-filter progressive pyramid filter bank;
[0008]
FIG. 5 is a schematic diagram of an embodiment of a 5-filter two-dimensional (2D) progressive pyramid filter bank;
[0009]
FIG. 6 is a schematic diagram showing a portion of the embodiment of FIG. 5 in greater detail; and
[0010]
FIG. 7 is a table comparing a traditional 2D filter bank with an embodiment of a progressive 2D filter bank.
DETAILED DESCRIPTION
[0011] In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the claimed subject matter. However, it will be understood by those skilled in the art that the claimed subject matter may be practiced without these specific details. In other instances, well-known methods, procedures, components and circuits have not been described in detail in order so as not to obscure the claimed subject matter.
[0012] An embodiment for an efficient implementation of a progressive pyramid 2D filter bank is presented. In this embodiment, the number of computations and the execution time is reduced over traditional two-dimensional (2D) pyramid filter bank implementations. Furthermore, hardware and software implementations of this particular embodiment are disclosed.
[0013] As is well-known, a pyramid filter is a special filter. Its coefficients form two arithmetic series and are symmetric to the center coefficient. For a one-dimensional finite pyramid filter with (2n−1) coefficients, its coefficients may be represented by a 1 x (2n−1) matrix [C1, C2, C3, . . . , Cn−1, Cn, Cn−1, . . . , C3, C2, C1]; where, n is a positive integer and C2−C1=C3−C2=. . . =Cn−Cn−1. For its corresponding two-dimensional separable filter, the coefficients may be represented by the product of a (2n−1)×1 and a 1×(2n−1) matrix using the elements from the one-dimensional filter.
[0014] Typically, a filter bank is comprised of a number of pyramid filters and their filter lengths are 3, 5, 7, 9, and so on. The coefficients of these filters also may have the following features:
[0015] 1. All are positive integers.
[0016] 2. The smallest coefficient is 1 and is the first and last coefficient in the coefficient series. The largest coefficient is dependent on the number of filter taps, or filter length, and is the center coefficient.
[0017] 3. The difference between any two consecutive coefficients is 1.
[0018]
FIG. 1 illustrates the coefficients of various one-dimensional pyramid filters in a filter bank. In this context, one-dimensional pyramid filters and two-dimensional separable pyramid filters are referred as lD-filters, and 2D-filters, respectively.
[0019] The 1D-filter output signals, in the form of signal samples, is the product of its input data and filter coefficients. Eq. 1 and Eq. 2, below, respectively show how to compute the output signal samples of a 3-tap column and a 3-tape row 1D-filter, respectively. Eq. 3, below, shows how to compute the output signal samples of a 3-tap 2D-filter, designated Yi,j, by using a 2D-filter. Eq. 4, below, shows how to generate the output signals of a 3-tap 2D-filter output, designated Zi,j, by using, instead, two ID-filters, one as a column filter and another as a row filter. This is possible because a 2D pyramid filter in this context comprises a separable function, which allows the computation of a 2D transformation in two operations, first by a row-wise ID transformation of rows followed by a column-wise ID transformation of columns, although the order of operations may also be reversed. Comparing Zi,j with Yi,j, they are the same; however, applying a two-ID-filter scheme has the following advantages:
[0020] 1. 1D-matrix multiplication is simpler than 2D-matrix multiplication.
[0021] 2. 1D-column-filter output signal samples are reusable for the next 1D-row-filter computations.
[0022] Since the column-filter data is reusable, to compute an N×N filter involves computing one new column-filter from raw data and one row-filter from the column-filter output signal samples as long as the other previous (N−1) column-filter output signal samples exist. As described in more detail hereinafter, this makes computation easier.
1
[0023] A traditional 2D-filtering scheme will typically use multiplication and addition, for software implementation, or a multiplier-and-accumulator (MAC), for hardware implementation, to implement column and row-filter computations.
[0024] Generally, a filter bank with M output signal samples has N=2M+1 input signal samples or coefficients. Here, the coefficients of a N-tap ID-filter are denoted C1, C2, . . . CN, the input data are denoted D1, D2, . . . DN, the column-filter data are denoted K1, K2, . . . KN, and the 2D output signal samples are denoted ON. In accordance with the description above, to get 2D-filter output data or signal samples, one column-filter output signal sample, KN, and one row-filter output signal sample, ON, are computed from the previous column-filter output signal samples, K1, K2, . . . KN−1. For example,
[0025] 1. Column filter:
[0026] Let KN=C1×D1+C2×D2 . . . CN×DN and push or store KN in a FIFO, such as one designated FIFO N in this example.
[0027] 2. Row filter:
[0028] Let ON=C1×K1+C2×K2 . . . CN×KN and output signal samples ON.
[0029] From this approach, an N-tap column-filter employs N multiplications and N−1 additions, for a software implementation, or one MAC in (2N−1) clocks, for a hardware implementation, with the assumption that the MAC takes two clocks to do one multiplication and one addition. The operations of row-filters are similar to that of the column-filter. Therefore, this doubles the number of software computations or hardware MACs, depending upon the implementation, in order to get 2D-filter output data, which is the row filter output signal samples.
[0030] Thus, for an implementation in software, a 3-tap 2D-filter employs 6 multiplications and 4 additions, a 5-tap 2D-filter employs 10 multiplications and 8 additions, and so on. Totally, the filter bank with M filters, thus, employs, for this embodiment:
[0031] 6+10+14+ . . . +2(2M+1)=2M2+4M multiplications,
[0032] and
[0033] 4+8+12+ . . . +2(2M+1−1)=2M2+2M additions.
[0034] For an implementation in hardware, a N-tap filter employs 2 MACs and (2N−1) clocks to produce the desired output signal samples for a 2D-filter. Totally, the filter bank with M filters, thus, employs, for this embodiment:
[0035] 2M MACs, with a MAC containing one multiplier and one 2-input adder, and
[0036] 2(2M+1)−1=(4M+1) clocks, which is also the number of clocks for the largest-tap filter, N=(2M+1), because all MACs can compute substantially simultaneously.
[0037] In alternative embodiments, it may also be possible to reduce the number of MACs but it will increase the number of clocks. Although the hardware embodiments discussed are those that execute in the smallest number of clock cycles, the claimed subject matter is not limited in scope in this respect.
[0038]
FIG. 2 provides the coefficients for a filter bank with 5 filters. The input data for separated filters is also listed. The number of input samples for such a filter is equal to the number of coefficients of that filter. An input data or signal sample and its corresponding coefficient are put in the same column in FIG. 2. For example, for the F—7, the input data is [D−3, D−2, D−1, D0, D1, D2, D3] and the corresponding coefficients are [1, 2, 3, 4, 3, 2, 1]. According to the Eqs. 1 or 2, the 1D-filtered data are the sum of products of an input data and its corresponding coefficient. In a shadowed box of FIG. 3, there is a single product of input data and coefficient. To get the filter output signal sample, add up the input-coefficient-products in a row of the table. For example, the output signal sample of F—7 is (D−3+2D−2+3D−1+4D+3D1+2D2+D3).
[0039] On one particular embodiment, he observation above regarding FIG. 3 provides a technique for producing the input and output signal samples of an individual filter by the following rules:
[0040] 1. Assume there is virtual filter, F—1, whose output signal sample is the input signal sample, D0.
[0041] 2. The sum of input signal samples of a filter is obtained by adding its first and last input signal samples to the sum of input samples of its next lower-tap filter.
[0042] 3. The output signal sample of a filter is the sum of its input signal samples and the output signal sample of its next lower-tap filter.
[0043] For example, the sum of input signal samples of F—3 is (D−1, +D0, +D1), the output signal sample of F 3 is (D−1+2D0+D1), and the first and last input signal samples of F—5 are D−2 and D2. From rule 2, the sum of input signal samples of F—5 is (D−2+D−1+D0+D1+D2), and then, from rule 3, the output signal sample of F—5 is (1D−2+2D−1+3D0+2D1+1D2).
[0044] Based on the rules just mentioned, computing becomes straight-forward by computing the filter output signal sample from the lowest-tap filter and progressing to the next higher-tap filters. As described in more detail hereinafter, a filter bank with 5 filters is used to demonstrate how to compute the output signal samples of the lD-filters, although this is merely an example and does not limit the scope of the claimed subject matter. For example, assume K3,F and S3 are, respectively, the output signal sample for column F and sum of the input signal samples of F—3, K5,F and S5 are, respectively, the output signal sample for column F and sum of input signal samples of F—5, and so on. Example pseudo code of a column-filter is as follows:
[0045] ColumnFilter(F)
[0046] Begin//totally, 15 additions are employed.
[0047] Let K1,F=D0 and S3=D−1+D0+D1;
[0048] Get K3,F=K1,F+S3 and S5=D2+S3+D2;
[0049] Get K5,F=K3,F+S5 and S7=D−3+S5+D3;
[0050] Get K7,F=K5,F+S7 and D−4+S7+D4;
[0051] Get K11,F=K9,F+S11;
[0052] End
[0053] From above, the column-filter data for column 5, K3,F, K5,F, K7,F, K9,F, and K11,F, is obtained by fifteen additions, for software, or one 2-input adder and one 3-input adder in six clocks, for hardware. The above data for a filter bank with M filters leads to M output signal samples by:
[0054] (3×M) additions, for a software implementation, or
[0055] One 2-input adder and one 3-input adder in (M+1) clocks, for a hardware implementation.
[0056] In one particular embodiment, KN,F are pushed into separated FIFOs as described in more detail hereinafter, although, of course, the claimed subject matter is limited in scope in this respect. For example, the contents of FIFOs, after completing 11 columns of raw data, are listed in FIG. 4. The FIFOs are employed to store data relevant to the filters, although, in alternative embodiments, other storage techniques or hardware may be employed other than FIFOs. For example, FIFO 3 in the table is for filter F—3, FIFO 5 is for F—5, and so on.
[0057] As described in more detail hereinafter, the KN,F will be passed, in this embodiment, to dedicated row filters for producing the 2D-filter output signal samples, although, again, the claimed subject matter is not limited in scope in this respect. For example, it is not necessary that dedicated row filters be employed. The computations for the row-filter are similar to that for the column-filter. However, unlike for the column-filter, the N-tap row-filter computes the center N data in thr FIFO. For example, the 3-tap row-filter employs [K3,−1, K3,0, K3,1]. Because an N-tap filter employs N input signal samples around the center, input signal samples beyond that range are ignored in this embodiment. This is depicted in FIG. 4 through the use of shading.
[0058] Assume O3 and O5 are 2D-output signal samples from 3-tap and 5-tap row-filters, respectively. Example pseudo code to produce these signal samples is provided below. For the other tap row-filters, output signal samples may be derived similarly:
1|
|
RowFilter(N)
Begin
switch(N)
{
case 3:// 3 additions in total
Let O3 = K3,0 and S3 = K3,−1 + K3,0 + K3,1;
Get O3 = O3 + S3; break;
case 5:// 6 additions in total
Let O5 = K5,0 and S5 = K5,−1 + K5,0 + K5,1;
Get O5 = O5+ S5 and S5 = K5,−2 +
K5,0 + K5,2;
Get O5 = O5+ S5; break;
case 7: . . . .
case 9: . . . .
. . . .
}
End.
|
[0059] Therefore, for a software implementation of a filter bank with M filters, the row-filters employ (3×1+3×2+ . . . +3×M)=3M(M+1)12 additions and the column-filter employs (3×M) additions. In total, 3M(M+3)/2 additions are employed in order to get M 2D-filter data for (2M+1)×(2M+1) matrix input signal samples. For a hardware implementation, a column or row filter contains a 2-input adder and a 3-input adder. Thus, the data of one column and M row-filters may be computed concurrently because they are independent from each other. In the other words, the total clocks employed for a filter bank with M filters may be the same as the number of clocks employed for the longest-tap filter, for example, the F—11 in the example above. Therefore, a hardware implementation may employ:
[0060] 1. (M+1) column and row filters, a filter containing one 2-input adder and one 3-input adder.
[0061] 2. (M+1) clocks to compute M 2D-filter output signal samples for (2M+1)×(2M+1) matrix input signal samples.
[0062] With the approach of the previously described embodiment, the computation for 2D-filter bank becomes relatively straight-forward. One possible software implementation is explained below in the form of pseudo-code, although, this is just an example, and the claimed subject matter is not limited in scope to this implementation. The progressive 2D-filter bank scheme is illustrated for this embodiment by the italics. In order to easily explain the pseudo-code, assume there are M filters in filter bank and the filter taps are 3, 5, . . . and (2M+1), respectively.
[0063] Begin
[0064] Set M=number of filters in filter bank;
[0065] Input (2M+1) rows of raw data and save them in buffer;
[0066] do{
[0067] Set K=the number of columns of raw data;
[0068] for(n=0; n<(2M+1); n++) ColumnFiter(n); //compute 1st (2M+1) column-filters
[0069] do{
[0070] RowFilter(3); //compute M 2D-filter output signal samples for 1 pixel.
[0071] RowFilte(5);
[0072] RowFilter(2M+1);
[0073] ColumnFilter(n++); //compute next column-filter for next M 2D-filter output signal samples
[0074] }while(n<K); I/repeat until complete all columns (all 2D-filter outputs in row)
[0075] Discard the 1st row data in the buffer;
[0076] Input one new row of raw data, save it as the last row data in the buffer;
[0077] } while (not last row); //repeat until complete all rows
[0078] End.
[0079]
FIG. 5 shows a block diagram of an embodiment of a progressive 2D-filter bank with 5 filters. The unit names of column and row FIFOs are [F5, F4, F3, F2, F1, F0, F−1, F−2, F−3, F−4, F−5]. The units with the same names but in different FIFOs contain different data. The detail of the controller is not shown in order not to obscure the claimed subject matter; however, in this embodiment, the controller is able to make the FIFOs and column raw data array repeat the following pattern:
[0080] 1. 1st clock: output data in F−1, F0, and F1.
[0081] 2. 2nd clock: output data in F−2, and F2,
[0082] 3. 3rd clock: output data in F3−, and F3,
[0083] 4. . . . and soon.
[0084] The circuitry in column filter and row filters are similar but the 3-tap row-filter outputs data at the 2nd clock, the 5-tap row-filter outputs data at the 3rd clock, and so on. The column-filter pushes its output data to FIFO-3 at the 2nd clock, to FIFO-5 at the 3rd clock and so on. The diagram of one possible embodiment of a column filter or row filter is shown in FIG. 6, where Dn represents the data output from the unit Fn in FIFO and n is an integer between −5 and 5. Of course, the claimed subject matter is not limited in scope to the a-column or row filter embodiment shown.
[0085] The previously described embodiment of a 2D-filter scheme for a pyramid filter bank has several advantages over the traditional 2D-filter in terms of the number of computations and computing speed for software and hardware implementations. Referring to FIG. 7, the advantages of the progressive scheme for filter bank include the following:
[0086] Reduced number of computations: For a software implementation, the previously described embodiment of a progressive 2D-filter bank reduces the numbers of computations. For example, the reducing ratio is about 2:1 (130:60) if M=5;
[0087] Increased computing speed: For a software implementation, the previously described embodiment of a progressive 2D-filter bank utilizes less time to calculate because of both a fewer number of computations and because additions are employed instead of multiplications. For a hardware implementation, its execution time is about one quarter (6:21) of that for the traditional one if M=5;
[0088] Reduction in number of gates: For a hardware implementation of a filter, the gate count of a multiplier is larger than that of an adder. As indicated, therefore, the previously described embodiment of a progressive 2D-filter may avoid using multipliers or multiplication and reduce gate count.
[0089] It will, of course, be understood that, although particular embodiments have just been described, the claimed subject matter is not limited in scope to a particular embodiment or implementation. For example, one embodiment may be in hardware, whereas another embodiment may be in software. Likewise, an embodiment may be in firmware, or any combination of hardware, software, or firmware, for example. Likewise, although the claimed subject matter is not limited in scope in this respect, one embodiment may comprise an article, such as a storage medium. Such a storage medium, such as, for example, a CD-ROM, or a disk, may have stored thereon instructions, which when executed by a system, such as a computer system or platform, or a computing system, for example, may result in an embodiment of a method in accordance with the claimed subject matter being executed, such as an embodiment of a method of filtering pixel data, for example, as previously described. For example, an image processing platform or an image processing system may include an image processing unit, an image or video input/output device and/or memory.
[0090] While certain features of the claimed subject matter have been illustrated and described herein, many modifications, substitutions, changes and equivalents will now occur to those skilled in the art. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the claimed subject matter.
Claims
- 1. A method of implementing a pyramid filter bank comprising: first adding a first and a last input signal sample to a sum of input samples of a next lower-tap filter of a current filter to produce a sum of input signal samples for the current filter; and
second adding the sum of input signal samples for the current filter to an output signal sample of the next lower-tap filter of the current filter to produce an output signal sample for the current filter.
- 2. The method of claim 1, wherein the first and second adding is performed by different adders.
- 3. The method of claim 2, wherein the pyramid filter bank comprises a two-dimensional pyramid filter bank and the first and second adding is applied by column and by row.
- 4. The method of claim 3, wherein the column and the row adding is performed independently.
- 5. The method of claim 1, wherein the pyramid filter bank comprises a two-dimensional (2D) pyramid filter bank and the first and second adding is applied by column and by row.
- 6. The method of claim 5, wherein the first and second adding is performed progressively.
- 7. The method of claim 1, wherein the first and second adding is applied by row.
- 8. The method of claim 1, wherein the first and second adding is applied by column.
- 9. An article comprising: a storage medium, said storage medium having stored thereon instructions, that, when executed result in: first adding a first and a last input signal sample to a sum of input samples of a next lower-tap filter of a current filter to produce a sum of input signal samples for the current filter; and second adding the sum of input signal samples for the current filter to an output signal sample of the next lower-tap filter of the current filter to produce an output signal sample for the current filter.
- 10. The article of claim 9, wherein the instructions, when executed, further result in the first and second adding being performed by different adders.
- 11. The article of claim 10, wherein the instructions, when executed, further result in the current filter comprising a two-dimensional pyramid filter bank and the first and second adding being applied by column and by row.
- 12. The article of claim 11, wherein the instructions, when executed, further result in the column and the row adding being performed independently.
- 13. The article of claim 9, wherein the instructions, when executed, further result in the current filter comprising a two-dimensional (2D) pyramid filter bank and the first and second adding being applied by column and by row.
- 14. The article of claim 13, wherein the instructions, when executed, further result in the first and second adding being performed progressively.
- 15. The article of claim 9, wherein the instructions, when executed, further result in the first and second adding being applied by row.
- 16. The article of claim 9, wherein the instructions, when executed, further result in the first and second adding being applied by column.
- 17. An integrated circuit comprising:
digital logic circuit components coupled so that, during operation, a first and a last input signal sample are added to a sum of input samples of a next lower-tap filter of a current filter to produce a sum of input signal samples for the current filter and so that the sum of input signal samples for the current filter are added to an output signal sample of the next lower-tap filter of the current filter to produce a n output signal sample for the current filter.
- 18. The integrated circuit of claim 17, wherein the digital logic components include a multiplexer, two flip-flops, a two-input adder and a three-input adder.
- 19. The integrated circuit of claim 17, wherein, during operation, the current filter comprises a two-dimensional pyramid filter bank and the adding is applied by column and by row.
- 20. The integrated circuit of claim 19, wherein, during operation, the column and the row adding is performed independently.