This invention relates to the field of digital sample rate conversion. More particularly, the invention relates to a novel and improved way of sample rate converting digitally sampled audio or video signals without restrictions on the conversion ratios.
With the recent diversification in digital motion picture formats, such as DVD-video or digital TV broadcasting, the need for high-speed sampling-rate-change algorithm is growing. For example, video with 16:9 aspect ratio must be vertically squeezed to fit into conventional 4:3 display. Also, high-resolution signals such as DV (Digital Video), digital broadcast, or others often need to be scaled down before stored in storage such as HDD (Hard Disk Drive) or various rewritable DVD (Digital Versatile Disk). In addition, recent progresses in human-interfaces of players or displays have provoked the need for scaling down and up of motion pictures in many applications. Since, those applications are directly related to consumers, the method which realizes the scaling must be simple and relatively low cost to implement. These applications also require real time processing at video (or audio) rates, therefore the algorithm must be as fast as possible.
Some of the major methods used for sampling rate conversion are simple down-sampling, 0-th holding, interpolation, and oversampling. However, simple down sampling (removing of some portion of the input signals) can only be used for scaling down and 0-th holding (replicating parts of the input signals) only for scaling up. When the scaling ratio is not integral, 0-th holding and decimation must be sequentially applied. In both of these cases, the resulting signal must be filtered with a high quality low-pass filter to remove aliasing. The resultant quality is, to a large extent, dependent on the quality of the low pass filters.
Linear interpolation is an alternate method with a relatively high quality output, and fairly simple implementation. Although linear interpolation is much simpler than other interpolation methods, the conventional algorithms still lack either computational speed or flexibility.
In the linear interpolation method, the output pixel is produced using the intensities of the nearest two input pixels.
The conventional algorithm for linear interpolation consists of the following steps:
A division is required in step 2 and 3, resulting in an implementation that tends to be large and slow.
Another way to accomplish sample rate conversion with arbitrary ratios is the interpolation/decimation method shown in
In the high speed interpolation/decimation model, illustrated in
The zero-padded values are fed into a digital FIR low-pass filter to smooth or integrate the sequence, and limit the bandwidth of the filter output. The interpolated output signal shown by 203 has been quantized to a much finer time scale than the original sequence. The interpolated sequence is then passed to a zero-order hold functional block (physically implemented as a register) resulting in the output shown by 204 and then asynchronously resampled at the output sample frequency, resulting in the converted output shown by 205. This resampling can be thought of as a decimation operation since only a very few samples out of the great many interpolated samples are retained. The output values represent the “nearest” values, in a temporal sense, produced by the inter-polation operation. There is always some error in the output sample amplitude. However, this error can be made arbitrarily small by using a very large interpolation ratio. While this approach gives excellent results, the computational requirements tend to limit it to audio applications with today's technology.
The invention described herein describes a novel and much simpler algorithm for digital sample rate conversion with arbitrary ratios. The conversion is performed using addition and subtraction only, with a single division that may be precomputed. This enables a simple implementation that is capable of real time sample rate conversion at video rates.
These and other aspects of this invention are illustrated in the drawings, in which:
Assume the spacing between two adjacent input signals 301 and 302 is m, and the spacing between two adjacent output signals 303 and 304 is n as shown in
Let a counter for input pixels be i, and count it up by m for each input pixel. If i is larger than or equal to n, there is one output pixel between the input pixel and the previous pixel. (Here, the two nearest input signals are determined.) Then, subtract m from i. If i is still larger than or equal to n, there is another output pixel. Continue this until i becomes smaller than n. Then move to the next input pixel.
The distance between the output pixel and the nearest two input pixels can be expressed by simple numerical array. Since the first output pixel is at the same location as the first input pixel, fp(0) (the distance between the 0th output pixel and the previous input pixel) is O. Also, fn(0), (the distance between the 0th output pixel and the current input pixel) is n. Since the spacing between two adjacent output pixels is n, fp(1) is larger than fp(0) by n. Since fp(x) is smaller than m:
Using this method, factors used in interpolation can be easily calculated. Also, in this calculation, no division is used except (n mod m) which only need be calculated once at the beginning. This value can be precalculated or may be calculated when needed without time constrain or penalty. Implementation of this algorithm is straightforward, as shown in the following flow charts.
If the test in block 402 fails, thus indicating that there will be more than one output pixel for each input pixel, control passes to block 403 where the data input is disabled.
Control from blocks 405 and 403 pass to decision block 406, where the value of counter is tested whether it is equal to or greater than n. If it is, data output is enabled in block 407, and then next_counter is decremented by n in block 409. If the test in block 406 fails, data output is disabled in block 408.
Control from blocks 408 and 409 then pass to block 410 where counter is set to next_counter. Proceeding to block 411, the system enters a wait state. Upon receiving the next clock pulse, control is returned to decision block 402.
fn=(m−(kn mod m))
and
fp=(kn mod m)
where fn is calculated by first initializing fn to 0 in block 501. Block 502 then tests whether data output is enabled as previously shown in block 407 of
fn=fn+n′−m
If the test fails, the equation:
fn=fn+n′
is used. After either case, control passes to block 507 where the system waits until the next clock cycle is detected.
Next, interpolation coefficient fp is calculated, as shown on
fp=fp−n′
If the test fails, the equation:
fp=fp−n+m
is used. In either case, control passes to block 607 where the system waits for the next clock cycle.
Once the interpolation coefficients are calculated, the actual data flow is shown in
Block 701 initializes the system by setting both dz and d equal to the first data value.
Decision block 703 tests whether data input is enabled by the data flow controller shown in block 404 of
Block 706 tests if data output is enabled by the data flow controller in block 407 of
Output=d*fp+dz*fn
Control then passes to block 709, where the system waits for the next clock cycle. If the test fails, control from block 706 continues directly to block 709.
Although the invention has been described in detail with reference to its preferred embodiments, it is to be understood that this description is by way of example only and is not to be construed in a limiting sense.
Moreover, numerous changes in the details of the embodiments of the invention will be apparent to persons of ordinary skill in the art having reference to this description. It will also be apparent that the algorithms and methods described may be equally well implemented in hardware or software. It is contemplated that such changes and additional embodiments are within the spirit and true scope of the invention as claimed below.
This application claims priority under 35 U.S.C. §119(e)(1) of provisional application No. 60/498,555, filed Aug. 28, 2003.
Number | Name | Date | Kind |
---|---|---|---|
5475628 | Adams et al. | Dec 1995 | A |
5497152 | Wilson et al. | Mar 1996 | A |
5729483 | Brown | Mar 1998 | A |
5768311 | Betts et al. | Jun 1998 | A |
6141671 | Adams et al. | Oct 2000 | A |
6208671 | Paulos et al. | Mar 2001 | B1 |
7262716 | Yu et al. | Aug 2007 | B2 |
Number | Date | Country | |
---|---|---|---|
20050078020 A1 | Apr 2005 | US |
Number | Date | Country | |
---|---|---|---|
60498555 | Aug 2003 | US |