1. Field of the Invention
The present invention relates to a method, system and program for fractionally shifting data subject to a previous transformation.
2. Description of the Related Art
Digital images may use one or more bits to describe the color intensity at each pixel. The term “pixel” or “pel” as used herein refers to one or more intensity (luminance) and color (chrominance) inputs or bit values at a data point that represents data to be rendered (i.e., printed, displayed, etc.), where the data to be rendered may include, but is not limited to, images, text, composite images, graphs, collages, scientific data, video, etc. The pixels that represent an image in uncompressed format are arranged in a bit map data structure.
To conserve storage space and improve transmission speed, the chrominance component of image data is often reduced in the image because the eye is not that sensitive to color changes. However, luminance information may not be removed at the same rate as chrominance data because the eye is more sensitive to changes in the luminance intensity than the chrominance intensity. For example, to eliminate chrominance data, the digital image processor algorithm may express the chrominance for a group of neighboring pixels as a single chrominance value, which may comprise the average chrominance of surrounding pixels. This process is referred to as down sampling the chrominance.
In digital image processing, a shift operation is commonly used to shift a portion of the image by translating a group of pixels in one or more rows or columns by one or more whole pixels in the horizontal and/or vertical directions. With prior art chrominance downsampling methods, an image's chrominance data may be stored in an initial image file on a particular grid while a second image file representing the same image has the chrominance data stored on a different grid, which is not aligned with the initial grid but is rather a translation of the first grid by fractions of the original grid units.
The fractional shift operation is a technique used in digital image processing operations such as motion detection, arbitrary rotation, scaling and translation. Fractional shifting is used to convert between different image sampling formats and improve the appearance of color images that have experienced chrominance reduction as part of a reduction or downsampling operation. When reproducing an image that has experienced chrominance downsampling, the image processor must reconstruct the chrominance values for all the down sampled pixels, e.g., the black dots shown in
One implementation of fractional pel shifting is conversion between MPEG-1 chrominance data sampled with a 4:2:0 scheme as shown in
Since a fractional shift on two axes can be achieved by two separate one-dimensional fractional shifts, each on one axis, we consider only one-dimensional fractional shifts. Furthermore, a one-dimensional fractional shift of an entire two-dimensional grid of data can be achieved by performing a one-dimensional fractional shift on each row or column of data in that grid. Thus, we consider only the case where a fractional shift is desired on a row (column) of data. For the purposes of exposition, we assume that this row (column) of data samples can be grouped into m blocks, each block consisting of eight data samples. That is, the row (column) is divided into m 1 by 8 (or 8 by 1) vectors with the ith vector being referred to as Gi. Thus, the vector Gi includes the following elements (Gi,1, Gi,2 . . . Gi,8).
The fractional shift matrix operation utilizes up to three matrices A, B, C, whose values are shown in
Hk=A·Gk+B·Gk+1 for k=1, 2, . . . , m−1 (1)
Hm=C·Gm (2)
Additional implementations of a fractional pel shift different than those described above may also be utilized. One benefit of the above prior art fractional shift transform process is that a color image which has chrominance samples on a grid different than a specific implementation in a decoding program can have the chrominance samples first shifted to the appropriate grid and correctly decoded so as to avoid visually obvious shifts in the chrominance data which would otherwise occur.
Notwithstanding, there is a need in the art to provide improved techniques for performing a fractional shift transform process.
Provided is a method, system, and program for performing a fractional shift of transformed data. At least one fractional shift transform matrix is provided in non-volatile storage that is capable of fractionally shifting data by a shift factor. The transformed data is received and the at least one fractional shift transform matrix is applied to the transformed data to generate output transformed data that is fractionally shifted by the shift factor without inverse transforming the transformed data.
In further implementations, the transform is the Forward Discrete Cosine Transform (FDCT).
In further implementations, the input data stream includes data that has been downsampled and the first transform process performs upsampling on the downsampled data. Still further, the input data stream may comprise an image and the down sampling eliminated data.
The described implementations provide a technique for fractionally shifting transformed data by a shift factor without having to inverse transform the data from the transform domain before applying the fractional shift and then forward transform the fractionally shifted data.
Referring now to the drawings in which like reference numbers represent corresponding parts throughout:
In the following description, reference is made to the accompanying drawings which form a part hereof and which illustrate several embodiments of the present invention. It is understood that other embodiments may be utilized and structural and operational changes may be made without departing from the scope of the present invention.
The above described process to first decode the downsampled pixel data, then perform the fractional pel shift, and then encode the pixel data including the shifted chrominance values is computationally expensive. In fact, the steps in the process of
The described implementations improve the performance of the JPEG encoding and decoding operations on data subject to fractional shifting by completely eliminating the need to perform FDCT and quantization operations, as well as the IDCT and dequantization operations.
The variable Cj is equal to:
The fractional shifting matrices à 58a, {tilde over (B)} 58b, and {tilde over (C)} 58c are calculated as follows, according to equations (6), (7), and (8) below:
Ã=D·A·DT (6)
{tilde over (B)}=D·B·DT (7)
{tilde over (C)}=D·C·DT (8)
In the description herein a letter X representing a matrix with a tilde (“˜”), i.e., {tilde over (X)}, signifies a vector or matrix of DCT coefficients for the vector X. The D matrix represents the forward DCT operator and the matrix DT is the transpose of D. The matrices A, B, C used in equations (6), (7), and (8) are the matrices shown in
In certain implementations, the transform domain fractional shift unit 56 performs a fractional shift of quantized DCT coefficients directly in the DCT transform domain without having to transform the data back to the real domain by performing a dequantization and IDCT operation prior to fractional shifting and then a FDCT and quantization of the fractional shifted data back to the DCT transform domain. The transform domain fractional shift unit 56 includes logic to shift the already quantized DCT coefficients with the following equations (9) and (10) below:
{tilde over (H)}k=÷{tilde over (G)}k+{tilde over (B)}·{tilde over (G)}k+1 for k=1 . . . (m−1) (9)
{tilde over (H)}k={tilde over (C)}·{tilde over (G)}k, where k=m. (10)
Thus, the transform domain fractional shift unit 56 performs a fractional shift operation directly on the quantized DCT data {tilde over (G)} from the original compressed image, i.e., the data in the DCT transform domain, by using the fractional shift matrices à 58a, {tilde over (B)} 58b, and {tilde over (C)} 58c stored in the non-volatile memory 60 and each block {tilde over (G)}k of the row (column) {tilde over (G)} comprising the actual quantized, DCT data. Thus, the image processing unit 56 can perform the fractional shift using simple matrix multiplication and without having to return to the real domain and then back again to the DCT transform domain by performing both full decoding and full encoding of the data as shown in
The equations (9) and (10) that are used by the image processing unit 56 are derived as follows. As discussed above, the fractional shift operation involves equations (1) and (2) above using the fractional shift matrices A, B, C shown in
D·Hk=D·A·Gk+D·B·Gk+1 for k=1 . . . (m−1) (11)
D·Hm=D·C·Gm, where k=m. (12)
The matrix D of the one-dimensional FDCT operators is by definition an orthogonal matrix, such that the principle of equation (13) hold true.
D·DT=DT·D=I8 (13),
where D is an 8×8 matrix and Im is the identity matrix. Multiplying the identity matrix by any matrix returns that matrix. Because of this property, the matrix DT·D can be distributed into equations (11) and (12) as shown in parenthesis in equations (14) and (15) below:
D·Hk=D·A·(DT·D)·Gk+D·B·(DTD)·Gk+1, for k=1 . . . (m−1) (14)
D·Hk=D·C·(DTD)·Gk, where k=m. (15)
Equations (16) and (17) below comprise equations(14) and (15) with the parenthesis redistributed to show that equations (14) and (15) can be used to derive equations (9) and (10) used by the transformation unit 56.
D·Hk=(D·A·DT)·(D·Gk)+(D·B·DT)·(D·Gk+1), for k=1 . . . (m−1) (16)
D·Hk=(D·C·DT)·(D·Gk), where k=m. (17)
With respect to equation (16), using equations (6) and (7) above, (D·A·DT) can be replaced with the transform matrix à and (D·B·DT) can be replaced with the transform matrix {tilde over (B)}. Further, in equation (16), D·Gk can be replaced with {tilde over (G)}k, which are the DCT coefficients of the original data and D·Hk can be replaced with {tilde over (H)}k, the DCT coefficients of the fractional shifted data. The result of these equivalent substitutions into equation (16) is equation (9) above used by the image processing unit 56.
Similarly, with respect to equation (17), (D·C·DT) can be replaced with the transform matrix {tilde over (C)} and D·Gm can be replaced with {tilde over (G)}m, which is the DCT matrix of the last block. The result of these equivalent substitutions into equation (17) is equation (10) above used by the image processing unit 56.
Equation (9) and (10) describe one algorithmic implementation of transform domain fractional shift processing which assumes that the last sample in the row or column being fractionally shifted remains fixed as the last sample after the shift (collocation). In further implementations, collocation takes place on the first sample in each row or column which would yield equations similar to those in equations (i) and (10). In yet further implementations, the shift is accomplished by replicating the last sample and applying equation (9) for k=1, . . . , m according to equation (18) below:
{tilde over (G)}m+1=DGm+1DT, where Gm+1=(Gm,8, 0, 0, . . . , 0) (18)
Additionally, one could assume replication of the first sample in the first block to be fractionally shifted and proceed similarly to drive a different implementation.
Furthermore, the above implementations are not restricted to the four specific fractional shift algorithms described above. In fact, the method described above can be applied to any fractional shift algorithm which can be expressed as a linear system of equations. Additionally, the process of developing transform domain fractional shift algorithms can be implemented to develop algorithms for fractionally shifting any transformed data, {tilde over (S)}, which is obtained through a matrix operation of equation (19) below:
{tilde over (S)}=MSMT (19)
Where S is the real data set which is to be fractionally shifted and M is an orthogonal matrix, e.g., MTM=MMT=I.
Furthermore, it is possible to include de/re quantization factors as follows. Assuming that q1=(q11, . . . q18) is the quantization vector corresponding to incoming row (column) vectors G1, . . . , Gm, and that q2=(q21, . . . , q28) is the quantization vector corresponding to outgoing rows (columns) H1, . . . , Hm, then dequantization (component-wise multiplication of Gi by qi) and requantization (prerounding) (component-wise division of Hi by q2) can be simultaneously accomplished by component-wise multiplication of matrices Ã, {tilde over (B)}, {tilde over (C)} and the quantization matrix 80 shown in
The image processor 50 further includes an entropy encoder 62 (
The image processor 50 may be implemented in digital hardware, such as an ASIC. In such case, the non-volatile storage 60 maintaining the fractional shifting matrices 58a, b, c may be implemented in the hardware circuitry of the image processor 40 or within a separate read-only memory device (e.g., ROM, programmable ROM (PROM), electronically erasable PROM (EEPROM), etc.) that is accessible to the image processor 50. Alternatively, the image processor 50 logic may be implemented as software code that is executed by a processor. In such software implementations, the fractional shifting matrices 58a, b, c can be stored in a non-volatile memory device or within one or more files on a magnetic medium, such as a hard disk drive. The non-volatile storage 60 may store the three fractional shifting matrices 58a, b, c for different fractional shift factors γ.
In certain implementations, the image processor 50 may be implemented within a printer to perform fractional shifting on data that has been subject to downsampling as described above. In certain printer implementations, the data stream received by the printer may include certain commands instructing the image processor 50 to perform fractional shifting on the pixel data at a specified shift factor γ. In cases where there are fractional shift matrices for different shift factors γ, the image processor 50 would select the matrices implementing the fractional shift transform for the requested shift factor γ. In further implementations, the image processor 50 can be utilized in other types of devices that render images, such as an image viewer program, video card rendering output for a display monitor, etc. Still further, the output factional pel shifted JPEG image may be outputted to a storage device for later access.
The described image processor may be implemented as a method, apparatus or article of manufacture using standard programming and/or engineering techniques to produce software, firmware, hardware, or any combination thereof. The term “article of manufacture” as used herein refers to code or logic implemented in hardware logic (e.g., an integrated circuit chip, Field Programmable Gate Array (FPGA), Application Specific Integrated Circuit (ASIC), etc.) or a computer readable medium (e.g., magnetic storage medium (e.g., hard disk drives, floppy disks, tape, etc.), optical storage (CD-ROMs, optical disks, etc.), volatile and non-volatile memory devices (e.g., EEPROMs, ROMs, PROMs, RAMs, DRAMs, SRAMs, firmware, programmable logic, etc.). Code in the computer readable medium is accessed and executed by a processor. The code in which preferred embodiments are implemented may further be accessible through a transmission media or from a file server over a network. In such cases, the article of manufacture in which the code is implemented may comprise a transmission media, such as a network transmission line, wireless transmission media, signals propagating through space, radio waves, infrared signals, etc. Of course, those skilled in the art will recognize that many modifications may be made to this configuration without departing from the scope of the present invention, and that the article of manufacture may comprise any information bearing medium known in the art.
Preferred embodiments were described with respect to transforming image data comprised of pixels. The image data may comprise still images or motion pictures. In alternative embodiments, the data stream subject to the transformation of the preferred implementations may comprise other types of data than image data.
The described implementations concerned a fractional shift transformation. However, the described transformation process may be applied to other transformations of data, whenever the transform is of the form given in equation (18) described above. In such case, a forward transform operation would be performed on the one or more transformation matrices to allow the transformation to be performed directly on the transform coefficients of the input data.
In implementations where the original data is downsampled, any down sampling technique known in the art may be used.
The described implementations concerned fractional pel shifting of JPEG data. However, the methods of the described implementations may apply to other compression schemes that utilize DCT operations as part of the compression process.
Still further, the described implementations are applicable to compression schemes that utilize other transformations, as long as the transformation is of the form of equation (18), discussed above. Transform equations can be derived by using the properties of the transform to generate the matrices which will perform the desired fractional shift operations in the transform domain.
Certain implementations were described with respect to performing the transform domain fractional shift on an 8×8 block of image data. However, the transform domain fractional shift described herein may be performed with respect to different m×n block sizes of image data through utilization of m×n matrices.
The described implementations concerned manipulation of transformed image data. However, the described implementations may be used to fractionally shift any type of transformed data, not just multimedia data.
The foregoing description of the preferred embodiments of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. It is intended that the scope of the invention be limited not by this detailed description, but rather by the claims appended hereto. The above specification, examples and data provide a complete description of the manufacture and use of the composition of the invention. Since many embodiments of the invention can be made without departing from the spirit and scope of the invention, the invention resides in the claims hereinafter appended.
Number | Name | Date | Kind |
---|---|---|---|
4912668 | Aubie et al. | Mar 1990 | A |
5274449 | Keesen | Dec 1993 | A |
5442399 | Asamura et al. | Aug 1995 | A |
5572236 | Feig et al. | Nov 1996 | A |
5774206 | Wasserman et al. | Jun 1998 | A |
6452969 | Yim | Sep 2002 | B1 |
Number | Date | Country |
---|---|---|
10003539 | Jan 1998 | JP |
Number | Date | Country | |
---|---|---|---|
20030152282 A1 | Aug 2003 | US |