Information
-
Patent Application
-
20010055336
-
Publication Number
20010055336
-
Date Filed
July 30, 199826 years ago
-
Date Published
December 27, 200123 years ago
-
CPC
-
US Classifications
-
International Classifications
Abstract
A compressed video decoder/encoder (reencoder) system for varying the compression ratio of a compressed video program. The composite reencoder system implements tightly coupled elements for decoding and encoding compressed video data implementing techniques of header forwarding and utilizing an architecture in which a shared motion compensator supports both decoding and encoding operations simultaneously. The reencoder system may be introduced in a statistical multiplexer for generating a compressed video data stream multiplex suitable for use in cable distribution and other video distribution systems.
Description
BACKGROUND OF THE INVENTION
[0001] 1. Field of the Invention
[0002] The present invention generally relates to the encoding and distribution of compressed video programs. More particularly, the present invention relates to varying the compression ratio of digitally encoded video.
[0003] 2. Background
[0004] The present invention relates to the encoding and distribution of compressed video programs. It is particularly suitable for use with a video compression technique known as variable bit-rate (VBR) encoding. VBR encoding can be used to overcome the well-known problem of most video compression encoders where the image quality tends to vary as a function of image complexity. Typically, a video program will contain a variety of scenes. Many of these scenes are lacking in motion or detail and are therefore easily compressed, while many other scenes contain complex details which are generally more difficult to compress, particularly when there is complex or random motion. Therefore, unless the available bandwidth is very high, the perceived quality of the decompressed and reconstructed images will tend to vary from one scene to the next. This problem becomes more serious as the available bandwidth is reduced until, eventually, the video quality becomes unacceptable, often because of just a few problem scenes.
[0005] VBR encoding overcomes this problem by allocating more bits to those scenes which are difficult to compress and fewer bits to those scenes which are more easily compressed. In this way the decompressed and reconstructed images can be made to appear consistently uniform in quality, and therefore superior to the reconstructed images derived from a constant bit-rate (CBR) encoder adjusted for the same average rate of compression. As a result, it is possible to compress a video program more efficiently when using the VBR encoding technique. This not only increases the number and variety of programs or program streams that can be delivered over a communications channel of a given capacity, but also reduces the storage capacity requirements at the head end or other site where the program library is maintained.
[0006] A disadvantage of the VBR encoding technique is that it makes it difficult to manipulate or edit a compressed data stream. In particular, it is difficult to efficiently utilize a fixed-capacity communication channel since the variable bit-rate stream may at times exceed the capacity of the channel, while at other times, it may utilize only a fraction of the available channel capacity.
[0007] One known technique that is used to alleviate this problem is to buffer the compressed bit stream at the transmission end of the communication channel in order to convert the variable bit-rate stream to a constant bit-rate stream. With this technique it is also necessary to buffer the signal received at the receiving end of the channel in order to recover the variable rate stream that is necessary for proper timing of the reconstructed video images. Unfortunately, the required amount of buffering for VBR-encoded streams would be prohibitively expensive and would introduce long delays into the distribution system. Moreover, existing video compression standards such as the Motion Picture Experts Group (MPEG) standards, a set of International Organization for Standardization/International Electrotechnical Commission (ISO/IEC) standards, specify limits on the amount of buffering needed for conforming decoders. Therefore, it is important that the received bit streams be decodable without exceeding these limits. MPEG is documented in ISO/IEC publications 11172 and 13818, respectively known as the MPEG-1 and MPEG-2 standards. As used hereafter, “MPEG” will be understood to refer to either MPEG-1 or MPEG-2.
[0008] One method of sending a VBR stream over a fixed-capacity channel is to convert the VBR stream to a CBR stream, where the CBR rate is at least as high as the peak variable rate. This can result in a very low channel utilization depending on the variance of the data rate. A technique that can be used to reduce the inefficiency of transmitting VBR encoded (and non-VBR encoded) programs over a fixed-capacity channel is to combine a plurality of program streams into a single multiplex. It is better to multiplex several VBR streams together and then convert the multiplex to a constant data rate than to individually convert each single stream to a constant data rate. Although each additional program stream will increase the overall data rate of the multiplex, the variance of the average per stream data rate of this multiplex will tend to decrease in approximate proportion to the number of program streams, assuming approximate statistical independence of the programs. Therefore, if the channel data rate is significantly greater than the average rate of a single program stream, then a large number of program streams can be combined and hence, the channel utilization can be significantly improved. This technique is known in the art as statistical multiplexing.
[0009] One method of assuring that buffer overflow does not occur when using buffering is to add feedback between the encoder buffer and the encoder. When the buffer approaches a full state, a buffer fullness signal from the buffer informs the encoder to increase the compression ratio so that the buffer does not overflow. When the buffer has more room, the feedback signal from the buffer to the encoder enables the encoder to reduce the compression ratio in order to maximize image quality. Such a feedback mechanism is particularly effective when combined with statistical multiplexing. Individual buffers may be provided at the output of each encoder before the inputs to the multiplexer, or a single buffer may be provided at the output of the multiplexer. In either case, a feedback signal, indicative of the level of fullness of the single buffer or the overall combined level of fullness of the individual buffers, is supplied as an input to each of the encoders. By adjusting the compression ratio as a function of the total data rate produced by the plurality of encoders, it becomes possible to reduce the size and frequency of the compression ratio adjustments.
[0010] The combination of encoder buffering with statistical multiplexing of multiple encoded program streams over fixed-capacity channels can be effective, but cannot always be applied. There are some situations where no feedback is possible between the output buffer(s) and the individual encoders. One such situation occurs when multiplexing already encoded data streams. Another occurs when the encoders are located in an area physically remote from the multiplexer. Both of these situations are referred to herein as remote encoding, indicating that encoding and multiplexing are carried out at separate times or places so that no feedback is possible from the multiplexer to the encoders of the program streams to be multiplexed.
[0011] The problem of matching the data rate of encoded programs to the maximum data rate of a broadcast channel is not limited to statistical multiplexing of multiple VBR-encoded programs. For example, a single program may be initially encoded at a first constant data rate (CBR) for distribution over a fixed-bandwidth channel. Then, if the compressed program is to be received at a particular location and subsequently redistributed over a second fixed-bandwidth channel, where the bandwidth of the second channel is less than that of the first, the data rate of the compressed program will need to be reduced. This is necessary to avoid exceeding the second constant data rate corresponding to the bandwidth of the second channel.
SUMMARY OF THE INVENTION
[0012] From the foregoing, it can be appreciated that it is desirable to provide a mechanism for matching the data rate of encoded video programs as closely as possible to a maximum data rate of a subsequent broadcast channel. The present invention introduces methods and apparatus wherein in one aspect of the present invention the data rate corresponding to one or more CBR or VBR data streams is reduced in order to avoid exceeding the maximum data rate of a subsequent fixed-bandwidth channel. In a second aspect of the present invention, the data rate corresponding to one or more CBR or VBR data streams is adjusted to deliver constant picture quality where the level of quality can be specified to satisfy the requirements of a particular application.
[0013] These and other objects of the present invention are provided in a compressed video reencoder system in which a compressed video program at a first data rate is translated to a second data rate in order to accommodate a channel bandwidth at the second data rate. A first, independent-component mechanism for varying the compression ratio of the digitally encoded video program utilizes independent decoder and encoder components. The received video program at the first data rate is processed by an independent compressed video decoder wherein the compressed video program is decoded. The decoded program is then encoded by a subsequent independent video encoder with the compression ratio adjusted in accordance with the desired output data rate.
[0014] The independent decoder/encoder reencoder system utilizing independent decoder and encoder, while effective, is an inelegant, expensive approach to varying the compression ratio of a compressed video program. In accordance with additional aspects of the present invention, exemplary reencoder systems, described for use with the MPEG compression standards, introduce successive improvements to the de-coupled component reencoder system. In a first improved aspect, it is recognized that header information and motion vectors in the recompressed video program may be identical to the header information and motion vectors received at the first compression rate. In such a case the header information and motion vectors are said to be static information, information that does not change during the reencoding process. Thus, a header forwarding path is introduced, coupling the decoder and encoder portions of the reencoder system during compression. This reduces the data processing required by the encoder portion of the system and eliminates header calculation circuitry from the encoder. Logic is introduced for recognizing the header information at the decoder and forwarding it to the encoder and with additional logic for reinserting the header information into the outgoing compressed bit stream.
[0015] A significant aspect of the present invention is a decoder/encoder combination suitable for use with MPEG compression techniques which is capable of utilizing only a single shared motion compensator. A significant portion of the cost associated with the independent decoder/encoder method is the memory components required by separate motion compensators for each stage. The improved reencoder system utilizes a shared motion compensator in a tightly coupled arrangement operating in a manner to adequately support both the decoding function and the reencoding function of the reencoder system while eliminating the expense and hardware required for a second motion compensator.
[0016] In accordance with another aspect of the present invention a reencoder system utilizing a shared motion compensator is taught wherein predictor subtraction is carried out in the DCT domain. In the MPEG implementation of this aspect of the present invention, an inverse discrete cosine transform (IDCT) unit is eliminated from the composite system, thus again reducing the cost and complexity of the reencoder system.
[0017] In accordance with another aspect of the present invention the composite reencoder system utilizing the shared motion compensator may be implemented in a manner to ensure a constant picture quality by varying the compression ratio in accordance with certain properties of a stream being recompressed. This is done by specifying a fixed quality desired and adjusting a parameter to the quantizer used for recompressing the encoded video data.
[0018] Another aspect of the present invention is realized by a statistical multiplexer which utilizes several of the composite reencoders of the present invention. The statistical multiplexer may receive numerous compressed data streams at a first data rate and desire that they be combined into a data stream multiplex. The statistical multiplexer system may vary the compression ratios of the outputs of the reencoders by measuring a buffer depth associated with the generated data stream multiplex.
[0019] In accordance with a final aspect of the present invention, a system architecture is introduced for a satellite uplink and cable distribution system in which a cable system head-end utilizes the statistical multiplexing techniques which incorporate the composite reencoder system of the present invention. Such a system would allow selected components of one or more statistical multiplexers of compressed video received from the satellite downlink to be recombined into different statistical multiplex combinations for distribution through the cable system.
BRIEF DESCRIPTION OF THE DRAWINGS
[0020] The objects, features and advantages of the present invention will be apparent from the following detailed description, in which:
[0021]
FIG. 1 illustrates a block diagram of a compressed video reencoder system utilizing independent decoder and encoder elements.
[0022]
FIG. 2 illustrates a compressed video reencoder system in which header and subheader information forwarding between a decoder component and an encoder component is implemented.
[0023]
FIG. 3 illustrates a flow diagram of the logic responsible for detecting and forwarding header and subheader information in accordance with the reencoder system of FIG. 2.
[0024]
FIG. 4 illustrates a flow diagram of the logic for inserting forwarded header and subheader information into a compressed video output data stream.
[0025]
FIG. 5 illustrates a block diagram of a portion of a compressed video motion compensator circuit demonstrating the interactions of the frame memories incorporated therein.
[0026]
FIG. 6 illustrates a compressed video program reencoder system implementing both header/subheader forwarding and a shared motion compensator architecture in accordance with one embodiment of the present invention.
[0027]
FIG. 7 illustrates an alternative reencoder system implementing header/subheader forwarding and a shared motion compensator architecture.
[0028]
FIG. 8 illustrates a block diagram of a quantizer for use in the reencoder system of the present invention for controlling the compression ratio of a compressed video output data stream.
[0029]
FIG. 9 illustrates a statistical multiplexer system which implements reencoder systems in accordance with the present invention.
[0030]
FIG. 10 illustrates a system architecture for a compressed video distribution system, including a satellite uplink system and a cable distribution system head end which may incorporate the reencoder system of the present invention.
[0031] FIGS. 11A-11D are used to illustrate a proof that the single shared motion compensator of the present invention can replace multiple motion compensators from other configurations.
DETAILED DESCRIPTION OF THE INVENTION
[0032] A method and apparatus are disclosed for modifying the compression ratio of digitally encoded video programs in a compressed video distribution system. Although the present invention is described predominantly in terms of compression techniques for video information encoded in accordance with the MPEG standard, the concepts and methods are broad enough to encompass video compression systems using other techniques. The present invention is applicable to both constant bit rate (CBR) and variable bit rate (VBR) encoded data streams. Throughout this detailed description, numerous specific details are set forth such as quantization levels and frame types, etc. in order to provide a thorough understanding of the present invention. To one skilled in the art, however, it will be understood that the present invention may be practiced without such specific details. In other instances, well-known control structures and encoder/decoder circuit components have not been shown in detail in order not to obscure the present invention.
[0033] In many instances, components implemented within the present invention are described at an architectural, functional level. Many of the elements are well known structures, particularly those designated as relating to MPEG compression techniques. Additionally, for logic to be included within the system of the present invention, flow diagrams are used in such a manner that those of ordinary skill in the art will be able to implement the particular methods without undue experimentation. It should also be understood that the techniques of the present invention may be implemented using numerous technologies. For example, the entire system could be implemented in software running on a computer system, or implemented in hardware via either a specially designed application specific integrated circuit (ASIC) or programmable logic devices. It will be understood by those skilled in the art that the present invention is not limited to any one particular implementation technique and those of ordinary skill in the art, once the functionality to be carried out by such components is described, will be able to implement the invention with various technologies without undue experimentation.
[0034] Referring now to FIG. 1 there is shown a reencoder system that implements an independent decoder/encoder method for varying the data rate of an encoded video program stream from a first rate to a second rate in accordance with the first and second compression ratios, respectively. In this case, a compressed video program is reencoded by coupling a decoder 110 with an encoder 150. That is, the composite reencoder system first decodes the previously encoded video program and then encodes it once again, using a different compression ratio. In most cases, the different compression ratio will be higher, but in an alternative embodiment, a lower compression ratio may be used. In the system of FIG. 1, both the decoder 110 and encoder 150 are compatible with the standards of the MPEG specifications. The decoder 110 first uses an Inverse-Variable-Length Coder (IVLC) 112 to translate variable-length codewords to quantized DCT coefficients. The quantized DCT coefficients are then processed by an Inverse Quantizer (IQ1) 114 and an Inverse Discrete Cosine Transformer (IDCT) 116 in order to recover the prediction errors corresponding to each pixel. The final decoding step is to reconstruct an approximation of the original pixels by summing the prediction errors at adder 117 and the pixel predictions provided by a Motion Compensator unit (MC) 118.
[0035] Once the data stream has been decoded and the pixels have been reconstructed, the image sequence can be compressed once again using a different compression ratio. The first step in the encoding process is to subtract the pixel predictions at subtractor 151 provided by a second Motion Compensator unit (MC) 152 from the reconstructed pixels to obtain a prediction error signal. This signal is then transformed by a Discrete Cosine Transform unit (DCT) 154 to a sequence of DCT coefficients, these coefficients are then quantized by Quantizer unit (Q2) 156, and finally, variable length codewords are assigned to the quantized DCT coefficients by the Variable Length Coder (VLC) 158. Additional encoder processing units, consisting of an Inverse Quantizer (IQ2) 160, an Inverse Discrete Cosine Transform Unit (IDCT) 162 and the Motion Compensator (MC) 152, are needed to duplicate the decoding process, thereby insuring that the same pixel predictions will be generated by this encoder 150 and all subsequent decoders as noted in the above-referenced ISO/IEC publication 13818-2. It will be understood that this architecture is necessary to maintain synchronization among MPEG encoders and decoders. In the embodiment described with respect to FIG. 1, it should be recognized that the quantizer is a combination of a quantizer and a scaler. That is, the DCT coefficients are first quantized and then the quantized results are normalized by dividing by the size of the quantization step. Similarly, the inverse quantizers reverse the normalization by multiplying the quantized and scaled coefficients by the same quantization step size.
[0036] The compression ratio for the encoder 150 is determined by the quantizer (Q2) 156 and the VLC 158. The efficiency of the VLC 158 is determined by the amplitude and pattern of the quantized DCT coefficients and since the tables specifying the VLC are generally fixed, the VLC cannot be used to vary the compression rate. Instead, the precision of the quantizer is varied, either by increasing the precision to increase the coefficient amplitude and therefore the data rate, or by reducing the precision to reduce the coefficient amplitude and hence, the data rate. In this way, the data rate at the output of the encoder 150 can be made less than the data rate at the input of the decoder 110 by adjusting the quantization precision of Q2 156.
[0037] The disadvantage of the reencoder system of FIG. 1 is its complexity and cost. Although decoders are becoming highly integrated and relatively inexpensive, encoders continue to remain complex and costly, not only because of smaller production quantities, but because of the computational complexity of the motion estimation process, and the need for complex algorithms to determine the best method of encoding to be used at various stages of the implementation. These steps are not required in the corresponding decoders.
[0038] One method of reducing the cost of the reencoder system of FIG. 1 is to utilize some of the same information that was derived during the original encoding process. Information specifying the motion vectors and the decisions made during the original encoding process can be extracted from the compressed data stream. For example, the same motion vectors may be utilized during the encoding step of FIG. 1, thereby eliminating the need for another motion estimation step. The same motion vectors as included in the original data stream may be utilized by the motion compensator (MC) 152 shown in the encoder 150 of FIG. 1. In addition, certain encoding decisions, such as the choice of intra-coding, forward prediction, backward prediction, bi-directional prediction, no motion compensation, and field or frame coding, may all be omitted during the reencoder's encoding stage, and instead performed in accordance with the modes selected during the original encoding process. Information specifying the motion vectors and the decisions made during the original encoding process can be extracted from the compressed data stream.
[0039] The encoding stage of the reencoder can also be simplified by omitting the step of encoding the high-level formatting data. For instance, the MPEG standards specify a hierarchy of header and header extension layers, each consisting of a unique startcode followed by a series of fixed length and/or variable-length codewords. These headers and header extensions precede each frame of picture data and provide information that is needed for proper decoding of the picture data. Generally, these headers and header extensions would not need be modified during the reencoding process. In the case of MPEG, one exception is a codeword in each picture header that is sometimes used by decoders to maintain proper synchronization of their corresponding channel buffers. Since the reencoding process can change the number of bits used to represent each frame, this codeword must be adjusted to insure that the output bit stream remains fully compliant with the MPEG standard. However, this codeword is not needed and remains constant when using VBR encoding, and in this case no adjustment of the header and header extensions are necessary. More information about MPEG headers can be found at Section 6.2 of ISO/IEC Spec. 13818-2.
[0040] Referring now to FIG. 2, there is illustrated a reencoder system incorporating the header (static information) forwarding aspect of the present invention. The reencoder system shown in FIG. 2 includes a header forwarding path or bus 230 for forwarding a copy of the header and header extension layers to the output side of the encoder portion 250 from the input side of the decoder portion 210. For hardware implementations, either a serial or parallel data path may be used in combination with a delay device, such as a First-In-First-Out (FIFO) memory. The FIFO can be used to compensate for the processing delay between the decoder 210 and the encoder 250.
[0041] The inverse variable length coder (IVLC) 212 detects these header layers and routes them onto the forwarding path 230. At the output, the variable length coder (VLC) 258 inserts the pre-encoded header information into the output stream at the appropriate time.
[0042] A flowchart describing the functions to be carried out by the logic to be incorporated into the IVLC 212 is shown in FIG. 3 as the IVLC Header Forwarding Procedure 300. In the MPEG example, the end of the header and extension layers and the beginning of the encoded picture data is signaled by the detection of the first slice header. In accordance with one embodiment of the present invention, all data is initially routed to the header forwarding path 230 until the start code corresponding to the first slice header is detected at step 320. The modified IVLC 212 then inserts a unique ‘end marker’ code into the forwarded stream at step 340 so that this point will be easily detected by the receiving VLC unit 258. This unique end marker may be chosen to be one of the reserved start codes that are not used by any of the MPEG standards. All subsequent data is then processed internally by the decoder portion 210 and not copied onto the header forwarding path 230 until the detection of the next start code that is not part of a slice header. Such a start code will not be detected until all of the picture data has been received.
[0043] A flowchart describing the functions to be carried out by the logic incorporated into the corresponding VLC unit 258 is shown in FIG. 4 as the VLC Header Insertion Procedure 400. The VLC 258 copies all data from the header forward path 230 to the output stream at step 430 until an end marker is detected at decision box 420. The end marker is then discarded and the VLC 258 begins to receive data from the primary stream at step 440. The VLC 258 continues to operate conventionally until the entire picture has been processed. After the picture is completed, the VLC 258 will again begin to accept data from the header forward path 230 at step 410 until another end marker is detected at decision box 420.
[0044] Although the functionality described with respect to FIGS. 3 and 4 is described as logic to be performed by circuitry incorporated into the IVLC 212 and VLC 258, respectively, it will be recognized by those of ordinary skill in the art that alternative embodiments may be implemented. For example, rather than incorporating logic for performing the functionality within the IVLC and VLC units, header detect and forward logic may be incorporated prior to the IVLC at the data receiving point for the decoder section 210. Similarly, header receipt and insertion logic may be incorporated external to the VLC unit for inserting the header information into the output data stream from the encoder section 250 of the composite reencoder system of FIG. 2. In another alternative embodiment, rather than incorporating header receipt and insertion logic, a delay line may be incorporated into the header forwarding path 230 such that the forwarded header information is timed to be received and inserted into the output stream when it is needed for output from the reencoder system.
[0045] A significant portion of the cost of the reencoder system shown in FIG. 2 is due to the memory components associated with each of the two motion compensators 118 and 152. Each motion compensator must contain at least one frame of storage if B-frames are not supported, and two frames of storage if B-frames are supported.
[0046] A simplified block diagram of a motion compensator 500 supporting B-frames is shown in FIG. 5. The sequencing of the control signals, consisting of a Write/Read selector (WR1)for frame memory 1 (505), a Write/Read selector (WR2) for frame 5 memory 2 (510), and multiplexer selector signals (SLCT_A, SLCT_B, and SLCT_C), is demonstrated in Table I below:
1TABLE I
|
|
Motion Compensator Control Signals
FrameFrame
DisplayProcessing
OrderOrderWR1WR2SLCT_ASLCT_BSLCT_C
|
B0I201001
B1B000010
I2B100010
B3P510111
B4B300100
P5B400100
B6P801001
B7B600010
P8B700010
B9P1110111
B10B900100
P11B1000100
B12P1401001
B13B1200010
P14B1300010
B15I1710111
B16B1500100
I17B1600100
B18P2001001
B19B1800010
P20B1900010
|
[0047] Frames which will be needed to predict other frames must be stored in one of the two frame memories. These frames are the I- and P-frames, since B-frames are never used for prediction. Therefore, only the I- and P-frames are transferred from the ‘Write Data’ port to one of the two frame memories. If the first of the two frame memories is selected for storing a particular I- or P-frame, then the second frame memory will be selected for storing the next I- or P-frame, and the selection will continue to toggle for each following I- or P-frame thereafter. Each arriving pixel is written to the location within the selected frame memory that is specified by the Write Address Generator 520. In this case, the addressing sequence is fixed and is synchronized with the sequence that the pixels are received at the Write Data port.
[0048] Motion compensation is performed during the process of reading data from the frame memories. When an I-frame is being processed, no prediction is necessary, and the output from the Read Data port of the motion compensator is discarded. When P-frames are received, forward prediction is performed using the frame memory containing the I- or P-frame that precedes the current frame. This frame memory is addressed by the first of the two Read Address Generators 530 using the motion vectors decoded from the received bit stream. This occurs at the same time that the incoming P-frame is written to the other frame memory under the control of the Write Address Generator 520.
[0049] When a B-frame is received, two frames are needed to generate a bi-directional prediction. In this case, the frame memory containing the preceding I- or P-frame is addressed by the first Read Address Generator 530, and the frame memory containing the following I- or P-frame is addressed by the second Read Address Generator 540. The Write Address Generator 520 is not used for B-frames, since B-frames are not used for prediction and therefore do not need to be stored.
[0050] The most frequent type of predictions used during B-frames are bi-directional. In this case, the forward prediction derived from the frame memory addressed by the first Read Address Generator 530 must be averaged with the backwards prediction derived from the other frame memory addressed by the second Read Address Generator 540. The control signals SLCT_A and SLCT_B will select the forward and backwards predictions, respectively, so that they may be averaged by the output adder 550.
[0051] The reencoder system of FIG. 2 may be constructed using only a single shared motion compensator instead of two independent motion compensators. The reencoder shown in FIG. 6 utilizing only a single motion compensator 630 is functionally compatible as a substitute for the reencoder system of FIG. 2. A proof supporting this novel and nonobvious replaceability is provided at the end of this detailed description.
[0052] In FIG. 6, a single, shared motion compensator 630 outputs the difference between the individual predictions provided by the decoder and encoder units respectively, of the reencoder system shown in FIG. 2. In this way, any errors due to the difference between the original quantization and the quantization performed by quantizer Q2 656 will be compensated for in all future frames which are either directly or indirectly derived from information in the current frame.
[0053] The shared motion compensator reencoder system shown in FIG. 6 can be further simplified, but not without sacrificing an important advantage for some implementations in that the resulting reencoder could no longer be realized as an adaptation of an existing decoder architecture. The most complex portion of this structure is similar to a basic decoder. This means that a decoder may be adapted to this application by adding a DCT, an IDCT, a quantizer, an inverse quantizer, and a VLC, either in the same IC device or in one or more external devices. Ideally, the decoder would be modified to output a header forwarding stream directly to the VLC unit. The decoder could also be modified to accept its input to the motion compensator from an external subtractor, but this is not essential, since the subtractor could write its output directly into the motion compensator's frame memory. Recall that only the I- or P-frames need to be stored at the motion compensator, and during such frames only one of the two frame memories is used for prediction, hence leaving the second frame memory available to accept the new frame. Even in highly integrated implementations, the frame memories are generally assigned to separate IC packages. Note that as an alternative to performing the subtraction prior to the motion compensator 630, the subtraction may be performed prior to the IDCT 162. This case is illustrated by the dashed line in FIG. 6.
[0054] An alternative shared motion compensator reencoder system 700 is illustrated in FIG. 7. The reencoder system 700 has been further modified by performing the prediction subtraction in the DCT domain instead of in the pixel domain. This structure is derived from the reencoder implementation of FIG. 6. As in the previous implementation, the pixel errors are stored in the frame memories of the motion compensator, although in this case, only one IDCT 762 is required instead of two.
[0055] The shared motion compensator reencoder system 700 shown in FIG. 7, illustrates that the quantizer Q2 756 can accept an external quality-level parameter to control the accuracy of the video signal after reconstruction. In one implementation, the process of re-quantizing the reconstructed DCT coefficients is performed based on a constant picture quality criterion, and in this case, the quality-level parameter remains fixed or changes only slightly during the entire reencoding process. However, in some applications it is more important that the output data rate remain constant. In such cases, the quality-level parameter can be adjusted as needed to insure that this output data rate is maintained. As will be described below with respect to FIG. 8, the quality-level parameter affects the resulting perceived image quality, which is subjective in nature, but may not have a one-to-one correspondence with the resulting accuracy of the reproduction.
[0056] One method for implementing a quantizer that delivers a specified level of picture quality is shown in FIG. 8. Typically, an MPEG quantizer accepts a parameter, referred to herein as the quantizer-scale code. The quantizer 830 maps the quantizer-scale code to a scaling factor which is used as an input to a multiplier acting upon the incoming stream of DCT coefficients. The quantizer-scale code could be derived directly from the input quality-level parameter, but this would not account for the viewer's variation in perceptual sensitivity to different types of scenes. For example, during complex moving scenes, a viewer is less likely to notice quantization errors than in simple scenes with very little movement. Therefore, it is advantageous to permit relatively large quantization errors in the complex moving regions of a picture and relatively small quantization errors in the simpler, stationary regions. The Macroblock Analyzer 810 shown in FIG. 8 identifies such complex and simple regions and outputs a signal that is indicative of the region's error masking qualities. In a typical MPEG implementation, the regions would be defined by non-overlapping blocks of 16×16 pixels, also referred to as macroblocks. There are numerous systems known in the art for analyzing the complexity and other characteristics of an incoming data stream and determining an acceptable quantizer-scale code to provide to a standard MPEG quantizer. Accordingly, the details for such an implementation are not discussed in detail herein.
[0057] Frequently, the ideal quantizer-scale code that would normally be derived from the quality-level parameter received as an input to the quantizer, and the scene complexity indicator received from the Macroblock Analyzer 810, would be similar, but not identical to the quantizer-scale code used during the most recent encoding process. In such cases, it is advantageous to use the same quantizer-scale code as used during the last encoding process since this will minimize the build-up of quantization errors which normally occurs when multiple encoding processes are applied. Likewise, using a quantizer-scale code that is an integer multiple of the one used during the original encoding process will also minimize the build-up of quantization errors. The previous quantizer-scale code can be easily extracted from the input data stream by Data Stream Parser 840. The Look-Up Table (LUT) 820 then assigns a new quantizer-scale code based on the quality level parameter received as input to the quantizer, the scene complexity indicator received from the Macroblock Analyzer 810, and the last quantizer-scale code received from Data Stream Parser 840. The quantization is then performed in a conventional manner by the Quantizer 830 using the new quantizer-scale code received from LUT 820.
[0058] In correcting the data rate by adjusting the quantizer-scale code, it is advantageous in one embodiment to begin by editing just the B-frames. Because B-frames are never used for predicting other frames, any errors introduced by changing the quantization for B-frames will not be propagated to other frames. If editing just B-frames does not sufficiently correct the data rate, then P-frames should be edited next, resorting to editing I-frames only if that still proves insufficient.
[0059] It is possible to implement a statistical multiplexing system using multiple reencoders such as those described with respect to FIGS. 2, 6 and 7. This is particularly advantageous when no feedback is possible between the multiplexer and the encoders, as is the case when some or all of the programs are pre-encoded or encoded in a remote location that is inaccessible to the multiplexer. An example of such a statistical multiplexing system 900 is shown in FIG. 9. Each of the xN reencoders 901-9xN regenerates a corresponding bit stream using a quality-level parameter derived by a device which monitors the level of fullness of the reencoder output buffers. In this example, the device is a Look-Up Table (LUT) 910 which monitors the time duration corresponding to the data in one of the xN output buffers. It is assumed in this case that the multiplexing is based on an algorithm which selects the next packet from one of the N output buffers based on the order in which the packets are to be decoded. When such an algorithm is used, each buffer will tend to contain an amount of data corresponding to the same time interval, and therefore only one of the xN buffers will need to be monitored. An example of such a statistical multiplexer system is described in a co-pending application entitled “Method and Apparatus for Multiplexing Video Programs for Improved Channel Utilization” Ser. No. 08/560,219, filed Nov. 21, 1995 and assigned to the assignee of the present invention.
[0060]
FIG. 10 illustrates one embodiment of a complete system architecture for a compressed video distribution system in which the reencoder-based statistical multiplexer of FIG. 9 may be implemented. In this system, a number of video program sources (only one subset of one subset are shown) are each encoded with a desirable video compression standard. The video sources Al to AN 1001-100n may be videocassette or laser disk players playing different programs, or the same program with different start times. Each program is respectively encoded by video encoders 1011, 1012 to 101n. The outputs of the video encoder systems are combined by a first multiplexer 1020 to generate a first multiplex stream 1025 of encoded video programs. There can be numerous bundles of encoded video programs, each being multiplexed respectively by compressed video multiplexers 1020, 1030 to 10M0 to generate compressed video multiplexes 1025, 1035 to 10M5, respectively.
[0061] The compressed video program multiplexes 1025-10M5 may then be forwarded to a satellite transmitter system 1040 and uplinked to a satellite 1050 for redistribution. In one embodiment of the distribution system, each multiplex 1025, 1035 to 10M5, would be uplinked to a difference transponder of the satellite 1050. The satellite 1050 downlinks the compressed video multiplexes to any of a plurality of distribution head-end systems such as the head-end system 1100. The head-end systems such as the head-end system 1100 may be widely distributed within the downlink range of the satellite 1050.
[0062] The exemplary head-end system 1100 includes a satellite receiver 1110 for receiving the downlink signals from one or more transponders of the satellite 1050. The downlink signals are then provided to the various tuner/demodulators 1120, 1130 to 11M0. These tuner/demodulators recover the multiplexes of compressed video programs from the modulated signals carried by the one or more transponders of satellite 1050 and provide the multiplexes to selector/demultiplexers 1200, 1210 to 12Q0. The selector/demultiplexers 1200, 1210 to 12Q0 are responsible for demultiplexing any number or combination of selected video programs from the various multiplexes received from tuner/demodulators 1120, 1130 to 11M0. The demultiplexed, previously encoded and compressed video programs 1 to Ni are provided to statistical multiplexers 900, 910 to 9Q0, such as the one described above with respect to FIG. 9. The statistical multiplexers combine the desired video programs into statistically multiplexed data streams for distribution through various distribution systems 1310, 1320 to 13Q0. Each of these distribution systems may carry different combinations of statistically multiplexed video program streams selected from the originally encoded video sources. These may then be selectively distributed to subscribers 1401 to 140I (I may, of course, be greater than 9) who each will have a video program decoder suitable for decoding the compressed video information by the standard used for encoding the data, such as the MPEG decoders described above.
[0063] In the head-end system 1100, there is incorporated the statistical multiplexing system 900, such as the one described with respect to FIG. 9. As noted above, the statistical multiplexer system 900 may advantageously incorporate numerous of the shared motion compensator reencoder systems of the present invention to reduce the complexity and cost of such as statistical multiplexing system. Thus, the shared motion compensator systems described with respect to FIGS. 2, 6 and 7 are advantageously incorporated into a complete compressed video distribution system.
[0064] Implementing a Single, Shared Motion Compensator
[0065] The following, with reference to FIGS. 11A-11D, demonstrates the effectiveness of the single, shared motion compensator architecture of the present invention.
[0066] A portion of FIG. 2 is reproduced in FIG. 11A with labels a through e attached to key points. It is useful to express the signals occurring at points b, c, and d in mathematical terms:
b=a+MC
(b) (1)
c=b−d
(2)
d=MC
(d+e) (3)
[0067] where MC() is the motion compensation operator, implemented by delaying the input signal by one frame interval and spatially rearranging the pixels of that frame according to a given set of motion vectors. In this case, the motion compensators in (1) and (3) use the same set of motion vectors and therefore both MC operations are identical.
[0068] Substituting (1) and (3) into (2) yields:
c=a+MC
(b)−MC(d+e). (4)
[0069] Since the motion vectors are the same, the motion compensator is a linear function and therefore:
c=a+MC
(b−d−e). (5)
[0070] If (2) is substituted into (5) then:
c=a+MC
(c−e). (6)
[0071] Therefore this result can be realized using the structure shown in FIG. 11B and it may be concluded that this structure is functionally equivalent to the one depicted in FIG. 11A when the signal is observed at point C or at any other point between point C and point E. On this basis, it may also be concluded that the structures shown in FIG. 2 and FIG. 6 are also functionally equivalent when compared at their respective outputs.
[0072] The same result could also be realized using the structure shown in FIG. 11C, which was derived from FIG. 11B. In this case, the subtraction is performed in the DCT domain instead of in the pixel domain.
[0073] To further simplify FIG. 11C the adder is moved from the output to the input of the IDCT. This can be done simply by inserting an additional DCT at the output of the motion compensator. Then, however, the series IDCT and DCT at the output of the adder negate each other and therefore both blocks can be eliminated, as shown in FIG. 11D. This resulting structure is now identical to the one shown in FIG. 7.
[0074] There has thus been described an advantageously-implemented method and apparatus for a compressed video reencoder system which uses only a single shared motion compensator. Such a shared motion compensator system may be advantageously incorporated into a complete system architecture for a compressed video distribution system. Although the present invention has been described with respect to certain exemplary and implemented embodiments, it should be understood that those of ordinary skill in the art will readily appreciate various alternatives to the present invention. Accordingly, the spirit and scope of the present invention should be measured by the terms of the claims which follow.
Claims
- 1. An apparatus for modifying the compression ratio of a digitally encoded video program comprising:
a compressed-video decoder for receiving and decoding a compressed-video program previously encoded at a first compression ratio, said compressed-video decoder for generating a decoded video stream; a video encoder in communication with said compressed video decoder for receiving said decoded video stream from the decoder and reencoding said video program at a second compression ratio and providing a compressed-video output stream; and a forwarding data path coupled between said decoder and encoder for forwarding previously encoded static information from said compressed-video program to be inserted into said compressed-video output stream.
- 2. The apparatus of claim 1 further comprising static-information detection logic for detecting said static-information in said compressed-video program and forwarding circuitry in communication with said static-information detection logic for copying said static information onto said forwarding data path.
- 3. The apparatus of claim 2 wherein said static-information detection logic further includes logic for detecting the end of said static information and circuitry for inserting an end marker onto said forwarding data path.
- 4. The apparatus of claim 2 wherein said decoder comprises an MPEG decoder including an inverse variable length coder (IVLC) and inverse quantizer (IQ), an inverse discrete cosine transform unit (IDCT) and a Motion Compensator (MC).
- 5. The apparatus of claim 4 wherein said static-information detection logic is incorporated into said IVLC.
- 6. The apparatus of claim 1 further comprising static-information insertion logic for detecting forwarded static information on said forwarding data path and insertion circuitry for inserting said static information into said compressed-video output stream.
- 7. The apparatus of claim 6 wherein said static-information insertion logic further includes end marker detection circuitry for detecting an end marker on said forwarding data path and for ending the insertion of static information into said compressed-video output stream.
- 8. The apparatus of claim 6 wherein said encoder comprises an MPEG encoder including a discrete cosine transform unit (DCT), a quantizer and a Variable Length Coder (VLC).
- 9. The apparatus of claim 8 wherein said static-information insertion logic is incorporated into said VLC for inserting MPEG header information into said compressed-video output stream.
- 10. A compressed-video reencoder for modifying the compression ratio of a compressed-video program from a first compression ratio to a second compression ratio comprising a data forwarding path for forwarding a copy of data that remains unchanged in a reencoding process.
- 11. The compressed-video reencoder of claim 10 wherein said compressed video program comprises MPEG data and said unchanged data comprises static information including header and header extension data.
- 12. The compressed-video reencoder of claim 11 wherein said static information further comprises motion vectors associated with said compressed-video program.
- 13. A method for reencoding a compressed-video program from a first compression ratio to a compressed output stream at a second compression ratio comprising the steps of:
detecting static information in the compressed-video program; forwarding a copy of the static information; reencoding the remainder of the compressed video program data in accordance with said second compression ratio; and combining the forwarded static information with the reencoded remainder data to yield said compressed output stream.
- 14. The method according to claim 13 wherein said compressed-video program is encoded in accordance with the MPEG standard, said detecting step comprising the step of detecting static information including MPEG header information in said compressed-video program.
- 15. The method according to claim 14 wherein said static information further comprises motion vectors associated with said compression-video program.
- 16. The method according to claim 15 wherein said forwarding step comprises the step of copying said header information onto a header forwarding data path.
- 17. The method according to claim 16 wherein said combining step comprises the step of inserting said forwarded header information into said compressed output stream.
- 18. A compressed-video reencoder for modifying the compression ratio of a compressed video program from a first compression ratio to a second compression ratio comprising:
decoder circuitry for receiving and decoding said compressed-video program; a shared motion compensator circuit coupled to communicate with said decoder circuitry and for providing motion compensation for decoding said compressed-video program; and encoder circuitry coupled to receive decoded video data from said decoder circuitry and for encoding said video data in accordance with said second compression ratio, said encoder circuitry coupled to communicate with said shared motion compensator circuit, said shared motion compensator circuit providing motion compensation for both said encoder circuitry and for said decoder circuitry.
- 19. The compressed-video reencoder of claim 18 further comprising a header forwarding data path coupled between said decoder circuitry and said encoder circuitry for forwarding static information including previously encoded header information to an output of said encoder circuitry.
- 20. The compressed-video reencoder of claim 19 wherein said static information further comprises motion vectors associated with said compressed-video program.
- 21. The compressed-video reencoder of claim 18 wherein said decoder circuitry includes:
an inverse variable length coder (IVLC) unit coupled to receive said compressed-video program and for translating variable-length codewords contained therein to quantized DCT coefficients; an inverse quantizer coupled to receive and process said quantized DCT coefficients from said IVLC unit; an inverse discrete cosine transform (IDCT) unit coupled to said inverse quantizer for recovering prediction errors corresponding to each pixel information contained in said compressed-video program; and an adder coupled to said IDCT unit and to said shared motion compensator for summing prediction errors and the output of said shared motion compensator.
- 22. The compressed-video reencoder of claim 21 wherein said encoder circuitry comprises:
a discrete cosine transform (DCT) unit coupled to said adder to provide DCT coefficients; a quantizer coupled to said DCT unit to quantize said DCT coefficients; a variable-length coder (VLC) unit coupled to said quantizer to translate quantized DCT coefficients to variable-length code words; a second inverse quantizer coupled to said quantizer to process requantized DCT coefficients; a second IDCT unit; and a subtractor to produce an output that is representative of the difference between a prediction corresponding to said first compression ratio and a prediction corresponding to said second compression ratio.
- 23. The compressed-video reencoder of claim 22 wherein said second IDCT is coupled to said second inverse quantizer and said subtractor is coupled to receive inputs from said second IDCT and from said adder and coupled to provide input to said shared motion compensator.
- 24. The compressed-video reencoder of claim 22 wherein said subtractor is coupled to receive inputs from said second inverse quantizer and from said DCT and coupled to provide input to said second IDCT and wherein said second IDCT is coupled to provide input to said shared motion compensator.
- 25. The compressed-video reencoder of claim 22 wherein said quantizer operates at a quantization level responsive to a quantization parameter (quantizer-scale code).
- 26. The compressed-video reencoder of claim 25 further including circuitry for varying said quantization parameter responsive to an externally-specified quality setting (quality-level parameter).
- 27. The compressed-video reencoder of claim 26 further comprising circuitry for varying said quantization parameter responsive to a visual model which includes a quantification of the complexity of one or more regions of an image.
- 28. The compressed-video reencoder of claim 27 wherein said one or more regions of an image are non-overlapping blocks of pixels, said non-overlapping blocks collectively comprising a video frame.
- 29. The compressed-video reencoder of claim 26 further comprising circuitry for inhibiting the varying of said quantization parameter.
- 30. The compressed-video reencoder of claim 18 wherein said decoder circuitry comprises:
an inverse variable length coder (IVLC) unit coupled to receive said compressed-video program; an inverse quantizer coupled to said IVLC unit; and an adder coupled to said inverse quantizer.
- 31. The compressed-video reencoder of claim 30 wherein said encoder circuitry comprises:
a quantizer coupled to said adder; a variable-length coder coupled to said quantizer; a second inverse quantizer coupled to said quantizer; a subtractor coupled to said second inverse quantizer and said adder; an inverse discrete cosine transform unit coupled between said subtractor and said shared motion compensator; and a discrete cosine transform unit coupled between said shared motion compensator and said adder
- 32. The compressed-video reencoder of claim 31 wherein said quantizer operates at a quantization level responsive to a quantization parameter (quantizer-scale code).
- 33. The compressed-video reencoder of claim 32 further including circuitry for varying said quantization parameter responsive to an externally-specified quality setting (quality-level parameter).
- 34. The compressed-video reencoder of claim 33 further comprising circuitry for varying said quantization parameter responsive to a visual model which includes a quantification of the complexity of one or more regions of an image.
- 35. The compressed-video reencoder of claim 34 wherein said one or more regions of an image are non-overlapping blocks of pixels, said non-overlapping blocks collectively comprising a video frame.
- 36. The compressed-video reencoder of claim 33 further comprising circuitry for inhibiting the varying of said quantization parameter.
- 37. A compressed-video reencoder for modifying the compression ratio of an encoded video program from a first compression ratio to a second compression ratio utilizing a single shared motion compensator for providing motion compensation for both decoding the encoded video program and for reencoding the video program in accordance with the second compression ratio.
- 38. A statistical multiplexer comprising:
a first compressed-video reencoder coupled to receive a first compressed data stream at a first compression ratio and for modifying said first compressed data stream to a first reencoded data stream at a second compression ratio; a second compressed-video reencoder coupled to receive a second compressed data stream at a third compression ratio and for modifying said second compressed data stream to a second reencoded data stream at a fourth compression ratio; and data stream multiplexing logic for selectively combining said reencoded data streams into a data stream multiplex.
- 39. The statistical multiplexer of claim 38 further comprising an additional plurality of compressed-video reencoders for receiving an additional plurality of compressed data streams for generating an additional plurality of reencoded data streams to be provided to said data stream multiplexing logic.
- 40. The statistical multiplexer of claim 38 wherein each of said compressed-video reencoders comprise decoding circuitry, encoding circuitry and a single shared motion compensator for providing motion compensation for both decoding and encoding circuitries.
- 41. The statistical multiplexer of claim 39 wherein each of said compressed-video reencoders comprise decoding circuitry, encoding circuitry, and a data forwarding path for forwarding static information from said decoding circuitry to said encoding circuitry.
- 42. The statistical multiplexer of claim 39 further comprising:
output buffers coupled between said reencoders and said data stream multiplexing logic; and means for measuring the fullness of said output buffers and for providing a quality-level parameter to said reencoders for effecting the compression ratios of said reencoded data streams.
- 43. A compressed-video head-end distribution system for use in a satellite video distribution system comprising:
a satellite receiver for receiving downlink signals from transponders of a satellite; a plurality of tuner/demodulators coupled to receive the downlink signals from the satellite receiver for recovering multiplexes of compressed-video programs from modulated signals carried by one or more transponders of a satellite; a plurality of selector/demultiplexers each coupled to each of said tuner/demodulators for selectively demultiplexing a selected combination of video programs from said multiplexes; and a statistical multiplexer for combining desired video programs into statistically multiplexed data streams for distribution, said statistical multiplexer including a plurality of compressed-video reencoders for modifying a received compressed data stream at a first compression ratio to a reencoded data stream at a second compression ratio.
- 44. The compressed-video head-end distribution system of claim 43 wherein each of said compressed-video reencoders comprise decoding circuitry, encoding circuitry and a single shared motion compensator for providing motion compensation for both decoding and encoding circuitries.
- 45. The compressed-video head-end distribution system of claim 44 wherein each of said compressed-video reencoders comprise decoding circuitry, encoding circuitry, and a data forwarding path for forwarding static information from said decoding circuitry to said encoding circuitry.
- 46. The compressed-video reencoder of claim 25 further including circuitry for setting said quantization parameter to an integer multiple of a previous quantization parameter.
- 47. The compressed-video reencoder of claim 32 further including circuitry for setting said quantization parameter to an integer multiple of a previous quantization parameter.
Divisions (1)
|
Number |
Date |
Country |
Parent |
08631084 |
Apr 1996 |
US |
Child |
09127071 |
Jul 1998 |
US |