Claims
- 1. In a coder for coding frames of input video information, each frame comprising an ordered plurality, i=1, 2, . . . , n, of slices, each slice comprising an ordered sequence j=1, 2, . . . , m of macroblocks, each macroblock comprising a plurality of input signal values, the coder having a plurality of modes for coding macroblocks, a method for selecting a coding mode, M.sub.i,j for the jth macroblock in the ith slice, the method comprising
- determining values for a cost function, J.sub.i,j, a quantizing parameter QP and a Langrangian multipliers .lambda., where J.sub.i,j (.)=D.sub.i,j (.)+.lambda.R.sub.i,j (.), D.sub.i,j is the distortion for the jth macroblock in the with strip, and R.sub.i,j is the rate for coding the jth macroblock in the with strip, and
- determining a set of coding modes for each macroblock that satisfy ##EQU3##
- 2. The method of claim 1 wherein said rates and distortions are determined using a normalized rate-distortion model.
- 3. The method of claim 2 wherein said macroblocks are characterized by the variance of input values in respective macroblocks.
- 4. The method of claim 2 wherein said macroblocks are characterized by the mean of input values in respective macroblocks.
- 5. The method of claim 2 wherein said rates and distortions determined using said normalized rate-distortion model are obtained from values stored in a memory.
- 6. The method of claim 2 wherein said minimums are determined using a trellis search.
- 7. The method of claim 6 wherein said minimums for QP and .lambda. are determined by an exhaustive trellis search using values for said cost function determined using said normalized rate-distortion model.
- 8. The method of claim 6 wherein said minimums for QP and .lambda. are approximated using a suboptimal search method.
- 9. The method of claim 8 wherein said suboptimal search method comprises determining the relationships where a, b, .alpha., and .beta. are model parameters which are computed in the coding mode optimization process.
- 10. The method of claim 9 wherein c and .gamma. are set to 1.0 and 0.7, respectively.
- 11. The method of claim 1 where said minimums are achieved approximately, using a suboptimal method, the suboptimal method comprising
- A. determining the target number of bits, R.sub.T for encoding the n.sup.th frame X(.sub.n), which is a function of the buffer fullness.
- B. determining a suboptimal QP by performing the method comprising
- (i) setting .lambda.=.lambda..sub.overall and and QP=QP.sub.0 =QP.sub.(n-), where .lambda..sub.overall is the average of .lambda.s over the previous 10 frames and QP.sub.(n-) is QP for the previous frame;
- (ii) performing coding mode optimization with (QP.sub.0, .lambda..sub.overall) to obtain the corresponding overall distortion D.sub.0 and the required number of bits R.sub.0 ;
- (iii) If R.sub.0 <R.sub.T, set QP.sub.1 =QP.sub.0 -.delta., otherwise set QP.sub.1 =QP.sub.0 +.delta., where .delta. is a non-negative small integer;
- (iv) determine D.sub.1 and R.sub.1 with QP=QP.sub.1 ;
- (v) model the R-QP curve by calculating a and b in
- from (QP.sub.0, R.sub.0) and (QP.sub.1, R.sub.1); ##EQU4## (vi) using said model, calculate the suboptimal QP* corresponding to R.sub.T ;
- C. model the R-D curve in
- using (D.sub.2, R.sub.2) obtained with (QP*,.lambda..sub.overall) to obtain a suboptimal .lambda.*; ##EQU5## D. performing rate-distortion coding mode optimization with (QP*, .lambda.*).
- 12. The method of claim 11 wherein c and .gamma. are 1.0 and 0.7, respectively.
- 13. In a coder for coding frames of input video information, each frame comprising an ordered plurality, i=1, 2, . . . , n, of slices, each slice comprising an ordered sequence j=1, 2, . . . , m of macroblocks, each macroblock comprising a plurality of input signal values, the coder having a plurality of modes for coding macroblocks, a system for selecting a coding mode, M.sub.i,j for the jth macroblock in the ith slice, the system comprising
- means for determining values for a cost function, J.sub.i,j, a quantizing parameter QP and a Langrangian multiplier.lambda., where J.sub.i,j (.)=D.sub.i,j (.)+.lambda.R.sub.i,j (.), D.sub.i,j is the distortion for the jth macroblock in the ith strip, and R.sub.i,j is the rate for coding the jth macroblock in the ith strip, and
- means for determining a set of coding modes for each macroblock that satisfy ##EQU6##
- 14. The system of claim 13 wherein said means for determining a set of coding modes comprises means for determining rates and distortions using a normalized rate-distortion model.
- 15. The system of claim 14 wherein said macroblocks are characterized by the variance of input values in respective macroblocks.
- 16. The system of claim 14 wherein said macroblocks are characterized by the mean of input values in respective macroblocks.
- 17. The system of claim 14 wherein said means for determining a set of coding modes comprises means for obtaining said rates and distortions from values stored in a memory.
- 18. The system of claim 14 wherein said means for determining a set of coding modes comprises means for performing a trellis search.
- 19. The system of claim 18 wherein said means for performing a trellis search comprises means for performing an exhaustive trellis search using values for said cost function determined using said normalized rate-distortion model.
- 20. The system of claim 18 wherein said means for performing a trellis search comprises means for determining minimum values for QP and .lambda. approximately using a suboptimal search method.
- 21. The system of claim 20 wherein said means for determining minimum values for QP and .lambda. approximately comprises means for determining the relationships where a, b, .alpha., and .beta. are model parameters which are computed by said means for determining rates and distortions using a normalized rate-distortion model.
- 22. The system of claim 13 wherein c and .gamma. are set to 1.0 and 0.7, respectively.
- 23. The system of claim 13 where said minimums are achieved approximately by a suboptimal search system, the suboptimal search system comprising
- A. means determining the target number of bits, R.sub.T for encoding the n.sup.th frame X(.sub.n), which is a function of the buffer fullness.
- B. means for determining a suboptimal QP by performing the method comprising
- (i) means for setting .lambda.=.lambda..sub.overall and QP=QP.sub.0 =QP.sub.(n-), where .lambda..sub.overall is the average of .lambda.s over the previous 10 frames and QP.sub.(n-) is QP for the previous frame;
- (ii) means for performing coding mode optimization with (QP.sub.0, .lambda..sub.overall) to obtain the corresponding overall distortion D.sub.0 and the required number of bits R.sub.0 ;
- (iii) means for setting QP.sub.1 =QP.sub.0 -.delta. if R.sub.0 <R.sub.T, and otherwise setting QP.sub.1 =QP.sub.0 +.delta., where .delta. is a non-negative small integer;
- (iv) means for determing D.sub.1 and R.sub.1 with QP=QP.sub.1 ;
- (v) modeling the R-QP curve by calculating a and b in ##EQU7## from (QP.sub.0, R.sub.0) and (QP.sub.1, R.sub.1); (vi) means for calculating the suboptimal QP* corresponding to R.sub.T using said model,
- C. means for modeling the R-D curve in ##EQU8## using (D.sub.2, R.sub.2) obtained with (QP*,.lambda..sub.overall), to obtain a suboptimal .lambda.*;
- D. means for performing rate-distortion coding mode optimization with (QP*, .lambda.*).
- 24. The system of claim 23 wherein c and .gamma. are 1.0 and 0.7, respectively.
RELATED APPLICATION
The present application is a continuation-in-part of the pending patent application by A. E. Jacquin, N. S. Jayant, and K. H. Yang, entitled Normalized Model for Quantization of Digital Signals, Ser. No. 09/178,088 filed Oct. 23, 1998, which is assigned to the assignee of the present application. The cited parent application is hereby incorporated by reference as if set forth in its entirety herein.
US Referenced Citations (6)
Non-Patent Literature Citations (3)
Entry |
Lei et al. "Rate Control in DCT Video Coding for Low-Delay Communications" IEEE Transaction on Circuits and Systems for Video Tech, vol. 9, No. 1 pp. 172-185, Feb. 1999. |
Hsu et al. "A Larangian optimization approach to rate control for delay-constraned video transmission over burst-error channels" Proc. of the 1998 IEEE International Conference on Acoustic, Speech and Signal Processing, vol. 5, pp 2989-2992, Feb. 1999. |
Wiegand, T., M. Lightstone, D. Mukherjee, T.G. Campbell and S.K. Mitra, "Rate-Distortion Optimized Mode Selection for Very Low Bit Rate Video Coding and the Emergin H.263 Standard," IEEE Trans. Circuits and Systems for Video Technology, vol. 6, No. 2, Apr., 1996. |
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
178088 |
Oct 1998 |
|