This application claims the benefit, under 35 U.S.C. §365 of International Application PCT/US2005/002335, filed Jan. 24, 2007 published in accordance with PCT Article 21(2) on Aug. 3, 3006 in English.
This invention relates to a technique for detecting errors in a video bit stream.
The implementation of present day video compression standards, such as MPEG, H.263 and JVT (also known as ISO/ITU H.264) result in the division of video frames into macroblocks and slices. A slice generally contains a given number of consecutive macroblocks. However, the use of the Flexible Macroblock Ordering option for video compressed in accordance with the JVT compression standard can result in the macroblocks in a slice are not necessary being adjacent to one another in display order. Variable Length Coding (VLC) techniques can enhance compression efficiency, at the expense of error resiliency. An error in a bit stream can propagate undetected and corrupt all subsequent macroblocks in a slice.
To enable error recovery, the header of each slice contains a resynchronization marker. Such markers constitute the earliest point at which proper decoding can take place following a bit error. Slice headers begin on a byte boundary so as to be byte aligned whereas individual macroblocks are not byte aligned. When using Context-based Adaptive Binary Arithmetic Coding in the connection with the coding of a video bit stream in accordance with the JVT compression standard, multiple symbols can be coded as a single bit, so that macroblock boundaries are not even bit aligned. In this regard, precise identification of when bit errors occur the decoder becomes difficult to variable length code words. Most bit patterns in a corrupted bit stream represent valid code words, even if they don't represent correct values. In some cases, identification of invalid code words can occur if the decoded value reaches beyond a legitimate range, but this will not always become apparent until incorrect decoding of a variable, unknowable amount. Thus all macroblocks in a slice following an error remain suspect for corruption.
Video error detection can occur by decoding a video frame and then checking the frame for spatial discontinuities at macroblock and slice boundaries, in the pixel domain. Various proposals now exist for video error detection and concealment. Follow detection of errors in particular macroblocks, error concealment can occur on the corrupt macroblocks. Concealed macroblocks will generally appear perceptually better than unconcealed corrupt macroblocks, but perceptually worse than correctly decoded macroblocks. However, present day concealment algorithms consume significant resources, making then computationally expensive. Moreover, such algorithms are also prone to false positives (concealing correctly decoded macroblocks) and false negatives (failure to conceal corrupted macroblocks.)
One approach to overcoming the problem of corrupted macroblocks recommends transmitting an entire slice in a single IP packet. Should a slice become lost during transmission, all macroblocks in the slice become marked as missing and thus require concealment. As previously mentioned, all data in a packet with a known transmission error will be marked as corrupted, discarded and concealed. Thus, an entire slice will now become lost in the case of a detected transmission error. Assigning relatively few macroblocks per slice will improve error resilience, but at a significant cost of coding and transport efficiency.
Thus, a need exists for a technique for detecting bit stream errors enjoys a relatively low computational complexity.
Briefly, in accordance with a preferred embodiment, there is provided a method for detecting errors in a coded video stream. The method commences upon the receipt of a coded video stream along with a first parity code calculated over a group of N macroblocks in the stream prior to transmission. The stream undergoes decoding to form a group of decoded macroblocks. A second parity calculation occurs over the group of decoded macroblocks, and a comparison is made between the first and second parity codes to determine whether the group of decoded macroblocks contains an error.
Briefly, in accordance with another preferred embodiment of the present principles, there is provided a technique for detecting errors during transmission of a video bit stream comprised of compressed macroblocks. A Cyclic Redundancy Check (CRC) parity code is calculated over a group of the compressed macroblocks forming the video stream prior to transmission. Upon receipt of the group of compressed macroblocks, a second CRC parity calculation occurs. The CRC codes undergo a comparison. If the CRC codes match, then the no error has occurred, and normal decoding or other processing of the group of blocks can take place. In the event of an error, the macroblocks in the group undergo error concealment. When the groups of macroblocks undergo compression in accordance with the JVT video compression standard, the CRC parity codes can be transmitted as SEI messages.
The number of bits or bytes over which a CRC is applied must be known upon reception, in order to perform error detection. CRCs can be applied to fixed length blocks of data, where a pre-determined fixed length is known at the transmitter and the receiver. Alternatively, CRCs can be applied to variable length packets, which contain a length field in their headers, so that the receivers can determine the length over which to apply the CRC checking computation. If the CRC check upon data reception reveal a corrupt packet, all of the data in the packet is generally discarded, even if a portion of valid data exists because the CRC does not provide information about where in the packet the corruption occurred.
To overcome the aforementioned disadvantage, transmitter 10 and receiver 20 of
A selector block 14 within the transmitter 10 selects N compressed macroblocks within each slice encoded by the encoder 12. To the extent that less than N macroblocks remain in the slice, the selector 14 will bundle such remaining macroblocks in a group. A CRC parity calculator 18 calculates a CRC parity code over the bits or bytes representing the N compressed macroblocks. The parity codes that are calculated are transmitted along with the compressed video data, either in a separate packet from the slice's compressed video data, or a the beginning of the same packet at the slice compressed video data. The value of N may either be pre-determined and known in advance at both transmitter 10 and the receiver 20 of
A CRC parity comparator 26 within the receiver 20 of
Advantageously, the decoder 22 of
As compared to prior art methods where a single CRC code applies to an entire slice, the error detection technique of the present principles affords the advantage that those macroblocks in groups preceding a transmission error will have no errors and do not need be concealment or testing for discontinuities. Non-corrupt macroblocks that improperly undergo concealment generally have lower visual quality than correctly decoded macroblocks. Because of inter-frame coding, lower visual quality can persist for longer periods of time.
The error detection technique of the present principles compares favorably to methods that apply spatial continuity measures to the decoded video to detect the location of transmission errors. It has the advantage of greatly reduced computational complexity, because the calculation of a CRC parity code and the comparison to a received value incurs much less complexity than measuring spatial discontinuities. The technique of the present principles has the advantage of avoiding false positives, where uncorrupted macroblocks undergo inappropriate concealment, and avoiding false negatives, where corrupted macroblocks do not undergo concealment but should do so.
When applied to the MPEG-2 video compression standard, the CRC parity codes, and optionally the value of N, can undergo transmission in user data fields. When applied to the JVT video compression standard, the CRC parity codes, and optionally the value of N, can undergo transmission in one or more SEI messages or in user data.
The selection of N, the number of macroblocks over which calculation of the CRC parity code occurs, will depend on the expected error rate of the network 15. Choosing a large value of N (fewer groups of macroblocks, more macroblocks per group) reduces the overhead for transmitting the CRC parity codes, but the precision of error detection will suffer. Choosing a small value of N (more groups of macroblocks, fewer macroblocks per group) will increase the overhead of transmitting the CRC parity codes, but improves the precision of error location. For small values of N, assuming uniformly distributed errors and uniform compression for all macroblocks in the slice, the ratio of macroblocks that can be identified as being not corrupted will approach ½.
Although the error detection technique described above makes use of a CRC code, any type of error detection or correction polynomial can be used. CRC codes are generally calculated and applied over bytes. Further, the present technique does not require calculation and transmission of CRC parity codes for all macroblocks in a slice. In some instances, sending parity codes for earlier macroblocks in the slice than for the later macroblocks can prove advantageous, as once detection of an error occurs in a compressed slice, the remainder of the macroblocks in the slice cannot undergo correction. This remains true even if they belong to a group of macroblocks that was received without any transmission errors.
The foregoing describes a technique for detecting errors in a coded video stream.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2005/002335 | 1/24/2005 | WO | 00 | 7/17/2007 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2006/080910 | 8/3/2006 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4126764 | Downey et al. | Nov 1978 | A |
4394762 | Nabeshima | Jul 1983 | A |
5778191 | Levine et al. | Jul 1998 | A |
5872577 | Perrin | Feb 1999 | A |
6029265 | Itol et al. | Feb 2000 | A |
6229854 | Kikuchi et al. | May 2001 | B1 |
6332003 | Matsuura et al. | Dec 2001 | B1 |
6445742 | Yoo et al. | Sep 2002 | B1 |
6609225 | Ng | Aug 2003 | B1 |
6611561 | Hannuksela et al. | Aug 2003 | B1 |
7020203 | Horowitz et al. | Mar 2006 | B1 |
7230986 | Wise et al. | Jun 2007 | B2 |
7607070 | Clark et al. | Oct 2009 | B2 |
20010000543 | Kajala et al. | Apr 2001 | A1 |
20020146074 | Ariel et al. | Oct 2002 | A1 |
20020157058 | Ariel et al. | Oct 2002 | A1 |
20030108011 | Lee et al. | Jun 2003 | A1 |
20040190609 | Watanabe | Sep 2004 | A1 |
20050123207 | Marpe et al. | Jun 2005 | A1 |
20050174768 | Conner | Aug 2005 | A1 |
20050175093 | Haskell et al. | Aug 2005 | A1 |
20050286024 | Gupta | Dec 2005 | A1 |
20060092337 | Howard | May 2006 | A1 |
20060139580 | Conner | Jun 2006 | A1 |
20060221305 | Magarill | Oct 2006 | A1 |
Number | Date | Country |
---|---|---|
7038854 | Feb 1995 | JP |
10116204 | May 1998 | JP |
11164296 | Jun 1999 | JP |
2002268731 | Jun 2001 | JP |
2002374535 | Dec 2002 | JP |
200318595 | Jan 2003 | JP |
2003209837 | Jul 2003 | JP |
2005121960 | May 2005 | JP |
2005242364 | Sep 2005 | JP |
2005300712 | Oct 2005 | JP |
2007524976 | Aug 2007 | JP |
2008537788 | Sep 2008 | JP |
Entry |
---|
Song, Jie, and KJ Ray Liu. “A data embedded video coding scheme for error-prone channels.” Multimedia, IEEE Transactions on 3.4 (2001): 415-423. |
Wu, Feng, et al. “SMART: an efficient, scalable, and robust streaming video system.” EURASIP Journal on Applied Signal Processing 2004: 192-206. |
Chung-How, J.T.H. et al.: “Loss resilient H.263 + video over the Internet”, Signal Processing Image Communication, Elsevier Science Publishers, Amsterdam, NL., vol. 16, No. 9, Jun. 2001, pp. 891-908, XP004245274. |
Papadakis, V. et al.: “Syntax based error concealment” Circuits and Systems, 1998, ISCAS 98. Proceedings of the 1998 IEEE International Symposium in Monterey, CA, USA, May 31-Jun. 3, 1998, IEEE, US, vol. 4, pp. 265-268, XP010289439. |
International Search Report, dated Jul. 8, 2005. |
Khalifa et al., “Cyclic Redundancy Encoder for Error Detection in Communication Channels”, 2004 IEEE, 2004 RF and Microwave Conference, Subang, Selangor, Malaysia, Oct. 5-6, 2004, pp. 224-226. |
Number | Date | Country | |
---|---|---|---|
20080298470 A1 | Dec 2008 | US |