The present invention generally relates to data communications systems, and more particularly to the transmission of data with time diversity.
Many transmission systems, such as mobile wireless broadcast systems are subject to a difficult physical channel. In addition to fading and Doppler effects, the signal may be entirely obstructed by buildings, trees, poles, and overpasses, among other things. Such conditions can easily cause signal loss for a period of a second or more at a receiver.
To combat these problems, mobile systems frequently use time diversity techniques, such as: interleaving; Long block codes, such as low density parity codes (LDPC) or Turbo codes; convolutional codes; and Multi-Protocol Encapsulation combined with forward error correction (MPE-FEC). Unfortunately, these systems generally incur a delay that is proportional to the time diversity. A user typically perceives this delay in the form of long channel change times, which is highly objectionable to the user.
A type of time diversity technique often used in the transmission of streams of data, such as video data, is staggercasting. Staggercasting offers a method of protection against signal loss by transmitting a secondary, redundant stream which could be time-shifted with respect to a primary stream. This allows a receiver to pre-buffer packets of the secondary stream to replace packets of the primary stream lost in transmission.
Various staggercasting techniques exist that differ in the types of redundant data sent in the secondary stream. For example, the secondary stream may simply be an exact copy of the primary stream staggered with some time offset.
Another staggercasting technique involves the transmission of a secondary stream that is separately encoded from the primary stream. When scalable video coding is not available (for example, with a specification or standard that does not offer a scalable video codec), this secondary stream is completely independent from the primary stream and is simply a separately encoded stream representing the same source video. Because video decoders must typically maintain state data, such as previously decoded reference frames that must be available for decoding future frames, such a staggercasting arrangement requires a receiver to maintain two separate decoder states for each of the streams, placing additional memory burdens on the receiver.
We have observed that temporal scalability techniques exist that allow a video stream to be coded such that the stream may be decoded and displayed at multiple frame rates. A video stream that is temporally scalably encoded contains reference frames, from which other frames may be predicted, and additional, non-reference frames. The non-reference frames, commonly referred to as “disposable” frames, are unnecessary for decoding other frames of the video stream and thus may be discarded, resulting in a lower display frame rate for the video.
In an exemplary embodiment of the present invention, staggercasting and temporal scalability techniques are combined to transmit a secondary coded video stream in addition to a primary coded video stream such that the secondary stream contains a subset of video frames from the primary stream. The two streams are transmitted at some time offset (i.e. “staggered”) allowing the secondary stream to be pre-buffered by a receiver to substitute for near future losses of the primary stream.
The impact of losing coded video data is alleviated by transmitting video reference frames in both the primary and secondary streams. Disposable video frames while transmitted in the primary stream, need not be transmitted in the secondary stream if bandwidth is limited. Staggering the two streams in time reduces the likelihood of the secondary stream data being lost along with the primary stream data. A receiver can buffer the secondary stream so that it may fall back on this data when loss of the primary stream occurs.
In a further exemplary embodiment of the invention, different levels of protection can be used in the staggered streams. For example, the secondary stream, which transports the more critical elements, can be provided with a high level of error protection, whereas the primary stream can be provided with a lower level of error protection or no error protection at all.
In view of the above, and as will be apparent from reading the detailed description, other embodiments and features are also possible and fall within the principles of the invention.
Some embodiments of apparatus and/or methods in accordance with embodiments of the present invention are now described, by way of example only, and with reference to the accompanying figures in which:
Other than the inventive concept, the elements shown in the figures are well known and will not be described in detail. For example, other than the inventive concept, familiarity with Discrete Multitone (DMT) transmission (also referred to as Orthogonal Frequency Division Multiplexing (OFDM) or Coded Orthogonal Frequency Division Multiplexing (COFDM)) is assumed and not described herein. Also, familiarity with television broadcasting, receivers and video encoding is assumed and is not described in detail herein. For example, other than the inventive concept, familiarity with current and proposed recommendations for TV standards such as NTSC (National Television Systems Committee), PAL (Phase Alternation Lines), SECAM (SEquential Couleur Avec Memoire) and ATSC (Advanced Television Systems Committee) (ATSC), Chinese Digital Television System (GB) 20600-2006 and DVB-H is assumed. Likewise, other than the inventive concept, other transmission concepts such as eight-level vestigial sideband (8-VSB), Quadrature Amplitude Modulation (QAM), and receiver components such as a radio-frequency (RF) front-end (such as a low noise block, tuners, down converters, etc.), demodulators, correlators, leak integrators and squarers is assumed. Further, other than the inventive concept, familiarity with protocols such as the File Delivery over Unidirectional Transport (FLUTE) protocol, Asynchronous Layered Coding (ALC) protocol, Internet protocol (IP) and Internet Protocol Encapsulator (IPE), is assumed and not described herein. Similarly, other than the inventive concept, formatting and encoding methods (such as Moving Picture Expert Group (MPEG)-2 Systems Standard (ISO/IEC 13818-1)) for generating transport bit streams are well-known and not described herein. It should also be noted that the inventive concept may be implemented using conventional programming techniques, which, as such, will not be described herein. Finally, like-numbers on the figures represent similar elements.
Note that while the embodiment described is for a video implementation, the principles of the present invention can be applied to systems handling a variety of temporally scalable data, such as for example, audio data.
The staggercast transmission from the transmitter 103 to the MUX 105 comprises two streams. One stream, the primary stream 10, corresponds to the original stream from the source 101 and the other stream, the secondary stream 20, can be a copy of all or a portion of the primary stream. The secondary stream 20 can be time-shifted or staggered relative to the primary stream 10, in which case it may also be referred to as a “staggered” stream. Staggering allows the receiver 109 to pre-buffer data units of the secondary stream 20 so that they may replace corresponding data units in the primary stream 10 that may have been lost or corrupted in transmission.
In the exemplary arrangement of
In the illustrative scenario of
In the example shown in
In an exemplary embodiment, the primary and secondary streams may be provided with error protection (e.g., turbo codes, forward error correction, etc.) Both or only the secondary stream may be provided with error protection. The two streams may also be provided with different levels of error protection, with the secondary stream preferably being provided with a higher level of protection. It would be possible to reduce the overhead of an error protection scheme by applying it only to the secondary stream. This also offers the advantage of allowing the receiver to immediately decode and play the unprotected primary stream. Since the secondary stream is preferably received before the primary stream, there should be sufficient time to correct errors in any secondary stream data units before they may be needed to replace any lost primary stream data units.
Referring again to
Note that in the arrangement shown in
Moreover, in some applications, a secondary stream as contemplated by the present invention may already be available, as opposed to being generated by a stagger transmitter, as shown. For example, a specification may define multiple profiles for the transmission of content to mobile devices. These profiles can vary from very low resolution/frame rate/bitrate streams for viewing on simple mobile phones with small screens to higher resolution/frame rate/bitrate streams for mobile devices better capable of presenting video (having a larger screen, more powerful decoder, etc.) A system may simultaneously transmit a given video program in both profiles on the same channel so that users of either type of device may receive video that is optimal for their respective devices. In such an application, an embodiment of the present invention would allow the more powerful device to use the simpler stream as the secondary stream in order to provide substitute video for time periods of data loss. This would entail identifying the simpler stream as the secondary stream and transmitting it at a time offset relative to the primary stream. Such an implementation has the benefit of not requiring any additional bandwidth on the channel since the secondary stream already exists.
Referring now to
The receiver 109, using the redundant secondary stream data units contained in the received stream 30′, can reconstruct all or some of the primary stream data units for which copies were included in the secondary stream. In
Upon processing the sequence of reconstructed data units 40, the decoder 111 presents the frames of decoded video to the display 113, as represented by the sequence 50. Sequence 50 shows the reconstructed sequence of data units containing the frames of video data presented for display and indicates the timing with which the frames are displayed. As indicated by the sequence 50, for the duration of the lost data, the frame rate is approximately cut in half, which typically may be perceived by a viewer as a short period of less fluid motion. The visual impact of missing disposable frames should be minimal given that the disposable frames are, by definition, not used as reference frames and thus their loss does not impact any other frames in the stream, as a missing reference frame would. Although it should not be necessary to reconstruct or conceal the missing frames, implementation of the present invention, however, does not preclude such measures.
As can be appreciated, the performance of the above-described arrangement will depend on a variety of factors, including the degree of redundancy provided in the secondary stream and the duration of data loss events. For example, the frame rate of the video generated from the reconstructed sequence 40 of data units will depend on the degree of redundancy provided by the secondary stream 20 in the combined stream 30. If, for example, the secondary stream 20 contains copies of all data units in the primary stream 10, i.e., disposable as well as non-disposable units, it is possible that all data units lost in transmission can be reconstructed at the receiver 109, so long as no staggered copy of a lost data unit is lost as well. For applications in which a temporarily reduced frame rate can be tolerated, only non-disposable data units can be provided in the secondary stream 20, as described above.
Embodiments of the present invention enjoy several advantages over known approaches. As mentioned above, one staggercasting method involves the transmission of a secondary stream that is separately encoded from the primary stream. When scalable video coding is not available (for example, with a specification or standard that does not offer a scalable video codec), this secondary stream is completely independent from the primary stream and is simply a separately encoded stream representing the same source video. Typical video decoders must maintain state data, such as previously decoded reference frames that must be available for decoding future frames that are predicted from them. Where the primary and secondary streams are independent, a receiver would need to maintain two separate decoder states for each of the streams, placing additional memory burdens on the receiver. The exemplary arrangement of the present invention described above can be implemented with only one decoder and associated state memory given that the two streams are related; i.e., the secondary stream is a subset of the primary stream.
The principles of the present invention may be combined with other staggercasting methods. For example, staggercasting may be used with spatially scalable video streams such that both the lower resolution base layer stream and the higher resolution enhancement layer stream are provided in the primary stream while the lower resolution base layer stream is provided in the staggered secondary stream. This would provide protection for the base layer while saving on bandwidth by not duplicating the enhancement layer as well.
In view of the above, the foregoing merely illustrates the principles of the invention and it will thus be appreciated that those skilled in the art will be able to devise numerous alternative arrangements which, although not explicitly described herein, embody the principles of the invention and are within its spirit and scope. For example, the inventive concept may be implemented in a stored-program-controlled processor, e.g., a digital signal processor, which executes associated software for carrying out a method in accordance with the principles of the invention. Further, the principles of the invention are applicable to other types of communications systems, e.g., satellite, Wireless-Fidelity (Wi-Fi), cellular, etc. Indeed, the inventive concept is also applicable to stationary as well as mobile receivers. It is therefore to be understood that numerous modifications may be made to the illustrative embodiments and that other arrangements may be devised without departing from the spirit and scope of the present invention.
This application claims the benefit under 35 U.S.C. §119(e) of U.S. Provisional Application No. 61/123,916, filed Apr. 11, 2008, the entire contents and file wrapper of which are hereby incorporated by reference for all purposes into this application.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US09/02168 | 4/7/2009 | WO | 00 | 10/2/2010 |
Number | Date | Country | |
---|---|---|---|
61123916 | Apr 2008 | US |