Splicing compressed, local video segments into fixed time slots in a network feed

Abstract
A method for seamlessly splicing a local commercial segment into an existing network time slot, without decoder buffer overflow or underflow. The vbv_delay of the commercial segment is manipulated (e.g., for a minimum delay or a maximum delay). The pictures from the commercial segment are output for at least a portion of the associated network time slot duration. A determination is made regarding the number of pictures remaining in a stored portion of the incoming network feed or the commercial segment and the output rate is adjusted as required. The vbv_delay of the stored network feed or the vbv_delay of the local commercial segment is adjusted to match the vbv_delay of the incoming network feed.
Description


[0002] The present invention relates to the generation of digital video signals. In particular, the invention is directed to improvements to video splicing in order to simplify the design requirements of a downstream receiver, especially by lowering the required processing speed.


[0003] The goal of splicing compressed bit streams is to change from one compressed source to a second compressed source with no disruption in the decoded program, while maintaining bitstream compliance through the transition. In general, input streams are de-multiplexed to the packetized elementary stream (PES) level before being processed by individual elementary stream type processors. Program video is spliced at access unit (picture) boundaries, and a continuous flow of time stamped video (and audio frames), without timing discontinuities, is maintained in the output stream.


[0004] Video splicing techniques can include the examination of incoming streams to extract stream parameters that are used to determine stream entry and exit points and calculates values required by the outgoing stream. Exit points are found in the current output stream while entry points are found in the next output stream.


[0005] Seamless entry/exit point indicators can be found by analysis of the types of the neighboring pictures. Specifically a seamless exit from a stream can be made at the end of a picture preceding an anchor picture. This identifies a naturally occurring exit point in the original stream. Seamless entry points can be identified by (1) the start of a closed GOP or (2) an Intra coded (I) picture followed by an anchor picture or (3) an I picture followed by a predictive picture using only backward prediction or intra coding.


[0006] The difficulty of switching from one compressed stream to a second compressed stream, by finding suitable exit and entry point, is eased by the fact that the actual switch point can generally be within a picture or two of the switching command (in order to maintain the seamless aspect) without noticeable visual effect. However, when a rigorously defined network time slot, possibly occupied by a network commercial, is replaced by an equally rigorously defined local insertion, care must be taken to correctly fill the slot, because segment replacement must be exact.


SUMMARY OF THE INVENTION

[0007] The invention is directed to a method for seamlessly splicing a local commercial segment into an existing network time slot, without decoder buffer overflow or underflow. In particular, the invention encompasses a method of splicing an incoming network feed having a network time slot duration and an associated vbv_delay with a commercial slot duration having an associated vbv_delay. The vbv_delay of the commercial slot vbv_delay is manipulated between one of a minimum delay and a maximum delay. The pictures from the compressed commercial slot are output for at least a portion of the network time slot duration. The number of pictures remaining is then determined (i.e., the remaining pictures from either a stored portion of the incoming network feed or the commercial slot). The output rate (of the remaining pictures from either the stored portion of the incoming network feed or the commercial slot) is adjusted as required to output the commercial slot. The vbv_delay of either the stored network feed or the vbv_delay of the local commercial slot is then adjusted to match the vbv delay of the incoming network feed.


[0008] In a preferred embodiment, the commercial slot vbv_delay is manipulated for a maximum delay. Pictures from the compressed commercial slot are output for the network time slot duration. Any remaining pictures from the commercial slot are output by, storing at least a portion of the incoming network feed, outputting the remaining pictures at an increased output rate and then outputting the stored portion of the network feed. The network time slot vbv_delay is also adjusted until the vbv_delay of stored network feed matches the vbv_delay of the incoming network feed.


[0009] In yet another preferred embodiment, the commercial slot vbv_delay is manipulated for a minimum delay. Pictures from the compressed commercial slot are output for at least a portion of the network time slot duration. The number of pictures remaining from the commercial slot are determined and the output rate is adjusted as required to complete the network time slot duration. The local commercial slot vbv_delay is also adjusted to match the vbv_delay of the incoming network feed.


[0010] In yet another preferred embodiment at least a portion of the incoming network feed is stored and delayed. The network time slot duration is determined based on a Decode Time Stamp and a network time slot duration time tolerance. The commercial slot vbv_delay is manipulated so that the commercial slot duration substantially matches the network time slot duration. The incoming network feed is output after completion of the network time slot duration.







BRIEF DESCRIPTION OF THE DRAWINGS

[0011]
FIG. 1 shows network and local time slot arrangement in accordance the invention.


[0012]
FIG. 2 shows the lapsed time from DTS and vbv_delay parameters in accordance with the invention.


[0013]
FIG. 3 shows the relative duration of the network slot and a commercial insert with the vbv_delay of the commercial insert manipulated to a maximum value in accordance with the invention.


[0014]
FIG. 4 shows the relative duration of the network slot and a commercial insert with the vbv_delay of the commercial insert manipulated to a minimum value in accordance with the invention.







DETAILED DESCRIPTION OF THE INVENTION

[0015] Consider a network Elementary Steam, N, shown in FIG. 1, where a sequence of S Network Access Units (pictures), numbered 1 through S, are embedded in the stream. This sequence represents a network time slot, possibly a network advertisement, that is to be replaced with a sequence of C Local Access Units, numbered 1 through C, representing a local time slot, possibly a locally inserted commercial.


[0016] The Out Point from the network is at the start of Network Access Unit 1, which coincides with the start of Local Access Unit 1. Similarly, the In Point of the return to the network occurs after Network Access Unit S. The Out Point from the local commercial occurs after Local Access Unit C.


[0017] Knowing the start time and the final presentation duration of a commercial is insufficient to determine how to insert a compressed local commercial into the stream in the time slot provided by the network when the compressed streams are not further constrained. The time slot in the stream, Ts, is a variable equal to the presentation time, Ps of the slot plus or minus some tolerance, Δs. Similarly, the compressed commercial, stored locally, has a variable time slot, Tc, equal to its presentation time, Pc plus or minus some tolerance, Δc. That is,




T


s


=P


s
+/−Δs and Tc=Pc+/−Δc



[0018] Normally, the number of network Access Units equals the number of local commercial Access Units if both the network slot and the commercial slot have the same frame rate, i.e., S=C. However, when S≠C, as in the case of inserting a 24 frame/second segment into a 30 frame/second slot, the number of Access Units in each segment must be proportionally arranged so that Ts≡Tc.


[0019] Determining the delta tolerances involved with the time slot duration is critical to understanding the invention of splicing a local segment into an existing network time slot.


[0020] Determining the Variability


[0021] In an elementary stream, n of MPEG2 compressed video data, shown in FIG. 2, an Access Unit, j, is stamped with two critical pieces of information, namely Video Buffering Verifier Delay (vbv_delay) and Decode Time Stamp (DTS).


[0022] The MPEG2 definition of vbv delay is “the number of periods of a 90 KHz clock derived from the 27 MHz system clock that the Video Buffering Verifier (VBV) shall wait after receiving the final byte of the picture start code before decoding the picture.” In MPEG2 terminology, with parenthetical remarks inserted for clarity, VBV is “a hypothetical (video) decoder (including a video buffer) that is conceptually connected to the output of the (video) encoder. It's purpose is to provide a constraint on the variability of the data rate that an encoder or editing process may produce (to avoid the video decoder's buffer from overflowing or underflowing).” The value of vbv delay is placed in the Picture Header.


[0023] In MPEG2 the DTS, placed in the Packetized Elementary Stream (PES) header of the jth Access Unit, “indicates the decoding time, tdn(j), in the system target decoder of an Access Unit j of elementary stream n”. Specifically, for an Access Unit j, This DTS can be denoted as DTS(j). The value of DTS is also specified in units of the number of periods of a 90 KHz clock derived from the 27 MHz system clock.


[0024] The time of day at any instant is obtained from the Program Clock Reference (PCR), which is derived from the 27 MHz system clock. The 90 KHz component of the PCR is the Program Clock Reference Base (PCRB). In particular, the time of day at the instant that the vbv_delay occurs in Access Unit j is denoted as PCRB(j). FIG. 2 shows the relationships of DTS (part of the PES header), and vbv_delay, (part of the Picture Header), in Access Unit j, followed by the next Access Units up to Access Unit k, in an Elementary Stream, n.


[0025] The value of the vbv_delay in any Access Unit is related to the DTS in that Access Unit, and the time of day (the wall clock so to speak) at the time that vbv_delay is present in the Elementary Stream by the formula:




vbv_delay=


DTS−PCRB




[0026] Dividing all terms by 90,000 yields values in seconds.


[0027] Solving for time yields:




T=PCRB=DTS−vbv_delay




[0028] In particular, for Access Unit j, the time of occurrence is:




t
(j)=PCRB(j)=DTS(j)−vbv_delay(j)



[0029] Likewise, for Access Unit (k), the time of occurrence is:




t
(k)=PCRB(k)=DTS(k)−vbv_delay(k)



[0030] Lapsed time between two Access Units within a transport stream can be found by:


Lapsed time=t(k)−t(j)


[0031] Where t(j) occurs before t(k), That is t(j)<t(k).


[0032] Lapsed time, TL, therefore equals:




TL=t
(k)−t(j)=[DTS(k)−vbv_delay(k)]−[DTS(j)−vbv_delay(j)]



[0033] Regrouping yields:




TL=[DTS
(k)−DTS(j)]+[vbv_delay(j)−vbv_delay(k)]



[0034] This final equation is the key element in understanding how to splice a local segment into a slot in the network stream.


[0035] Observe that [DTS(k)−DTS(j)] is the duration of the sequence of Access Units shown in FIG. 2, which represents the slot and the commercial playtime, or presentation time (perhaps a 30-second spot). The term [vbv_delay(j)−vbv_delay(k)] represents the variability or time tolerance of the slot duration.


[0036] As a numerical example, consider that the DTS value increments by 3000 from one Access Unit to the next one in a typical 30 Hz system. This is so because the DTS decode times are in units of the presentation picture rate. Therefore, in a typical commercial slot of 30 seconds, the difference between the two values of DTS {i.e., [DTS(k)−DTS(j)]} would be 2,700,000 which when divided by 90,000 equals 30 seconds. When multiplexed into transport stream, variability as to when an Access Unit is present in the stream is introduced by the difference between the two values of vbv_delay {i.e., [vbv_delay(j)−vbv_delay(k)]}. If the vbv_delay values were identical then the Access Units would be spaced in time by the exact difference between the respective DTS values. Determining the absolute worst case (maximum) variability is the next step


[0037] Boundary Limits on Variability


[0038] Unconstrained streams allow any value between zero and 45,000 for the vbv_delay. A time slot for a fixed presentation length commercial insert can thus vary by +/−45,000 periods of the 90 KHz clock. In terms of time, this represents +/−0.5 seconds when observed in real time in the transport stream. For example, a 30 second commercial slot that presents the decoded commercial to a viewer in exactly 30 seconds, will appear in the transport stream for a period of time offset from the nominal 30 seconds intended. The offset is determined by the difference between the vbv_delay after the last picture and the vbv_delay of the first picture, which, as derived, is +/−0.5 seconds.


[0039] When the vbv_delay at the first Access Unit (picture) of a sequence is equal to the vbv_delay after the last Access Unit (picture) in the sequence, the time slot within the transport stream will be equal to the differences between the ending and starting DTS values. This is the same amount of time that is finally presented when the stream is decoded. For example, a 30 second commercial slot that presents the decoded commercial to a viewer in exactly 30 seconds, would appear in the transport stream for exactly 30 seconds, when the vbv_delay after the last picture of the commercial is exactly equal to the vbv_delay of the first picture of the commercial. No specific values for vbv_delays are required, only that the two values are the same.


[0040] As mentioned earlier, knowing the start time and the final presentation duration of a commercial is insufficient to determine how to insert a compressed local commercial into the stream in the time slot provided by the network, when the compressed streams are not further constrained. The time slot in the stream is a variable equal to the presentation length of the slot plus or minus up to 0.5 seconds. Similarly the compressed commercial stored locally is of variable stream length equal to the presentation time plus or minus up to 0.5 seconds.


[0041] The Solutions


[0042] Several strategies can be undertaken to match the network Elementary Stream slot duration with the commercial slot duration or vice versa. The first strategy involves fixing the problem after the fact. This works when the local commercial is arranged to have the longest stream duration relative to the network slot. That is, if the network slot is nominally T seconds, then the commercial stream time is arranged, by vbv_delay manipulation, to have T+0.5 seconds duration. For example, a local compressed commercial would have the vbv_delays adjusted such that 30.5 seconds of compressed stream time are required when the presentation duration is nominally 30 seconds. It is understood that the nominal network slot duration an/or nominal commercial slot duration can vary without limitation (e.g., 15 sec., 30 sec., 1 min., 10 min., 30 min., 1 hr., multiple hours, etc. etc.). Referring to FIG. 3, a nominal 30-second commercial has the vbv_delay adjusted so that the commercial duration is the longest possible, namely 30.5 seconds. The ideal case is that the +/−0.5-second variation of time in the network spot causes the spot to also be 30.5 seconds, so that the commercial fits in exactly. For all other shorter network slot duration, this strategy guarantees that the network feed will want to start playing program material before the time the local commercial has been fully multiplexed into the network stream. Since the network slot can end before the local commercial is finished, the network program must be stored in the splicer as is the case of normal splicer operation for two real-time streams. Once the network slot has ended, the remaining pictures from the local commercial can be multiplexed into the output transport stream at a higher rate. This closes the gap between the end of the network slot and the end of the local commercial. At the end of the commercial a splice returns the stream to the splicer stored network program. The vbv_delay is adjusted over multiple pictures of the resumed network stream until the vbv_delay in the outgoing stream matches the incoming network feed values.


[0043] A second strategy is to fix the problem before the fact. This works when the local commercial is arranged to have the shortest stream duration relative to the network slot. That is, if the network slot is nominally T seconds, then the commercial stream time is arranged, by vbv_delay manipulation, to have T—0.5 seconds duration. For example, a local compressed commercial would have the vbv_delays adjusted such that 29.5 seconds of compressed stream time are required when the presentation duration is nominally 30 seconds. Referring to FIG. 4, a nominal 30-second commercial has the vbv_delay adjusted so that the commercial duration is the shortest possible, namely 29.5 seconds. The ideal case is that the +/−0.5-second variation of time in the network spot causes the spot to also be 29.5 seconds, so that the commercial fits in exactly. For all other longer network slot durations, the network slot is monitored for the number of pictures remaining in the slot. When 29 seconds of the commercial have been output, fifteen pictures of commercial remain to be output and nominally 30 pictures of network slot remain. The actual number of network slot pictures remaining is determined from the difference between the slot duration and the number of pictures that have passed in the network stream since the start of the commercial. The output rate of the commercial pictures (Rn) is adjusted (lowered) to meet the expected completion time of the network slot. The expected completion time of the network slot equals the number of network slot pictures remaining multiplied by the picture rate. The output rate is adjusted on a picture by picture basis as the commercial pictures are output. The vbv_delay is adjusted over multiple pictures of the inserted commercial data stream until the vbv_delay in the outgoing inserted stream matches the incoming network feed values.


[0044] This strategy is the preferred embodiment.


[0045] A third strategy is a combination of the first two strategies. The stream from a remote source can be delayed by a fixed amount. This delay can provide a look ahead at the incoming stream. This look ahead provides opportunity to determine the duration of a stream time slot prior to its arrival into the internal splicing block. The vbv_delay of local commercials are adjusted to occupy the same amount of time in the transport stream as the presentation time. A nominal 30-second commercial would occupy the same time as the network slot despite the variations +/−0.5 seconds) of either the slot or the commercial. After splicing of the commercial, the delay buffer is monitored to determine the arrival of the end of the network slot. The local commercial vbv_delays can then be modified to cause the local commercial to end just after the network slot has ended, regardless of its time variation. This permits a smooth splice back to the network program that follows the network slot.


[0046] While this invention has been described with an emphasis upon preferred embodiments, it will be obvious to those of ordinary skill in the art that variations in the preferred devices and methods may be used and that it is intended that the invention may be practiced otherwise than as specifically described herein. Accordingly, this invention includes all modifications encompassed within the spirit and scope of the invention as defined by the claims that follow.


Claims
  • 1. A method of splicing an incoming network feed having a network time slot duration and an associated vbv_delay with a commercial slot duration having an associated vbv_delay comprising: (a) manipulating the commercial slot vbv_delay between one of a minimum delay and a maximum delay, (b) outputting pictures from the compressed commercial slot for at least a portion of the network time slot duration, (c) determining the number of pictures remaining from one of a stored portion of the incoming network feed and the commercial slot and adjusting the output rate as required to output the commercial slot, and (d) adjusting one of the vbv_delay of stored network feed and the vbv_delay of the local commercial slot to match the vbv_delay of the incoming network feed.
  • 2. The method of claim 1 comprising manipulating the commercial slot vbv_delay for a maximum delay.
  • 3. The method of claim 2 comprising outputting pictures from the compressed commercial slot for the network time slot duration.
  • 4. The method of claim 3 comprising outputting any remaining pictures from the commercial slot by, storing at least a portion of the incoming network feed, outputting the remaining pictures at an increased output rate and then outputting the stored portion of the network feed and adjusting network time slot vbv_delay until the vbv_delay of stored network feed matches the vbv_delay of the incoming network feed.
  • 5. The method of claim of claim 2 wherein the compressed commercial slot duration is manipulated to have a 30.5 second duration.
  • 6. The method of claim 5 wherein the network time slot duration is based on a Decode Time Stamp and a network time slot duration time tolerance and wherein the stored portion of the network feed is at least as large as the difference between the 30.5 seconds and the network time slot duration.
  • 7. The method of claim 1 comprising manipulating the commercial slot vbv_delay for a minimum delay.
  • 8. The method of claim 7 comprising outputting pictures from the compressed commercial slot for at least a portion of the network time slot duration.
  • 9. The method of claim 8 comprising determining the number of pictures remaining from the commercial slot and adjusting the output rate as required to complete the network time slot duration, and adjusting the local commercial slot vbv_delay to match the vbv_delay of the incoming network feed.
  • 10. The method of claim of claim 7 wherein the compressed commercial slot duration is manipulated to have a 29.5 second duration.
  • 11. The method of claim 10 wherein the network time slot duration is based on a Decode Time Stamp and a network time slot duration time tolerance and the compressed commercial slot is output for 29 seconds such that 15 pictures remain from the commercial slot, wherein the output rate of the 15 remaining pictures is determined based on the difference between the network time slot duration and 29.5 seconds.
  • 12. A method of splicing an incoming network feed having a network time slot duration and an associated vbv_delay with a commercial slot duration having an associated vbv_delay comprising: (a) manipulating the commercial slot vbv_delay for a maximum delay, (b) outputting pictures from the compressed commercial slot for the network time slot duration, and (c) outputting any remaining pictures from the commercial slot by, storing at least a portion of the incoming network feed, outputting the remaining pictures at an increased output rate and then outputting the stored network feed and adjusting network time slot vbv_delay until the vbv_delay of stored network feed matches the vbv_delay of the incoming network feed.
  • 13. The method of claim of claim 12 wherein the compressed commercial slot duration is manipulated to have a 30.5 second duration.
  • 14. The method of claim 13 wherein the network time slot duration is based on a Decode Time Stamp and a network time slot duration time tolerance and wherein the stored portion of the network feed is at least as large as the difference between the 30.5 seconds and the network time slot duration.
  • 15. A method of splicing an incoming network feed having a network time slot duration and an associated vbv_delay with a commercial slot duration having an associated vbv_delay comprising: (a) manipulating the commercial slot vbv_delay for a minimum delay, (b) outputting pictures from the compressed commercial slot for at least a portion of the network time slot duration, and (c) determining the number of pictures remaining from the commercial slot and adjusting the output rate as required to complete the network time slot duration, and adjusting the local commercial slot vbv_delay to match the vbv_delay of the incoming network feed.
  • 16. The method of claim of claim 15 wherein the compressed commercial slot duration is manipulated to have a 29.5 second duration.
  • 17. The method of claim 16 wherein the network time slot duration is based on a Decode Time Stamp and a network time slot duration time tolerance and the compressed commercial slot is output for 29 seconds such that 15 pictures remain from the commercial slot, wherein the output rate of the 15 remaining pictures is determined based on the difference between the network time slot duration and 29.5 seconds.
  • 18. A method of splicing an incoming network feed having a network time slot duration and an associated vbv_delay with a commercial slot duration having an associated vbv_delay comprising: (a) storing and delaying at least a portion of the incoming network feed, (b) determining the network time slot duration based on a Decode Time Stamp and a network time slot duration time tolerance, (c) manipulating the commercial slot vbv_delay so that the commercial slot duration substantially matches the network time slot duration, and (d) outputting the incoming network feed after completion of the network time slot duration.
Parent Case Info

[0001] This application claims the benefit of Provisional Patent Application Serial No. 60/220,671, filed Jul. 25, 2000.

Provisional Applications (1)
Number Date Country
60220671 Jul 2000 US