SYSTEM AND METHOD FOR TRACKING CONTENT TIMELINE IN THE PRESENCE OF PLAYBACK RATE CHANGES

Description

FIELD OF INVENTION

The present invention generally relates to watermarking digital content and more particularly to using watermarks to track content timeline in the presence of playback rate changes.

BACKGROUND

This section is intended to provide a background or context to the disclosed embodiments that are recited in the claims. The description herein may include concepts that could be pursued but are not necessarily ones that have been previously conceived or pursued. Therefore, unless otherwise indicated herein, what is described in this section is not prior art to the description and claims in this application and is not admitted to be prior art by inclusion in this section.

A video watermarking system which embeds ancillary information into a video signal is found in the ATSC standard A/335. In such systems it is sometimes necessary to playback auxiliary content which is synchronized to a watermark timeline recovered from the received content in cases where the recovered timeline has a non-linear mapping to real time.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 Illustrates exemplary non-linear timelines resulting from the user's operation of STB remote control trickplay functions in response to a sequence of user commands in accordance with an embodiment of the disclosure.

FIG. 2 Illustrates exemplary non-linear timelines resulting from the user's operation of STB remote control trickplay functions showing that occasionally two input frames are skipped resulting in an overall rate of ˜2.08× as shown in Figure

FIG. 3 Illustrates exemplary non-linear timelines resulting from the user's operation of STB remote control trickplay functions in response to starting 1× playback, then hitting the ‘>>’ button three times in succession, resulting in ‘1×’, ‘2×’, ‘8×’, ‘32×’ playback in accordance with an embodiment of the disclosure.

FIG. 4 Illustrates exemplary non-linear timelines resulting from the user's operation of STB remote control trickplay functions in response to a series of ‘skip-forward’ and ‘skip-back’ commands, resulting in short pauses prior to the skip, then an immediate return to 1× playback.

FIG. 5 Illustrates a block diagram of a device that can be used for implementing various disclosed embodiments.

SUMMARY OF THE INVENTION

This section is intended to provide a summary of certain exemplary embodiments and is not intended to limit the scope of the embodiments that are disclosed in this application.

Disclosed embodiments relate to method for synchronizing auxiliary content to a watermark timeline recovered from a received content when the recovered timeline has a non-linear mapping to real time. The method includes receiving video content having a video watermark embedded therein and decoding video frames from the received video content. A Detector Engine is used to receive the decoded video frames and extract a time-offset field, a VP1 payload, and a Cyclic Redundance Check (CRC) field in each video frame. A Content Timeline Tracker is used to monitor and analyze the output of the Detector Engine, to produce a piecewise linear approximation of the content timeline, wherein the playback rate changes by a user in an upstream device can be tracked, thereby enabling the playback of auxiliary content which is synchronized to a watermark timeline recovered from the received content when the recovered timeline has a non-linear mapping to real time.

These and other advantages and features of disclosed embodiments, together with the organization and manner of operation thereof, will become apparent from the following detailed description when taken in conjunction with the accompanying drawings.

DETAILED DESCRIPTION OF CERTAIN EMBODIMENTS

In the following description, for purposes of explanation and not limitation, details and descriptions are set forth in order to provide a thorough understanding of the disclosed embodiments. However, it will be apparent to those skilled in the art that the present invention may be practiced in other embodiments that depart from these details and descriptions.

Additionally, in the subject description, the word “exemplary” is used to mean serving as an example, instance, or illustration. Any embodiment or design described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments or designs. Rather, use of the word exemplary is intended to present concepts in a concrete manner.

Introduction

This disclosure describes the logic that uses video watermarks specified in the ATSC 3.0 Standards, Video Watermark Emission (A/335), Doc. A335:2016, 20 Sep. 2016, which is incorporated by reference, and Content Recovery in Redistribution Scenarios (A/336), Doc. A/336:2019, 3 Oct. 2019, which is incorporated by reference, in order to detect and measure trick-play action on upstream devices such as Set Top Box (STB), such as pause, speed-up, slow-down and skip. In particular it is based on detecting eVP1 messages specified in the A/336 standard, which comprises 8-bit time_offset field, 50-bit VP1 payload and 32-bit Cyclic Redundancy Check (CRC) field in each video frame.

The time_offset field is incremented by one every 1/30 s within a message group that lasts 1.5 s, i.e., it can have values 0, 1, 2, . . . 44 within each message group. The VP1 payload (P) is divided into four fields: Domain Type (DT), Server Code (SC), Interval Code (IC), and Query Flag (QF). DT is a one-bit field (0=“small domain”, 1=“large domain”). For “small domain”, the SC field consists of 31 bits and the IC field consists of 17 bits. For “large domain”, the SC field consists of 23 bits and the IC field consists of 25 bits. The QF field is always one bit, and its toggling signals a dynamic event that requires new signaling recovery. The IC field is incremented by one for each subsequent message group.

The CRC field is used to confirm correctness of the extracted data, as is well known to those skilled in the art. It is assumed that there is a detector engine that will receive decoded video frames and extract 8-bit time_offset field, 50-bit VP1 payload and 32-bit CRC field in each video frame based on A/335 and A/336. The details of detector engine design are not part of this disclosure.

CRC Matching

The CRC matching logic compares the CRC fields extracted from the current frame with CRC field extracted from the previous frame and sets the CRC repetition flag to TRUE if they match and otherwise sets it to FALSE. This process is done regardless of whether the extracted CRC field matches the calculated CRC field based on the extracted data. Even if extracted CRC field may have bit errors and the actual data cannot be retrieved, we still want to know if the consecutive CRC fields are repeated. This information can be later used to discriminate between actual payload repetition, such as time_offset repetition in high frame-rate video or fragment repetition, or frame repetition in pause-and-seek playback rate change, skip and pause, as described below.

Content Timeline Tracker

The Content Timeline Tracker (“Tracker”) monitors the output of the detector engine, and analyzes frame_counter, interval_code, time_offset, and CRC repetition flag values and to produce estSpeed, a piecewise linear approximation of the content timeline which can track playback rate changes initiated by a user on an upstream device (e.g., STB).

Overview

Some applications require playback of auxiliary content which is synchronized to the watermark timeline recovered from the main content. For normal viewing the recovered timeline is real-time, meaning that an elapsed interval of media time occurs in an equal duration interval of real time. Other times, such as when the user is controlling playback of main content using ‘trick play’, the recovered timeline has a non-linear mapping to real time.

To play content, Media Player APIs typically expose a command to start (or continue) to play from a specific frame at a specific speed. A sufficiently fast enough player could track, frame-by-frame, the recovered timeline in all modes of play, but most current players cannot respond quickly enough to be able to precisely seek to and render a frame within one frame's duration.

A goal of the Tracker is to quickly recognize where playback rate changes are initiated by the user, and provide a piecewise-linear estimate of the playback speed which can then be used in controlling a replacement media player, minimizing the number of seek commands required to track the main content.

TABLE

Tracker Details

Parameter Name
Input/Output

Frame Counter (Fc)
Input

Interval Code (P)
Input

time_offset (To)
Input

CRC repetition flag
Input

(crf)

Frame rate (fps)
Input

estSpeed
Output

fSpeedUpdated
Output

Tracker inputs and outputs are summarized in the Tracker Details Table below.

TABLE

Tracker Variables

Parameter Name
Type
Description

controlSegmentStartMediaTime
Float
Media time of current

Control Segment start.

Init value = 0.0

controlSegmentStartClockTime
Float
Clock time of current

Control Segment start

currentMediaTime
Float
Media time as

calculated using ic

and To. Unit: Seconds

prevMediaTime
Float
Value of

currentMediaTime last

time tracker ( ) was

called. Unit: Seconds

deltaMediaTime
Float
Time since last time

tracker ( ) was called.

Unit: Seconds

currentClockTime
Float
Local receiver clock

derived from counting

samples. Unit: Seconds

prevClockTime
Float
Value of

currentClockTime last

time tracker ( ) was

called. Unit: Seconds

prevOffset
Int
Value of time_offset

from previous call to

track ( )

pauseCounter
Int
Counts successive

frames with same media

time.

estSpeed
Float
Estimated playback

speed. Init value = 0.0

fSpeedUpdated
boolean
true if a new control

segment was detected

during the last call

to track ( ). Init

value = false.

TABLE

Tracker Configuration Constants

Parameter Name
Type
Description

pauseCount Threshold
Int
=11; make this larger

than the largest number

of frames encountered

during pause-seek trick

play.

stableStateCount
Int
=5; stability

threshold for counting

pause or 1x events.

speedLimit
Float
=32; clip speed

estimates to +/−

speedLimit

speedRatioReportingThreshold
Float
=1.005; estSpeed

ratio change reporting

threshold

ToQuantization
Float
=1/30 sec;

quantization of T0

during embedding

Control Segments

A Control Segment represents a period of time between two upstream user transport control commands which modify playback speed. The media timeline detected with the watermark might be a smooth rendition of the user's command (e.g., 2× resulting in regular frame decimation), or it might be a pause-seek stepwise approximation to the user's command (e.g., 32× in FIG. 1).

The Control Segment is initialized with the currentMediaTime and currentClockTime.

func controlSegmentInit ( ) {

csStartMediaTime = currentMediaTime

csStartClockTime = currentClockTime

}

An initial speed estimate uses the most recent deltaMediaTime

func getInitJumpingSpeed ( ) -> Float {

let estSpeed = deltaMediaTime / frameDurationSec

return min(max (estSpeed,

−c.speedLimit) , c.speedLimit)

}

Occasionally the speedEstimate is updated in the middle of a Control Segment as the slope of the expanding control segment line becomes a better estimator for media speed. getCurrentCSSpeed( ) calculates the current slope and clips the value to speedlimit.

func getCurrentCSSpeed ( ) -> Float {

let speed = (currentMediaTime − csStartMediaTime) /

(currentClockTime − csStartClockTime)

let clippedSpeed = min(max(speed, −speedLimit),

speedLimit)

return clippedSpeed

}

Tracker States

The Tracker implements a state machine to help recognize patterns in the recovered timeline and estimate the control segment boundaries. The states are shown in the tracker States Table below.

State Name
Description

Init
Initial tracker state

Paused
Paused

PauseSeek
Pause-Seek

OneXOnset
First 1x frame spacing

detected. This state provides

a one frame delay before

making decision on next state

OneXPlay
Playback at speed less than or

equal to 2.0. which might

include some individual

repeated or skipped frames.

JumpOnset
First Jumping spacing detected

(not paused and not 1x) . This

state provides a one frame

delay before making decision

on next state

Jumping
In the middle of a sequence of

non-pause, non-1x play spaced

frames

Tracker Events/Tracker Main

track( ) is called with parameters frame_counter, interval_counter, time_offset and CRC repetition flag. It generates events which drive the Tracker state machine. The events are:

Event Name
Description

pauseDetected ( )

play1xDetected ( )

discontinuityDetected ( )

track( ) is called once for every detected frame.

Two successive calls to tracker with the same IC and time offset might mean that content is paused, but this can also happen for frame rates higher that 30 fps because time offset is quantized to 1/30 sec (TOQuantization). These duplicate frames caused by To quantization should be discarded by the tracker, and this is done by looking at the deltaClockTime to determine if two successive calls are spaced less than 1/30 sec. Note that deltaMediaTime might not be zero even if two successive calls are spaced closer than 1/30 sec because of upstream trick play, and these samples should not be discarded.

Two successive calls to Track( ) might be spaced further than 1/fps seconds apart if intervening frames did not have time_offset available. The number of skipped frames is calculated in skippedFrames and used to test for 1× play speed.

The CRC repetition flag crf is used to indicate paused state when the time_offset is not available; in this case the previous value of the time offset is used.

When the fps is different than 1/ToQuantization, there will be an error in the calculation of delta media time. This kind of jitter is tolerated using a threshold in the calculation:

- frameJitterThresholdSec=0.99/fps

Pseudo-code for the track( ) function of the Tracker:

func track(frame_counter:Int, ic:Int, time_offset:int,

crf:Boolean, fps:Float) {

trackingTimelineTimetick(frame_counter)

if ( (time_offset != −1) && (ic != −1) ) | | crf {

if crf {

let currentOffset = prevOffset

if (ic != −1) {

prevIC = ic

} else {

ic = prevIC

}

else { // (time_offset != −1) && (ic != −1)

let currentOffset = time_offset

prevOffset = time_offset

prevIC = ic

}

let clockTime = frame_counter / fps

let mediaFrameOffsetTime = (ic * 1.5) +

(currentOffset * ToQuantization )

deltaClockTime = clockTime − prevClockTime

deltaMediaTime = mediaFrameOffsetTime −

prevMediaTime

if ! ( (deltaClockTime < ToQuantization) &&

(deltaMediaTime == 0) ) {

skippedFrames = Int ( (deltaClockTime *

fps) .rounded ( ) )

currentClockTime = clockTime

prevClockTime = currentClockTime

currentMediaTime = mediaFrameOffsetTime

prevMediaTime = currentMediaTime

if deltaMediaTime == 0 {

pauseDetected( )

} else if deltaMediaTime < skippedFrames *

frameDurationSec + frameJitterThresholdSec &&

deltaMediaTime > skippedFrames *

frameDurationSec − frameJitterThresholdSec {

play1xDetected( )

} else {

discontinuityDetected( )

}

trackingTimelineUpdate( )

}

}

}

Pause Detected Event Handler

This event is triggered when successive frames show no advance in media 10 time. This could be because the content is paused, or it might part of content playback at speed not equal 1×, such as part of a ‘Pause-Seek’ operation for speed >2.0, or part of frame interpolation for speed <1.0.

A goal is to recognize as quickly as possible that pause is occurring to ensure that a tracking media player is responsive to user commands.

The main decision to be made in the event handlers is whether to start a new or update the current control segment. For example, new control segments should not be started in the middle of a sequence of pause-seeks, but the existing speed estimate should be updated.

func pauseDetected( ) {

if state == . JumpOnset {

if prevState == . PauseSeek { / / in trickplay. Dont

reanchor control segment; just update speed

setEstSpeed(getCurrentCSSpeed( ) )

} else { / / This is a new jump, so reanchor

control segment;

setEstSpeed(getInitJumpingSpeed( ) )

controlSegment Init( )

}

pauseCounter = 0

state = . PauseSeek

} else if (state == .PauseSeek) | | (state == .Jumping)

{

/ / This might be a real pause, or it might be part

of a

/ / pause-seek: Only update if real pause

if pauseCounter > pauseCountThreshold {

setEstSpeed (0)

controlSegmentInit( )

state = .Paused

pauseCounter = 0;

}

} else if state == .OneXPlay | | state == .OneXOnset {

/ / Start a new control segment only if coming from

continuous

/ / (non-jumping) 1x playback . Ignore transient

duplicated frames during jumping.

if pauseCounter > stableStateCount | | onexCounter >

stableStateCount {

setEstSpeed (1) / / Do not zero speed entering

pause-seek. This is not a real pause.

controlSegmentInit( )

state = .PauseSeek

pauseCounter = 0;

} else { / / This could be oscilation between pause

and 1x seen in speeds < 1

setEstSpeed (getCurrentCSSpeed( ) )

}

} else { / / state == .Paused or state == .Init

state = .Paused

if estSpeed != 0 {

setEstSpeed(0.0)

}

}

discCounter = 0;

pauseCounter = pauseCounter + 1

onexCounter = 0

}

play1× Detected Event Handler

play1× Detected might be part of normal 1× play, or it might be part of a sequence of frames where playback speed is <2×. A goal is to recognize as quickly as possible that normal 1× play is occurring to ensure that a tracking media player is responsive to user commands.

func play1xDetected( ) {

if state == .JumpOnset {

state = .OneXOnset

} else if state == .Jumping {

controlSegmentInit( )

state = .OneXOnset

} else if state == .OneXOnset {

setEstSpeed (1.0)

controlSegmentInit( )

state = .OneXPlay

} else if state == .Paused | | state == .Init {

setEstSpeed(1.0)

controlSegmentInit( )

state = .OneXPlay

} else if state == .PauseSeek { / / might be <2x, so

don't reanchor segment

state = .OneXOnset

} else if state == .OneXPlay {

if onexCounter == stableStateCount { / / Fail safe

(in presense of CRC errors).

setEstSpeed(1)

controlSegment Init( ) / / Establish

1x control segment

}

}

discCounter = 0;

pauseCounter = 0;

onexCounter = onexCounter + 1

}

Discontinuity Detected Event Handler

A discontinuity is any jump in the recover timeline, which is not a pause or frames spaced 1/fps apart. These might be part of a pause-seek (a ‘big’ jump below), or result from playback speeds estSpeed <2.0 && estSpeed >1.0.

func discontinuityDetected( ) {

if state == .Paused | | state == .PauseSeek | | state ==

.Init {

/ / first jump afer a pause; wait for next frame to

establish slope for estSpeed

/ / (esp useful for 1x play after skip)

state = .JumpOnset

} else if state == .JumpOnset {

state = .Jumping

setEstSpeed(getInitJumpingSpeed( ) )

controlSegment Init( )

} else if state == .OneXPlay | | state == .OneXOnset {

if abs(deltaMediaTime) > 2.2 * skippedFrames *

frameDurationSec | | discounter > 2 { / / if this is a big

skip

controlSegmentInit( )

state = .JumpOnset / / wait a frame before

tsUpdated to get better speed estimate

} else { / / a small jump could be part of speed <

2, so stay in .OneXPlay

setEstSpeed(getCurrentCSSpeed( ) )

}

} else if state == .Jumping {

if abs(deltaMediaTime) > 2.2 * Float(skippedFrames)

* frameDurationSec { / / if this is a big skip

state = .JumpOnset

} / / else, don't controlSegmentInit via .JumpOnset

setEstSpeed(getCurrentCSSpeed( ) )

}

discCounter = discounter + 1;

pauseCounter = 0;

onexCounter = 0

}

Tracking Timeline

estSpeed represents the slope of an idealized control segment. In reality, it is a noisy signal that is influenced by the imperfect nature of trick play media transports. A trackingTimeline is created with logic to try to remove this noise and produce sparsely spaced fSpeedUpdated events that delineate constant slope (constant speed) control segments.

The timeline is parametrized by a tt.speed and tt.mediaTime, and can be quantized in time to correspond to the underlying video frame rate. For each processed video frame, trackingTimelineTimetick( ) is called to update the timeline by extrapolating the mediaTime using tt.speed. The timeline can also be resynchronized to the video watermark timeline in trackingTimelineUpdate( ) which is also called every processed video frame. trackingTimelineUpdate( ) selectively calls trackingTimelineSetTimeAndSpeed (time, speed) which updates the tracking timeline and sets the fSpeedUpdated Boolean.

trackingTimelineUpdate( ) does not always update tt.speed and tt.mediaTime and uses thresholding logic and other heuristics to avoid too frequent updates to fSpeedUpdated. This can be important if, for example, fSpeedUpdated is used to trigger the seeking of a media player which playing alternate content synchronized to the incoming watermarked content.

trackingTimelineUpdate( ) analyzes the differences between tt.speed and the estSpeed which is estimated from the recovered watermarks. If there is any transition between pause and play (i.e., if (estSpeed==0.0∥estSpeed==1.0∥tt.speed==0∥tt.speed==1.0) && (tt.speed˜=estSpeed), the tracking timeline is immediately updated.

If tt.speed and estSpeed have opposite signs, the tracking timeline is also immediately updated so that overshoot is reduced in tracking devices. If the signs are the same then the tracking timeline is only updated if the ratio of tt.speed and estSpeed is outside of a thresholded window. This avoids constant fSpeedUpdated triggers that might be due to small estimation errors in estSpeed and other system noise.

If none of the speed analysis conditions are true, trackingTimelineUpdate( ) analyzes the differences between tt.mediaTime and the currentMediaTime. If this difference is above a threshold, then the tracking timeline is updated. The threshold is adjusted based on the estSpeed, so that there is a greater tolerance to time errors when operating at fast trick play speeds. In most cases the tracking timeline is updated using the currentMediaTime and estSpeed; however, if such an update would reverse the sign of the speed when the time difference is relatively small and the difference is diverging, this is recognized as normal tracking of a pause-seek trick play source, so the tracking timeline is updated to pause at currentMediaTime to wait for the next seek in the pause seek sequence.

TABLE

Tracking Timeline Variables

Parameter Name
Type
Description

tt.mediaTime
Float
Current time along the

tracking timeline

tt.speed
Float
Current speed along the

tracking timeline

tt.currentMediaTimeError
Float
Difference between the

tt.mediaTime and estSpeed

tt.prevFrameNumber
Int
curentFrameNumber last

time

trackingTimelineTimeTick

was called

tt.prevMediaTimeError
Float
Difference between the

tt. mediaTime and estSpeed

as measured in the last

frame

TABLE

TrackingTimeline Constants

Parameter Name
Type
Description

tt.timeErrorThreshold
Float
=0.5 Seconds (mediaTime)

error before issuing a

correction in jump mode.

Smaller number results in

less error but more speed

updates

TimeTracker Functions

func trackingTimelineInit( ) {

tt.speed = 0.0

tt.mediaTime = 0.0

tt.currentMediaTimeError = 0. 0

tt. prevMediaTimeError = 0.0

tt. prevFrameNumber = 0

}

func trackingTimelineTimetick(current FrameNumber) {

let skippedFrames = current FrameNumber −

tt. prevFrameNumber

tt. prevFrameNumber = current FrameNumber

tt.mediaTime = tt.mediaTime + tt.speed *

skippedFrames * frameDurationSec

tt.prevMediaTimeError = tt.currentMediaTimeError

tt. currentMediaTimeError = tt.mediaTime −

currentMediaTime

fSpeedUpdated = 0

}

func trackingTimelineSetTimeAndSpeed(time, speed) {

tt.speed = speed

tt.mediaTime = time

fSpeedUpdated = 1

}

func trackingTimelineUpdate( ) {

fCorrected = false;

/ / First check speed difference

if (estSpeed == 0.0 | | estSpeed == 1.0 | | tt.speed

== 0 | | tt.speed == 1.0) && ( tt.speed ~= estSpeed ) {

trackingTimelineSetTimeAndSpeed(currentMediaTime,

estSpeed)

fCorrected = true;

} elseif sign(tt.speed) == sign(estSpeed) ) {

if ( tt.speed ~= estSpeed ) {

if abs(estSpeed) > abs(tt.speed ) {

ratio = tt.speed/ estSpeed

} else {

ratio = estSpeed/tt.speed

}

if ratio < 0.5 {

trackingTimelineSetTimeAndSpeed

(currentMediaTime, estSpeed)

fCorrected = true

}

}

} elseif tt.speed * state.estSpeed ~= 0 / / / / update

if speeds are opposite signs and non-zero

trackingTimelineSetTimeAndSpeed

(currentMediaTime, estSpeed)

fCorrected = true

}

/ / Second, check time difference

if fCorrected == false {

thresh = tt.timeErrorThreshold

if abs(estSpeed) > 2 {

thresh = tt.timeErrorThreshold *

abs(estSpeed)

}

if abs(tt.currentMediaTimeError) >=

abs(deltaMediaTime) &&

abs(tt.currentMediaTimeError) >= thresh {

/ / do not make small direction

reversals when correcting

if sign(tt.currentMediaTimeError) ~=

sign (state .estSpeed) | |

sign (tt.speed) ~= sign (state.estSpeed) {

/ / in this case a correction will not

reverese direction

trackingTimelineSetTimeAndSpeed(currentMediaTime,

estSpeed)

} elseif abs(tt.currentMediaTimeError) >

abs(tt.prevMediaTimeError) &&

(abs (tt.currentMediaTimeError)/abs (stat

e.estSpeed) ) < 5

/ / in this case a correction will

reverse directions,

/ / so if we're diverging pause media

player at its current position

trackingTimelineSetTimeAndSpeed (currentMediaTime,

0)

} else {

trackingTimelineSetTimeAndSpeed (currentMediaTime,

estSpeed)

}

}

}

}

Trickplay Timeline Examples

Examples of non-linear timelines resulting from the user's operation of STB remote control trickplay functions are shown below. These are selected from a set of test vectors that can be used to validate implementations of this algorithm.

In these examples, the user input is a sparse sequence of button pushes to change playback speed or skip through content. The STBs main media player responds by seeking in the content and using custom frame decimation and interpolation to play the content at the commanded speed. A typical algorithm is ‘Pause-Seek’, where a frame is repeated (‘Pause’) while the player seeks to an appropriate frame to play next.

1×->2×->8× Playback

FIG. 1 shows the results of a sequence of user commands to a ChannelMaster DVR: starting 1× playback, then hitting the ‘>>’ button at frame 40 results in 2× playback, until frame 90, when a the second ‘>>’ command results in a brief pause, a slight regression in time, then a succession of pause-seek intervals. The pause-seek interval timing is regularly spaced, with slight variations (e.g. Pause for 6 frames then a jump of 36 frames). Even though the display overlay says ‘8×’ the actual average is approx. 5.4×.

A closer look at the 2× playback section in FIG. 1 shows that it is not simply discarding every other frame. Occasionally two input frames are skipped resulting in an overall rate of ˜2.08× as shown in FIG. 2.

Similarly, playback rates between 1.0 and 2.0 consisting of periods of 1× playback interspersed with jumps of 2 frames. Playback rates <1.0 consist of repeated frames interspersed with 1× frame increments.

ChannelMaster 32× Playback

FIG. 3 shows the results of starting 1× playback, then hitting the ‘>>’ button three times in succession, resulting in ‘1×’, ‘2×’, ‘8×’, ‘32×’ playback. Notice that at 32× the pause-seek steps are no longer uniform.

ChannelMaster Skip Ahead/Skip Back

FIG. 4 shows the result of a series of ‘skip-forward’ and ‘skip-back’ commands, resulting in short pauses prior to the skip, then an immediate return to 1× playback.

It is understood that the various embodiments of the present invention may be implemented individually, or collectively, in devices comprised of various hardware and/or software modules and components. These devices, for example, may comprise a processor, a memory unit, an interface that are communicatively connected to each other, and may range from desktop and/or laptop computers, to consumer electronic devices such as media players, mobile devices, and the like. For example, FIG. 4 illustrates a block diagram of a device 1000 within which the various disclosed embodiments may be implemented. The device 1000 comprises at least one processor 1002 and/or controller, at least one memory 1004 unit that is in communication with the processor 1002, and at least one communication unit 1006 that enables the exchange of data and information, directly or indirectly, through the communication link 1008 with other entities, devices and networks. The communication unit 1006 may provide wired and/or wireless communication capabilities in accordance with one or more communication protocols, and therefore it may comprise the proper transmitter/receiver antennas, circuitry and ports, as well as the encoding/decoding capabilities that may be necessary for proper transmission and/or reception of data and other information.

Referring back to FIG. 4 the device 1000 and the like may be implemented in software, hardware, firmware, or combinations thereof. Similarly, the various components or sub-components within each module may be implemented in software, hardware, or firmware. The connectivity between the modules and/or components within the modules may be provided using any one of the connectivity methods and media that is known in the art, including, but not limited to, communications over the Internet, wired, or wireless networks using the appropriate protocols.

Various embodiments described herein are described in the general context of methods or processes, which may be implemented in one embodiment by a computer program product, embodied in a computer-readable medium, including computer-executable instructions, such as program code, executed by computers in networked environments. A computer-readable medium may include removable and non-removable storage devices including, but not limited to, Read Only Memory (ROM), Random Access Memory (RAM), compact discs (CDs), digital versatile discs (DVD), etc. Therefore, the computer-readable media that is described in the present application comprises non-transitory storage media. Generally, program modules may include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Computer-executable instructions, associated data structures, and program modules represent examples of program code for executing steps of the methods disclosed herein. The particular sequence of such executable instructions or associated data structures represents examples of corresponding acts for implementing the functions described in such steps or processes.

The foregoing description of embodiments has been presented for purposes of illustration and description. The foregoing description is not intended to be exhaustive or to limit embodiments of the present invention to the precise form disclosed, and modifications and variations are possible in light of the above teachings or may be acquired from practice of various embodiments. The embodiments discussed herein were chosen and described in order to explain the principles and the nature of various embodiments and its practical application to enable one skilled in the art to utilize the present invention in various embodiments and with various modifications as are suited to the particular use contemplated. The features of the embodiments described herein may be combined in all possible combinations of methods, apparatus, modules, systems, and computer program products.

Claims

1. A method comprising: receiving video content having a video watermark embedded therein;decoding video frames from the received video content;using a detector engine to receive the decoded video frames and extract at least a time-offset, a payload, and an error detection field in a plurality of the decoded video frames, wherein the detector engine generates an output for the plurality of decoded video frames that includes a time-offset and a payload derived from the video watermark; andusing a content timeline tracker to monitor and analyze the output of the detector engine, to produce a piecewise linear approximation of the content timeline, wherein the playback rate changes by a user in an upstream device can be tracked, thereby enabling the playback of auxiliary content which is synchronized to a watermark timeline recovered from the received content when the recovered timeline has a non-linear mapping to real time.
2. The method of claim 1 further comprising using matching logic to compare the watermark data extracted from the current frame with at least some watermark data extracted from the previous frame and to set a data repetition flag to TRUE if they match and otherwise set it to FALSE.
3. The method of claim 2 wherein the matching logic determines if consecutive watermark data are repeated, wherein this information can be later used to discriminate between actual payload repetition, such as time offset repetition in high frame-rate video or fragment repetition, or frame repetition in pause-and-seek playback rate change, skip and pause.
4. The method of claim 1 wherein the content timeline tracker analyzes at least the time_offset and data repetition flag values to produce estSpeed, a piecewise linear approximation of the content timeline which can track playback rate changes initiated by a user on an upstream device.
5. The method of claim 1 wherein the Content Timeline Tracker implements a state machine to recognize patterns in the recovered timeline and estimate the control segment boundaries.
6. The method of claim 1 wherein the piecewise linear approximation of the content timeline includes an estimate of the playback speed and further comprising using the estimate of the playback speed in controlling a replacement media player, whereby the number of seek commands required to track main content in minimized.
7. The method of claim 1 wherein the content timeline tracker recognizes one or more patterns selected from the group consisting of: initial tracker state; pause; pause seek; oneXOnset; oneXplay; JumpOnset; and Jumping.
8. A device comprising: a detector engine receiving decoded video having a video watermark embedded therein, the detector engine extracting at least a time-offset, a payload, and an error detection field in a plurality of the decoded video frames, wherein the detector engine generates an output for the plurality of decoded video frames that includes a time-offset and a payload derived from the video watermark; anda content timeline tracker that monitors and analyzes the output of the detector engine and produces a piecewise linear approximation of the content timeline, wherein the playback rate changes by a user in an upstream device can be tracked, thereby enabling the playback of auxiliary content which is synchronized to a watermark timeline recovered from the received content when the recovered timeline has a non-linear mapping to real time.
9. The device of claim 8 wherein the detector engine uses matching logic to compare the watermark data extracted from the current frame with at least some watermark data extracted from the previous frame and to set a data repetition flag to TRUE if they match and otherwise set it to FALSE.
10. The device of claim 9 wherein the matching logic determines if consecutive watermark data are repeated, wherein this information can be later used to discriminate between actual payload repetition, such as time_offset repetition in high frame-rate video or fragment repetition, or frame repetition in pause-and-seek playback rate change, skip and pause.
11. The method of claim 8 wherein the content timeline tracker analyzes at least the time_offset and data repetition flag values to produce estSpeed, a piecewise linear approximation of the content timeline which can track playback rate changes initiated by a user on an upstream device.
12. The device of claim 8 wherein the content timeline tracker implements a state machine to recognize patterns in the recovered timeline and estimate the control segment boundaries.
13. The device of claim 8 wherein the piecewise linear approximation of the content timeline includes an estimate of the playback speed and wherein the content timeline tracker uses the estimate of the playback speed to control a replacement media player, whereby the number of seek commands required to track main content in minimized.
14. The device of claim 8 wherein the content timeline tracker recognizes one or more patterns selected from the group consisting of: initial tracker state; pause; pause seek; oneXOnset; oneXplay; JumpOnset; and Jumping.
15. A computer program product embodied on one or more non-transitory computer readable media, comprising: program code for receiving video content having a video watermark embedded therein;program code for decoding video frames from the received video content;program code for using a Detector Engine to receive the decoded video frames and extract at least a time-offset, a payload, and an error detection field in a plurality of the decoded video frames, wherein the Detector Engine generates an output for the plurality of decoded video frames that includes a time-offset and a payload derived from the video watermark; andprogram code for using a Content Timeline Tracker to monitor and analyze the output of the Detector Engine, to produce a piecewise linear approximation of the content timeline, wherein the playback rate changes by a user in an upstream device can be tracked, thereby enabling the playback of auxiliary content which is synchronized to a watermark timeline recovered from the received content when the recovered timeline has a non-linear mapping to real time.
16. The computer program product of claim 15 further comprising using matching logic to compare the watermark data extracted from the current frame with at least some watermark data extracted from the previous frame and to set a data repetition flag to TRUE if they match and otherwise set it to FALSE.
17. The computer program product of claim 16 wherein the matching logic determines if consecutive watermark data are repeated, wherein this information can be later used to discriminate between actual payload repetition, such as time_offset repetition in high frame-rate video or fragment repetition, or frame repetition in pause-and-seek playback rate change, skip and pause.
18. The computer program product of claim 15 wherein the Content Timeline Tracker analyzes at least the time_offset and data repetition flag values to produce estSpeed, a piecewise linear approximation of the content timeline which can track playback rate changes initiated by a user on an upstream device.
19. The computer program product of claim 15 wherein the Content Timeline Tracker implements a state machine to recognize patterns in the recovered timeline and estimate the control segment boundaries.
20. The computer program product of claim 15 wherein the piecewise linear approximation of the content timeline includes an estimate of the playback speed and further comprising using the estimate of the playback speed in controlling a replacement media player, whereby the number of seek commands required to track main content in minimized.

REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. Nonprovisional patent application Ser. No. 17/667,464, filed Feb. 8, 2022, which claims priority to U.S. Provisional Patent Application No. 63/147,122, filed Feb. 8, 2021, and U.S. Provisional Patent Application No. 63/225,381, filed Jul. 23, 2021 the entirety of which are hereby incorporated by reference.

Provisional Applications (2)

	Number	Date	Country
	63147122	Feb 2021	US
	63225381	Jul 2021	US

Continuations (1)

	Number	Date	Country
Parent	17667464	Feb 2022	US
Child	18344792		US

SYSTEM AND METHOD FOR TRACKING CONTENT TIMELINE IN THE PRESENCE OF PLAYBACK RATE CHANGES

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

REFERENCE TO RELATED APPLICATIONS

Provisional Applications (2)

Continuations (1)