A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
Closed Captioning is used in television programming to provide visible text (generally a transcription of speech and sounds) for the television viewer. Closed Captioning (CC) is especially helpful for the hearing impaired. CC data is provided in television programming using techniques defined by various technical standards including various versions and implementations defined in technical standards CEA-608 and CEA-708.
Unfortunately, with the emergence of three dimensional (3D) stereoscopic television (3DTV), conventional techniques do not provide the optimum mechanism for one to view captioning in the 3DTV environment since the captioning may generally appear in an inconvenient z-axis location (depth) relative to other video objects which may appear to be in front of or behind the captions.
Certain illustrative embodiments illustrating organization and method of operation, together with objects and advantages may be best understood by referencing the detailed description that follows taken in conjunction with the accompanying drawings in which:
While this invention is susceptible of embodiment in many different forms, there is shown in the drawings and will herein be described in detail specific embodiments, with the understanding that the present disclosure of such embodiments is to be considered as an example of the principles and not intended to limit the invention to the specific embodiments shown and described. In the description below, like reference numerals are used to describe the same, similar or corresponding parts in the several views of the drawings.
The terms “a” or “an”, as used herein, are defined as one or more than one. The term “plurality”, as used herein, is defined as two or more than two. The term “another”, as used herein, is defined as at least a second or more. The terms “including” and/or “having”, as used herein, are defined as comprising (i.e., open language). The term “coupled”, as used herein, is defined as connected, although not necessarily directly, and not necessarily mechanically. The term “program” or “computer program” or similar terms, as used herein, is defined as a sequence of instructions designed for execution on a computer system. A “program”, or “computer program”, may include a subroutine, a function, a program module, a script, a procedure, an object method, an object implementation, in an executable application, an applet, a servlet, a source code, an object code, a shared library/dynamic load library and/or other sequence of instructions designed for execution on a computer system.
The term “program”, as used herein, may also be used in a second context (the above definition being for the first context). In the second context, the term is used in the sense of a “television program”. In this context, the term is used to mean any coherent sequence of audio video content such as those which would be interpreted as and reported in an electronic program guide (EPG) as a single television program, without regard for whether the content is a movie, sporting event, segment of a multi-part series, news broadcast, etc. The term may also be interpreted to encompass commercial spots and other program-like content which may not be reported as a program in an electronic program guide.
Reference throughout this document to “one embodiment”, “certain embodiments”, “an embodiment”, “an implementation”, “an example” or similar terms means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearances of such phrases or in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments without limitation.
The term “or” as used herein is to be interpreted as an inclusive or meaning any one or any combination. Therefore, “A, B or C” means “any of the following: A; B; C; A and B; A and C; B and C; A, B and C”. An exception to this definition will occur only when a combination of elements, functions, steps or acts are in some way inherently mutually exclusive.
Implementations consistent with embodiments of the present invention generally relate to the transport and format of data used for the rendering of closed captions over 3D video.
When closed captioning is rendered in the display of a 3D video program, as described above, the CC window and associated text is likely to be rendered in the plane of the screen unless steps are taken to render the window and text such that they appear at a different, more appropriate, perceived depth. An object in a scene within 3D content may be presented such that it appears to the viewer to be some distance in front of the plane of the display screen. If a captioning window positioned in depth at the plane of the display were to be placed in front of that object, a “depth violation” would occur. In such a case, the viewer is presented with conflicting depth cues, a situation that causes eye fatigue and discomfort. Since captioning may intersect in the z-axis with content in the scene if it is simply positioned at the screen plane, the presentation of captioning is preferably individually authored to the subject matter of the video presentation. In order to accomplish this, extra information can be sent along with the captions to define the perceived placement on the z-axis (a designated distance in front or behind the plane of the screen) of a window containing the caption text for effective presentation and to avoid interference with objects in the scene. A number of techniques can be devised to provide this information, but many have disadvantages.
Stereoscopic 3D television involves delivery to the display screen of separate views for the left and right eyes, coupled with a method to allow each of the viewer's eyes to see only the image intended for that eye. The illusion of depth is achieved when, for a given object, the left- and right-eye views differ in the horizontal position of that object's placement. An overview of the basic technology relating to three dimensional display systems is described in “3D Display Systems” by Dr. Nick Holliman, Department of Computer Science, University of Durham, Science Laboratories, South Road, Durham DH1 3LE, UK (see particularly pp. 10-11 and FIG. 5), which is hereby incorporated by reference in the document's entirety.
The term “disparity” as used herein can be more formally referred to as “screen disparity” (i.e., as measured on the screen) as distinguished from “retinal disparity” and is defined as the difference of the physical x coordinates of corresponding points in the right and left images in a pair of aligned stereo images displayed on a screen. The less formal term “disparity” will be used herein for simplicity. By convention, when disparity is negative (e.g. when the left-eye image is rendered on the screen to the right of the right-eye image), the object is perceived as being in front of the plane of the screen. When disparity is positive (e.g. when the left-eye image is rendered on-screen to the left of the right-eye image), the object is perceived as being behind the screen plane.
The term “disparity data” can refer to any data indicating the disparity value to be used in rendering a given graphical object, such as a caption window and its associated text. The term can also be used more generally to refer to data reflecting the z-axis positions of objects in the current scene. The scene area can be mapped out into regions, with the z-axis position of the object in each region that is closest to the viewer recorded as a coordinate in the map. Such a map may be called a “disparity map” or a “depth map.” Disparity maps can change on a frame-by-frame basis and can be represented in any number of ways. It is noted that disparity is a measure of the horizontal offset of left eye and right eye images, but the offset need not be in an integer number of pixels as fractional pixel offsets are perfectly acceptable.
Within the context of delivery of 3D video and CC content for display, disparity is generally represented as a percentage of the width of the accompanying video. As such, it is a dimensionless number. For example, the signaling scheme may specify that one unit of disparity is equal to 1/1920 of the width of the video content (which is generally rendered to match the width of the display screen). But, a disparity of 1/1920 is not the minimum increment in actual disparity even with a screen width of 1920. With this definition, a disparity of 7 refers to a distance equal to 7/1920 of the width of the video content. Again with this definition, in the special case of a screen with 1920 pixels of horizontal resolution, this works out such that one full unit of disparity is the same width as one display pixel, but this should be thought of as a special case. If a display screen has fewer than 1920 horizontal pixels, e.g., 1600 horizontal pixels, then disparity is scaled accordingly and a disparity of 7 equates to (7/1920)(1600)=5.83 pixels. Hence, disparity should most properly be viewed as the difference in the physical location on the screen along the x axis (horizontal) of corresponding points in left eye and right eye images in a pair of aligned stereoscopic images. It is also noteworthy that the CC window will generally be a two dimensional window which is positioned along the z-axis and which is perceived to be in a plane parallel to the plane of the display screen.
A proposal has been made to a standards committee for an approach to adding disparity data to set the perceived z-axis, or depth, for specific CEA-708 caption windows using CEA-708-compatible syntax. One approach provides disparity data for any given CEA-708 caption window by using a command extension method in CEA-708 called the ETX1 method. This proposal has drawbacks, but several other possibilities can be devised for transport of the disparity data. In one example, the caption disparity data can be placed into a separate picture user_data( ) packet, thus disconnecting it from the buffering and timing concerns that it would have to face if it were carried inside the caption stream. This is depicted in the diagram shown in
Referring now to
The encoded data from the physical channel is received at a television receiver device (e.g., a television or a set top box) as depicted in
The closed caption-related data is processed at a caption processor 316. The caption processor renders caption text using the disparity data that defines the x-axis separation of left eye and right eye images to produce the perceived z-axis position of the caption text and its associated window. The resulting caption images are then composited at compositor 320 with the video data so as to produce 3D video with captions placed according to the data in the caption data packets in the x, y and z plane. Such data may place the captions in a static manner (wherein the z-axis position stays constant during the display of the given caption segment) or dynamically (where the z-axis position moves in time during display) in accordance with the authoring of the caption data. The caption data can then be rendered along with the program video on a video display (not shown).
A receiver device is depicted in greater detail in
A/V content may also be received via the Internet 422 via a network interface 426 for IP television content decoding. Additionally, storage 430 can be provided for non-real time (NRT) stored content. The NRT content can be played by demultiplexing at 406 in a manner similar to that of other sources of content. The receiver generally operates under control of a processor such as CPU 438 which is interconnected to working memory 440 and program memory 442 as well as a graphics subsystem 444 via one or more buses such as 450.
The CPU 438 receives closed caption data from the demultiplexer 406 as well as the disparity data by whatever transport mechanism (e.g., via the caption data packets as described above or other mechanism as referenced below) and determines by parsing the data what horizontal separation of left eye and right eye positions are to be used to create the perception of z-axis position as well as x and y position to locate the caption data. This information is passed to the graphics subsystem 444 and the images are composited at the compositor 460 to produce an output suitable for display on a video display.
Thus, a method of communication of closed caption data to a television receiver device involves packaging closed caption text in a digital television caption channel stream of packets; transporting and delivering disparity data that defines a z-axis position for the closed captioning text for each frame of video displayed with closed captions within a stream of digital television user data packets; transmitting the digital television caption channel stream of packets to the television receiver device as a part of a transport stream that carries audio/video data associated with the closed caption text; and transmitting the stream of digital television user data packets containing the disparity data to the television receiver device outside of the transport stream that carries the audio/video data associated with the closed caption text.
And, a method of processing closed caption data in a television receiver device involves receiving a digital television caption channel stream of packets to the television receiver device as a part of a transport stream that carries audio/video data associated with the closed caption text; receiving a stream of digital television user data packets containing the disparity data to the television receiver device outside of the transport stream that carries the audio/video data associated with the closed caption text; and placing the closed caption text in a closed caption window for display on a three dimensional video display with the closed caption window positioned along a z-axis in a position defined by the disparity data.
In certain implementations consistent with embodiments of the invention, the amount of data used to define the z-position can be minimized and either transported along with the caption text and the video portion of the DTV transport stream or in separate user_data( ) packets as described above.
When the disparity data is delivered outside the DTV Transport Channel, the caption disparity data does not impact the DTVCC buffer model or data rates, and can be multiplexed into the Video separately from caption data. Note: there are other types of user data defined in the standards, such as active format description (AFD) and Bar Data. In accord with certain embodiments consistent with the present invention, the caption disparity data can be transported as a type of picture user data. However, this should not be considered limiting since the transport mechanism described above is only one of several mechanisms for transport of the disparity data encoded in the manner described later. For example, the disparity data can be transported using the techniques described in U.S. patent application Ser. No. 13/022,828, filed concurrently herewith entitled “Service Linkage to Caption Disparity Transport” to Eyer, or U.S. patent application Ser. No. 13/022,810, filed concurrently herewith entitled “Extended Command Stream for Closed Caption Disparity” to Eyer, both of which are hereby incorporated by reference, could also be used without limitation.
In accord with certain preferred embodiments, the amount of data used to transport the disparity data can be dramatically reduced to minimize impact on the available bandwidth for caption service regardless of the transport mechanism as will be described later.
In order to define the data structure as one would portray the definition in a technical standard specification, as depicted below, an example data structure which fits into the framework of ATSC A/72 Part 1, AVC Video (for ATSC) and A/53 is provided. Text is copied here from A/72 Part 1 and A/53 Part 4. Portions of the specification that are modified to accommodate the presentation of disparity data for this example implementation of the present invention are depicted in the following portions:
With these changes in mind, a specification for delivery of caption data can be developed that is similar to that presented below as a Sample Standard and used as the basis for standardizing delivery of captions in accord with the discussion above:
Example Caption Disparity Data syntax is shown in Table 5.
In the above EXAMPLE 1, the disparity data are defined and encoded as an offset with one- or two-unit precision. However, this level of precision in the disparity is not as precise as one might desire on a frame-by-frame basis. While an initial placement at a disparity value with a precision of 1/1920 of the screen width is probably sufficient, smooth motion in the depth plane requires a higher resolution.
In the method of this first example, the caption disparity data is transported within a user_data( ) packet using a unique value of user_data_identifier_code (Table 4). The present example it is useful as an illustration of one type of disparity data that can be transported outside the DTVCC transport channel as user_data( ) to transport the caption disparity data 110 as depicted in
The above example provides for encoding and transport of caption disparity data, but unfortunately, for caption windows that move in the z-axis direction over time, the caption disparity data is not encoded in the most efficient manner and may require substantial amounts of bandwidth to transmit. Additionally, a disparity resolution of 1/1920 of the screen width has been found to be too coarse for frame-by-frame samples of disparity. Finer granularity is needed when specifying the disparity for objects that move smoothly in depth over time.
In general, one may wish to author the disparity and caption data somewhat simultaneously so as to permit placement of the captioning in an appropriate position in the z axis in the video frame so that other objects do not intersect the caption display. In other words, for proper rendering of the scene, a caption window should appear closer to the viewer than any object it overlays. Intersection in the z axis of the caption window with the object it overlays causes viewer fatigue, as the depth cues given by the object and the caption window are in conflict with each other.
The captions may be generally rendered based on the dialog or other sounds that are represented in the video. So, for example, each segment of dialog (e.g., from one actor) may appear in a separate caption text window. That window may be rendered in a static location if the receiver is incapable of adjusting the z-axis position dynamically (due to, for example, insufficient CPU or graphics processing power).
Preferably, the caption data is authored so as to be represented with a z-axis position for each individual frame associated with each segment of dialog. In this manner, during the time of its display, a given closed caption window and its associated text may move closer to or farther away from the viewer in a manner desired by the caption author. The x and y position of a given closed caption window can be defined according to the CEA-708-D standard, but the z-position may be changed dynamically by adoption of a mechanism that defines the perceived movement along the z-axis as will be described.
If the perceived z-axis position over time (over a number of sequential frames of video) is graphed as depicted in
As will be apparent from later disclosure, certain lower-capability receivers may not be outfitted with the ability to dynamically adjust the disparity in a manner that tracks the authored disparity curve of 600. In those instances, a fixed disparity value such as 604 may be provided as an “alternate disparity” which is statically located in the z-axis for a particular caption window in a manner as described later. The alternate disparity can be set for a perceived distance of the caption window represented by 604 to be approximately at the closest perceived level of disparity curve 600 to assure no interference between the caption window and other screen objects in perceived depth. This fixed alternate disparity value can be provided as will be described later. In accord with certain implementations consistent with the present invention, both a dynamically changing set of data representing authored disparity and an alternate static disparity value can both be transmitted to and received by the TV receiver. In this manner, the TV receiver device can utilize the static alternate disparity if, for example, it does not have enough processing power to deal with a frame by frame disparity change. It is also noted that captions, whether the display position is dynamically changing or static, are displayed in a manner that matches dialog and other sounds or commentary in the accompanying video. So, the x-y and z positions for the CC window may change on a phrase by phrase or other frequent basis as it accompanies A/V content and each CC window will only generally be displayed for a few seconds or so. Thus, each window of CC data will have its own sequence of data describing dynamically authored disparity and an alternate static disparity.
The data of curve 600 may be approximated as shown in
It is convenient to represent the slope change with a small number to minimize the amount of transmitted data. For example, a 5-bit 2's complement signed integer in the range −16 to +15 that indicates the amount of change in slope to apply over the designated number of frames can be used.
Again for ease of implementation, it may be convenient to represent one unit of disparity as 1/1920 of the screen width. With this choice, when the display resolution is 1920 pixels horizontally, one unit of disparity is equal to the width of one pixel, but this is not to say that rendered disparity values always have one pixel of resolution.
As noted above, the representation of
The example below provides for the disparity data to be sent using an extended service that is mapped from the main service number for the caption text. This mapping is discussed in greater detail in U.S. patent application Ser. No. 13/022,828 filed of even date herewith to Eyer, et al. entitled “Service Linkage to Caption Disparity Transport” which is hereby incorporated by reference.
Augmentation Services
Caption disparity data for a given caption service shall be delivered in an “Augmentation Service.” An Augmentation Service shall be considered to be a caption service that provides additional commands and data for the caption service it is associated with. The Augmentation Service associations shall be as shown in Table I. As shown, commands and data transported in Extended Service #49 are considered to be effectively part of Service #1, commands and data transported in Extended Service #50 are considered to be part of standard Service #2, etc.
Augmentation services are only defined for Standard caption Services #1 through #6.
Name: SetDisparity—Specify disparity for 3D captioning
Format: Variable-length
Command Coding: EXT1, 90h, caption_disparity_data( )
Caption Disparity Data
The syntax of Caption_disparity_data( ) shall be as shown in Table II.
In this example the slope of the disparity curve is defined as starting at a value of zero. In the example, after two frames, the slope changes to −7 disparity units per 32 frames. After 10 frames, the disparity changes again to −16 disparity units per 32 frames. Eight frames later it changes back to −7, and 10 frames after that it changes back to zero.
The disparity at each frame can be computed precisely using binary arithmetic carrying five binary digits to the right of the decimal point (precision of 1/32 disparity unit). Receivers that do not support 1/32 disparity unit precision in rendering are expected to round the results at each frame to whatever rendering precision they do support, such as 1/16, ⅛, ¼ pixel, etc. To avoid accumulated error, the computation of the disparity at each frame is expected to be preferably performed at full precision ( 1/32 unit), with rounding (if any) performed afterwards.
It is also noted that some receivers may be provided which do not support the dynamic time-varying change in disparity. For this reason, it is desirable to also specify a fixed disparity as shown in
In order to accomplish an implementation of this concept, a data structure is adopted represent disparity data. Table II below shows an implementation of this data structure. Within 30 bytes, this data structure accommodates two windows with up to ten segments: 2 bytes (win_info)+2 bytes (seg_info)+segment_count*1.25 bytes (frm_info)=16.5 bytes per window.
The disparity data as described above can be delivered in any suitable manner without departing from teachings consistent with implementations of certain embodiments consistent with the present invention. In one such implementation, it is desirable to map service numbers to extended services numbers or augmentation service numbers, while in other implementations, the disparity data can be conveyed in any manner including contemporaneously with the closed caption text and associated video or in advance as one or more blocks of data either in-band or out of band.
An overall process consistent with certain example implementations of the disparity encoding is depicted in the flow chart 802 of
The process carried out at the receiver can be represented in greater detail using an example process such as that shown as process 900 of
Once the text is available and the window is visible, the caption window and text can be rendered using the value of disparity calculated at each frame, 950. At each successive frame, 960, the next value of calculated disparity is rendered until the last frame is reached (test 970). When the last frame is reached, or the caption window is deleted or set to invisible (not shown), the process ends at 980.
It will be understood that these events may represent a continuous flow with text and caption data being received on an ongoing basis, so that the present representation is not to be construed as accurately depicting time. The order of the various operations in
Many variations in this process will be apparent to those skilled in the art upon consideration of the present teachings.
Thus, a method of transforming segments of authored closed captioning disparity data that defines a z-axis position for a window containing closed captioning text (a two dimensional window parallel to the screen) for display on a display into an encoded approximation of the disparity data involves approximating the segments of authored three-dimensional closed captioning disparity data as a piecewise linear approximation comprised of an ordered sequence of linear segments applicable over a sequence of frames with each segment connected to a subsequent segment at breakpoints; at an encoder, encoding the piecewise-linear approximation as a designated number of frames over which each linear segment is to be applied and a change in slope from the slope of a preceding segment in the sequence to produce the encoded approximation of the disparity data; and where a first segment in the sequence presumes a predetermined slope as a reference for change in slope.
The predetermined initial slope is preferably zero, but could be any other starting point either by convention or by conveyed information. The slope can be given as a representation of a fraction's numerator where the denominator of the fraction is a predetermined value of 2N, where N is an integer, so that only the numerator is required to convey the value of the fraction. In one preferred implementation, N=5, and the numerator can be represented by a value from −16 to +15, and one unit of slope is defined as a change of one unit of disparity over a time interval of 32 frames. In that case 2N is 32 and a 5-bit 2's complement signed integer in the range −16 to +15 can be used to indicate the amount of change in slope to apply over the designated number of frames.
The method may further involve packaging the closed caption text in a digital television caption channel stream of packets; packaging the encoded closed caption disparity data within a stream of digital television user data packets that carries video data associated with the closed caption text. Those skilled in the art, upon consideration of the present teachings, will appreciate that the encoding using number of frames over which a slope is to be applied can be transported to the receiver in any suitable manner without departing from implementations consistent with the present invention.
The method may further involve establishing an alternate fixed disparity value that establishes a fixed z-axis position for the closed caption text and packaging the alternate fixed disparity value along with the encoded closed caption disparity data for use as an alternative z-axis position.
The method may also involve receiving the encoded disparity data and low pass filtering the encoded three dimensional disparity data to smooth transitions at the breakpoints.
The method may further involve packaging the closed caption text in a digital television caption channel stream of packets; and packaging the encoded closed caption disparity data as a command extension within a stream of digital television transport stream packets that carries audio/video data associated with the closed caption text or as user data. The method may also involve transmitting the digital television caption channel stream of packets and the encoded closed caption disparity data and the closed caption text to a television receiver device as a part of a transport stream that carries audio/video data associated with the closed caption text in any suitable manner.
Thus, in certain implementations, a method of decoding closed caption disparity data involves at a television receiver device, receiving closed caption disparity data encoded as a sequence of one or more frame interval/slope change pairs, with the frame interval representing a designated number of frames over which each linear function representing disparity is to be applied paired with a value indicating a change in slope from the slope of a preceding segment. A a first segment in the sequence of disparity values presumes a slope of zero as a reference for subsequent changes in slope, and receiving a sequence of caption data packets where the sequence of caption data packets specify a closed caption display window containing closed caption text, where closed caption data is packaged in a digital television caption channel stream of packets. The disparity value is used in each frame in a 3D rendering of left and right eye views of the closed caption display to establish a horizontal offset that establishes the window's perceived z-axis position. The method further receives an alternate fixed disparity value that establishes a fixed disparity for the closed caption display packaged along with the encoded three-dimensional closed caption disparity data for use as an alternative perceived z-axis position. Low-pass filtering is used to filter the encoded three dimensional disparity data to smooth transitions in disparity values. The caption text and disparity data are processed to produce an output suitable for defining the rendering of a three dimensional image on a display of the caption text at a perceived z-axis position defined by the disparity data.
Another method of decoding closed caption disparity data involves at a television receiver device, receiving closed caption disparity data encoded as a sequence of one or more frame interval/slope change pairs, with the frame interval representing a designated number of frames over which each linear function representing disparity is to be applied paired with a value indicating a change in slope from the slope of a preceding segment. A first segment in the sequence of disparity values presumes a predetermined slope as a reference for subsequent changes in slope. A sequence of caption data packets is received, where the sequence of caption data packets specify a closed caption display window containing closed caption text. The disparity value is used in each frame to determine a horizontal offset of left and right eye views of the closed caption display that establishes the closed caption display window's perceived z-axis position.
In certain implementations, the predetermined slope is zero. In certain implementations, the slope change is given as a value representing a change in disparity over 2N frames where N is an integer. In certain implementations, N=5. In certain implementations, the slope change is signaled as a value in the range −16 to +15 that indicates the amount of change in slope to apply over the designated frame interval. In certain implementations, the method involves receiving an alternate fixed disparity value that establishes a fixed perceived z-axis position for the closed caption display packaged along with the encoded three-dimensional closed caption disparity data for use as an alternative z-axis position. In certain implementations, the method involves low pass filtering the encoded three dimensional disparity data to smooth transitions in disparity values. In certain implementations, the method involves processing the caption text and disparity data to produce an output suitable for defining a rendering of a three-dimensional image on a display of the caption text at a perceived z-axis position defined by the disparity data.
A computer readable electronic storage device such as a non-transitory storage medium can be used for storing instructions that when executed on one or more programmed processors carry out any of the processes described herein.
A television receiver device that processes disparity data for closed captions in a manner consistent with certain implementations has a decoder that receives closed caption disparity data encoded as a sequence of one or more frame interval/slope change pairs, with the frame interval representing a designated number of frames over which each linear function representing disparity is to be applied paired with a value indicating a change in slope from the slope of a preceding segment. A a first segment in the sequence of disparity values presumes a predetermined slope as a reference for subsequent changes in slope. A parser parses the disparity data and closed caption text from the one or more signals. A compositor receives the disparity data and the closed caption text and generates a sequence of video frame signals including a closed caption display window containing closed caption text with the closed caption display window has a perceived z-axis position in each frame defined by the disparity values.
In certain implementations, the predetermined slope is zero. In certain implementations, the frame interval is given as a predetermined value of 2N frames, where N is an integer. In certain implementations, the slope change is given as a value representing a change in disparity over 32 frames. In certain implementations, the slope change is signaled as a value in the range −16 to +15 that indicates the amount of change in slope to apply over the designated frame interval. In certain implementations, the slope change is signaled as a value in the range −16 to +15 that indicates the amount of change in slope to apply over the designated frame interval. In certain implementations, the decoder further receives an alternate fixed disparity value that establishes a fixed z-axis position for a portion of the closed caption display packaged along with the encoded closed caption disparity data for use as an alternative perceived z-axis position. In certain implementations, a low-pass filter processes the encoded three dimensional disparity data to smooth transitions in disparity values. In certain implementations, the compositor further processes the caption text and disparity data to produce an output suitable for defining a rendering of a three-dimensional image on a display of the caption text at a perceived z-axis position defined by the disparity data.
Those skilled in the art will recognize, upon consideration of the above teachings, that certain of the above exemplary embodiments are based upon use of a programmed processor. However, the invention is not limited to such exemplary embodiments, since other embodiments could be implemented using hardware component equivalents such as special purpose hardware and/or dedicated processors. Similarly, general purpose computers, microprocessor based computers, micro-controllers, optical computers, analog computers, dedicated processors, application specific circuits and/or dedicated hard wired logic may be used to construct alternative equivalent embodiments.
Those skilled in the art will appreciate, upon consideration of the above teachings, that the program operations and processes and associated data used to implement certain of the embodiments described above can be implemented using disc storage as well as other forms of storage such as non-transitory storage devices including as for example Read Only Memory (ROM) devices, Random Access Memory (RAM) devices, network memory devices, optical storage elements, magnetic storage elements, magneto-optical storage elements, flash memory, core memory and/or other equivalent volatile and non-volatile storage technologies without departing from certain embodiments of the present invention. The term non-transitory does not suggest that information cannot be lost by virtue of removal of power or other actions. Such alternative storage devices should be considered equivalents.
Certain embodiments described herein, are or may be implemented using a programmed processor executing programming instructions that are broadly described above in flow chart form that can be stored on any suitable electronic or computer readable storage medium. However, those skilled in the art will appreciate, upon consideration of the present teaching, that the processes described above can be implemented in any number of variations and in many suitable programming languages without departing from embodiments of the present invention. For example, the order of certain operations carried out can often be varied, additional operations can be added or operations can be deleted without departing from certain embodiments of the invention. Error trapping can be added and/or enhanced and variations can be made in operational flow, user interface and information presentation without departing from certain embodiments of the present invention. Such variations are contemplated and considered equivalent.
While certain illustrative embodiments have been described, it is evident that many alternatives, modifications, permutations and variations will become apparent to those skilled in the art in light of the foregoing description.
This application is a continuation application of U.S. application Ser. No. 13/022,817 filed Feb. 8, 2011, which claims priority benefit of U.S. Provisional Application No. 61/415,924 filed Nov. 22, 2010 entitled “Service Linkage to Caption Disparity Data Transport” to Mark Eyer, et al.; U.S. Provisional Application No. 61/415,457 filed Nov. 19, 2010 entitled “Disparity Data Signaling and Transport for 3D Captioning” to Mark Eyer, et al.; U.S. Provisional Application No. 61/378,792 filed Aug. 31, 2010 entitled “Efficient Transport of Frame-by-Frame Change in Captioning Disparity Data” to Mark Eyer; U.S. Provisional Application No. 61/346,652 filed May 20, 2010 entitled “Disparity Data Transport” to Mark Eyer, et al.; U.S. Provisional Application No. 61/316,733 filed Mar. 23, 2010 entitled “Extended Command Stream for CEA-708 Captions” to Mark Eyer, et al.; and U.S. Provisional Application No. 61/313,612 filed Mar. 12, 2010 entitled “Disparity Data Transport” to Mark Eyer, et al., each of which are hereby incorporated by reference in their entirety. This application is also related to International Application No. PCT/US11/26291 filed Feb. 25, 2011.
Number | Name | Date | Kind |
---|---|---|---|
5200823 | Yoneda et al. | Apr 1993 | A |
5418782 | Wasilewski | May 1995 | A |
5519443 | Salomon | May 1996 | A |
5534941 | Sie | Jul 1996 | A |
5818439 | Nagasaka et al. | Oct 1998 | A |
5923837 | Matias | Jul 1999 | A |
6043838 | Chen | Mar 2000 | A |
6252596 | Garland | Jun 2001 | B1 |
6266094 | Taylor, Jr. | Jul 2001 | B1 |
6295093 | Park | Sep 2001 | B1 |
6480819 | Boman | Nov 2002 | B1 |
7035292 | Giorgetta et al. | Apr 2006 | B1 |
7312891 | Tsuchiya et al. | Dec 2007 | B2 |
7463257 | Martin | Dec 2008 | B2 |
7664108 | Bahattab | Feb 2010 | B2 |
7848425 | Cho et al. | Dec 2010 | B2 |
7999820 | Weitkemper et al. | Aug 2011 | B1 |
8421914 | Hirasawa et al. | Apr 2013 | B2 |
8595783 | Dewa | Nov 2013 | B2 |
8704874 | Suh | Apr 2014 | B2 |
8705933 | Eyer | Apr 2014 | B2 |
8730301 | Eyer | May 2014 | B2 |
9258544 | Suh | Feb 2016 | B2 |
20010019357 | Ito et al. | Sep 2001 | A1 |
20020044536 | Izumi | Apr 2002 | A1 |
20020184446 | Kagan | Dec 2002 | A1 |
20030035063 | Orr | Feb 2003 | A1 |
20030053661 | Magarey | Mar 2003 | A1 |
20030066081 | Barone | Apr 2003 | A1 |
20030093485 | Dougall | May 2003 | A1 |
20030156188 | Abrams | Aug 2003 | A1 |
20040010807 | Urdang et al. | Jan 2004 | A1 |
20040016003 | Lee | Jan 2004 | A1 |
20040017831 | Shen | Jan 2004 | A1 |
20040100588 | Hartson | May 2004 | A1 |
20040125877 | Chang et al. | Jul 2004 | A1 |
20050025361 | Gadre et al. | Feb 2005 | A1 |
20050060758 | Park | Mar 2005 | A1 |
20050190180 | Jin | Sep 2005 | A1 |
20060020959 | Masuda | Jan 2006 | A1 |
20060182051 | Lee | Aug 2006 | A1 |
20060184994 | Eyer et al. | Aug 2006 | A1 |
20060268788 | Harris | Nov 2006 | A1 |
20070022461 | Lee | Jan 2007 | A1 |
20070076122 | Modi et al. | Apr 2007 | A1 |
20070076960 | Takamori et al. | Apr 2007 | A1 |
20070124768 | Morris | May 2007 | A1 |
20070176914 | Bae | Aug 2007 | A1 |
20070180460 | Park | Aug 2007 | A1 |
20070263731 | Yamada | Nov 2007 | A1 |
20070294494 | Conti | Dec 2007 | A1 |
20070296863 | Hwang et al. | Dec 2007 | A1 |
20080098433 | Hardacker | Apr 2008 | A1 |
20080180572 | Pickett et al. | Jul 2008 | A1 |
20080180575 | Park | Jul 2008 | A1 |
20080205851 | Sakai | Aug 2008 | A1 |
20080212838 | Frigerio | Sep 2008 | A1 |
20080225164 | Park | Sep 2008 | A1 |
20080262996 | Yogeshwar et al. | Oct 2008 | A1 |
20080316303 | Chiu et al. | Dec 2008 | A1 |
20090003377 | Andersson | Jan 2009 | A1 |
20090015710 | Hirasawa et al. | Jan 2009 | A1 |
20090034633 | Rodirguez et al. | Feb 2009 | A1 |
20090059031 | Miyakoshi | Mar 2009 | A1 |
20090060044 | Suh et al. | Mar 2009 | A1 |
20090060055 | Blanchard et al. | Mar 2009 | A1 |
20090128572 | MacInnis et al. | May 2009 | A1 |
20090207305 | Fujita et al. | Aug 2009 | A1 |
20090228917 | Yun | Sep 2009 | A1 |
20090231490 | Chen et al. | Sep 2009 | A1 |
20090244371 | Park | Oct 2009 | A1 |
20090244372 | Petronelli et al. | Oct 2009 | A1 |
20090244373 | Park | Oct 2009 | A1 |
20090315979 | Jung et al. | Dec 2009 | A1 |
20090315981 | Jung | Dec 2009 | A1 |
20100002134 | Eyer et al. | Jan 2010 | A1 |
20100021141 | Yamashita et al. | Jan 2010 | A1 |
20100039558 | Detore | Feb 2010 | A1 |
20100114576 | Sundararajan | May 2010 | A1 |
20100157025 | Suh | Jun 2010 | A1 |
20100165079 | Yamada | Jul 2010 | A1 |
20100171812 | Kim et al. | Jul 2010 | A1 |
20100177243 | Park | Jul 2010 | A1 |
20100177244 | Park | Jul 2010 | A1 |
20100183278 | Black | Jul 2010 | A1 |
20100188572 | Card | Jul 2010 | A1 |
20100225808 | Mears | Sep 2010 | A1 |
20100238267 | Izzat et al. | Sep 2010 | A1 |
20100238268 | Tulkki | Sep 2010 | A1 |
20100269065 | Uchimura | Oct 2010 | A1 |
20100295970 | Suzuki | Nov 2010 | A1 |
20100303158 | Lin | Dec 2010 | A1 |
20100328427 | Sakano | Dec 2010 | A1 |
20110002533 | Inoue | Jan 2011 | A1 |
20110033170 | Ikeda | Feb 2011 | A1 |
20110063414 | Chen et al. | Mar 2011 | A1 |
20110081133 | Chen et al. | Apr 2011 | A1 |
20110088075 | Eyer | Apr 2011 | A1 |
20110128351 | Newton et al. | Jun 2011 | A1 |
20110142309 | Zhang et al. | Jun 2011 | A1 |
20110149035 | Tsukagoshi | Jun 2011 | A1 |
20110149154 | Park | Jun 2011 | A1 |
20110221863 | Eyer | Sep 2011 | A1 |
20110221873 | Eyer | Sep 2011 | A1 |
20110243536 | Eyer | Oct 2011 | A1 |
20110246488 | Eyer | Oct 2011 | A1 |
20110247028 | Eyer | Oct 2011 | A1 |
20110273532 | Kitazato | Nov 2011 | A1 |
20110279644 | Suh | Nov 2011 | A1 |
20110285817 | Tsukagoshi | Nov 2011 | A1 |
20110292189 | Newton et al. | Dec 2011 | A1 |
20110298981 | Eyer | Dec 2011 | A1 |
20110299827 | Eyer | Dec 2011 | A1 |
20110302599 | Eyer | Dec 2011 | A1 |
20110302611 | Eyer | Dec 2011 | A1 |
20110307920 | Blanchard et al. | Dec 2011 | A1 |
20110317068 | Park | Dec 2011 | A1 |
20120044418 | Eyer | Feb 2012 | A1 |
20120047531 | Eyer | Feb 2012 | A1 |
20120050473 | Suh | Mar 2012 | A1 |
20120050619 | Kitazato et al. | Mar 2012 | A1 |
20120050620 | Kitazato | Mar 2012 | A1 |
20120054214 | Yamagishi et al. | Mar 2012 | A1 |
20120054235 | Kitazato et al. | Mar 2012 | A1 |
20120054267 | Yamagishi et al. | Mar 2012 | A1 |
20120054268 | Yamagishi | Mar 2012 | A1 |
20120054783 | Yamagishi | Mar 2012 | A1 |
20120054784 | Kitazato et al. | Mar 2012 | A1 |
20120060197 | Kitahara et al. | Mar 2012 | A1 |
20120063508 | Hattori et al. | Mar 2012 | A1 |
20120072965 | Dewa | Mar 2012 | A1 |
20120081508 | Kitazato | Apr 2012 | A1 |
20120081607 | Kitazato | Apr 2012 | A1 |
20120082266 | Kitazato et al. | Apr 2012 | A1 |
20120082440 | Kitazato | Apr 2012 | A1 |
20120084802 | Kitazato | Apr 2012 | A1 |
20120084824 | Kitazato | Apr 2012 | A1 |
20120084829 | Kitazato | Apr 2012 | A1 |
20120147141 | Sasaki | Jun 2012 | A1 |
20120154676 | Park | Jun 2012 | A1 |
20120154677 | Park | Jun 2012 | A1 |
20120162510 | Park | Jun 2012 | A1 |
20120185888 | Eyer et al. | Jul 2012 | A1 |
20120253826 | Kitazato et al. | Oct 2012 | A1 |
20120274848 | Kitahara et al. | Nov 2012 | A1 |
20120275764 | Eyer | Nov 2012 | A1 |
20130002833 | Card, II | Jan 2013 | A1 |
20130010062 | Redmann | Jan 2013 | A1 |
20130010189 | Park | Jan 2013 | A1 |
20130010190 | Park | Jan 2013 | A1 |
20130010191 | Park | Jan 2013 | A1 |
20130010192 | Park | Jan 2013 | A1 |
20130014180 | Park | Jan 2013 | A1 |
20130016651 | Zhang et al. | Jan 2013 | A1 |
20130024894 | Eyer | Jan 2013 | A1 |
20130024897 | Eyer | Jan 2013 | A1 |
20130031569 | Kitazato et al. | Jan 2013 | A1 |
20130036440 | Eyer | Feb 2013 | A1 |
20130055313 | Eyer | Feb 2013 | A1 |
20130103716 | Yamagishi | Apr 2013 | A1 |
20130145414 | Yamagishi | Jun 2013 | A1 |
20130167171 | Kitazato et al. | Jun 2013 | A1 |
20130191860 | Kitazato et al. | Jul 2013 | A1 |
20130198768 | Kitazato | Aug 2013 | A1 |
20130201399 | Kitazato et al. | Aug 2013 | A1 |
20130205327 | Eyer | Aug 2013 | A1 |
20130212634 | Kitazato | Aug 2013 | A1 |
20130215327 | Kitazato et al. | Aug 2013 | A1 |
20130222691 | Park | Aug 2013 | A1 |
20130222692 | Park | Aug 2013 | A1 |
20130222693 | Park | Aug 2013 | A1 |
20130227614 | Park | Aug 2013 | A1 |
20130232529 | Park | Sep 2013 | A1 |
20130250173 | Eyer | Sep 2013 | A1 |
20130254824 | Eyer | Sep 2013 | A1 |
20130258191 | Eyer | Oct 2013 | A1 |
20130282870 | Dewa et al. | Oct 2013 | A1 |
20130283311 | Eyer | Oct 2013 | A1 |
20130283328 | Kitazato | Oct 2013 | A1 |
20130291022 | Eyer | Oct 2013 | A1 |
20130291049 | Kitazato | Oct 2013 | A1 |
20130300927 | Park | Nov 2013 | A1 |
20130300928 | Park | Nov 2013 | A1 |
20130300930 | Park | Nov 2013 | A1 |
20130300931 | Park | Nov 2013 | A1 |
20130300932 | Park | Nov 2013 | A1 |
20130340007 | Eyer | Dec 2013 | A1 |
20140013347 | Yamagishi | Jan 2014 | A1 |
20140013379 | Kitazato et al. | Jan 2014 | A1 |
20140020038 | Dewa | Jan 2014 | A1 |
20140020042 | Eyer | Jan 2014 | A1 |
20140040965 | Kitazato et al. | Feb 2014 | A1 |
20140040968 | Kitazato et al. | Feb 2014 | A1 |
20140043540 | Kitazato et al. | Feb 2014 | A1 |
20140053174 | Eyer et al. | Feb 2014 | A1 |
20140067922 | Yamagishi et al. | Mar 2014 | A1 |
20140099078 | Kitahara et al. | Apr 2014 | A1 |
20140122528 | Yamagishi | May 2014 | A1 |
Number | Date | Country |
---|---|---|
101180658 | May 2008 | CN |
101653011 | Feb 2010 | CN |
0 230 704 | Dec 1990 | EP |
2004-274125 | Sep 2004 | JP |
WO 2008115222 | Sep 2008 | WO |
WO 2009029278 | Mar 2009 | WO |
WO 2009139740 | Nov 2009 | WO |
WO 2010010499 | Jan 2010 | WO |
WO 2010064853 | Jun 2010 | WO |
WO 2010085074 | Jul 2010 | WO |
Entry |
---|
Extended European Search Report issued May 27, 2014 in Patent Application No. 11753792.8. |
Combine Office Action and Search Report issued Sep. 24, 2014 in Chinese Patent Application No. 201180012449.4 (with English translation). |
U.S. Appl. No. 13/905,721, filed May 30, 2013, Kitahara et al. |
U.S. Appl. No. 13/930,814, filed Jun. 28, 2013, Fay. |
U.S. Appl. No. 13/930,880, filed Jun. 28, 2013, Fay et al. |
U.S. Appl. No. 13/927,667, filed Jun. 26, 2013, Yamagishi. |
U.S. Appl. No. 13/934,549, filed Jul. 3, 2013, Fay et al. |
U.S. Appl. No. 13/934,615, filed Jul. 3, 2013, Eyer. |
U.S. Appl. No. 13/934,924, filed Jul. 3, 2013, Yamagishi. |
U.S. Appl. No. 13/934,473, filed Jul. 3, 2013, Yamagishi. |
U.S. Appl. No. 13/955,130, filed Jul. 31, 2013, Fay. |
U.S. Appl. No. 13/963,138, filed Aug. 9, 2013, Fay et al. |
U.S. Appl. No. 13/963,111, filed Aug. 9, 2013, Fay et al. |
U.S. Appl. No. 14/046,543, filed Oct. 4, 2013, Fay et al. |
U.S. Appl. No. 14/046,566, filed Oct. 4, 2013, Fay et al. |
U.S. Appl. No. 14/196,432, filed Mar. 4, 2014, Eyer. |
U.S. Appl. No. 14/253,474, filed Apr. 15, 2014, Kitazato et al. |
U.S. Appl. No. 14/275,231, filed May 12, 2014, Eyer. |
Office Action issued Apr. 2, 2014, in Chinese Patent Application No. 201180012449.4 (with English-language Translation). |
Holliman, “3D Display Systems,” Draft v3, Department of Computer Science, University of Durham, Science Laboratories, South Road, Durham, DH1 3LE, Mar. 28, 2003, pp. 1-45. |
Office Action issued Jan. 2, 2013 in Korean Patent Application No. 10-2011-7010282 (with English translation). |
Written Opinion of the International Searching Authority issued May 10, 2011 in PCT/US2011/026291. |
Written Opinion of the International Searching Authority issued Apr. 29, 2011 in PCT/US2011/026475. |
Written Opinion of the International Searching Authority issued May 5, 2011 in PCT/US2011/026698. |
Search Report issued Nov. 25, 2013 in Japanese Patent Application No. 2012-556174 (with English translation). |
Supplementary European Search Report and Opinion issued May 23, 2014, in European Patent Application No. 11753806.6. |
Supplementary European Search Report and Opinion issued May 23, 2014, in European Patent Application No. 11753798.5. |
CEA Standard: Digital Television (DTV) Closed Captioning, CEA-708 D, pp. 18-22, Aug. 2008. |
Proposed SMPTE Standard for Television—DTV Closed-Caption Server to Encoder Interface; SMPTE 333M; XP-000877922; SMPTE Journal, Nov. 1999; 8 pgs. |
Number | Date | Country | |
---|---|---|---|
20140253683 A1 | Sep 2014 | US |
Number | Date | Country | |
---|---|---|---|
61415924 | Nov 2010 | US | |
61415457 | Nov 2010 | US | |
61378792 | Aug 2010 | US | |
61346652 | May 2010 | US | |
61316733 | Mar 2010 | US | |
61313612 | Mar 2010 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13022817 | Feb 2011 | US |
Child | 14282572 | US |