The present invention relates to the delivery of multimedia content and more specifically to streaming video content encoded at multiple resolutions optimized for a variety of scaled display resolutions.
The term streaming media describes the playback of media on a playback device, where the media is stored on a server and continuously sent to the playback device over a network during playback. Typically, the playback device stores a sufficient quantity of media in a buffer at any given time during playback to prevent disruption of playback due to the playback device completing playback of all the buffered media prior to receipt of the next portion of media. Adaptive bit rate streaming or adaptive streaming involves detecting the present streaming conditions (e.g. the user's network bandwidth and CPU capacity) in real time and adjusting the quality of the streamed media accordingly.
In adaptive streaming systems, the source media is typically stored on a media server as a top level index file pointing to a number of alternate streams that contain the actual video and audio data. Each stream is typically stored in one or more container files. Different adaptive streaming solutions typically utilize different index and media containers. The Matroska container is a media container developed as an open standard project by the Matroska non-profit organization of Aussonne, France. The Matroska container is based upon Extensible Binary Meta Language (EBML), which is a binary derivative of the Extensible Markup Language (XML). Decoding of the Matroska container is supported by many consumer electronics (CE) devices. The DivX Plus file format developed by DivX, LLC of San Diego, Calif. utilizes an extension of the Matroska container format, including elements that are not specified within the Matroska format.
Systems and methods for the selection of resolutions for seamless resolution switching of multimedia content in accordance with embodiments of the invention are disclosed. In one embodiment of the invention, a source encoder includes a processor configured by a source encoder application to receive multimedia content, where the multimedia content comprises video data having a primary resolution and a primary sample aspect ratio and encode the video data as a set of alternative streams, where a plurality of the streams in the set of alternative streams have different maximum bitrates and resolutions, the resolution of each of the plurality of streams comprises a width and height that are both an integer number of pixels, and both the width and height of each of the plurality of streams is a common fraction of the width and height of the corresponding primary resolution.
In an additional embodiment of the invention, a plurality of the streams in the set of alternative streams has the same aspect ratio and different maximum bitrates and resolutions.
In still another embodiment of the invention, a plurality of the streams in the set of alternative streams has the same aspect ratio and different maximum bitrates, sample aspect ratios and resolutions.
In still another embodiment of the invention, the primary resolution is 1920×1080, and the primary aspect ratio is 16:9 and a plurality of the streams in the set of alternative streams are encoded with the following resolution, and sample aspect ratios:
In yet still another embodiment of the invention, the primary resolution is 1920×1040, and the primary aspect ratio is 1.85:1 and a plurality of the streams in the set of alternative streams are encoded with the following resolution, and sample aspect ratios:
In yet another additional embodiment of the invention, wherein the primary resolution is 1920×816, and the primary aspect ratio is 2.35:1 and a plurality of the streams in the set of alternative streams are encoded with the following resolution, and sample aspect ratios:
In still another additional embodiment of the invention, the primary resolution is 1920×800, and the primary aspect ratio is 2.40:1 and a plurality of the streams in the set of alternative streams are encoded with the following resolution, and sample aspect ratios:
Still another embodiment of the invention includes encoding multimedia content comprising video data having a primary resolution and a primary sample aspect ratio as a plurality of alternative streams including receiving multimedia content comprising video data having a primary resolution and a primary sample aspect ratio using a source encoder and encoding the video data as a set of alternative streams using a source encoder, where a plurality of the streams in the set of alternative streams have different maximum bitrates and resolutions, wherein the resolution of each of the plurality of streams comprises a width and height that are both an integer number of pixels and both the width and height of each of the plurality of streams is a common fraction of the width and height of the corresponding primary resolution.
In yet another embodiment of the invention, the numerator and denominator of the fraction are each less than 100.
Still another embodiment of the invention includes a playback device which includes a processor configured to communicate with a memory, where the memory contains a client application, wherein the client application configures the processor to obtain an index file, where the index file describes a plurality of alternative video streams having the same aspect ratio and different bitrates, resolutions and at least two of the plurality of alternative video streams have different sample aspect ratios, configure a video decoder to decode video having a resolution, and sample aspect ratio of a first of the plurality of alternative video streams, where decoding the video comprises scaling the decoded video to the resolution of the display device, and request a portion of the first stream.
Still another embodiment of the invention includes playing back content including obtaining an index file using a playback device, where the index file describes a plurality of alternative video streams having the same aspect ratio and different bitrates, resolutions and at least two of the plurality of alternative video streams have different sample aspect ratios, configuring a video decoder on the playback device to decode video having a resolution, and sample aspect ratio of a first of the plurality of alternative video streams, where decoding the video comprises scaling the decoded video to the resolution of the display device, and requesting a portion of the first stream using the playback device.
Still another embodiment of the invention includes determining a set of sub-resolutions for encoding a primary video stream having a primary resolution, and primary aspect ratio as a set of lower resolution streams including selecting a set of sub-resolutions, where the height and width of each sub-resolution in the set of sub-resolutions is an integer number of pixels and less than the corresponding width and height of the primary resolution and selecting sub-resolutions from the set of sub-resolutions, where the width and height of the selected sub-resolutions have the same aspect ratio as the primary aspect ratio.
In yet another additional embodiment of the invention, the width and height for each sub-resolution in the set of sub-resolutions is a multiple of 8.
In still another additional embodiment of the invention, the set of sub-resolutions comprises sub-resolutions having a square sample aspect ratio.
In yet still another additional embodiment of the invention, the set of sub-resolutions is selected by iteratively subtracting a column width from the width of the primary resolution and multiplying the resulting width by the primary aspect ratio to determine corresponding heights and selecting the sub-resolution where the width and corresponding height are integer numbers of pixels.
In yet another embodiment of the invention, selecting the set of sub-resolutions further includes iteratively subtracting a row height from the height of the primary resolution and multiplying the resulting height by the primary aspect ratio to determine corresponding widths and selecting the sub-resolutions where the height and corresponding width are integer number of pixels.
In still another embodiment of the invention, the width and height for each sub-resolution in the set of sub-resolutions is a multiple of 8.
In yet still another embodiment of the invention, the set of sub-resolutions comprises sub-resolutions having a square sample aspect ratio.
In yet another additional embodiment of the invention, the set of sub-resolutions comprises sub-resolutions having a square sample aspect ratio, and the width and height for each sub-resolution is a multiple of 8.
In still another additional embodiment of the invention, the sub-resolutions for the primary resolutions of 1920×1080, 1920×1040, 1920×816, and 1920×800 are calculated.
In yet still another additional embodiment of the invention, the set of sub-resolutions comprises sub-resolutions having a common width that is repeated for a majority of the primary resolutions, and wherein a sub-resolution is selected for any primary resolution that does not share the common width value.
In yet another embodiment of the invention, the sub-resolutions selected for the primary resolutions that do not have the common width value are generated by using the width value and selecting a height value having an aspect ratio identical to the aspect ratio of the primary resolution.
In still another embodiment of the invention, the selected height values are rounded to the closest multiple of 8 pixels.
In yet still another embodiment of the invention, the selected height values is rounded down to the nearest multiple of 8.
In yet another additional embodiment of the invention, the set of sub-resolutions is selected by iteratively subtracting a row height from the height of the primary resolution and multiplying the resulting height by the primary aspect ratio to determine corresponding widths and selecting the sub-resolutions where the height and corresponding width are integer number of pixels.
Still another embodiment of the invention includes a machine readable medium containing processor instructions, where execution of the instructions by a processor causes the processor to perform a process including receiving multimedia content comprising video data having a primary resolution and a primary sample aspect ratio and encoding the video data as a set of alternative streams, where a plurality of the streams in the set of alternative streams have different maximum bitrates and resolutions, wherein the resolution of each of the plurality of streams comprises a width and height that are both an integer number of pixels and both the width and height of each of the plurality of streams is a common fraction of the width and height of the corresponding primary resolution.
In yet another additional embodiment of the invention, the numerator and denominator of the fraction are each less than 100.
Still another embodiment of the invention includes a machine readable medium containing processor instructions, where execution of the instructions by a processor causes the processor to perform a process including obtaining an index file, where the index file describes a plurality of alternative video streams having the same aspect ratio and different bitrates, resolutions and at least two of the plurality of alternative video streams have different sample aspect ratios, configuring a video decoder to decode video having a resolution, and sample aspect ratio of a first of the plurality of alternative video streams, where decoding the video comprises scaling the decoded video to the resolution of a display device, and requesting a portion of the first stream.
Turning now to the drawings, systems and methods for adaptive bitrate streaming of streams encoded at different resolutions in accordance with embodiments of the invention are illustrated. Adaptive bitrate streaming systems in accordance with embodiments of the invention are configured to stream multimedia content encoded at different maximum bitrates and resolutions over a network, such as the Internet. Multimedia content typically includes video and audio data, subtitles, and other related metadata. Video data is created at a certain resolution and encoded to achieve a target maximum bitrate. Video data at a particular resolution can be encoded at multiple bitrates; however, the subjective quality of the video data at a particular bitrate depends in part on the resolution of the video data. For example, video data at a high resolution may have subjectively poor quality when encoded at a low bitrate due to the information stored in the high resolution video being lost in the encoding. Likewise, video data at a lower resolution may not show any improved subjective quality when encoded at gradually higher bitrates; however, the subjective quality of a lower resolution video may be acceptable at a lower bitrate relative to a higher resolution encoding. Adaptive bitrate streaming systems in accordance with embodiments of the invention contain multimedia sources containing video data of varying video resolutions and maximum bitrates. In order to provide the highest quality video experience independent of the network data rate, adaptive bitrate streaming systems are configured to switch between the available sources of video data throughout the delivery of the video data according to a variety of factors, including, but not limited to, the available network data rate, and video decoder performance. Systems and methods for switching between video streams during playback are disclosed in U.S. patent application Ser. No. 13/221,682 entitled “Systems and Methods for Adaptive Bitrate Streaming of Media Stored in Matroska Container Files Using Hypertext Transfer Protocol” to Braness et al., filed Aug. 30, 2011, the disclosure of which is incorporated by reference herein in its entirety.
Although streamed multimedia content can be encoded at any of a number of different resolutions, the process of decoding video for display typically involves scaling the decoded video to the resolution of the display device. Where the resolution differs from that of the display device, each pixel of the decoded video is scaled to correspond to one or more pixels of the display. In some cases, it may be preferable that the scaling ratio between the streamed multimedia content and the display resolution be a whole-number fraction, such as 1:2, 2:3, 3:4, 7:8, etc. Adaptive bitrate streaming systems are designed to transition between streams in response to changes in streaming conditions. Transitions between streams encoded at resolutions that are scaled by different ratios for display are often visibly disruptive. In a number of embodiments, the adaptive bitrate streaming system is configured so that content is encoded in such a way that each stream of video is encoded with an identical aspect ratio that is the same as the aspect ratio of the source video. In this way, each pixel of the encoded video scales in a uniform fashion when decoded and displayed on a display device receiving the video. Adaptive bitrate streaming systems configured to stream video encoded at multiple resolutions with the same aspect ratio as the source video in accordance with embodiments of the invention are discussed further below.
System Overview
An adaptive bitrate streaming system in accordance with an embodiment of the invention is illustrated in
In the illustrated embodiment, playback devices include personal computers 110, CE players 108, and mobile phones 112. In other embodiments, playback devices can include consumer electronics devices such as DVD players, Blu-ray players, televisions, set top boxes, video game consoles, tablets, and other devices that are capable of connecting to a server via HTTP and playing back encoded media. In the illustrated embodiment, a variety of playback devices use HTTP or another appropriate stateless protocol to request portions of a top level index file and the container files via a network 102 such as the Internet. Prior to a playback device performing adaptive bitrate streaming using portions of media from alternative streams contained within the container files, a bandwidth probe can be performed by the playback device to determine available bandwidth. Once the bandwidth probe has been completed, the playback device can utilize data within the top level index including (but not limited to) the maximum bitrate of each of the available streams to determine the initial streams from which to commence requesting portions of encoded media as part of an adaptive streaming process.
Once playback of content from the initial set of streams commences, the playback device utilizes the top level index to perform adaptive bitrate streaming of the content in response to changes in streaming conditions. In an adaptive bitrate streaming system that utilizes streams encoded at different resolutions, the playback device can progress through a series of operational phases in which the playback device responds differently in each phase to changes in the streaming conditions. In a number of embodiments, stability in streaming conditions or improving streaming conditions can result in a transition to a phase in which the playback device assumes stable operating conditions, buffers more content, and is less responsive to fluctuations in streaming conditions. In many embodiments, a deterioration in streaming conditions results in a stream switch to a set of streams at a lower resolution utilizing less bandwidth and resulting in the playback device transitioning to a phase in which the playback device assumes unstable operating conditions, buffers less content, and responds rapidly to variations in streaming conditions.
In the illustrated embodiment, the adaptive bitrate streaming system includes computer systems capable of delivering multimedia content with variable data rates utilizing a variety of video resolutions. In many embodiments, adaptive bitrate streaming systems can be implemented using any device capable of delivering encoded streams of multimedia, where the streams are encoded at different maximum bitrates and resolutions. The basic architecture of an adaptive streaming system source encoder in accordance with an embodiment of the invention is illustrated in
Selection of Resolutions for a Primary Resolution
As discussed above, adaptive bitrate streaming systems involve switching between different video streams encoded at different bitrates depending on streaming conditions. In an adaptive streaming system that utilizes alternative video streams encoded at different resolutions, the resolution of each stream can be selected based upon the playback quality of the encoded video at a given bitrate. A generalized process for selecting resolutions for a video stream in accordance with embodiments of the invention is illustrated in
Once the set of resolutions is determined using both width and height, the lists are combined (216) and any duplicate width and heights removed. Once the duplicate resolutions are removed, the calculated width and heights are compared to the primary width and height to determine which resolutions are selected (218). In many embodiments of the invention, those calculated resolutions where both the height and width can be represented as whole-number fractions of the height and width of the primary resolution are selected (218), where the numerator or the denominator are less than a predetermined number such as 100. One example of a set 500 of resolutions and ratios calculated in accordance with an embodiment of the invention utilizing the process described above are shown in
Although a specific process for selecting resolutions for use in a multiple resolution adaptive streaming system is discussed above, any of a variety of processes that involve calculating resolutions based on a primary resolution, including those where resolutions higher than the primary resolution are calculated, can be utilized in accordance with embodiments of the invention. Systems and methods for selecting resolutions given a combination of primary resolutions are discussed further below.
Selection of Resolutions for a Combination of Primary Resolutions
Video data contained in multimedia content is often created at a particular resolution and aspect ratio. However, it is desirable to display multimedia content on a variety of display devices; these display devices may not be of the same resolution or aspect ratio as the video data. In an adaptive streaming system, the video data may be encoded at a variety of primary resolutions with differing aspect ratios in order to maintain a level of consistency in the display of the source multimedia content. A generalized process for selecting resolutions for a video stream with multiple aspect ratios in accordance with embodiments of the invention is illustrated in
An example set 600 of resolutions calculated in accordance with an embodiment of the invention is shown in
Although the present invention has been described in certain specific aspects, many additional modifications and variations would be apparent to those skilled in the art. It is therefore to be understood that the present invention may be practiced otherwise than specifically described, including playback devices where the set of streaming switching conditions utilized by the playback device are continuously changing, without departing from the scope and spirit of the present invention. Thus, embodiments of the present invention should be considered in all respects as illustrative and not restrictive. Accordingly, the scope of the invention should be determined not by the embodiments illustrated, but by the appended claims and their equivalents.
The current application is a continuation of U.S. application Ser. No. 13/430,032, filed Mar. 26, 2012, which application claims priority to U.S. Provisional Patent Application No. 61/529,216, filed Aug. 30, 2011, the disclosures of which are incorporated herein by reference in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
3919474 | Benson | Nov 1975 | A |
4974260 | Rudak | Nov 1990 | A |
5912710 | Fujimoto | Jun 1999 | A |
6005621 | Linzer | Dec 1999 | A |
6157410 | Izumi et al. | Dec 2000 | A |
6430354 | Watanabe | Aug 2002 | B1 |
6920179 | Anand et al. | Jul 2005 | B1 |
8818171 | Soroushian et al. | Aug 2014 | B2 |
9467708 | Soroushian et al. | Oct 2016 | B2 |
9510031 | Soroushian et al. | Nov 2016 | B2 |
9955195 | Soroushian | Apr 2018 | B2 |
20030142872 | Koyanagi | Jul 2003 | A1 |
20040150747 | Sita | Aug 2004 | A1 |
20040208245 | Macinnis et al. | Oct 2004 | A1 |
20050052294 | Liang et al. | Mar 2005 | A1 |
20050089091 | Zhao et al. | Apr 2005 | A1 |
20050157948 | Lee | Jul 2005 | A1 |
20050210145 | Kim et al. | Sep 2005 | A1 |
20060015813 | Chung | Jan 2006 | A1 |
20060039481 | Shen | Feb 2006 | A1 |
20060126717 | Boyce et al. | Jun 2006 | A1 |
20070024706 | Brannon, Jr. | Feb 2007 | A1 |
20070053293 | Mcdonald et al. | Mar 2007 | A1 |
20080030614 | Schwab | Feb 2008 | A1 |
20080052306 | Wang et al. | Feb 2008 | A1 |
20080137848 | Kocher et al. | Jun 2008 | A1 |
20080196076 | Shatz et al. | Aug 2008 | A1 |
20080253454 | Imamura et al. | Oct 2008 | A1 |
20080266522 | Weisgerber | Oct 2008 | A1 |
20090116821 | Shibamiya et al. | May 2009 | A1 |
20090300204 | Zhang et al. | Dec 2009 | A1 |
20100002069 | Eleftheriadis et al. | Jan 2010 | A1 |
20100262712 | Kim et al. | Oct 2010 | A1 |
20100278271 | Maclnnis | Nov 2010 | A1 |
20110022432 | Ishida et al. | Jan 2011 | A1 |
20110082924 | Gopalakrishnan | Apr 2011 | A1 |
20110099594 | Chen et al. | Apr 2011 | A1 |
20110126104 | Woods et al. | May 2011 | A1 |
20110164679 | Satou et al. | Jul 2011 | A1 |
20110170408 | Furbeck et al. | Jul 2011 | A1 |
20110280307 | MacInnis | Nov 2011 | A1 |
20120173751 | Braness et al. | Jul 2012 | A1 |
20120177101 | van der Schaar | Jul 2012 | A1 |
20120179834 | van der Schaar et al. | Jul 2012 | A1 |
20120281767 | Duenas et al. | Nov 2012 | A1 |
20130051767 | Soroushian et al. | Feb 2013 | A1 |
20130051768 | Soroushian et al. | Feb 2013 | A1 |
20130058393 | Soroushian | Mar 2013 | A1 |
20130169863 | Smith | Jul 2013 | A1 |
20140355958 | Soroushian et al. | Dec 2014 | A1 |
20170041604 | Soroushian et al. | Feb 2017 | A1 |
20180278975 | Soroushian | Sep 2018 | A1 |
Number | Date | Country |
---|---|---|
1662952 | Aug 2005 | CN |
1756359 | Apr 2006 | CN |
1787422 | Jun 2006 | CN |
101461149 | Jun 2009 | CN |
102138327 | Jul 2011 | CN |
103858419 | Jun 2014 | CN |
103875248 | Jun 2014 | CN |
103875248 | Sep 2018 | CN |
108989847 | Dec 2018 | CN |
1335603 | Aug 2003 | EP |
1195183 | Feb 2018 | HK |
1260329 | Dec 2019 | HK |
2009508452 | Feb 2009 | JP |
20140056317 | May 2014 | KR |
101928910 | Dec 2018 | KR |
10-1936142 | Jan 2019 | KR |
10-1981923 | May 2019 | KR |
10-2020764 | Sep 2019 | KR |
10-2074148 | Jan 2020 | KR |
10-2086995 | Mar 2020 | KR |
2010150470 | Dec 2010 | WO |
2011053658 | May 2011 | WO |
2011059291 | May 2011 | WO |
2011087449 | Jul 2011 | WO |
2011093835 | Aug 2011 | WO |
2011102791 | Aug 2011 | WO |
2013033334 | Mar 2013 | WO |
2013033335 | Mar 2013 | WO |
2013033458 | Mar 2013 | WO |
2013033458 | May 2013 | WO |
Entry |
---|
International Preliminary Report on Patentability for International Application No. PCT/US2012/053223, Report Issued Mar. 4, 2014, 7 pgs. |
International Search Report and Written Opinion for International Application No. PCT/US2012/053052, International Filing Date Aug. 30, 2012, Report Completed Oct. 25, 2012, dated Nov. 16, 2012, 9 pgs. |
International Search Report and Written Opinion for International Application No. PCT/US2012/053223, International Filing Date Aug. 30, 2012, Report Completed Dec. 7, 2012, dated Mar. 7, 2013, 10 pgs. |
International Search Report and Written Opinion for International Application PCT/US2012/053053, dated Oct. 23, 2012, 11 pgs. |
International Search Report for International Application No. PCT/SE2011/050166, Search completed Mar. 3, 2011, dated Mar. 3, 2011, 5 Pgs. |
Extended European Search Report for European Application EP12828956.8,Report Completed Feb. 18, 2015, dated Mar. 2, 2015, 13 Pgs. |
“Adaptive HTTP Streaming in PSS—Client Behaviour”, S4-AHI129, 3GPP TSG-SA4 Ad-Hoc Meeting, Dec. 14-16, 2009, Paris, France; section 12.6.1. |
“Adaptive HTTP Streaming in PSS—Data Formats for HTTP—Streaming excluding MPD”, S4-AHI128, 3GPP TSGSA4 Ad-Hoc Meeting, Dec. 14-16, 2009, Paris, France; sections 12.2.1 and 12.2.4.2.1. |
“Adaptive HTTP Streaming in PSS—Discussion on Options”, S4-AHI130, 3GPP TSG-SA4 Ad-Hoc Meeting, Dec. 14-16, 2009, Paris, France; sections 1, 2.7-2.8, and 2.16-2.19. |
“Fragmented Time Indexing of Representations”, S4-AHI126, 3GPP TSG-SA4 Ad-Hoc Meeting, Dec. 14-16, 2009, Paris, France. |
“Transparent end-to-end packet switched streaming service (PSS); 3GPP file format (3GP) (Release 9)”, 3GPP TS 26.244 V9.0.0 (Dec. 2009),sections 7.1-7.4. |
Fecheyr-Lippens, A., “A Review of HTTP Live Streaming”, Jan. 25, 2010, X P002638990, Retrieved from the Internet: URL:http://issuu.com/andruby/docs/http_live_streaming Retrieved on May 24, 2011, 38 Pages. |
Watson, Mark, “Input for Dash EE#1 (CMP): Pixel Aspect Ratio”, 94. MPEG Meeting, Oct. 11-15, 2010, Guangzhou, Motion Picture Expert Group or ISO/IEC JTC1/SC29/WG11, No. M18498, Oct. 28, 2010, XP030047088, 4 Pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2012/053052, dated Mar. 4, 2014, 8 pgs. |
Number | Date | Country | |
---|---|---|---|
20170026445 A1 | Jan 2017 | US |
Number | Date | Country | |
---|---|---|---|
61529216 | Aug 2011 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13430032 | Mar 2012 | US |
Child | 15287608 | US |