The following relates to systems and methods for applying post-processing to a video stream.
A computing device, such as a mobile device, uses resources, such as a processor, to perform tasks. Each task inherently consumes a certain percentage of the overall resources of the device. It is well known that mobile devices generally have fewer resources than, for example, personal computers (PCs). Many tasks, often referred to as non-interactive tasks, are fixed tasks that are scheduled by a scheduling algorithm. Other tasks, often referred to as interactive tasks, in some way relate to recent input/output (I/O) traffic or user related tasks, such as user input or user directed output. The scheduling algorithm typically aims to schedule interactive tasks for optimal low latency and non-interactive tasks for optimal throughput.
An example of a non-interactive task is video decoding, which is done in the background, and an example of an interactive task is a keystroke or status bar update that is to be presented to a user on the display of the mobile device. The video content expected to be played on a mobile device often pushes the capabilities of mobile devices. Video decoding can be part of decoding a multimedia file including both audio content and video content. Devices for playing multimedia content are often referred to as media players. In some circumstances, the mobile device cannot decode video sufficiently quickly to maintain synchronization with playback of the audio content. This can provide a poor viewing experience to the user.
Example implementations will now be described with reference to the appended drawings wherein:
Example mobile devices and methods performed thereby are now described for dynamic post-processing of a video stream. The methods can be performed according to video processing lag from a rendered audio stream. The methods and devices can provide increased video processing speed. This can result in decreased video rendering lag of audio rendering, and increased synchronization of video and audio playback.
Referring now to
The mobile device 10a shown in
The mobile device 10b shown in
The mobile device 10b also comprises a menu or option button 24 that loads a menu or list of options on display 12b when pressed, and a cancel or escape button 16b to exit, “go back” or otherwise escape from a feature, option, selection or display. The mobile device 10b as illustrated in
The reduced QWERTY keyboard 22 comprises a plurality of multi-functional keys and corresponding indicia including keys associated with alphabetic characters corresponding to a QWERTY array of letters A to Z and an overlaid numeric phone key arrangement. The plurality of keys that comprise alphabetic and/or numeric characters total fewer than twenty-six (26). In the implementation shown, the number of keys that comprise alphabetic and numeric characters is fourteen (14). In this implementation, the total number of keys, including other functional keys, is twenty (20). The plurality of keys may comprise four rows and five columns of keys, with the four rows comprising in order a first, second, third and fourth row, and the five columns comprising in order a first, second, third, fourth, and fifth column. The QWERTY array of letters is associated with three of the four rows and the numeric phone key arrangement is associated with each of the four rows.
The numeric phone key arrangement is associated with three of the five columns. Specifically, the numeric phone key arrangement may be associated with the second, third and fourth columns. The numeric phone key arrangement may alternatively be associated with keys in the first, second, third, and fourth rows, with keys in the first row including a number “1” in the second column, a number “2” in the third column, and a number “3” in the fourth column. The numeric phone keys associated with keys in the second row include a number “4” in the second column, a number “5” in the third column, and a number “6” in the fourth column. The numeric phone keys associated with keys in the third row include a number “7” in the second column, a number “8” in the third column, and a number “9” in the fourth column. The numeric phone keys associated with keys in the fourth row may include a “*” in the second column, a number “0” in the third column, and a “#” in the fourth column.
The physical keyboard may also include a function associated with at least one of the plurality of keys. The fourth row of keys may include an “alt” function in the first column, a “next” function in the second column, a “space” function in the third column, a “shift” function in the fourth column, and a “return/enter” function in the fifth column.
The first row of five keys may comprise keys corresponding in order to letters “QW”, “ER”, “TY”, “UI”, and “OP”. The second row of five keys may comprise keys corresponding in order to letters “AS”, “DF”, “GH”, “JK”, and “L”. The third row of five keys may comprise keys corresponding in order to letters “ZX”, “CV”, “BN”, and “M”.
It will be appreciated that for the mobile device 10, a wide range of one or more positioning or cursor/view positioning mechanisms such as a touch pad, a joystick button, a mouse, a touchscreen, set of arrow keys, a tablet, an accelerometer (for sensing orientation and/or movements of the mobile device 10 etc.), or other whether presently known or unknown may be employed. Similarly, any variation of keyboard 20, 22 may be used. It will also be appreciated that the mobile devices 10 shown in
Movement, navigation, and/or scrolling with use of a cursor/view positioning device 14 (e.g. trackball 14b or positioning wheel 14a) is beneficial given the relatively large size of visually displayed information and the compact size of display 12, and since information and messages are typically only partially presented in the limited view of display 12 at any given moment. As previously described, positioning device 14—positioning wheel 14a and trackball 14b, are helpful cursor/view positioning mechanisms to achieve such movement. Positioning device 14, which may be referred to as a positioning wheel or scroll device 14a in one example implementation (
Mobile station 32 will normally incorporate a communication subsystem 34 which includes a receiver 36, a transmitter 40, and associated components such as one or more (preferably embedded or internal) antenna elements 42 and 44, local oscillators (LOs) 38, and a processing module such as a digital signal processor (DSP) 46. As will be apparent to those skilled in field of communications, particular design of communication subsystem 34 depends on the communication network in which mobile station 32 is intended to operate.
Mobile station 32 may send and receive communication signals over a network after required network registration or activation procedures have been completed. Signals received by antenna 44 through the network are input to receiver 36, which may perform such common receiver functions as signal amplification, frequency down conversion. filtering, channel selection, and like, and in example shown in
Network access is associated with a subscriber or user of mobile station 32. In one implementation, mobile station 32 uses a Subscriber Identity Module or “SIM” card 74 to be inserted in a SIM interface 76 in order to operate in the network. SIM 74 is one type of a conventional “smart card” used to identify an end user (or subscriber) of the mobile station 32 and to personalize the device, among other things. Without SIM 74, the mobile station terminal in such an implementation is not fully operational for communication through a wireless network. By inserting SIM 74 into mobile station 32, an end user can have access to any and all of his/her subscribed services. SIM 74 generally includes a processor and memory for storing information. Since SIM 74 is coupled to a SIM interface 76, it is coupled to processor 64 through communication lines. In order to identify the subscriber, SIM 74 contains some user parameters such as an International Mobile Subscriber Identity (IMSI). An advantage of using SIM 74 is that end users are not necessarily bound by any single physical mobile station. SIM 74 may store additional user information for the mobile station as well, including datebook (or calendar) information and recent call information. It will be appreciated that mobile station 32 may also be used with any other type of network compatible mobile device 10 such as those being code division multiple access (CDMA) enabled and should not be limited to those using and/or having a SIM card 74.
Mobile station 32 is a battery-powered device so it also includes a battery interface 70 for receiving one or more rechargeable batteries 72. Such a battery 72 provides electrical power to most if not all electrical circuitry in mobile station 32, and battery interface 70 provides for a mechanical and electrical connection for it. The battery interface 70 is coupled to a regulator (not shown) which provides a regulated voltage V to all of the circuitry.
Mobile station 32 in this implementation includes a processor 64 which controls overall operation of mobile station 32. It will be appreciated that the processor 64 may be implemented by any processing device. Communication functions, including at least data and voice communications are performed through communication subsystem 34. Processor 64 also interacts with additional device subsystems which may interface with physical components of the mobile device 10. Such additional device subsystems comprise a display 48 (the display 48 can be the display 12 including 12a and 12b of
Processor 64, in addition to its operating system functions, preferably enables execution of software applications on mobile station 32. A predetermined set of applications which control basic device operations, including at least data and voice communication applications, as well as the inventive functionality of the present disclosure, will normally be installed on mobile station 32 during its manufacture. A preferred application that may be loaded onto mobile station 32 may be a personal information manager (PIM) application having the ability to organize and manage data items relating to user such as, but not limited to, e-mail, calendar events, voice mails, appointments, and task items. Naturally, one or more memory stores are available on mobile station 32 and SIM 74 to facilitate storage of PIM data items and other information.
The PIM application preferably has the ability to send and receive data items via the wireless network. In the present disclosure, PIM data items are seamlessly integrated, synchronized, and updated via the wireless network, with the mobile station user's corresponding data items stored and/or associated with a host computer system thereby creating a mirrored host computer on mobile station 32 with respect to such items. This is especially advantageous where the host computer system is the mobile station user's office computer system. Additional applications may also be loaded onto mobile station 32 through network, an auxiliary subsystem 54, serial port 56, short-range communications subsystem 66, or any other suitable subsystem 68, and installed by a user in RAM 52 or preferably a non-volatile store (such as Flash memory 50) for execution by processor 64. Such flexibility in application installation increases the functionality of mobile station 32 and may provide enhanced on-device functions, communication-related functions, or both. For example, secure communication applications may enable electronic commerce functions and other such financial transactions to be performed using mobile station 32.
In a data communication mode, a received signal such as a text message, an e-mail message, or web page download will be processed by communication subsystem 34 and input to processor 64. Processor 64 will preferably further process the signal for output to display 48 or alternatively to auxiliary I/O device 54. A user of mobile station 32 may also compose data items, such as e-mail messages, for example, using keyboard 58 in conjunction with display 48 and possibly auxiliary I/O device 54. Keyboard 58 is preferably a complete alphanumeric keyboard and/or telephone-type keypad. These composed items may be transmitted over a communication network through communication subsystem 34.
For voice communications, the overall operation of mobile station 32 is substantially similar, except that the received signals would be output to speaker 60 and signals for transmission would be generated by microphone 62. Alternative voice or audio I/O subsystems, such as a voice message recording subsystem, may also be implemented on mobile station 32. Although voice or audio signal output is preferably accomplished primarily through speaker 60, display 48 may also be used to provide an indication of the identity of a calling party, duration of a voice call, or other voice call related information, as some examples.
Serial port 56 in
Short-range communications subsystem 66 of
As shown in
Returning to
It will also be appreciated that the multimedia file 120 may be streaming content that is provided to or otherwise obtained by the mobile device 10.
Referring to
Some multimedia files 120 do not have timestamps 129a in the multimedia file 120. For example, a file 120 may include audio data 127 and video data 125 and a target frame rate. The target frame rate is the preferred frame rate for the multimedia file 120, such as for example 25 fps. The frame rate in combination with the video frames and audio frames 127 provides the information provided by a timestamp. For example, at 25 fps frames are to be played at 40 ms intervals. Thus, if the first frame is to be played at 0 ms then the fifth frame is to be played at 160 ms, and the respective timestamps of the first frame and the fifth frame would be 0 ms and 160 ms.
The multimedia file 120 can be encoded using MPEG encoding, for example MPEG-4; it will be appreciated, however, that the principles discussed herein are equally applicable to other encoding/decoding schemes. A further example of an encoding format for the multimedia file 120 is H.264. Decoding H.264 is particularly time consuming and can benefit significantly from application of the principles described herein.
In MPEG video encoding, a group of pictures is used to specify the order in which intra-frame and inter-frames are arranged, wherein the group of pictures is a stream of encoded frames 126 in the video data 125. The frames 126 in MPEG encoding are of the following types: An I-frame (intra coded) corresponds to a fixed image and is independent of other picture types. Each group of pictures begins with this type of frame. A P-frame (predictive coded) contains difference information from the preceding I or P-frame. A B-frame (bidirectionally predictive coded) contains difference information from the preceding and/or following I or P-frame. D frames may also be used, which are DC direct coded pictures that serve the fast advance. In the following examples, video data 125 having I, B and P frames is used. It will be appreciated that the dynamic post-processing discussed below may be applied on a frame by frame basis or for every group of pictures.
Referring to
The media player client 904 displays the media player interface 132 (
The parser 900 parses the multimedia file 120 incoming as a multimedia stream 910 into an audio stream 920 and a video stream 922. The output of the parser 900 is a compressed video stream 923a and a compressed audio stream 921a. The audio stream 920 is decoded, including decompressions, to a decoded audio stream 921b. Then the audio stream 920 is rendered to a rendered audio stream 921c. Each of the compressed audio stream 921a, the decoded audio stream 921b, and the rendered audio stream 921c form part of the audio stream 920. The video stream 922 is decoded, including decompression, to a decoded video stream 923b. Then the video stream 922 is rendered to a rendered video stream 923c. Each of the compressed video stream 923a, the decoded video stream 923b, and the rendered video stream 923c form part of the video stream 922.
Referring to
Referring again to
The audio renderer 918 queues the audio frames 128 according to the timestamps 924. Depending on the implementation, the audio renderer 918 can perform additional rendering functions, such as digital to analog conversion using a digital to analog converter (DAC), not shown, and the rendered data stream is an analog stream that could be played directly by speakers as sound system 912. Alternatively, where the sound system 912 has additional processing capability then the renderer 918 can queue the frames 128 and transmit the frames 128 to the sound system 912. The sound system 912 is then responsible for any digital to analog conversion. When the renderer 918 sends pcm data for a frame 128, either internally in the multimedia player 898 for further processing, or externally to the sound system 912, the audio renderer 918 updates the media clock 902 to the corresponding audio timestamp 926 for the frame 128. The audio renderer 918 can modify the audio timestamp 926 prior to sending the timestamp 926 to the media clock 902 to take into account anticipated delays introduced by the sound system 912. The media clock 902 stores the timestamp 926 as a media time 919.
Similarly, the video player 88 has a video decoder 140, sometimes referred to as a video codec, and a video renderer 940. The video decoder 140 decodes the video stream 923a. For example, an MPEG compressed video stream can be decompressed to frames in a standard NTSC TV transmission format (yuv420). The decoded video stream 921b is passed from the video decoder 140 to the video renderer 940. The video renderer 940 queues the decoded video frames 126 for transmission to the display 48. Depending on the capabilities of display 48, the frames 126 can remain in digital format, or can be converted to analog format, by the video renderer 940.
Video player 88 has a video scheduler 950. The video scheduler 950 can have access to the video stream 923b. The video scheduler 950 controls transmission of the decoded video stream 923b to the video renderer 940.
The video player 88 includes one or more video post-processing blocks 960. The video scheduler 950 commands operation of the post-processing blocks 960. The video post-processing blocks 960 can perform such functions as deblocking, color conversion, scaling, and rotation among other things. As an example, the post-processing blocks 960 in video player 88 include a deblocking filter module 144 and a color conversion module 962.
The post-processing blocks are not required to be physically or logically grouped together. For example, the deblocking filter module 144 and the video decoder 140 can be grouped together in a video decode module 980. Decoding of an H.264 video stream is typically architected to have a decode module 980 that incorporates both a video decoder 140 and a deblocking module 144. H.264 is a block oriented encoding. The strength of a deblocking filter 124, or turning off (disabling) deblocking (not using a filter 124) can be specified in the multimedia file 120, possibly in the video content itself and in the resulting video stream 922. Deblocking can be specified by a flag encoded into the video stream. Turning off deblocking can cause the deblocking module 144 to ignore (override) the flag and skip deblocking.
The decode module 980 controls passage of the video stream from the video decoder 140 to the deblocking module 144 and to an output of the decode module 980. Similarly, the color conversion module can be grouped together with the video renderer 940 in a video render module 982. An output of the decode module 140 can be a deblocked decoded video stream 923b, which would form the input to the video render module 982. An output of the render module can be a color converted rendered video stream 923c for transmission to display 48. The render module 940 controls passage of the video stream 923c to an output of the render module 982.
Where a decode module 980 and render module 982 architecture are used the video scheduler 950 may only have access to the video stream 923b between the decode module 980 and the render module 982. The video scheduler 950 can command operation of the deblocking module 144 via control 984. As an example, where decode module 980 is implemented incorporating a computer program, deblocking module 144 can be exposed to the video scheduler through an application programming interface (API). The API can allow control of simple functionality such as deblocking on and deblocking off. Alternatively, the deblocking filter module can have a plurality of deblocking filters 124 (FILTER 1 through FILTER N) and a filter selection module 142. The control can allow selection of a particular filter 124 through the filter selection module 124.
Similarly, the video scheduler 950 can command operation of the color conversion module 962 via control 986. As an example, where render module 982 is implemented incorporating a computer program, color conversion module 962 can be exposed to the video scheduler 950 through an application programming interface (API). The API can allow control of simple functionality such as color conversion on and color conversion off. It is noted that turning color conversion off is likely not a practical option in most circumstances as frames 126 will not be rendered sufficiently well for viewing. Alternatively, the color conversion module 962 can have a plurality of color conversion converters 988 (CONVERTER 1 through CONVERTER N) and a converter selection module 990. Each converter 988 is based on a different color space conversion algorithm resulting in different quality and processing complexity. The control 986 can allow selection of a particular converter 988 through the converter selection module 990.
The video scheduler 950 has access to video timestamps 928 of video frames 126 decoded by the video decoder 140. For example, the video scheduler 950 can access decoded video timestamps 930 by accessing the video stream 923b after the video decoder 140. In the decode module 980 and render module 982 architecture described above, the video scheduler 950 can access the video stream 923b output from the decode module 980.
In alternate implementations the video scheduler 950 could access video timestamps 930 of the decoded video stream 923b elsewhere in the video stream 923b after the video decoder 140. For example, the video scheduler 950 could access video timestamps 930 from the decoded video stream 923b after the decoder 140 and prior to the video post-processing blocks 960. As a further example, video timestamps 930 could be accessed between post-processing blocks 960, or between the post-processing blocks 960 and the video renderer 940. As another example, the video scheduler could access the video timestamps 930 directly from the video decoder 140 as represented by command line 983 between the video decoder 140 and the video scheduler 950. Direct access of the video timestamps 930 from the video decoder 140 after decoding of the video stream 922 is accessing the decoded video stream 923b.
The video scheduler 950 could access timestamps 126 from multiple locations in the video stream 923b during processing; however, there is a trade-off between overhead required to perform access in multiple locations and the benefits derived thereby. It has been found that accessing the video timestamps 126 in a single location after the video decoder 140 can be sufficient for the purposes described herein.
Video post-processing improves the quality of the decoded video stream 923b from the video decoder 140 before the decoded video stream 923b reaches the video renderer 940. Video post-processing includes any step within the video player 88 post decoding (after decoding by the video decoder 140). Although video post-processing can improved the picture quality of the video frames 126 when viewed on the display 48, video post-processing can be resource intensive leading to a lag between the audio stream 920 and the video stream 922.
The video scheduler 950 has access to the media clock 902 via a request line 996. Through the media clock 902 the video scheduler 950 has access to the media time 919. The video scheduler 950 can request the media time 919 from the media clock 902. The video scheduler 950 can compare the media time to the video timestamp 930.
The video scheduler 950 has one or more thresholds 988 (THRESHOLD 1 to THRESHOLD N). The video scheduler 950 can use the thresholds 988 to trigger adaptive post-processing to increase decoded video stream 923b processing speed. The video scheduler 950 can trigger multiple post-processing adaptations based on the different thresholds. The post-processing adaptations can escalate in accordance with escalating thresholds 988.
The multimedia player 898 can be implemented in a variety of architectures. As described herein, the multimedia player 898 is a combination of the media player application 86 running on processor 64 to carry out the functions described herein. The processor 64 can be a general purpose processor without dedicated hardware functionality for multimedia processing. Such a general purpose processor becomes a special purpose processor when executing in accordance with the multimedia player application 86. Alternatively, any components of the multimedia player 898 can be implemented entirely or partially within dedicated hardware. For example, audio renderer 918 can comprise a DAC as described elsewhere herein. As a further example, such hardware can be incorporated into a single integrated circuit, or distributed among a plurality of discrete integrated circuits. Each of these hardware configurations is understood to be a processor 64, whether or not the functionality is distributed among various hardware components.
Referring to
Assuming the target frame rate is 40 ms then the video decoding is lagging the audio rendering by 40 ms. As the video timestamp 930 is for decoded video and the rendered audio timestamp 926 is for rendered audio, a lag of one frame 126 should be acceptable as there should be a rendered video frame 126 ahead of the decoded video frame 126, and the video stream 922 is synchronized with the audio stream 920. Accordingly, a lag of one frame (or 40 ms in the example) between the media time 919 and the decoded video timestamp 930 provides a first threshold for determining unacceptable lag.
Referring to
Referring to
Escalating thresholds, examples of which are discussed above, can be used to assess the degree of video lag 996. Escalating action can be taken as higher thresholds are reached.
Referring to
At 814, if the video timestamp 930 is within a first threshold of the media time 919 then, at 816, default post-processing is allowed to continue. Default post-processing could include all post-processing blocks with best quality. For example, for decoding of H.264 encoding default processing could include enabling (turning on) deblocking Again at 814, if the video timestamp 930 is above a first threshold of the media time 919 then, at 818, default post-processing is adapted to increase video stream 922 processing speed. At 820, following post-processing at 816 or at 818 the video stream 923b is rendered. As will later be described herein, adapting post-processing can involve skipping, or discarding, a frame 126 in the video stream 923b so that the frame 126 is not post-processed at all and is not rendered, or is partially post-processed and not rendered. An example of a circumstance where a frame 126 could be partially processed, but not rendered could include decoding an H.264 encoded frame and deblocking the frame, and then discarding the frame 126.b. Preferably a frame 126 that is to be discarded would be discarded at an early a stage after decoding as possible; although, accessing frames 126 prior to some post-processing may not be practical in all circumstances.
Referring to
As adaptive post-processing including discarding frames 126 will apply from one checking of the media time 919 to the next, it will likely not be desirable to discard all frames. Rendered video could be blank or jittery, and content could be lost. As an alternative to discarding all frames 126, frames 126 could be regularly discard. For example, every third frame could be discarded. If a chosen period for discarding frames 126 is insufficient to synchronize rendering of the video stream 922 and audio stream 920 then the period could be decreased (making the period shorter), for example from every third frame 126 to every second frame 126. Discarding frames periodically reduces the actual frame rate. For example if the target frame rate for a multimedia file is 30 fps then discarding every third frame 126 will result in an actual frame rate of 20 fps, and discarding every second frame will result in an actual frame rate of 15 fps. Depending on the actual video lag, the percentage of discarded frames can be increased to a maximum of 100%.
Referring to
Referring to
Referring to
It is noted that the methods described herein can be combined with other triggers for increasing video processing speed, such as processor 64 load. For example, video scheduler 950 can be connected to processor 64 as indicated by command line 998 or some other component of the device 10 that provides an indication of processor 64 usage by the multimedia player 898 or components thereof. Depending on the level of processor 64 usage, the video scheduler 950 can apply adaptive post-processing including multiple thresholds 999 based on processor 64 usage to trigger escalating post-processing adaptations, examples of which have been described herein. Example structures and methods to access processor load are disclosed in US Pat. Pub. No. 2009/0135918 of Mak-Fan et al having a Pub. Date of May 28, 2009, and in US Pat. Pub. No. 2009/0052555 of Mak-Fan et al having a Pub. Date of Feb. 26, 2009, the contents of which publications is incorporated by reference herein.
The video scheduler 950 can use the thresholds 988 to trigger adaptive post-processing to increase decoded video stream 923b processing speed. The video scheduler 950 can trigger multiple post-processing adaptations based on the different thresholds. The post-processing adaptations can escalate in accordance with escalating thresholds 999. Specific numbers of thresholds and values for those thresholds to be triggered based on processor 64 usage will depend on the performance desired.
Referring to
It is noted that the multiple adaptations in the example shown in
Turning now to
In some themes, the home screen 100 may limit the number icons 102 shown on the home screen 100 so as to not detract from the theme background 106, particularly where the background 106 is chosen for aesthetic reasons. The theme background 106 shown in
One or more of the series of icons 102 is typically a folder 112 that itself is capable of organizing any number of applications therewithin.
The status region 104 in this implementation comprises a date/time display 107. The theme background 106, in addition to a graphical background and the series of icons 102, also comprises a status bar 110. The status bar 110 provides information to the user based on the location of the selection cursor 18, e.g. by displaying a name for the icon 102 that is currently highlighted.
Accordingly, an application, such as the media player application 88 may be initiated (opened or viewed) from display 12 by highlighting a media player icon 114 using the positioning device 14 and providing a suitable user input to the mobile device 10. For example, media player application 88 may be initiated by moving the positioning device 14 such that the contacts icon 114 is highlighted as shown in
Turning now to
Various functions of the multimedia player 898 are shown in the FIGS. and described herein with reference to distinct components of the multimedia player 898. It is recognized that the functions of the components herein can be combined with other components in the media player, and the particular layout of components in the FIGS. are provided by way of example only.
An aspect of an implementation of this description provides a method of operating a media player. The method includes decoding an audio stream of the media player and rendering the decoded audio stream in the media player, and while decoding and rendering the audio stream, decoding a video stream and checking to determine if decoded video stream lag of the rendered audio stream within the media player is within a threshold and if not, then adapting post-processing of the video stream to decrease video stream post-processing time.
An aspect an implementation of this description provides a method of operating a multimedia player. The method includes decoding an audio stream of the multimedia player and rendering the decoded audio stream in the multimedia player, updating the media time of the media player with an audio timestamp of the rendered audio stream as the audio stream is rendered, and, while decoding and rendering the audio stream, decoding a video stream and checking the media clock to determine if a video timestamp of the decoded video stream is within a threshold of the media clock time, and if not then adapting post-processing of the video stream to decrease video stream post-processing time.
The audio stream can be decoded and rendered frame by frame from the audio stream, the media clock can be updated as each frame is rendered from the audio stream, the video stream can be decoded frame by frame, and the media clock can be checked for each decoded video frame.
Adapting post-processing of the video stream to decrease video stream post-processing time can include degrading color space conversion processing. Adapting post-processing of the video stream to decrease video stream post-processing time can include frame discarding. Adapting post-processing of the video stream to decrease video stream post-processing time can include degrading deblock filtering. Degrading deblock filtering can include turning deblocking off.
Prior to decoding the audio stream and decoding the video stream, the audio stream and the video stream can be separated from a multimedia stream by a parser. The video timestamp of the video stream and the audio timestamp of the audio stream can be provided as a multimedia timestamp of the multimedia stream. The video timestamp of the video stream and the audio timestamp of the audio stream can be generated by the parser when separating the audio stream and the video stream from the multimedia stream.
In another aspect an implementation of this description provides a computer readable medium, such as memory, can include computer executable instructions, such as media player configured for causing a mobile device, such as mobile device, to perform the methods described herein.
In an aspect an implementation of this description provides a multimedia player for use to render a multimedia file to render audio and video content, the multimedia player including an audio decoder and audio renderer configured to output an audio timestamp and a rendered audio stream from the multimedia file, a video decoder to output a decoded video stream comprising a video timestamp, a video renderer configured to output a rendered video stream from the multimedia file, one or more post-processing blocks, a video scheduler configured to check if the decoded video stream is within a threshold of the audio stream, and if not then adapting post-processing of the video stream to decrease video stream post-processing time.
In a further aspect an implementation of this description provides a multimedia player for use to render a multimedia file including audio and video content, the multimedia player including an audio decoder and audio renderer configured to output an audio timestamp and a rendered audio stream from the multimedia file, a video decoder to output a decoded video stream comprising a video timestamp, a video renderer configured to output a rendered video stream from the multimedia file, one or more post-processing blocks, a media clock configured to receive the audio timestamp from the audio renderer and store the received audio timestamp as a media time, and a video scheduler configured to check the media clock for the media time to determine if a video timestamp of the decoded video stream is within a threshold of the media time, and if not then adapting post-processing of the video stream to decrease video stream post-processing time.
In another further aspect an implementation of this description provides a mobile device including a processor and a multimedia player application stored on a computer readable medium accessible to the processor, the multimedia player application comprising instructions to cause the processor to decode an audio stream of the media player and render the decoded audio stream in the media player, update a media time of the media player with an audio timestamp of the rendered audio stream as the audio stream is rendered, and while decoding and rendering the audio stream, decode a video stream and check the media time to determine if a video timestamp of the decoded video stream is within a threshold of the media time, and if not then adapting post-processing of the video stream to decrease video stream post-processing time.
In yet a further aspect an implementation of this description provides a method of operating a media player executing on a mobile device includes decoding a video stream and checking if processing of the video stream is within a plurality of thresholds, and if processing of the video stream is not within one of a plurality of thresholds then adapting post-processing of the video stream to decrease video stream post-processing, such adapting in different post-processing steps depending on the threshold exceeded by the processing of the video stream.
The plurality of threshold can include one or more thresholds based on video lag between the video stream and the audio stream within the media player. The plurality of thresholds can include one or more thresholds based on processor usage by processing of the video stream.
Other aspects and implementations of those aspects, and further details of the above aspects and implementations, will be evident from the detailed description herein.
Application of one or more of the above-described techniques may provide one or more advantages. For example, a user of a media player may experience a more pleasant rendition of audio and video output. Audio and video data may seem to be more in synchronization. Further, a media player may be better able to support multimedia files of a range of complexities.
It will be appreciated that the particular options, outcomes, applications, screen shots, and functional modules shown in the FIGS. and described above are for illustrative purposes only and many other variations can be used according to the principles described.
Although the above has been described with reference to certain specific implementations, various modifications thereof will be apparent to those skilled in the art as outlined in the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
2,684,678 | Nov 2009 | CA | national |
This application claims the benefit of the filing date of U.S. Provisional Patent Application No. 61/257,680 filed Nov. 3, 2009 and Canadian Patent Application No. 2,684,678 filed Nov. 3, 2009 under the title SYSTEM AND METHOD FOR DYNAMIC POST-PROCESSING ON A MOBILE DEVICE. The contents of the above applications are hereby expressly incorporated by reference into the detailed description hereof.
Number | Date | Country | |
---|---|---|---|
61257680 | Nov 2009 | US |