METHOD AND SYSTEM FOR ERROR ROBUST AUDIO PLAYBACK TIME STAMP REPORTING

Abstract
A method and system for resynchronizing an embedded multimedia system using bytes consumed in an audio decoder. The bytes consumed provides a mechanism to compensate for bit error handling and correction in a system that does not require re-transmission. The audio decoder keeps track of the bytes consumed and periodically reports the bytes consumed. A host microprocessor indexes the actual bytes consumed since bit errors may have been handled or corrected to a predetermined byte count to determine whether resynchronization is necessary.
Description

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing summary, as well as the following detailed description of preferred embodiments of the invention, will be better understood when read in conjunction with the accompanying drawings. For the purpose of illustrating the invention, there is shown in the drawings embodiments which are presently preferred. It should be understood, however, that the invention is not limited to the precise arrangement shown. In the drawings:



FIG. 1 illustrates a block diagram of a multimedia system in accordance with the present invention;



FIG. 2A illustrates a block diagram of the multimedia system in FIG. 1 with details of the audio decoder shown in phantom;



FIG. 2B illustrates a block diagram of a multimedia system in FIG. 1 with details of the host microprocessor shown in phantom;



FIG. 3 illustrates a flowchart of an error robust audio playback time stamp reporting method by the audio decoder in accordance with the present invention;



FIG. 4 illustrates a flowchart of the system resynchronization method in accordance with the present invention;



FIG. 5 illustrates a flowchart of the host microprocessor control for the error robust audio playback time stamp reporting method in accordance with the present invention;



FIG. 6 illustrates a block diagram of an alternate embodiment of the multimedia system in accordance with the present invention with details of the audio decoder shown in phantom.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

While this invention is susceptible of embodiments in many different forms, this specification and the accompanying drawings disclose only some forms as examples of the use of the invention. The invention is not intended to be limited to the embodiments so described, and the scope of the invention will be pointed out in the appended claims.


The preferred embodiment of the embedded multimedia system according to the present invention is described below with a specific application to a variable length coding scheme. The multimedia system may employ a media play back application. The audio decoder may be compatible with windows media audio, an advanced audio coding (AAC) decoder, an AAC Plus decoder, an enhanced AAC plus decoder (eAAC+), MP3, and Real Audio bitstream formats. However, it will be appreciated by those of ordinary skill in the art that the present invention is also well adapted for other types of audio decoders including for Game Audio applications.


Referring now to the drawings in detail, wherein like numerals are used to indicate like elements throughout, there is shown in FIGS. 1 and 2A-2B an embodiment of an embedded multimedia system, generally designated at 10, according to the present invention.


In general, the embedded multimedia system 10 includes a plurality of media outputs such as audio through speaker 44 and video through video display 46. The system 10 further outputs a playback time display 48 (shown in phantom) which may be integrated in or separate from the video display 46. While the embedded multimedia system 10 illustrated includes both audio and video, any one user-selected program may include only audio or both audio and video.


In the embedded multimedia system 10, the host microprocessor 12 includes program instructions which assemble a packed audio bitstream in a bitstream assembling module 14 and sends the packed audio bitstream to the audio decoder 32 in the DSP 30, as best seen in FIG. 2B. The audio decoder 32 includes program instructions which decodes the packed audio bitstream into a waveform which is played out through speaker 44. In general, each frame in the packed audio bitstream of a program has associated therewith a predetermined audio playback time and byte count which is constructed by a Look-Up-Table (LUT) constructing module 15 and subsequently stored in a Look-Up-Table (LUT) 22.


In the exemplary embodiment, the predetermined audio playback times for the frames FR1, FR2 . . . FRX of a program are denoted as 24A, 24B, 24X. Furthermore, the LUT 22 has a predetermined byte count 26A, 26B, . . . , 26X associated with each frame's predetermined audio playback time 24A, 24B, . . . 24X.


The host microprocessor 12 further includes program instructions operable to function as an error-robust playback time stamp report cross referencing module 16 which cross references the stored predetermined audio playback time 24A, 24B . . . 24X for a particular frame in the LUT 22. Moreover for resynchronization operations, the host microprocessor 12 further includes program instruction operable as a resynchronization evaluation module 18 and a resynchronization module 20.


The audio decoder 32 includes program instructions operable to function as an error-robust audio playback time stamp report generator 40. The error-robust audio playback time stamp report generator 40 includes a report interval comparer 34, sample counter 36, and a byte counter 38, as best seen in FIG. 2A.


The report interval comparer 34 compares a call back interval 54 with time so that the error-robust audio playback time stamp report generator 40 can send the report. The report interval comparer 34 may receive and extract one or more of a call back intervals 54 stored in shared memory 50, the call back interval 54 being written by the host microprocessor 12. The audio decoder 32 can use the call back interval for use throughout the program.


The error-robust audio playback time stamp report generator 40 creates an error-robust audio playback time stamp report which is sent to the host microprocessor 12. The error-robust audio playback time stamp report includes a sample count from the sample counter 36 and the byte count from the byte counter 38.


In operation, the host microprocessor 12 will set and/or send a call back interval 54 in terms of a number (N) of audio samples played out to instruct the audio decoder 32 as to how frequently the error-robust audio playback time stamp report should be sent back to the host microprocessor 12. The call back interval 54 can be used to assist the host microprocessor 12 in constructing the LUT 22 by the LUT constructing module 15. The audio decoder 32 extracts the call back interval 54 from the shared memory 50. Alternately, the call back interval 54 may be sent in a host command to the DSP 30. Furthermore, the audio decoder 32 keeps track of how many bytes of the bitstream are consumed in byte counter 38, and how many samples are decoded/sent out to the speaker 44 via the sample counter 36. The error-robust audio playback time stamp report generator 40 reports back both the sample count of the sample counter 36 and the byte count of byte counter 38 at the requested call back interval 54 or integer multiples of the call back interval 54.


The host microprocessor 12 maintains a look up table (LUT) 22 that maps the number of bitstream bytes (byte count) 26A, 26B, . . . , 26X with the predetermine audio playback time 24A, 24B, . . . , 24X from the start of the playback. When the host microprocessor 12 receives the error-robust audio playback time stamp report, the host microprocessor 12 recognizes that the number of samples may not be an accurate indication of timing due to the possibility of at least one bitstream error. Therefore, the host microprocessor 12 uses the number of bytes consumed (byte count) from the report to index into the LUT 22 to find the predetermined audio playback time 24A, 24B, . . . , 24X. Preferably, the “index into the LUT 22” should use a closest match search to find the entry in the LUT 22 that is the closest to the byte count sent in the report, since any errors in the packed audio bitstream may result in the byte count being temporarily not aligned with a frame boundary.


With the byte count and sample count known, the host microprocessor 12 can determine where the audio is in playout and syncs up at least one of the audio, the video to associated with the audio, and playback time display.


When a command for the media (audio or video) playback to rewind, fast forward or stop is received, the host microprocessor 12 instructs the audio decoder 32 to do so by sending a corresponding command. The audio decoder 32 will reset the sample and byte counters 36 and 38. Furthermore, the host microprocessor 12 will re-construct the LUT 22 via the LUT constructing module 15. Sample and byte commands may be used to reset the sample and byte counters 36 and 38 to zero. Alternately, the sample and byte commands may simply advance forward or backward, the sample and byte counters 36 and 38.


The host microprocessor 12 and/or DSP 30 know when to reset the byte/sample counter based on the determinations made at steps S172A, 172B, S172C of FIG. 5. The resynchronization is controlled by the host microprocessor 12 such as to delete a video frame or delay a video frame since the audio playback is in real time and human ears are very sensitive to dropped or added audio samples. With video playback, a frame can be dropped or played twice.


Various bitstream error detecting and handling schemes are available in audio decoders. The audio decoder 32 could either silence out the output or conceal the output pulse code modulation (PCM) when it detects or hits an error bitstream. Since audio decoding has been well established no further explanation of decoding and error handling are set forth.


The error robust audio playback time stamp reporting method 100 and multimedia system 10 take advantage of the fact that the bit stream length remains the same even under hostile channel conditions. The host microprocessor 12 uses the audio byte count to cross reference the predetermined audio playback time through the packed bitstream position at an audio bitstream frame boundary and therefore can determine accurate “actual” audio playback time, regardless of any errors in the bitstream such as for resynchronization of system 10.


Referring now to FIG. 3, the error robust audio playback time stamp reporting method 100 enables error robust time stamp reporting that can be implemented in embedded multimedia systems 10 to facilitate in the host's resynchronization of the system 10 via method 150 (FIG. 4). The method 100 begins with step S102 where the audio decoder 32 decodes the packed audio bitstream from the host microprocessor 12. Step S102 is followed by steps S104 and S106. At step S104 as decoding takes place, the byte counter 38 counts the bytes consumed by the decoder at step S104. Returning again to step S106, step S106 determines whether an error is detected during the decoding process. If the determination is “NO,” step S106 is followed by step S109 where the samples which are played out through speaker 44 at step S110 are counted.


Returning again to step S106, if the determination at step S106 is “YES” (meaning an error has been detected), then step S106 is followed by step S108 where the error is handled using known error correction and handling techniques suitable for the error detected. Step S108 is also followed by step S109 where the samples which are played out at step S110 (step S110 follows step S109) are counted.


As can be appreciated, steps S102, S106, and S108 are part of the decoding process where the audio decoder 32 decodes and corrects or handles bit errors in the received bitstream before being playout (step S110) through the speaker 44.


Returning again to step S109, step S109 is followed by step S112 to determine if the call back interval 54 or an integer multiple of the call back interval 54 has been reached. The call back interval 54 is a function of the sample count. If the determination is “NO,” step S112 returns to step S109. If the determination at step S112 is “YES,” step S112 is followed by step S114 where the error-robust audio playback time stamp report with the byte and sample counts are sent to the host microprocessor 12. Steps S114 is followed by step S116 where a determination is made whether there is any more programming. If the determination at step S120 is “YES,” then step S120 returns to step S102. Otherwise, if the determination is “NO” at step S120, then the method 100 ends.


Referring now to FIG. 4, the system resynchronization method 150 is shown. The method 150 begins with step S152 where the host microprocessor 12 receives the error-robust audio playback time stamp report with the byte count and sample count. Step S152 is followed by step S154 where the predetermined audio playback time 24A, 24B, . . . , 24X for the frame(s) of the packed bitstream is looked up or searched in the LUT 22. The closest match search can be used to find the entry for the predetermined audio playback time which is closest to the byte count. Step S154 is performed by the error-robust playback time stamp report cross referencing module 16.


Step S154 is followed by S156 where the actual audio playback time is calculated using the byte count and the cross referenced predetermined audio playback time. Step S156 is followed by step S158 where a determination is made whether resynchronization is needed based on the result of the calculation in step S156. If the determination is “NO,” the method 150 ends. Steps S156 and S158 are performed by the resynchronization evaluation module 18.


On the other hand, if the determination at step S158 is “YES,” step S158 is followed by steps S160 and S162. At step S162, the resynchronization module 20 will synchronize the playback time display. At step S160, a determination is made whether the program also includes video. If the determination is “NO,” step S160 is followed by step S164 where the audio-audio sync is synchronized. On the other hand, if the determination is “YES” at step S160, the program includes both video and audio. Therefore, the audio-video sync needs to be synchronized at step S166. Steps S162, S164 and S166 end the resynchronization method 150. If resynchronization is done with insertion or delaying of a video frame or other manipulation of the video, the LUT 22 does not need to be modified. Thus, the resynchronization at any point in time of the video would realign the LUT 22 so that the next reporting of the sample count and the byte count would be aligned with the LUT 22 (assuming no error occurred). On the other hand, if resynchronization required an audio frame to be dropped or otherwise manipulated, the LUT 22 may require reconstruction.


The embedded multimedia system 10 may require the synchronization information for other media applications not mentioned above since to mention each specific application would be prohibitive.


Referring now to FIG. 5, a flowchart of the host microprocessor control method 170 of the audio decoder 32 for the error robust audio playback time stamp reporting is shown. The method 170 begins with steps S172A, 172B, and 172C. In operation, the host microprocessor 12 can receive a user command for playback of a new program at step S172A, fast forward the program at step S172B, and rewind the program at step S172C.


Steps S172A, 172B, and/or 172C are followed by step S174 where the host microprocessor 12 constructs or re-constructs the LUT 22 with a new byte count S174 and the predetermined audio playback time 24A, 24B, . . . , 24X associated therewith. Step S174 is followed by step S176 (shown in phantom) where the sample counter 36 is reset. Step S176 (shown in phantom) is followed by step S178 (shown in phantom) where the byte counter 38 is reset. The LUT 22 is constructed using a new or reset byte count. In one embodiment, the sample counter 36 and byte counter 38 may be reset to zero (0) or some other number determined by the host microprocessor 12 and/or DSP 30. Step S178 (shown in phantom) is followed by optional step S180 (shown in phantom) where a call back interval 54 or counter for the call back interval 54 is set or reset. The call back interval 54 can be sent as deemed appropriate. For example, the call back interval 54 at step S180 may only be set when a new program is started for playback and not at other times.


The steps S176 and S178 are shown in phantom to indicate, that as the result of the host microprocessor 12 receiving a user input for a new program selection, fast-forward or rewind, the interface between the host microprocessor 12 and the DSP 30 would cause the sample counter 36, the byte counter 38 to be set or reset. Likewise, the call back interval 54 may be set or reset. Moreover, the resetting of the sample counter 36 and the byte counter 38 may also be done automatically by the DSP 30.


In the embodiment of FIGS. 1, 2A, and 2B, the interface between the host microprocessor 12 and the DSP 30 to communicate the call back interval 54 is a shared memory 50. In this embodiment, the host microprocessor 12 would write the call back interval 54 in the shared memory 50. The DSP 30 would read the call back interval 54 from the shared memory 50 for use by the error-robust audio playback time stamp report generator 40. Alternately, the interface between the host microprocessor 12 and the DSP 30 to communicate the call back interval 54 could be a command sent from the host microprocessor 12 to the DSP 30.


Referring now to FIG. 6, a block diagram of an alternate embodiment of the multimedia system 10′ with details of the DSP 30 (shown in phantom) is shown. In this embodiment, the interface between the host microprocessor 12 and the DSP 30 to communicate the call back interval 54 is a shared memory 50′. The interface between the DSP 30 to the host microprocessor 12 to send the sample count 52 and the byte count 53 of the error-robust audio playback time stamp report includes writing, by the audio decoder 32′, the sample count 52 and the byte count 53 in memory locations of the shared memory 50′. The host microprocessor 12 would read the sample count 52 and the byte count 53 from the shared memory 50′. Furthermore, the error-robust audio playback time stamp report generator 40′ sends an error-robust audio playback time stamp interrupt to the host microprocessor 12. The interrupt notifies the host microprocessor 12 of the availability of the sample count 52 and the byte count 53 in the shared memory 50′.


In view of the above, the present invention provides an error robust reporting method and embedded multimedia system 10 which achieves accurate time stamp reporting back to the host microprocessor for system synchronization. The method and system 10 also compensate for the cumulative nature of error in an audio bitstream to synchronize the system.


It will be appreciated by those of ordinary skill in the art that by the embedded multimedia system, method and program instructions disclosed herein, the error robust reporting achieves accurate time stamp reporting and compensates for the cumulative nature of errors in an audio bitstream. The error robust reporting also compensates for the cumulative nature of errors when the protocol does not require re-transmission. For example, the protocol may be the UDP Lite or other “best-effort” (un-reliable) protocols. The error robust reporting also allows improved synchronization.


The foregoing description of the embodiments of the invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and modifications and variations are possible in light of the above teachings or may be acquired from practice of the invention. The embodiments were chosen and described in order to explain the principles of the invention and its practical application to enable one skilled in the art to utilize the invention in various embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the claims appended hereto, and their equivalents.

Claims
  • 1. A multimedia system having multiple processors comprising: an audio decoder operable to decode an audio bitstream, count bytes consumed by the audio decoder, and report the count of the bytes consumed; anda host microprocessor operable to resynchronize the system based on the count of the bytes consumed.
  • 2. The system of claim 1; wherein the audio decoder further comprising a sample counter operable to count a number of decoded samples of the audio bitstream played out by a speaker.
  • 3. The system of claim 2; wherein the audio decoder comprises an error-robust audio playback time stamp report generator operable to generate the report, the report further comprising the count of the sample counter.
  • 4. The system of claim 3; further comprising a shared memory wherein the host microprocessor includes a bitstream assembling module operable to assemble and communicate a bitstream, and write a call back interval in the shared memory for use by the audio decoder, the call back interval being a function of the number of the decoded samples and indicates a frequency in which the report is sent to the host microprocessor; and the error-robust audio playback time stamp report generator writes the count of the sample counter and the bytes consumed of the report in the shared memory for use by the host microprocessor.
  • 5. The system of claim 1; wherein the host microprocessor includes a look-up-table (LUT) having a predetermined byte count and a predetermined audio playback time associated with the predetermined byte count; and an error-robust playback time stamp report cross referencing module which cross references the count of the bytes consumed and the predetermined byte count indexed in the LUT.
  • 6. The system of claim 5; wherein the host microprocessor includes a LUT constructing module for constructing the LUT for a predetermined audio program.
  • 7. The system of claim 6: wherein the host microprocessor is responsive to a user input command to fast forward, rewind, or stop, wherein in response to the user input command to fast forward or rewind, the host microprocessor is operable to reconstruct the LUT and the audio decoder is operable to reset the bytes consumed.
  • 8. The system of claim 1; wherein the audio decoder is further operable to interface with the host microprocessor to receive a call back interval and reset the bytes consumed.
  • 9. The system of claim 1; wherein when the host microprocessor resynchronizes the system, a playback time display indicative of an audio playout of the decoded audio bitstream is resynchronized.
  • 10. The system of claim 1; wherein when the host microprocessor resynchronizes the system, a video stream related to the audio bitstream is resynchronized.
  • 11. The system of claim 1; wherein the audio decoder includes program instructions executed by a digital signal processor (DSP) and the host microprocessor includes program instructions executed by an Advance RISC Machine (ARM).
  • 12. The system of claim 1; wherein the audio decoder is compatible with at least one of windows media audio, an advanced audio coding (AAC) decoder, an AAC Plus decoder, an enhanced AAC plus decoder (eAAC+), MP3, and Real Audio bitstream formats.
  • 13. A multimedia system comprising: decoding means for decoding an audio bitstream, count bytes consumed during decoding and reporting the count of the bytes consumed; andprocessing means for processing the bytes consumed and resynchronizing the system based on the count of the bytes consumed.
  • 14. The system of claim 13; wherein the decoding means further comprises sample counting means for counting a number of decoded samples of the audio bitstream played out.
  • 15. The system of claim 14; wherein the decoding means further comprises reporting means for reporting the count of the sample counting means.
  • 16. The system of claim 15; further comprising storing means for storing a call back interval wherein the processing means includes means for assembling and communicating a bitstream and writing the call back interval to the storing means for use by the decoding means, the call back interval being a function of the number of the decoded samples and indicates a frequency in which the reporting means writes the count of the sample counting means and the bytes consumed to the storing means for use by the processing means.
  • 17. The system of claim 13; wherein the processing means includes a look-up-table (LUT) having a predetermined byte count and a predetermined audio playback time associated with the predetermined byte count; and cross referencing means for cross referencing the count of the bytes consumed and the predetermined byte count indexed in the LUT.
  • 18. The system of claim 17; wherein the processing means includes constructing means for constructing the LUT for a predetermined audio program.
  • 19. The system of claim 18: wherein the processing means is responsive to a user input command to fast forward, rewind, or stop, wherein in response to the user input command to fast forward or rewind, the processing means the constructing means includes reconstructing means for reconstructing the LUT and the decoding means includes resetting means for resetting the bytes consumed.
  • 20. The system of claim 13; wherein the decoding means further comprises interfacing means for interfacing with the processing means to receive a call back interval and reset the bytes consumed.
  • 21. The system of claim 13; wherein the processing means includes resynchronizing means for resynchronizing a playback time display indicative of an audio playout of the decoded audio bitstream.
  • 22. The system of claim 13; wherein the processing means includes resynchronizing means for resynchronizing a video stream related to the audio bitstream.
  • 23. The system of claim 13; wherein the decoder means is compatible with at least one of windows media audio, an advanced audio coding (AAC) decoder, an AAC Plus decoder, an enhanced AAC plus decoder (eAAC+), MP3, and Real Audio bitstream formats.
  • 24. A method of resynchronizing a multimedia system comprising the steps of: decoding an audio bitstream by an audio decoder and counting bytes consumed during the decoding;reporting a count of the bytes consumed to a host microprocessor;processing a reported bytes consumed; andresynchronizing the system based on the reported bytes consumed.
  • 25. The method of claim 24; further comprising the step of: during the decoding step, counting a number of decoded samples of the audio bitstream played out.
  • 26. The method of claim 25; wherein the reporting step further comprises the step of reporting a count of the number of decoded samples.
  • 27. The method of claim 26; further comprising the steps of assembling and communicating a bitstream; and setting a call back interval for use during the decoding step wherein the call back interval is a function of the number of the decoded samples and indicates a frequency of the reporting step.
  • 28. The method of claim 27; wherein the reporting step comprises the steps of writing the bytes consumed and the count of the decoded sampled by the audio decoder in memory for use by the host microprocessor.
  • 29. The method of claim 24; wherein the processing step comprises: cross referencing the count of the bytes consumed with a predetermined byte count in a look-up-table (LUT) having the predetermined byte count and a predetermined audio playback time associated with the predetermined byte count;.
  • 30. The method of claim 29; further comprising the step of prior to the decoding step, constructing the LUT for a predetermined audio program.
  • 31. The method of claim 24; wherein resynchronizing step comprises the step of resynchronizing a playback time display indicative of an audio playout of the decoded audio bitstream.
  • 32. The method of claim 24; wherein the resynchronizing step includes the step of resynchronizing a video stream related to the audio bitstream.
  • 33. The method of claim 24; wherein the decoding step is compatible with at least one of windows media audio, an advanced audio coding (AAC) decoder, an AAC Plus decoder, an enhanced AAC plus decoder (eAAC+), MP3, and Real Audio bitstream formats.
  • 34. An audio decoder for use in an embedded multimedia system and which is operable to decode an audio bitstream, count bytes consumed, and periodically report the count of the bytes consumed to a host microprocessor for resynchronizing the system.
  • 35. The decoder of claim 34; wherein the audio decoder is further operable to count a number of decoded samples of the audio bitstream played out by a speaker.
  • 36. The decoder of claim 35; wherein the audio decoder is further operable to report the count of the sample counter.
  • 37. The decoder of claim 36; wherein the audio decoder is further operable to write the count of the sample counter and the bytes consumed to memory and retrieve a call back interval from the memory.
  • 38. The decoder of claim 34; wherein the audio decoder is compatible with at least one of windows media audio, an advanced audio coding (AAC) decoder, an AAC Plus decoder, an enhanced AAC plus decoder (eAAC+), MP3, and Real Audio bitstream formats.
  • 39. Program instructions executable by multiple processors of an embedded multimedia system, the program instructions upon execution being operable to decode an audio bitstream, count bytes consumed during decoding, and report the count of the bytes consumed; andresynchronize the system based on the count of the bytes consumed.
  • 40. The program instructions of claim 39; wherein the program instructions are further operable upon execution to count a number of decoded samples of the audio bitstream played out by a speaker.
  • 41. The program instructions of claim 40; wherein the program instruction operable to report are further operable upon execution to report the count of the number of decoded samples.
  • 42. The program instructions of claim 39; wherein the program instructions upon execution are further operable to assemble and communicate a bitstream and write a call back interval to memory wherein the call back interval is a function of the number of the decoded samples and indicates a frequency in which report by the instructions operable to decode.
  • 43. The program instructions of claim 39; wherein the program instructions upon execution are further operable to construct a look-up-table (LUT) having a predetermined byte count and a predetermined audio playback time associated with the predetermined byte count; and cross reference the count of the bytes consumed and the predetermined byte count indexed in the LUT.
  • 44. The program instructions of claim 43; wherein the program instructions upon execution are responsive to a user input command to fast forward, rewind, or stop, wherein in response to the user input command to fast forward or rewind, the program instructions upon execution are operable to communicate reset the count of the bytes consumed and reconstruct the LUT.