Transmission of text, images, voice and speech signals across communication channels, including the Internet, is increasing rapidly, as is the provision of multimedia services capable of accommodating various types of information, such as text, images and music. Multimedia signals, including speech and music signals, require a broad bandwidth at the time of transmission. Therefore, to transmit multimedia data, including text, images and audio, it is highly desirable that the data is compressed.
Compression of digital speech and audio signals is well known. Compression is generally required to efficiently transmit signals over a communications channel, or to store compressed signals on a digital media device, such as a solid-state memory device or computer hard disk.
A fundamental principle of data compression is the elimination of redundant data. Data can be compressed by eliminating redundant temporal information such as where a sound is repeated, predictable or perceptually redundant. This takes into account human insensitivity to high frequencies.
Generally, compression results in signal degradation, with higher compression rates resulting in greater degradation. A bit stream is called scalable when parts of the stream can be removed in a way that the resulting sub-stream forms another valid bit stream for some target decoder, and the sub-stream represents the source content with a reconstruction quality that is less than that of the complete original bit stream but is high when considering the lower quantity of remaining data. Bit streams that do not provide this property are referred to as single-layer bit streams. The usual modes of scalability are temporal, spatial, and quality scalability. Scalability allows the compressed signal to be adjusted for optimum performance over a band-limited channel.
Scalability can be implemented in such a way that multiple encoding layers, including a base layer and at least one enhancement layer, are provided, and respective layers are constructed to have different resolutions.
While many encoding schemes are generic, some encoding schemes incorporate models of the signal. In general, better signal compression is achieved when the model is representative of the signal being encoded. Thus, it is known to choose the encoding scheme based upon a classification of the signal type. For example, a voice signal may be modeled and encoded in a different way to a music signal. However, signal classification is generally a difficult problem.
An example of a compression (or “coding”) technique that has remained very popular for digital speech coding is known as Code Excited Linear Prediction (CELP), which is one of a family of “analysis-by-synthesis” coding algorithms. Analysis-by-synthesis generally refers to a coding process by which multiple parameters of a digital model are used to synthesize a set of candidate signals that are compared to an input signal and analyzed for distortion. A set of parameters that yield the lowest distortion is then either transmitted or stored, and eventually used to reconstruct an estimate of the original input signal. CELP is a particular analysis-by-synthesis method that uses one or more codebooks that each essentially comprises sets of code-vectors that are retrieved from the codebook in response to a codebook index.
In modern CELP coders, there is a problem with maintaining high quality speech and audio reproduction at reasonably low data rates. This is especially true for music or other generic audio signals that do not fit the CELP speech model very well. In this case, the model mismatch can cause severely degraded audio quality that can be unacceptable to an end user of the equipment that employs such methods.
The accompanying figures, in which like reference numerals refer to identical or functionally similar elements throughout the separate views and which together with the detailed description below are incorporated in and form part of the specification, serve to further illustrate various embodiments and to explain various principles and advantages all in accordance with the present invention.
Skilled artisans will appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions of some of the elements in the figures may be exaggerated relative to other elements to help to improve understanding of embodiments of the present invention.
Before describing in detail embodiments that are in accordance with the present invention, it should be observed that the embodiments reside primarily in combinations of method steps and apparatus components related to selective signal coding base on model fit. Accordingly, the apparatus components and method steps have been represented where appropriate by conventional symbols in the drawings, showing only those specific details that are pertinent to understanding the embodiments of the present invention so as not to obscure the disclosure with details that will be readily apparent to those of ordinary skill in the art having the benefit of the description herein.
In this document, relational terms such as first and second, top and bottom, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. The terms “comprises,” “comprising,” or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. An element preceded by “comprises . . . a” does not, without more constraints, preclude the existence of additional identical elements in the process, method, article, or apparatus that comprises the element.
It will be appreciated that embodiments of the invention described herein may comprise one or more conventional processors and unique stored program instructions that control the one or more processors to implement, in conjunction with certain non-processor circuits, some, most, or all of the functions of selective signal coding base on model fit described herein. Alternatively, some or all functions could be implemented by a state machine that has no stored program instructions, or in one or more application specific integrated circuits (ASICs), in which each function or some combinations of certain of the functions are implemented as custom logic. Of course, a combination of the two approaches could be used. Thus, methods and means for these functions have been described herein. Further, it is expected that one of ordinary skill, notwithstanding possibly significant effort and many design choices motivated by, for example, available time, current technology, and economic considerations, when guided by the concepts and principles disclosed herein will be readily capable of generating such software instructions and programs and ICs with minimal experimentation.
After passing through the channel, a second reconstructed signal 118 is produced by passing the received core layer encoded signal 106′ through a second core layer decoder 120. The second core layer decoder 120 performs the same function as the first core layer decoder 112. If the enhancement layer encoded signal 114 is also passed through the channel 116 and received as signal 114′, it may be passed to an enhancement layer decoder 122. The enhancement layer decoder 122 also receives the second reconstructed signal 118 as an input and produces a third reconstructed signal 124 as output. The third reconstructed signal 124 matches the original signal 102 more closely than does the second reconstructed signal 118.
The enhancement layer encoded signal 114 comprises additional information that enables the signal 102 to be reconstructed more accurately than second reconstructed signal 118. That is, it is an enhanced reconstruction.
One advantage of such an embedded coding system is that a particular channel 116 may not be capable of consistently supporting the bandwidth requirement associated with high quality audio coding algorithms. An embedded coder, however, allows a partial bit-stream to be received (e.g., only the core layer bit-stream) from the channel 116 to produce, for example, only the core output audio when the enhancement layer bit-stream is lost or corrupted. However, there are tradeoffs in quality between embedded vs. non-embedded coders, and also between different embedded coding optimization objectives. That is, higher quality enhancement layer coding can help achieve a better balance between core and enhancement layers, and also reduce overall data rate for better transmission characteristics (e.g., reduced congestion), which may result in lower packet error rates for the enhancement layers.
While many encoding schemes are generic, some encoding schemes incorporate models of the signal. In general, better signal compression is achieved when the model is representative of the signal being encoded. Thus, it is known to choose the encoding scheme based upon a classification of the signal type. For example, a voice signal may be modeled and encoded in a different way to a music signal. However, signal classification is a difficult problem in general.
Although core layer decoder 112 is shown to receive core layer encoded signal 106 that is correspondingly sent to channel 116, the physical connection between elements 104 and 106 may allow a more efficient implementation such that common processing elements and/or states could be shared and thus, would not require regeneration or duplication.
Each enhancement layer encoder 206 receives the original signal 102 and the first reconstructed signal as inputs (or a signal, such as a difference signal, derived from these signals), and the selected encoder produces an enhancement layer encoded signal 208. In one embodiment, the enhancement layer encoder 206 encodes an error signal that is the difference between the reconstructed signal 110 and the input signal 102. The enhancement layer encoded signal 208 contains additional information based on a comparison of the signals s(n) (102) and sc(n) (110). Optionally, it may use parameters from the core layer decoder 104. The core layer encoded signal 106, the enhancement layer encoded signal 208 and the selection signal 204 are all passed to channel 116. The channel represents a medium, such as a communication channel and/or storage medium.
After passing through the channel, a second reconstructed signal 118 is produced by passing the received core layer encoded signal 106′ through a second core layer decoder 120. The second core layer decoder 120 performs the same function as the first core layer decoder 112. If the enhancement layer encoded signal 208 is also passed through the channel 116 and received as signal 208′, it may be passed to an enhancement layer decoder 210. The enhancement layer decoder 210 also receives the second reconstructed signal 118 and the received selection signal 204′ as inputs and produces a third reconstructed signal 212 as output. The operation of the enhancement layer decoder 210 is dependent upon the received selection signal 204′. The third reconstructed signal 212 matches the original signal 102 more closely than does the second reconstructed signal 118.
The enhancement layer encoded signal 208 comprises additional information, so the third reconstructed signal 212 matches the signal 102 more accurately than does second reconstructed signal 118.
While the input and reconstructed signal components may differ significantly in amplitude, a significant increase in amplitude of a reconstructed signal component is indicative of a poorly modeled input signal. As such, a lower amplitude reconstructed signal component may be compensated for by a given enhancement layer coding method, whereas, a higher amplitude (i.e., poorly modeled) reconstructed signal component may be better suited for an alternative enhancement layer coding method. One such alternative enhancement layer coding method may involve reducing the energy of certain components of the reconstructed signal prior to enhancement layer coding, such that the audible noise or distortion produced as a result of the core layer signal model mismatch is reduced.
Referring to
It will be apparent to those of ordinary skill in the art that other measures of signal energy may be used, such as the absolute value of the component raised to some power. For example, the energy of a component Sc(k) may be estimated as |Sc(k)|P, and the energy of a component S(k) may be estimated as |Sc(k)|P, where P is a number greater than zero.
It will be apparent to those of ordinary skill in the art that error energy E_err may be compared to the total energy in the input signal rather than the total energy in the reconstructed signal.
The encoder may be implemented on a programmed processor. An example code listing corresponding to
In this example the threshold values Thresh1 and Thresh2 are set at 0.49 and 0.264, respectively. Other values may be used dependent upon the types of enhancement layer encoders being used and also dependent upon which transform domain is used.
A hysteresis stage may be added, so the enhancement layer type is only changed if a specified number of signal blocks are of the same type. For example, if encoder type 1 is being used, type 2 will not be selected unless two consecutive blocks indicate the use of type 2.
In a test over 22,803 frames of a speech signal, the type 2 enhancement layer encoder was selected in only 227 frames, that is, only 1% of the time. In a test over 29,644 frames of music, the type 2 enhancement layer encoder was selected in 16,145 frames, that is, 54% of the time. In the other frames the core encoder happens to work well for the music and the enhancement layer encoder for speech was selected. Thus, the comparator/selector is not a speech/music classifier. This is in contrast to prior schemes that seek to classify the input signal as speech or music and then select the coding scheme accordingly. The approach here is to select the enhancement layer encoder dependent upon the performance of the core layer encoder.
In this embodiment, the enhancement layer encoder is responsive to an error signal, however, in an alternative embodiment, the enhancement layer encoder is responsive the input signal and, optionally, one or more signals from the core layer encoder and/or the core layer decoder. In a still further embodiment, an alternative error signal is used, such as a weighted difference between the input signal and the reconstructed signal. For example, certain frequencies of the reconstructed signal may be attenuated prior to formation of the error signal. The resulting error signal may be referred to as a weighted error signal.
In another alternative embodiment, the core layer encoder and decoder may also include other enhancement layers, and the present invention comparator may receive as input the output of one of the previous enhancement layers as the reconstructed signal. Additionally, there may be subsequent enhancement layers to the aforementioned enhancement layers that may or may not be switched as a result of the comparison. For example, an embedded coding system may comprise five layers. The core layer (L1) and second layer (L2) may produce the reconstructed signal Sc(k). The reconstructed signal Sc(k) and input signal S(k) may then be used to select the enhancement layer encoding methods in layers three and four (L3, L4). Finally, layer five (L5) may comprise only a single enhancement layer encoding method.
The encoder may select between two or more enhancement layer encoders dependent upon the comparison between the reconstructed signal and the input signal.
The encoder and decoder may be implemented on a programmed processor, on a reconfigurable processor or on an application specific integrated circuit, for example.
In the foregoing specification, specific embodiments of the present invention have been described. However, one of ordinary skill in the art appreciates that various modifications and changes can be made without departing from the scope of the present invention as set forth in the claims below. Accordingly, the specification and figures are to be regarded in an illustrative rather than a restrictive sense, and all such modifications are intended to be included within the scope of the present invention. The benefits, advantages, solutions to problems, and any element(s) that may cause any benefit, advantage, or solution to occur or become more pronounced are not to be construed as a critical, required, or essential features or elements of any or all the claims. The invention is defined solely by the appended claims including any amendments made during the pendency of this application and all equivalents of those claims as issued.
Number | Name | Date | Kind |
---|---|---|---|
4560977 | Murakami et al. | Dec 1985 | A |
4670851 | Murakami et al. | Jun 1987 | A |
4727354 | Lindsay | Feb 1988 | A |
4853778 | Tanaka | Aug 1989 | A |
5006929 | Barbero et al. | Apr 1991 | A |
5067152 | Kisor et al. | Nov 1991 | A |
5327521 | Savic et al. | Jul 1994 | A |
5394473 | Davidson | Feb 1995 | A |
5956674 | Smyth et al. | Sep 1999 | A |
6108626 | Cellario et al. | Aug 2000 | A |
6236960 | Peng et al. | May 2001 | B1 |
6253185 | Arean et al. | Jun 2001 | B1 |
6263312 | Kolesnik et al. | Jul 2001 | B1 |
6304196 | Copeland et al. | Oct 2001 | B1 |
6453287 | Unno et al. | Sep 2002 | B1 |
6493664 | Udaya Bhaskar et al. | Dec 2002 | B1 |
6504877 | Lee | Jan 2003 | B1 |
6593872 | Makino et al. | Jul 2003 | B2 |
6658383 | Koishida et al. | Dec 2003 | B2 |
6662154 | Mittal et al. | Dec 2003 | B2 |
6691092 | Udaya Bhaskar et al. | Feb 2004 | B1 |
6704705 | Kabal et al. | Mar 2004 | B1 |
6775654 | Yokoyama et al. | Aug 2004 | B1 |
6813602 | Thyssen | Nov 2004 | B2 |
6940431 | Hayami | Sep 2005 | B2 |
6975253 | Dominic | Dec 2005 | B1 |
7031493 | Fletcher et al. | Apr 2006 | B2 |
7130796 | Tasaki | Oct 2006 | B2 |
7161507 | Tomic | Jan 2007 | B2 |
7180796 | Tanzawa et al. | Feb 2007 | B2 |
7212973 | Toyama et al. | May 2007 | B2 |
7230550 | Mittal et al. | Jun 2007 | B1 |
7231091 | Keith | Jun 2007 | B2 |
7414549 | Yang et al. | Aug 2008 | B1 |
7461106 | Mittal et al. | Dec 2008 | B2 |
7596486 | Ojala et al. | Sep 2009 | B2 |
7761290 | Koishida et al. | Jul 2010 | B2 |
7801732 | Park et al. | Sep 2010 | B2 |
7840411 | Hotho et al. | Nov 2010 | B2 |
7885819 | Koishida et al. | Feb 2011 | B2 |
7996233 | Oshikiri | Aug 2011 | B2 |
8015017 | Sung et al. | Sep 2011 | B2 |
8060363 | Ramo et al. | Nov 2011 | B2 |
8069035 | Yoshida | Nov 2011 | B2 |
8160868 | Kawashima et al. | Apr 2012 | B2 |
8195454 | Muesch | Jun 2012 | B2 |
8306827 | Yamanashi et al. | Nov 2012 | B2 |
8315863 | Oshikiri | Nov 2012 | B2 |
20020052734 | Unno et al. | May 2002 | A1 |
20030004713 | Makino et al. | Jan 2003 | A1 |
20030009325 | Kirchherr et al. | Jan 2003 | A1 |
20030220783 | Streich et al. | Nov 2003 | A1 |
20040252768 | Suzuki et al. | Dec 2004 | A1 |
20050261893 | Toyama et al. | Nov 2005 | A1 |
20060022374 | Chen et al. | Feb 2006 | A1 |
20060047522 | Ojanpera | Mar 2006 | A1 |
20060173675 | Ojanpera | Aug 2006 | A1 |
20060190246 | Park | Aug 2006 | A1 |
20060241940 | Ramprashad | Oct 2006 | A1 |
20060265087 | Philippe et al. | Nov 2006 | A1 |
20070171944 | Schuijers et al. | Jul 2007 | A1 |
20070239294 | Breuckner et al. | Oct 2007 | A1 |
20070271102 | Morii | Nov 2007 | A1 |
20080065374 | Mittal et al. | Mar 2008 | A1 |
20080120096 | Oh et al. | May 2008 | A1 |
20090024398 | Mittal et al. | Jan 2009 | A1 |
20090030677 | Yoshida | Jan 2009 | A1 |
20090076829 | Ragot et al. | Mar 2009 | A1 |
20090083041 | Yoshida | Mar 2009 | A1 |
20090094024 | Yamanashi et al. | Apr 2009 | A1 |
20090100121 | Mittal et al. | Apr 2009 | A1 |
20090112607 | Ashley et al. | Apr 2009 | A1 |
20090231169 | Mittal et al. | Sep 2009 | A1 |
20090234642 | Mittal et al. | Sep 2009 | A1 |
20090276212 | Khalil et al. | Nov 2009 | A1 |
20090306992 | Ragot et al. | Dec 2009 | A1 |
20090326931 | Ragot et al. | Dec 2009 | A1 |
20100088090 | Ramabadran | Apr 2010 | A1 |
20100169087 | Ashley et al. | Jul 2010 | A1 |
20100169099 | Ashley et al. | Jul 2010 | A1 |
20100169100 | Ashley et al. | Jul 2010 | A1 |
20100169101 | Ashley et al. | Jul 2010 | A1 |
20110161087 | Ashley et al. | Jun 2011 | A1 |
Number | Date | Country |
---|---|---|
0932141 | Jul 1999 | EP |
1483759 | Aug 2004 | EP |
1533789 | May 2005 | EP |
1619664 | Jan 2006 | EP |
1818911 | Aug 2007 | EP |
1845519 | Oct 2007 | EP |
1912206 | Apr 2008 | EP |
1959431 | Aug 2008 | EP |
1959431 | Jun 2010 | EP |
9715983 | May 1997 | WO |
03073741 | Sep 2003 | WO |
2008063035 | May 2008 | WO |
2010003663 | Jan 2010 | WO |
Entry |
---|
Ramo et al. “Quality Evaluation of the G.EV-VBR Speech Codec” Apr. 4, 2008. |
Jelinek et al. “Itu-T G.EV-VBR Baseline Codec” Apr. 4, 2008. |
Jelinek et al. “Classification-Based Techniques for Improving the Robustness of Celp Coders” 2007. |
Fuchs et al. “A Speech Coder Post-Processor Controlled by Side-Information” 2005. |
J. Fessle. “Chapter 2 Discrete-time signals and systems” 2004. |
Tancerel, L. et al., “Combined Speech and Audio Coding by Discrimination,” In Proceedings of IEEE Workshop on Speech Coding, pp. 154-156, (2000). |
Kyung Tae Kim et al.: “A new bandwidth scalable wideband speech/audio coder”, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings, (ICASSP), Orlando FL. May 13-17, 2002, [IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)], New York, NY, IEEE, US vol. 1, May 13, 2002, pp. 1-657. |
Kovesi B et al.: “A scalable speech and audio coding scheme with continuous bitrate flexiblity”, Acoustics, Speech, and Signal Processing, 2004, Proceedings, (ICASSP '04), IEEE International Conference on Montreal, Quebec, Canada May 17-21, 2004, Piscataway, NJ, USA, IEEE, Piscataway, NJ, USA, vol. 1, May 17, 2004, pp. 273-276. |
Daniele Cadel, et al. “Pyramid Vector Coding for High Quality Audio Compression”, IEEE 1997, pp. 343-346, Cefriel, Milano, Italy and Alcatel Telecom, Vimercate Italy. |
“Enhanced Variable Rate Codec, Speech Service Options 3, 68 and 70 for Wideband Spread Spectrum Digital Systems”, 3GPP2 TSG-C Working Group 2, XX, XX, No. C S0014-C, Jan. 1, 2007, pp. 1-5. |
Virette et al “Adaptive Time-Frequency Resolution in Modulated Transform at Reduced Delay” ICASSP 2008; pp. 3781-3784. |
Edler “Coding of Audio Signals with Overlapping Block Transform and Adaptive Window Functions”; Journal of Vibration and Low Voltage fnr; vol. 43, 1989, Section 3.1. |
Ido Tal et al.: “On Row-by-Row Coding for 2-D Constraints”, Information Theory, 2006 IEEE International Symposium on, IEEE, PI, Jul. 1, 2006, pp. 1204-1208. |
Greiser, Norbert: “The International Search Report and The Written Opinion of the International Searching Authority”, European Patent Office, Rijswijk, completed: Feb. 26, 2010, mailed Mar. 10, 2010, all pages. |
Winkler, Gregor: “The International Search Report and the Written Opinion of the International Searching Authority”, European Patent Office, Rijswijk, completed: Jul. 21, 2009, mailed Jul. 28, 2009, all pages. |
Winkler, Gregor: “The International Search Report and The Written Opinion of the International Searching Authority”, European Patent Office, Rijswijk, completed: Jul. 8, 2009, mailed: Jul. 20, 2009, all pages. |
Chat C. Do: “The International Search Report and the Written Opinion of the International Searching Authority”, US Patent Office, completed: May 22, 2008, mailed Jul. 23, 2008, all pages. |
Zimmermann, Elko: “The International Search Report and The Written Opinion of the International Searching Authority”, European Patent Office, Rijswijk, completed: Nov. 14, 2008, mailed: Dec. 15, 2008, all pages. |
Greiser, Norbert: “The International Search Report and The Written Opinion of the International Searching Authority”, European Patent Office, Rijswijk, completed Feb. 25, 2010, mailed: Mar. 5, 2010, all pages. |
Greiser, Norbert: “The International Search Report and The Written Opinion of the International Searching Authority”, European Patent Office, Rijswijk, completed: Mar. 2, 2010, mailed: Mar. 15, 2010, all pages. |
Greiser, Norbert: “The International Search Report and the Written Opinion of the International Searching Authority”, European Patent Office, Rijswijk, completed: Mar. 8, 2010, mailed: Mar. 15, 2010, all pages. |
Ramprashad: “Embedded Coding Using a Mixed Speech and Audio Coding Paradigm” International Journal of Speech Technology Kluwer Academic Publishers Netherlands, Vo. 2, No. 4, May 1999, pp. 359-372. |
Hung et al., Error-Resilient Pyramid Vector Quantization for Image Compression, IEEE Transactions on Image Processing, 1994 pp. 583-587. |
Princen et al., “Subband/Transform Coding Using Filter Sank Designs Based on Time Domain Aliasing Cancellation” IEEE 1987; pp. 2181-2164. |
Ramprashad, “High Quality Embedded Wideband Speech Coding Using an Inherently Layered Coding Paradigm,” Proceedings of International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000, vol. 2, Jun. 5-9, 2000, pp. 1145-1148. |
Ramprashad, “A Two Stage Hybrid Embedded Speech/Audio Coding Structure,” Proceedings of Internationnal Conference on Acoustics, Speech, and Signal Processing, ICASSP 1998, May 1998, vol. 1, pp. 337-340, Seattle, Washington. |
International Telecommunication Union, “G.729.1, Series G: Transmission Systems and Media, Digital Systems and Networks, Digital Terminal Equipments—Coding of analogue signals by methods other than PCM,G.729 based Embedded Variable bit-rate coder: An 8-32 kbit/s scalable wideband coder bitstream interoperable with G.729,” ITU-T Recomendation G.729.1, May 2006, Cover page, pp. 11-18. Full document available at: http://www.itu.int/rec/T-REC-G.729.1-200605-l/en. |
Mittal et al., Coding unconstrained FCB excitation using combinatorial and Huffman codes, Speech Coding 2002 IEEE Workshop Proceedings, Oct. 1, 2002, pp. 129-131. |
Ashley et al., Wideband coding of speech using a scalable pulse codebook, Speech Coding 2000 IEEE Workshop Proceedings, Sep. 1, 2000, pp. 148-150. |
Mittal et al., Low complexity factorial pulse coding of MDCT coefficients using approximation of combinatorial functions, Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, Apr. 1, 2007, pp. I-289-I-292. |
3rd Generation Partnership Project, Technical Specification Group Service and System Aspects;Audio codec processing functions;Extended Adaptive Multi-Rate—Wideband (AMR-WB+) codec; Transcoding functions (Release 7), V7.0.0, Mar. 1, 2007. |
Chan et al., “Frequency domain postfiltering for multiband excited linear predictive coding of speech”, In Electronics Letters, pp. 1061-1063, Feb. 27, 1996. |
Chen et al., “Adaptive postfiltering for quality enhancement of coded speech”, In IEEE Transactions on Speech and Audio Processing, vol. 3, No. 1, pp. 59-71, Jan. 1, 1995. |
Andersen et al., “Reverse water-filling in predictive encoding of speech”, In 1999 IEEE Workshop on Speech Coding Proceedings, pp. 105-107, Jun. 20, 1999. |
Makinen et al., “AMR-WB+: a new audio coding standard for 3rd generation mobile audio service”, In 2005 Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. ii/1109-ii/1112, Mar. 18, 2005. |
Faller et al., “Technical advances in digital audio radio broadcasting”, Proceedings of the IEEE, vol. 90, No. 8, pp. 1303-1333, Aug. 1, 2002. |
Salami et al., “Extended AMR-WB for High-Quality Audio on Mobile Devices”, IEEE Communications Magazine, pp. 90-97, May 1, 2006. |
Hung et al., Error-resilient pyramid vector quantization for image compression, IEEE Transactions on Image Processing, vol. 7, No. 10, Oct. 1, 1998. |
Markas T. et al.: “Multispectral image compression algorithms”, Data Compression Conference, 1993, DCC'93, Snowbird, UT, USA Mar. 30-Apr. 2, 1993, Los Alamitos, CA, USA, IEEE Compt. Soc. US, Mar. 30, 1993, pp. 391-400. |
Boris Ya Ryabko et al.: “Fast and Efficient Construction of an Unbiased Random Sequence”, IEEE Transactions on Information Theory, IEEE, US, vol. 46, No. 3, May 1, 2000, ISSN: 0018-9448, pp. 1090-1093. |
Ratko V. Tomic: “Quantized Indexing: Background Information”, May 16, 2006, URL: http://web.archive.org/web/20060516161324/www.1stworks.com/ref/TR/tr05-0625a.pdf, pp. 1-39. |
Chinese Patent Office (SIPO), 1st Office Action for Chinese Patent Application No. 200980153318.0 dated Sep. 12, 2012, 6 pages. |
United States Patent and Trademark Office, “Non-Final Office Action” for U.S. Appl. No. 12/844,199 dated Aug. 31, 2012, 13pages. |
United States Patent and Trademark Office, “Non-Final Office Action” for U.S. Appl. No. 12/187,423 dated Sep. 30, 2011, 9 pages. |
Mexican Patent Office, 2nd Office Action, Mexican Patent Application MX/a/2010/004479 dated Jan. 31, 2002, 5 pages. |
United States Patent and Trademark Office, “Non-Final Office Action” for U.S. Appl. No. 12/345,165 dated Sep. 1, 2011, 5 pages. |
United State Patent and Trademark Office, Non-Final Rejection for Patent U.S. Appl. No. 12/196,414 dated Jun. 4, 2012, 9 pages. |
Ratko V. Tomic: “Fast, Optimal Entropy Coder” 1stWorks Corporation Technical Report TR04-0815, Aug. 15, 2004, pp. 1-52. |
Patent Cooperation Treaty, “PCT Search Report and Written Opinion of the International Searching Authority” for International Application No. PCT/US2011/0266400 Aug. 5, 2011, 11 pages. |
Neuendorf, et al., “Unified Speech Audio Coding Scheme for High Quality oat Low Bitrates” ieee International Conference on Accoustics, Speech and Signal Processing, 2009, Apr. 19, 2009, 4 pages. |
Bruno Bessette: “Universal Speech/Audio Coding using Hybrid ACELP/TCX Techniques”, Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference, Mar. 18-23, 2005, ISSN : III-301-III-304, Print ISBN: 0-78. |
United States Patent and Trademark Office, “Non-Final Rejection” for U.S. Appl. No. 12/047,632 dated Mar. 2, 2011, 20 pages. |
Patent Cooperation Treaty, “PCT Search Report and Written Opinion of the International Searching Authority” for International Application No. PCT/US2011/026660 Jun. 15, 2011, 10 pages. |
Udar Mittal et al., Encoder for Audio Signal Including Generic Audio and Speech Frames, U.S. Appl. No. 12/844,199, filed Jul. 27, 2010. |
Udar Mittal et al., “Decoder for Audio Signal Including Generic Audio and Speech Frames”, U.S. Appl. No. 12/844,206, filed Sep. 9, 2010. |
Office Action for U.S. Appl. No. 12/345,141, mailed Sep. 19, 2011. |
Office Action for U.S. Appl. No. 12/345,165, mailed Sep. 1, 2011. |
Office Action for U.S. Appl. No. 12/047,632, mailed Oct. 18, 2011. |
Office Action for U.S. Appl. No. 12/187,423, mailed Sep. 30, 2011. |
European Patent Office, Supplementary Search Report for EPC Patent Application No. 07813290.9 dated Jan. 4, 2013, 8 pages. |
Cover, T.M., “Enumerative Source Encoding” IEEE Transactions on Information Theory, IEEE Press, USA vol. IT-19, No. 1; Jan. 1, 1973, pp. 73-77. |
Mackay, D., “Information Theory, Inference, and Learning Algorithms” In: “Information Theory, Inference, and Learning Algorithms”, Jan. 1, 2004; pp. 1-10. |
Korean Intellectual Property Office, Notice of Preliminary Rejection for Korean Patent Application No. 10-2010-0725140 dated Jan. 4, 2013. |
The Federal Service for Intellectual Property, Patents and Trade Marks (Rospatent), Decision on Grant, Aug. 12, 2013, all pages. |
Number | Date | Country | |
---|---|---|---|
20090259477 A1 | Oct 2009 | US |