A typical home theater system comprises a display unit, a DVD player or other signal source, an audio video control receiver (AVR), and multiple speakers. The speakers may comprise a center speaker, a subwoofer and two or more pairs (left and right) of surround speakers. The home theater system includes a home theater decoder that creates multiple digital audio signals, which are assigned to the speakers respectively. Each digital audio signal may be composed of a succession of 24-bit samples at a rate of 48 kSa/s. A complete multi-channel audio sample comprises one sample for each channel. Thus, in the case of an eight channel home theater audio system, one multi-channel sample comprises eight 24-bit samples.
The home theater decoder supplies the digital audio signals to the AVR, which converts the digital audio signals to analog form for driving the speakers over wired speaker connections between the AVR and the respective speakers. Alternatively, however, the AVR may include a radio transmitter that transmits the digital audio data wirelessly to radio receivers incorporated in the speakers. In this case, the AVR supplies the digital audio signals to a packetizer, which constructs a transmission packet having a payload that contains the digital audio data for several, e.g. 48, multi-channel audio samples. In the case of the eight channel system, the transmission packet may contain the data for six multi-channel samples.
The radio transmitter employs the transmission packet data bits to modulate a carrier at the frequency of a selected communication channel and transmits the modulated signal via an antenna. In each speaker, a radio receiver receives the modulated signal and detects the modulating transmission packet data bits. An audio processor included in the speaker recovers the digital audio signal assigned to that speaker from the successive transmission packets, converts the digital audio signal to analog form, amplifies the analog audio signal and supplies the audio signal to the audio driver.
It is possible for one or more packets to be lost or damaged in transmission from the AVR to a speaker. This leaves a gap in the sound information and degrades the quality of the reproduction of the audio signal. It is possible to reduce the degradation through various signal processing techniques such as interleaving packets and interpolating between lost or damaged data points.
A signal to be transmitted is input. The elements are interleaved to minimize the impact of data loss. After transmission and reception, the signal is examined for missing or damaged elements, determined if folding is required, and such elements are estimated using an interpolation process.
The foregoing and other objectives, features, and advantages of the invention will be more readily understood upon consideration of the following detailed description of the invention taken in conjunction with the accompanying drawings.
It is possible to minimize errors and maximize the quality of sound transmitted through a noisy and/or lossy channel through the use of certain processing techniques. Interleaving a digital signal spreads out a loss over several packets so that it is diluted by good data. A checksum process detects missing or damaged data, making it possible to fill it in by interpolation. “Folding” a signal makes interpolation more accurate for high frequencies by reversibly converting them to lower frequencies, which are easier to interpolate.
Interleaving a signal consists of breaking down packets into their constituent parts and recombining them into new packets comprising parts from each of the packets to be interleaved. In audio transmission, interleaving four packets has been found to provide a good balance between minimizing losses and minimizing processing time. The actual interleaving algorithm may take many forms, but one common and effective form is to divide each packet into the same number of sections as there are packets to be interleaved, then create a new set of interleaved packets, each with one section from each of the old packets.
In one embodiment, using block interleaving, the first new packet may be constructed out of the first section of each old packet, the second new packet may be made from the second section of each old packet, and so on. The process may be represented symbolically, as shown:
In another embodiment, using recursive interleaving, the process would be represented symbolically at the symbol level as:
Interpolation is most effective when performed on DC or low-frequency signals, where large swings between data points do not occur. For sufficiently high frequencies, interpolation is highly unreliable and may not be better than simply omitting a data point. For signal in this range, it is necessary to reversibly convert the signal to a low frequency one to permit accurate interpolation. When interleaving is combined with frequency conversion and polynomial interpolation, high fidelities at reasonable processing speeds are possible.
The interleaved packets are then transmitted, and the reverse process is performed where the packets are received. The de-interleaved packets match the original transmitted packets, assuming no transmission errors. In this way, if an interleaved packet is lost or damaged in transmission, rather than losing all data in one transmission packet, each de-interleaved packet loses only a portion of the data it would normally contain. Thus the damage to accurate sound reproduction is spread out over a longer time period than would otherwise be the case and becomes less noticeable.
Sound quality can be improved still further if it is possible to replace the missing or damaged section of the de-interleaved packet when it is detected. Lost or damaged data can be detected using a checksum process. “Checksum” as used in this application should be construed to encompass error detection regimes that rely on a mathematical manipulation of the data to produce a single datum (in the form of a bit, byte, or word) which can be transmitted along with the data for comparison on the receiving end after the same manipulation is performed. Other possible systems to detect missing or damaged data include, but are not limited to, longitudinal parity checks, Fletcher's checksum, Adler-32, cyclic redundancy checks (CRC), and MD5, all of which are well known in the art of data transmission and error correction.
When the checksum fails and the missing or damaged interleaved packet is identified, the missing or damaged section of each de-interleaved packet can also be identified. Referring to
Interpolation of any kind is most effective on curves which change only slightly between points, or in the world of audio, temporal representations of low frequencies. At one extreme, a DC signal requires no interpolation at all. At another extreme, a high-frequency signal might have data points that literally alternate between maxima and minima of the signal, leaving the interpolator with little hope of finding an accurate value using a curve fit. For this reason, “folding” the frequency spectrum at a point where interpolation is still successful is desirable.
This “folding” is accomplished by multiplying the sequence of data points in the signal by the binary sequence that is most closely correlated to the signal. A DC signal will be converted to a high-frequency signal swinging strongly between extremes, as
“Folding” low frequencies will make them harder to interpolate. Therefore, the optimal interpolation will be achieved when only high frequencies are folded and low frequencies are not. Digital audio signals are normally time-domain signals, with the frequencies undifferentiated, and therefore it is impossible to fold only certain frequencies. Instead, the entire signal must be folded. For this reason, the decision to fold or not fold must be made on an ongoing basis, depending on the predominant frequencies in the signal. This decision to fold or not fold can is optimally made relatively frequently, after no more than 100 samples have gone by since the last decision, for a signal having about 48,000 samples per second. The decision is therefore made approximately every 0.002 seconds.
Preferably, the signal is examined by the decision-making circuit before it is passed into the rest of the circuit for reproduction. If high frequencies predominate in a given interval, that interval will be folded before interleaving and transmission. If low frequencies predominate, then it will not be folded. Experimentation has shown that interpolation is very successful up to approximately 10 kHz and degrades in quality after that point. Therefore, in a preferred embodiment, “low” frequencies are below 10 kHz and “high” frequencies are above it.
In one embodiment, the interpoler showed dramatic improvement in fidelity when tested at 48 kHz with a radio link of 18 Mbps. The results of this test are shown at Table 1. Under the older, known methods, signal fidelity began to deteriorate at −76 dBm and suffered step degradation thereafter and decreasing power levels. In contrast, the new interpolator maintained much higher signal fidelity though decreasing power levels, as shown below.
The rapid degradation using the older method is also shown in Table 2.
At an output power of −77 dBm, the signal-to-noise and distortion ratio (SINAD) improves from approximately 90 dB to 140 dB. At −80 dBm, the SINAD improves from approximately 10 dB to 80 dB. This improvement is a 300-1000 times increase in power over known methods.
In another embodiment the interpoler showed dramatic improvement in fidelity when tested at 96 kHz with a radio link of 36 Mbps. The results of this test are shown at Table 3. Under the older, known methods, signal fidelity began to deteriorate at −70 dBm and suffered step degradation thereafter and decreasing power levels. In contrast, the new interpolator maintained much higher signal fidelity though decreasing power levels, as shown below.
The rapid degradation using the older method is also shown in Table 4.
The improvement in SINAD here is even more dramatic. At −75 dBm, the SINAD improves from approximately 40 dB under known methods to approximately 130 dB.
Terms and expressions which have been employed in the foregoing specification are used therein as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding equivalents of the features shown and described or portions thereof, it being recognized that the scope of the invention is defined and limited only by the claims which follow.
Number | Name | Date | Kind |
---|---|---|---|
5386478 | Plunkett | Jan 1995 | A |
5534933 | Yang | Jul 1996 | A |
5666424 | Fosgate et al. | Sep 1997 | A |
6523147 | Kroeger et al. | Feb 2003 | B1 |
7123731 | Cohen et al. | Oct 2006 | B2 |
7155017 | Kim et al. | Dec 2006 | B2 |
7158643 | Lavoie et al. | Jan 2007 | B2 |
7234095 | Lee et al. | Jun 2007 | B2 |
7272073 | Pellegrini et al. | Sep 2007 | B2 |
7319641 | Goudie et al. | Jan 2008 | B2 |
7496004 | Sako et al. | Feb 2009 | B2 |
7676044 | Sasaki et al. | Mar 2010 | B2 |
7873120 | Kroeger | Jan 2011 | B2 |
8199941 | Hudson et al. | Jun 2012 | B2 |
20040240573 | Yuasa et al. | Dec 2004 | A1 |
20050251725 | Huang et al. | Nov 2005 | A1 |
20070133813 | Morishima | Jun 2007 | A1 |
20110107176 | Song et al. | May 2011 | A1 |
20140314177 | Choi et al. | Oct 2014 | A1 |
Number | Date | Country |
---|---|---|
2006131893 | Dec 2006 | WO |
Number | Date | Country | |
---|---|---|---|
20150100857 A1 | Apr 2015 | US |