This invention relates to parametric loudspeakers which utilize the non-linearity of air when excited by high frequency or ultrasonic waves for reproducing frequencies in the audible range. In particular, this invention relates to signal processing and modulators for parametric loudspeakers.
A parametric array in air results from the introduction of sufficiently intense, audio modulated ultrasonic signals into an air column. Self demodulation, or down-conversion, occurs along the air column resulting in an audible acoustic signal. This process occurs because of the known physical principle that when two sound waves with different frequencies are radiated simultaneously in the same medium, a sound wave having a wave form including the sum and difference of the two frequencies is produced by the non-linear interaction (parametric interaction) of the two sound waves. So, if the two original sound waves are ultrasonic waves and the difference between them is selected to be an audio frequency, an audible sound is generated by the parametric interaction. However, due to the non-linearities in the air column down-conversion process, distortion is introduced in the acoustic output. The distortion can be quite severe and 30% or greater distortion may be present for a moderate modulation level. Lowering the modulation level lowers the distortion, but at the expense of both a lower output volume and a lower power efficiency.
In 1965, Berktay formulated that the secondary resultant output (audible sound) from a parametric loudspeaker is proportional to the second time derivative of the square of the modulation envelope. It was shown by Berktay that the demodulated signal, p(t), in the far-field is proportional to the second time derivative of the modulation envelope squared.
The earliest use of this relationship for parametric loudspeakers in air was a modulator design for parametric loudspeakers in 1985. This advancement included the application of a square root function to the modulation envelope. Using the square root function compensates for the natural squaring function which distorts the envelope of the modulated sideband signal emitted to the air. Those skilled in the art have also shown that the square root double sideband signal can theoretically produce a low distortion system but at the cost of infinite system and transducer band width. It is not practical to produce any device that has an infinite bandwidth capability. Further, the implementation of any significant bandwidth means that the inaudible ultrasonic primary frequencies will, on the lower sideband, extend down into the audible range and cause new distortion which is at least as bad as the distortion eliminated by the infinite bandwidth square root pre-processing system.
In a typical application, the desired signal is amplitude modulated (AM) modulated on an ultrasonic carrier of 30 kHz to 50 kHz, then amplified, and applied to an ultrasonic transducer. If the ultrasonic intensity is of sufficient amplitude, the air column will perform a demodulation or down-conversion over some length (the length depends, in part, on the carrier frequency and column shape). The prior art, such as U.S. Pat. No. 4,823,908 to Tanaka, et al., teaches that the modulation scheme to achieve parametric audio output from an ultrasonic emission uses a double sideband signal with a carrier frequency and sideband frequencies spaced on either side of it by the frequency difference corresponding to the audio frequencies of interest.
For example, when amplitude modulating a 6 kHz tone onto a 40 kHz carrier, as shown in
In practice, the first five or six harmonics are enough to give a good approximation of the ideal square rooted wave. However, even when the number of harmonics is limited, the low sideband frequencies still reach down into the audio range and create distortion. As in the foregoing example in
Applying a square root function to the original signal reduces or eliminates the distortion in the demodulated audio but it creates unwanted audible frequencies that are emitted. In the current state of the prior art, the only choice is between high distortion (avoiding the square root function) or a wide bandwidth requirement with less distortion (using a square root function). Further, the square rooted signal for any given ultrasonic frequency is only valid for low level signals. As the ultrasonic power levels are increased to provide significant audio output, the ideal envelope shifts from the square root of the signal to the audio signal itself (or 1 times the signal).
Another problem exhibited by parametric loudspeaker systems is that as the frequency and/or intensity of the ultrasonic sound waves is increased to allow room for lower sidebands and to achieve reasonable conversion levels in the audible range, the air can be driven into saturation. This means that the fundamental ultrasonic frequency is limited as energy is robbed from it to supply the harmonics. The level at which the saturation problem appears is reduced 6 dB for every octave the primary frequency is increased. In other words, the power threshold at which saturation appears, decreases as the frequency increases. Double sideband signal systems used with parametric arrays must always be at least the bandwidth of the signal above any audible frequency (assuming a 20 kHz bandwidth) and even more if the distortion reducing square root function is used which also demands an infinite bandwidth.
A further problem with prior art parametric loudspeakers is that they have a built in high pass filter characteristic such that the amplitude of the secondary signal (audio output) falls at 12 dB per octave for descending frequencies. Because the lower sideband of a double sideband system must be kept from producing output in the audible range, the carrier frequency must be kept at least 20 kHz above the audible upper limit for double sideband (DSB) and at the very least twice that amount with a square rooted DSB. This range forces the carrier frequency up quite high. As a result, the saturation limit is easily reached and the overall efficiency of the system suffers.
These excessive and undesirable types of distortion preclude the practical or commercial use of the uncompensated parametric arrays or even square-rooted compensation schemes in high fidelity applications. Accordingly, it would be an improvement over the state of the art to provide a new method and system for pre-processing the audio signal which would result in lowered distortion with a decreased bandwidth requirement for the ultrasonic parametric array output. It would also be desirable to use lower primary frequencies which are still above the audible range to produce less saturation and attenuation.
It is an object of the present invention to provide a method and apparatus to reduce the primary frequencies of a parametric loudspeaker system to thereby minimize air saturation and increase the conversion efficiency.
It is another object of the present invention to provide a parametric loudspeaker system which corrects distortion without increasing the required bandwidth to reduce the distortion.
It is another object of the present invention to provide a method and system for pre-processing an audio signal that will result in lower distortion and better reproduction of an acoustic audio signal for a parametric array output.
Another object of the present invention is to provide a parametric loudspeaker system that uses a double sideband modulated signal which has a truncated lower sideband.
It is another object of the present invention to provide a parametric loudspeaker system using pre-processed single sideband modulation with reduced bandwidth requirements.
Yet another object of the present invention is to provide a parametric loudspeaker system to eliminate the extended lower sideband of a double sideband modulation scheme used with parametric loudspeakers.
The presently preferred embodiment of the present invention is a signal processor for a parametric loudspeaker system used in air. The signal processor has an audio signal input and a carrier frequency generator to produce a carrier frequency. The audio signal and the carrier frequency are mixed together by a modulator to produce a modulated signal with sideband frequencies which are divergent from the carrier frequency by the frequency value of the audio signal. An error correction circuit is included to compensate for the inherent squaring function distortion by modifying the modulated signal substantially within said modulated signal's bandwidth to approximate the ideal envelope signal. The error correction circuit compares the modulated signal envelope to a calculated ideal square rooted audio signal and generates an inverted error difference which is then added back into the modulated signal to correct for parametric loudspeaker distortion. In one embodiment, an error correction step adds new errors but at a greatly reduced level. This comparison and adding back of the error difference to the original signal can be recursively implemented to decrease the error to a desired level. Each level of recursive error correction tends to reduce the error by more than one half and enough levels of recursive correction should be used to correct the distortion without adding so many levels that more distortion is added. In alternative embodiments of the present invention, the modulated signal can use forms which include but are not limited to a double sideband signal, a truncated double sideband signal or a single sideband signal.
These and other objects, features, advantages and alternative aspects of the present invention will become apparent to those skilled in the art from a consideration of the following detailed description taken in combination with the accompanying drawings.
Reference will now be made to the drawings in which the various elements of the present invention will be given numerical designations and in which the invention will be discussed so as to enable one skilled in the art to make and use the invention. It is to be understood that the following description is only exemplary of certain embodiments of the present invention, and should not be viewed as narrowing the claims which follow.
This invention is a signal processing apparatus and method, implemented either digitally or in analog, which significantly reduces the audible distortion of a parametric array in air. Within this invention, multiple signal processing steps are performed. The input side of the processor(s) accepts a line-level signal from an audio source such as a CD player. In the digital implementation, an analog audio signal will first be digitized or a direct digital input may be received. The first step in the invention multiplies the incoming audio signal by a higher ultrasonic carrier frequency to create a modulated signal. In other words, the carrier frequency is modulated by the incoming audio signal to generate a conventional single sideband (SSB) or double sideband (DSB) signal. The carrier signal is generated by a local oscillator set at the desired frequency. Note that in a multi-channel system (stereo, for example) only one oscillator is preferably used so that all channels have exactly the same carrier frequency. This modulation may produce either a single-sideband (upper sidebands only) (SSB) multiplied with a carrier signal, or a double sideband (DSB) multiplied with a carrier signal. A truncated double sideband (TDSB) signal may also be produced in the invention, where the lower sidebands of a double sideband (DSB) signal are sharply truncated by a filter so nearly all of the frequencies passed are above the carrier.
Next, the calculated envelope of the modulated signal is compared to the calculated “ideal” audio signal with the square root applied. This comparison uses the modulated carrier envelope to compare against the ideal audio signal with the square root applied. The ideal signal is the unmodulated audio signal after it has been offset by a positive DC (direct current) voltage equal in magnitude but opposite to its maximum negative peak value and then square rooted. As mentioned, this is because the audio signal that demodulates in a parametric speaker is proportional to the square of the modulation envelope. Therefore, an envelope that is proportional to the square root of the incoming audio will be converted back to the original audio signal upon demodulation in the medium.
The frequency response of the ultrasonic transducer to be used is also taken into account in the comparison. In other words, a correction is also added which accounts for the distortion created by the transducer (i.e. speaker) when it emits the ultrasonic signals. Before the envelopes are compared, the modulated signal's bandwidth or spectrum is multiplied by the actual frequency response curve of the transducer/amplifier combination. This ensures that the comparison between the ideal envelope and the modulated signal envelope is valid because the modulated signal envelope will be altered by the transducer/amplifier when it is emitted. An embodiment using truncated double side band (TDSB) may be partially truncated by the transducer's high-pass frequency response, or the modulation scheme itself may also truncate the TDSB before it reaches the transducer. This makes it possible to use a simple DSB multiplier unit to generate a conventional DSB signal and a filter and the transducer to convert the DSB signal into a TDSB signal.
The modulated signal envelope is then compared or subtracted from the ideal square rooted signal. This gives a new signal that represents the error. This new signal is then inverted (in phase or in sign) and summed with the original incoming audio signal just ahead of the modulation step. This serves to alter the resulting envelope so that it is a closer match to the ideal envelope. A significant feature of the present invention is the error terms that are calculated and then added back into the audio signal are always within the audio bandwidth of the original audio signal and no extra bandwidth is required. In another embodiment of the invention, the primary distortion correction occurs within the audio signal but some of the distortion correction terms may be outside of the audio signal if the added terms do not produce significant distortion.
Adding the calculated error correction does not correct the envelope in one step, because the envelope's frequency spectrum is not proportional to the incoming audio frequencies only. The envelope is proportional to the square root of the sum of the squares of the modulation spectrum and the modulation spectrum shifted by 90 degrees. In other words, each introduced correction frequency produces other smaller error frequencies that must also be corrected. Accordingly, the error correction is preferably done recursively a number of times until the SSB, DSB or TDSB envelope error versus the ideal signal is within a desired small amount. The number of recursive steps will depend on the desired amount of distortion reduction and on the practical limits of the processor. The modulated signal is then output to an amplifier and ultimately to the ultrasonic transducer where it is emitted into the air or some other medium. The ultrasonic waves then demodulate into the original audio signal according to Berktay's solution.
Each recursive step reduces the total harmonic distortion (THD) error percentage by at least one-half, with the actual amount depending on the incoming spectrum and the modulation method chosen. The number of recursive steps is dependent upon the processing power available and the desired level of correction. Generally, a half-dozen iterations or less of the recursion process produces the desired distortion correction. The processing power required for this level of correction in real-time is low and could be implemented on an inexpensive DSP chip, or equivalent hardware. As previously described, a carrier modulated by a square rooted audio signal has infinite bandwidth and cannot be emitted accurately by any known means. Using this method makes it possible to approximate the ideal envelope without requiring the substantially increased bandwidth that is otherwise required. It should be recognized that error correction could be performed with only one level of error correction if desired. Analog circuitry could also be used instead of a digital or software implementation of the invention.
In a digital embodiment of the invention, the modulated signal which is an ultrasonic frequency would usually be converted back into analog form before amplification. A high sampling rate is needed for a faithful digital to analog conversion in the output stage. For example, if the SSB carrier frequency was 35 kHz, and the input audio bandwidth was 20 kHz (the nominal value), the output signal would have a spectrum from 35 kHz to 55 kHz. A sampling rate of 96 kHz or higher would be a good choice. The standard 44.1 kHz tends to be insufficient for wideband audio. In contrast, certain applications for speech could use lower sampling rates. Further, the output signal for the digital implementation is at line level. This signal would be input to an ultrasonic amplifier which would in turn drive the transducer. Again, the demodulated signal is proportional to the square of the modulation envelope. At higher ultrasonic amplitudes where saturation comes into play, the demodulated audio begins to be proportional to the envelope itself, not its square. This can be taken into account in the error correction compensator if the final drive level is known. For example, if the amplifier and the signal processor were integrated, the error correction scheme could vary with the power output in relation to the amplifier settings. Varying the error correction with the power output is described in more detail later. For simpler systems, the square of the envelope can be used as the demodulation model with good results.
By using a SSB or a TDSB system, the carrier frequency and modulated signal frequencies can be lowered without worrying about the lower sidebands which would otherwise be emitted in the audible range (i.e. audible distortion). The carrier frequency and modulated signal frequencies can be lowered so they are close to the upper limit of the audible range. In this invention, close is defined, as close to the upper limit of the audible range as possible without producing significant distortion and where the carrier signal and sidebands are inaudible.
A lower carrier frequency allows for better conversion efficiency in three ways. First, the attenuation rate of the ultrasound is lower so the effective ultrasonic beam length is longer, and the available energy isn't absorbed by the medium quite so quickly. Second, the shock formation (saturation) length is increased for a given sound pressure level (SPL), so a higher SPL can be used. The higher the SPL used, the greater the conversion efficiency (between ultrasonic and audio). In fact, the amplitude of the audio signal generated is proportional to the square of the ultrasonic SPL. In other words, the gain of the system increases with increasing drive levels, until the saturation limit is reached. The saturation limit is increased by lowering the carrier frequency. Third, a lower carrier frequency increases the volume velocity available to the system and therefore increases the available output in the audible range.
For example, the single sideband (SSB) method is used to specifically decrease the carrier frequency as far as possible which maximizes the efficiency of the ultrasonic-to-audio conversion. With a lower frequency saturation carrier, higher saturation levels can be achieved because the acoustic saturation limit is higher with longer acoustic wavelengths. The ideal envelope can be created using only the upper sidebands of a carrier modulated by an audio signal.
There are several additional advantages to using single sideband (SSB) amplitude modulation. These benefits include: eliminating the need to apply the square root function to the audio, reducing the transducer bandwidth requirements, and greater ultrasonic conversion efficiency because lower carrier frequencies are used. In order to make the ideal envelope to create a single audio tone, SSB without a square root applied gives exactly the same envelope as offsetting, applying the square root, re-offsetting, and using double sideband (DSB) AM. To create a 6 kHz tone when using SSB the following spectra are needed as shown in
Of course as the complexity of the audio signal increases, the SSB method becomes less of a perfect substitute for the full square root method. However, by artificially adding extra upper sideband components within the signal bandwidth, SSB can be made to match the ideal envelope very closely.
Using a SSB or TDSB scheme is advantageous because it more ideally matches the amplitude output of a typical ultrasonic transducer above and below its resonant frequency. For example, the carrier in an SSB or TDSB arrangement would be placed at the fundamental resonant frequency of the transducer for maximum speaker output levels, and the upper sideband frequencies would fall on the upper side of the resonant peak where the transducer operates efficiently. Many transducers work well above the resonance frequency, and poorly below this peak frequency.
Now a more detailed embodiment of the invention which uses a recursive error correction scheme will be discussed and block diagrams of the invention will be described. Although the preferred TDSB method is discussed, SSB or DSB are also thoroughly described. In the invention, a distortion compensator is positioned after the modulator to cancel first-order distortion products. A first order base-band compensator is used which can also be recursively extended to an Nth order distortion compensator. The base-band compensators pre-distort the audio signal prior to modulation. When the first order distortion correction is applied it creates smaller distortion terms which are then corrected in the next level of recursion. Significant distortion improvements have been shown using the Nth order compensator with various modulation schemes.
The first component of the invention models the non-linear demodulation which occurs in the air column of a parametric speaker. This relationship must be modeled to provide a proper approximation of the distortion which is needed to produce the correct acoustic sound wave. The second derivative function in Berktay's solution (Equation 1) presents a linear distortion that may be compensated for by passing the audio signal through a double integrator prior to subsequent processing and modulation. Since the focus here is to control the non-linear distortion component, the derivative which can be handled by simple equalization techniques will be dropped from this discussion.
In an alternative embodiment of the invention, the squaring function in the non-linear demodulator uses an exponent which decreases as the intensity of the ultrasonic signal increases. The demodulation exponent of this invention can increase from ½ to 1 in a smooth curved fashion or it can be linearly interpolated from ½ to 1. Increasing the exponent, models the air saturation that takes place as the power of the ultrasonic signal increases.
An SSB channel model 60 will now be described which models an uncompensated parametric array system that uses a SSB modulator 70. Referring now to
The SSB modulator 70 is expanded in
To summarize the SSB method, the distortion of a SSB modulator with discrete tone input signals can be reduced by this invention. The distortion products have frequencies that are equal to the differences of the primary input signals. Additionally, the distortion tones have a lower amplitude than the primary input tones if the modulation index is less than one (amplitude of the carrier signal is greater than the peak modulated signal amplitude). So, if additional input tones are injected at the distortion frequencies it completely cancels these “first-order” distortion products. The result is that “second-order” distortion products are introduced at the additional tone difference frequencies. However, the amplitude of the secondary distortion products is significantly less than the original distortion amplitude, resulting in an overall improvement of distortion figures. Application of additional canceling tones in a recursive manner further improves output distortion.
Injecting weak tones at the distortion frequencies improves the overall distortion. Distortion-tone injection works by observing the amplitude of the distortion and injecting a tone with the same amplitude and opposite phase. This works because the SSB channel model passes input tones without significant amplitude or phase modification, and superposition (summation) applies at the acoustic output facilitating the cancellation. This assumes a unity gain transducer model.
In the preferred embodiment of this invention compensating for the distortion of broadband signals, not just tones, is desired and the distortion components of a general, wide-band input signal must be estimated. Estimating the distortion in the wide-band modulated signal will now be described.
This invention uses a modulation-side distortion compensator, shown in
This compensator also works for the case the h(t) is approximately unity. The system may be modified to handle an arbitrary transducer response by including a transducer inverse model. This is not detailed here because the base-band distortion compensator discussed below is the most preferred embodiment.
Now, base-band distortion compensators will be addressed. Another method of distortion abatement is to subtract the distortion products from the main modulator input as detailed in
In this embodiment of the system, the SSB channel model 110 is used to derive an estimate of the first order distortion products dist(t). The distortion is estimated by using the SSB Channel model 110 to estimate the distortion 114, and then the original audio input 112 is subtracted from the estimated distorted signal 114 leaving the distortion dist(t), 118. This distortion is scaled by the parameter c, 120 and subtracted 122 from the original audio input 112, resulting in the first-order pre-distorted audio signal, x1(t) at 124. The cancellation parameter, c controls the percentage of the first-order distortion that is canceled.
Since the SSB channel model produces distortion products with frequencies equal to differences of the inputs, no frequency expansion occurs at any node in the system. Thus, if the input bandwidth is limited to 20 kHz, then the bandwidth of the distortion, dist(t), and pre-distorted signal, x1(t) are also limited to 20 kHz. The single sideband modulator simply right shifts (translates) the spectrum of x1(t) and adds a carrier. Therefore, the bandwidth of mod(t) is also limited to 20 kHz (although the center frequency is high). The main implication of this is that the actual transducer bandwidth need only be 20 kHz wide and the inverse filter, h−1(t) need only perform inversion over the 20 kHz band of interest. One of the benefits of this system is that difficult transducer responses may be dealt with easier.
The first-order compensator of
The Nth order distortion compensator may be also viewed as the cascade of distortion models subtracted from the audio input as shown in
0, 1, 2, . . . , N−1 (Equation 2)
where M(·) is the channel model and x0(t) is defined as the input; x0(t)=x(t). Next, define the distortion generator system, D(·) as the difference between the channel model output and its input,
The SSB channel model may simplified which creates a more efficient implementation for the distortion compensators.
Since the SSB channel model is used as part of the distortion controller, an efficient implementation is desirable. The SSB channel model (excluding the transducer response) is expanded in the top 150 of
The basic principle of the Nth order recursive distortion compensator also works with an amplitude modulator. The channel model must be redefined to include the AM modulator as shown in
The ultrasonic transducer will typically cut off or attenuate a portion of the lower sideband of the AM frequency spectrum. For this reason, the filter g(t), is required in the AM channel model to simulate this attenuation. Minimum requirements for this filter is that it be linear phase filter and have a bandpass characteristic similar to the actual transducer used in the system. The filter should be modeled as the cascade of a compensation filter and the transducer filter, that is
There are two alternative approaches to designing the compensation filter. The first option is to choose hcomp(t) as the approximate inverse of the transducer response h(t). This choice will flatten out the amplitude response of the cascade g(t), and linearize the phase. In this case, g(t) is a model of the cascade of the transducer inverse and the transducer filters as in the bottom portion of
The second option is to compensate only for the phase of the transducer model with hcomp(t). Gain variations with frequency will be present in the cascade g(t). In this case, for example, a pair of equal amplitude tones may emerge at the output with different amplitudes. This amplitude error will be treated as distortion. The effect of the Nth order compensator will equalize the amplitude difference between the two tones and improve the distortion. However, performance suffers when compared to using phase and amplitude compensation.
For example, if a transducer with a 40 dB roll-off from 40 kHz to 50 kHz is used, and two equal amplitude tones, 1 kHz and 9 kHz, are input to an uncompensated system, resulting in a ˜35 dB amplitude mismatch. A 6th order compensator will reduce the amplitude mismatch to only 3 dB. Using both phase and amplitude compensation gives better overall results with only a second order compensator.
Considerable simplification of the AM channel model may be performed if the transducer response is unity over the complete AM modulation spectrum, or a unity response over both upper and lower sideband frequencies, (a 40 kHz bandwidth). A unity response is generally not the case because wide-band transducers are difficult to build.
Another useful simplification is to lower the carrier frequency of the AM modulator in the AM channel model and shift down the frequency response of the filter g(t), so that it is in the correct position relative to the carrier. The final modulator remains at the desired carrier frequency. Only the carrier frequencies of modulators in the AM channel models are reduced. These changes preserve the input/output relationship of the AM channel model, but lower the maximum signal frequency to twice the system bandwidth (e.g. maximum frequency of 40 kHz for a 20 kHz system). This simplifies a DSP based implementation by reducing the sampling rate.
It is to be understood that the above-described arrangements are only illustrative of the application of the principles of the present invention. Numerous modifications and alternative arrangements may be devised by those skilled in the art without departing from the spirit and scope of the present invention. The appended claims are intended to cover such modifications and arrangements.
This application is a continuation of copending U.S. patent application Ser. No. 10/393,893, filed Mar. 21, 2003, which is a continuation of U.S. patent application Ser. No. 09/384,084, filed Aug. 26, 1999, each of which is hereby incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | 10393893 | Mar 2003 | US |
Child | 11651944 | Jan 2007 | US |
Parent | 09384084 | Aug 1999 | US |
Child | 10393893 | Mar 2003 | US |