Parametric audio reproduction systems produce sound through the heterodyning of two acoustic signals in a non-linear process that occurs in a medium such as air. The acoustic signals are typically in the ultrasound frequency range. The non-linearity of the medium results in acoustic signals produced by the medium that are the sum and difference of the acoustic signals. Thus, two ultrasound signals that are separated in frequency can result in a difference tone that is within the 60 Hz to 20,000 Hz hearing range of humans.
Additional non-linearities in parametric audio reproduction systems occur at the transducers used to generate the ultrasound signals. Further distortion can be caused by various harmonics and other audio artifacts that are inherent in parametric reproduction. The inherent distortion in parametric reproduction processes caused by the various non-linearities and distortions has been problematic.
Features and advantages of the invention will be apparent from the detailed description which follows, taken in conjunction with the accompanying drawings, which together illustrate, by way of example, features of the invention; and, wherein:
Reference will now be made to the exemplary embodiments illustrated, and specific language will be used herein to describe the same. It will nevertheless be understood that no limitation of the scope of the invention is thereby intended.
It has been realized that reduction of distortion in a non-linear system such as an audio reproduction system, including that comprising a parametric array, can be improved using a function that closely approximates system response including inherent distortion. Once this function is understood and defined, an inverse function can be derived and used for preprocessing. This inverse function in effect conditions the input signal so that when it is fed into the non-linear system the original input signal (before conditioning) is recovered in the output with much less distortion than would otherwise be the case. This alteration of the input signal is a dynamic process and itself changes with the input signal, as the function used in preprocessing will be dependent on the input signal just as the system response upon which it was based changes with the input signal.
The system output is in effect linearized with respect to the input, or more properly, at each instant the signal in the output is recovered to a value that is directly proportional to the input value. It will be understood that the response of the underlying system itself is not linearized. This is an important distinction to make in some applications. For example parametric reproduction in a real sense requires non-linearity to produce audio output. This process allows the non-linear response to occur, and yet recover the original signal in the output.
In one example this is done using at least some part of an infinite series expression of a non-linear system transfer function, expressed mathematically, which defines the expected distortion imparted by the system. The solved inverse of this nonlinear system transfer function is the function used in preprocessing. As will be more fully appreciated from the discussion below, “solved inverse” in this context implies solving for the expression which is the true inverse, not just switching the variables (e.g. x for y in a graphical analogy and vice-versa) nor by simply compensating for signal gain, or the like.
In one example we first examine the system and ascertain its properties. Specifically, the distortion expected in the non-linear system output due to at least one cause or factor is defined. Or put another way, a function defining what happens to the input when it passes through the system is determined and represented by a mathematical equation. The total system response can be considered in this equation in one example. In another example a deviation from a linear response can be mathematically represented. In another example system response due to one of a plurality of factors affecting total system response can be represented mathematically.
This definition of system response can be established by mathematical modeling. In another example it can be established by means of empirical investigation, and then determining a function expression that fits the data. Infinite series theorems and other conventional numerical methods can be used to define the function. In any case, the function which closely approximates the non-linear system response that leads to distortion of interest is expressed mathematically.
The mathematically expressed or expressible function defining distortion of the input expected in the output (the “non-linear system transfer function”) is then manipulated to find its inverse, solving in terms of one or more variables of interest with respect to the other variables of interest (not just exchanging variables) as exemplified by the equations set out below. This is done with a view towards being able to recover the original input in the output (i.e. obtain an output value directly proportional to the input value).
Since a mathematical expression of a non-linear system transfer function in most cases will not be in closed form, it can be expressed—at least in part—as an infinite series. This series expression will as a practical matter likely be defined as a polynomial of “N” terms. The non-linear system transfer function is then manipulated to find its inverse, as mentioned. In any event a “preprocessing transfer function” can be created relating the input signal (or more properly one or more parameters of the input signal) to a change or “pre-distortion” of the input signal. This preprocessing transfer function, when applied in a preprocessing scheme, determines the conditioned or “pre-distorted” signal to be fed into the system with a view to having it output from the system with distortion mitigated. The approximation may yield error, but if the error causes less distortion than the system otherwise produces it is a net improvement. It has been found that the method can reduce distortion from non-linear response in a parametric array dramatically to have distortion levels that are comparable to conventional acoustic reproduction systems.
Parameters of interest in defining and using the non-linear system transfer function and preprocessing transfer function may include (but are not limited to): input signal magnitude, frequency, signal magnitude envelope; rate of change of the signal magnitude, rate of change of the frequency, and rate of change of the signal magnitude envelope; and/or some combination or derivative of these parameters. The parameters that are significant depend upon the audio system under consideration. For parametric audio systems the behavior of the envelope of the audio input signal as well as the magnitude of the signal are very relevant. The parameters relevant to these are therefore quite important. Other parameters can be quite relevant in other kinds of audio systems.
Depending on how the correction system is set up, the pre-distortion is determined by the preprocessing transfer function (which can be dynamically changing with the input signal) and, in various example implementations, is either: a) determined as a value to be added to the input at each time instant; b) determined as a scalar value, and the input is multiplied by the scalar value at each time instant; or c) determined so that the output of the preprocessing transfer function is itself the corrected (pre-distorted) input signal for the non-linear system. Precisely how the input signal is altered depends upon the audio system. In any case one or more parameters of the input signal is/are sampled and fed into the preprocessing transfer function, and based on the result, one or more parameters of the output of the preprocessing transfer function are altered. Whether a correction is determined and applied to create the pre-distortion or the pre-distorted signal itself is the output of the preprocessing transfer function, the result desired is this pre-distorted signal. In digital implementations this will be done by sampling the input at a selected clock interval. In analog implementations the values of the input and output of the preprocessing transfer function are continuously changing and are coordinated with each other in time. A variety of ways of implementing the concept will be apparent to one skilled in the art.
In one example set out in detail below the parameters of the input signal used are the magnitude of the input (i.e. voltage) and the behavior of the input signal envelope (specifically that of a defined value “u” which is dependant on the trend of the envelope more than its magnitude at any instant). The pre-distortion created by the preprocessing changes the input signal, altering the magnitude (voltage) at each time instant in the examples. This will be more fully explained below.
In one example using parametric reproduction technology available at the time of this disclosure, the sound pressure level (SPL) of the output is not great enough to be fully in saturation all the time. Rather, it is essentially constantly moving in the transition between lower-power non-saturated (or weak saturation) in character, and higher-power higher-SPL output strongly saturated in character, where compressibility effects (or rather shock effects) are more dominant, and back again: varying with input signal program material level, for example. Using an approximation of expected distortion based on the actual level within this transition to pre-process the input signal has not been effectively implemented to the inventors' knowledge prior to this time. The method disclosed herein enables this.
In one example dual side band (DSB) modulation is used and a non-linear system transfer function comprising a selected number N of terms in a Taylor series is used. The selection of N is predicated on a balance of desired precision and convenience, as a larger number of terms gives rise to additional labor in solving for the inverse function. After a few terms of the series are defined, each new terms adds incrementally less accuracy and upper extension of valid results in view of extending the preprocessing transfer function farther into the transition between the situations mentioned (that of no compressibility affecting the output, and of full saturation in the output). Moreover, as SPL increases, the net change to the input made by the preprocessing transfer function decreases. If SPL is high enough (saturation predominates) the system becomes essentially linear in response, and then simple amplitude modulation (AM) with no preprocessing yields an essentially correct output. Therefore as just alluded to, at some level the correction scheme can potentially simply be replaced by AM without correction of the kind under discussion, as they achieve the same result in saturation (high) output levels. Where the Taylor series preprocessing transfer function is burdensome, particularly where DSB is being used, at saturation levels it can be beneficial to switch over to simple AM.
There may be a gap in signal input level ranges between where an N term preprocessing function yields accurate results and essentially full saturation where AM can be used. To bridge a gap between the output SPL (i.e. input signal voltage) where a series expression with N selected terms ceases to yield valid results and a voltage (output SPL) and where pure AM can be used with essentially no harmonic distortion (full saturation) as will be further discussed herein, a polynomial expression can be used to approximate the output distortion and can be similarly treated to obtain a preprocessing transfer function over this output level region in this example.
In this disclosure, for consistency, when a number of terms is specified for a transfer function, it is the number of terms in the non-linear system transfer function that is meant. The preprocessing transfer function expression may, and likely will, have more terms than the number of terms in the series expression it was based on. However, for simplicity, when referring to a preprocessing transfer function as an “‘N’-term” series, or an “‘N’-order” series (or function in either case), it is the number of terms (N) of the non-linear system transfer function it is based on that is meant.
In another example a modified two-term Taylor series is used over the entire range up to where pure AM can be used if desired. This will be discussed further below. And in another example a low-order Taylor series expansion is used where it is valid and a further polynomial expression defining an approximation of response is applied over the invalid range mentioned in another variation on implementation. Simple AM is used in signal level ranges where saturation predominates.
It will be appreciated from the discussion below that when enough terms are used in the series, or the preprocessing transfer function otherwise closely follows theoretical predictions, the result of the correction scheme can be a parametric system output with distortion that is very low in comparison to prior systems known in the art. Depending on the modulation scheme and the corresponding mathematical model of the system response (the non-linear system transfer function), the series expression of the preprocessing transfer function can look different in each case, but will function essentially as described herein.
In the examples, a DSB modulation scheme can be used. It has been found that using the information set forth below, the output will match the predictions and distortion will be greatly reduced. The scheme can also be applied to other modulation schemes such as single sideband (SSB) modulation, unbalanced sidebands (e.g. truncated dual sideband (DSB)), and so forth, as long as the non-linear system transfer function can be defined for the system and the implementation takes into consideration the particular modulation scheme used.
A DSB modulation scheme can be advantageous in implementations where a switch over to simple AM (no preprocessing) at high SPL is contemplated, as will be apparent. This is especially true where many terms of the series are used. For example, in some instances more than 5 terms may be used. In other instances, 50 or more terms may be used for sufficient accuracy. In some situations, microprocessor limitations may make real time calculation difficult. Developing a look-up table can potentially be used to simplify part or all of the implementation in real signal processor hardware and software. On the other hand, if the algorithm used requires a look-up table with a plurality of arguments for the system under consideration, then the number of values in the table can become large, and again hardware limitations may impinge on implementation. One solution to this dilemma is to use a series polynomial of few terms, and further correct as necessary with other approximating polynomials.
Another example implementation is to pre-record program material using powerful processors not readily adaptable to individual system implementations. As will be appreciated, a large N is less of a problem when a large computer and a large amount of time for processing are available. Once the pre-processed program material is prepared, individual pre-distorted copies of the program can be made. These pre-distorted copies can be used to play the pre-distorted material through a user's end system, enabling the original non-preprocessed signal to be substantially reproduced in the parametric array output.
With this description of the distortion correction in general terms in mind, examination will now be made of specific examples of mathematical expressions usable as preprocessing transfer functions.
Case 1: Nonlinear System Distortion Correction Generally
Let x and y be the input and output of a nonlinear system, respectively. Then a general nonlinear system can be described as
y=f(x) (1)
Apply a Taylor series expansion, and Equation (1) in general can be written as a Taylor Series:
y=C1x+C2x2+C3x3+C4x4+ . . . (2)
Note that Equation (2) reduces to a linear system when Cn=0, n=2, 3, 4, . . . , or in other words when only C1 is non-zero.
For a second order nonlinear system transfer function, Equation (2) reduces to y=C1 x+C2 x2, where C1≠0 and C2≠0. For a third order nonlinear system transfer function, Equation (2) reduces to y=C1 x+C2 x2+C3 x3, where among the three coefficient, at least C1≠0 and C3≠0, while the coefficient C2 can be zero or non-zero. For a forth order nonlinear system transfer function, Equation (2a) reduces to y=C1 x+C2 x2+C3 x3+C4 x4, where among the three coefficients, at least C1≠0 and C4≠0, while the coefficients C2 and C3 can be zero, or non-zero; and so on, the pattern continuing for higher-order system transfer functions.
Physically, if the input is a sinusoidal wave x=sin(2πf0t), then a linear system preserves the frequency component f0. A second order nonlinear system will produce both a fundamental f0, and a second harmonic 2f0. A third order nonlinear system will produce a fundamental f0, a second harmonic 2f0, and a third harmonic 3f0. A fourth order nonlinear system will produce a fundamental f0, a second harmonic 2f0, a third harmonic 3f0, and a fourth harmonic 4f0, and so on.
In order to recover the frequency component of the input signal, a signal preprocessing scheme, such as was described above, is needed. As shown in FIG. 1, the input signal x is preprocessed for a particular nonlinear system using a signal preprocessing algorithm 102. The output of the preprocessing algorithm, x′ (now “pre-distorted”) is input into the non-linear system 104. At the output of the system, the signal y is proportional to the input signal x. Note that the system itself remains nonlinear. This scheme linearizes the output with respect to the input by circumventing the non-linearity, not altering its essential character.
Now let's look at what preprocessing algorithms are needed for higher order systems, e.g. those that can be defined by second, third, and fourth order nonlinear system transfer functions.
For a second order nonlinear system transfer function,
y=C1x′+C2x′2=f(x′)=x (3)
Solving for x′ in equation (3), we obtain:
In equation (4), the preprocessing transfer function can serve as the basis for a preprocessing algorithm for a second order nonlinear system. Whether a + or − sign will be chosen depends on the real system being modeled.
For a third order nonlinear system transfer function the equation,
y=C1x′+C2x′2+C3x′3=x (5)
can be used. Solving Equation (5), we get
where
Equation (6) can serve as the preprocessing transfer function basis of the preprocessing algorithm for the third order nonlinear system. Whether x′1, x′2 or x′3 will be chosen to use as the basis of it depends on a real system parameter being modeled and corrected by the preprocessing scheme. Equation (6) looks formidable. However, for a practical nonlinear system, these expressions forming the bases of algorithms can likely be greatly simplified, once the coefficients C1, C2 and C3 particular to a real world system are known.
Similarly the corresponding preprocessing algorithms for a fourth order or higher order nonlinear system transfer function can be derived (as will be done below). It will be appreciated that the algorithms will become more complicated when the order of non-linearity is higher. However, these multiple term preprocessing schemes can serve as the general guiding principles for reducing the distortion of the original input signal as it appears when recovered in the output of nonlinear audio systems.
In order to determine the actual values, It is first necessary to resolve what the nonlinear system transfer function—Equation (1)—would be for a real system. This is then used to solve for an inverse function as the above examples illustrate. This inverse is the basis for a preprocessing transfer function, which in turn is the basis for the preprocessing to be applied.
Case 2: Parametric Acoustic Reproduction Systems
While the invention is not strictly limited to parametric audio reproduction, the use of preprocessing is very helpful there, and parametric audio reproduction examples shall be used. Audio signals generated by a parametric array in its farfield are expressed as a function of the audio envelope. The audio distortion correction algorithms are proposed for low level, moderate and higher sound pressure levels (SPL) approaching saturation ultrasound levels.
The parametric generation of sound was observed by Bellin and Beyer in 1962 and a theoretical analysis was developed by Westervelt in 1963. An experiment that opened the possibility of the parametric array in air was done by Bennett and Blackstock in the mid-1970's. Generation of a broadband audible signal through a parametric array was reported by Yoneyama et al. in 1983, who introduced the term ‘audio spotlight’ for audio application of a parametric array. In their experiment, an array of piezoelectric transducers was used to project amplitude-modulated ultrasound at 120 dB SPL and higher. It was shown that the parametric array in air could generate a broadband audio signal at reasonable loudness, although it suffered from limited frequency response and harmonic distortion. These two issues—poor frequency response and harmonic distortion—are inevitable consequences of the nonlinear interaction processes in parametric sound reproduction.
In 1965, Berktay derived a simple expression that is widely used to try to predict the farfield response of a parametric array. Briefly, and in general terms, Berktay's expression predicts that the resulting low frequency waveform along the axis of the beam of the parametric array is proportional to the second time derivative of the square of the amplitude modulation envelope. This explains a 12 dB per octave roll-off in frequency response and harmonic distortion in the demodulated signal. There have been algorithms proposed with a view to improving the frequency response and lessening the harmonic distortion. It has been suggested that integrating twice and taking the square root of the audio envelope reduces the harmonic distortion and improves the frequency response. It has been suggested that harmonic distortion can be improved by square-rooting the audio signal. A square rooting scheme can be implemented real-time but in theory requires infinite transducer bandwidth.
Spencer and Croft et al. propose the first examples of the present approach in the parent case to this disclosure, which is set forth in U.S. Pat. Nos. 6,584,205 B1 and 7,162,042 B2, which disclosure is hereby incorporated herein by reference for the teachings consistent herewith, and to the extent inconsistent, this document shall control for the purposes of this disclosure. In any case the reader's attention is directed to the prior disclosure. As an aside, in addition to the polynomial approximation approach disclosed, a recursive scheme was disclosed. The recursive algorithm suggested in the prior disclosure requires only limited transducer bandwidth but needs larger memory to store the computation data and processing time to accomplish the necessary repetitive calculations. Both the AM modulation with square-rooting algorithm and the SSB Modulation with a recursive algorithm are applied to the Berktay's solution which implies better results at low ultrasound levels with increasing difficulties as SPL rises.
As mentioned, a polynomial approximation of distortion and a needed correction or compensation based thereon is suggested in the prior disclosure wherein a preprocessing transfer function (be it expressed as a polynomial having a plurality of terms, or an exponential function, or other form) is developed based on an expression which models distortion. The approximation is applied to the incoming audio signal before it passes through the distortion-inducing parametric system in order to pre-distort it so as to reduce the distortion actually obtained. This disclosure sets forth improved methods for doing this.
Returning to consideration of Berktay's solution, a parametric output should be distortion free for a single audio frequency tone produced by two ultrasonic waves, i.e., with SSB modulation. However, in practice it is found that high distortion can be present while testing a single audio tone. THD (Total Harmonic Distortion) ranges between a few percentage points to as high as 70-80% depending on factors such as emitter size, resonant frequency of the emitter(s), input ultrasound level, and the testing distance, among other factors. The distortion mentioned should not be thought to be completely generated within the emitter itself.
Much of the distortion is due to nonlinear interaction of the ultrasound waves in the medium, typically air, and particularly at higher SPLs. Empirically it has been found that distortion is small close to the emitter, and increases with distance. Distortion stops increasing with distance and levels out at the transition between the near-field and far-field regions. This also suggests that the distortion peculiar to parametric systems is primarily not produced directly by the emitter nor other potential distortion sources prior to the impartation of the signal to the air medium at the emitter.
Berktay's solution suggests using square rooting of the input signal and this has been found to work for low SPL output. However, when SPL is increased distortion again begins to be a problem.
Conversely, at high SPLs where saturation predominates, Merklinger's solution (as referenced below) holds. As will be apparent, this implies that in full saturation there is essentially no distortion using pure AM (DSB) modulation of the carrier by the input audio signal. That these two conditions exist at the low end and high end of a range of levels was known, but how to cover the range of SPL between them was the problem the parent disclosure addressed and is the problem that is further addressed in the present disclosure.
The following discussion will characterize the far-field solution/equation/model of the output of the parametric array and discuss the relevant distortion correction schemes, particularly in the range of output SPL between that where Berktay's solution holds and that where Merklinger's solution holds. A Taylor series expansion defining a non-linear system transfer function in a general solution based on theoretical prediction is used to express the output of the parametric array regardless whether the primaries (carrier waves) are weak or strong. This solution will also lead to Berktay's solution for weak non-linearity at low SPLs; and to Merklinger's solution when the primary waves are in shock throughout the parametric array, i.e., essentially fully in saturation. The distortion correction schemes with AM modulation for both the general and saturation cases will also be addressed. The general solution of the parametric array will be briefly summarized in its farfield regardless of the amplitude of the primary waves. Characteristics of the two limiting cases (Berktay and Merklinger) are summarized as well.
General Solution for a Parametric Acoustic Array
The mathematical derivation of a solution for a parametric array in its farfield is presented by Merklinger (H. M. Merklinger, “Improved efficiency in the parametric transmitting array,” J. Acoust. Soc. Am., Vol. 58, No. 4, pp. 784-787, 1975). A Taylor series expansion of the solution is employed to write the solution in terms of the magnitude (power) of the envelope function squared. This solution will serve as the basis for distortion correction algorithms regardless of the ultrasound amplitude. A few key steps of the derivation are set forth, leading to equation (10) below.
Taking the primary wave (ultrasonic carrier) as a collimated plane wave having cross section area S and traveling along the x axis, the waveform at the transducer is expressed (as an amplitude modulated carrier) as follows:
pp(t)=P0E(t)sin ω0t (7)
where P0 is the primary wave pressure amplitude, E(t) is the envelope function, ω0=2πf0 is the primary angular frequency and f0 is the primary frequency. As this wave travels along the x axis, it suffers attenuation due, for example, to thermoviscous absorptions and to nonlinear absorptions. Merklinger suggested that a primary wave at a distance x is of the form
where α is the thermoviscous attenuation coefficient, ρ0 is the air density, c0 is the sound velocity, k0=ω0/c0 is the wave number of the carrier and β is the nonlinearity parameter in air. E′ (x,t−x/c0) contains the range and modulation dependent terms describing the primary waves.
The virtual source density as given by Westervelt (P. J. Westervelt, “Parametric Acoustic Array,” J. Acoust. Soc. Am., Vol. 35, No. 4, pp. 535-537, 1963) is
The time-dependent part of Equation (8), when squared, leads to two terms. One contributes to the lower frequency (secondary) wave and the other to the second harmonic of the primary wave. Neglecting the second harmonic of the primary waves, we obtain:
These sources give rise to an on-axis farfield sound pressure at range r given by:
If r is in the farfield, there will be negligible error introduced if we let the upper limit of integration go to infinity. Substituting the complete expression for E′ (x,t−r/c0) and integrating, we obtain:
Recall that the plane wave shock formation distance is defined as
and the parametric array length limited by thermoviscous absorption is defined as,
la=1/α. (14)
Equation (12) can then be rewritten as:
where τ=t−r/c0 is the retarded time,
is the Goldberg number (or the acoustic Reynold number), and
which is defined as the ultrasound pressure level index in this disclosure. By expanding the arctangent part in the form of the Taylor series, we have:
Generally speaking, it requires an infinite number of terms in Equation (16) to accurately express the secondary wave output sound pressure ps(r,t). However, only the first few terms will generally be used in reality for practical reasons. How many terms are required to approximate ps(r,t) depends on the primary frequency f0 and its sound pressure level P0. If the SPL is higher, more terms are required, generally speaking. (See
As will be apparent with reference to
In practice, solving for the inverse expressions becomes harder and harder as the incremental value of having the additional number of terms in the series becomes less and less compelling. It is well to contemplate the two limiting cases of the Berktay solution at low SPL and Merklinger at high SPL, as well as how to handle the transition between them. Whether it is all the way across with a Taylor Series-based preprocessing transfer function, or such a series for part of the way across, and another polynomial expression of some kind for the last part of the transition where working with the series becomes burdensome, some scheme can be decided upon.
Reasonably low distortion over this range can be obtained using a polynomial, exponential, or some other type or hybrid-type function approximation, rather than the Taylor series. For example, when the burden of solving for the form of equation and calculating the additional terms values becomes large, and the incremental benefit of additional terms becomes small, a different polynomial approximation, or some other approximation makes practical sense. As mentioned this need only be used over part of the range of SPL up until pure AM gives valid results.
The Weakly Nonlinear Solution (Berktay's Solution)
In a very weakly nonlinear parametric array output where u<<1, the array length is limited by the thermoviscous absorption, modeled by the equation:
tan−1[uE(τ)]≈uE(τ). (17)
So we recover the Berktay's solution as follows:
The most evident characteristics of the Berktay's solution are summarized as follows:
To validate the condition u<<1, we can apply ultrasound at a pressure level between 120 and 125 dB for the frequency range of 30 kHz to 80 kHz, which would generate a very low audio sound pressure level in general. The parametric emitter sensitivity in an example emitter used to verify this (available from American Technology Corporation, San Diego Calif.) is around 126 dB per 10 V rms input at the resonant frequency where the primary (carrier) frequency signal is placed. With an emitter driven to as high as 50 V rms, which corresponds to the ultrasound level of 130 dB, the ultrasound level index u is between 1 to 2 for the frequency range of 40 kHz to 80 kHz. Thus Berktay's solution is no longer applicable at this SPL. We have passed out of the output level range where the Berktay solution gives valid results and into the transition zone between the two limiting cases.
Saturation Solution (Merklinger's Solution)
In a very strong nonlinear parametric array where u>>1, the array length is limited by the nonlinear absorption (saturation) based on the equation:
We recover the Merklinger's solution as follows:
It should be pointed out that Willette and Moffett (J. G. Willette and M. B. Moffett, “Harmonics of the difference frequency in saturation-limited parametric sources,” J. Acoust. Soc. Am., Vol. 62, No. 6, pp. 1377-1381, 1977) derived an equation for the secondary sound pressure in saturation-limited parametric sources. Experimentally, Willette and Moffett and Aoki et al. reported that the secondary wave harmonic distortion decreases while the primary wave pressure increases if an AM modulation schemes is employed. The most evident characteristics of the Merklinger solution are summarized as follows:
Distortion Correction Algorithms
As set forth above, algorithms were developed for distortion correction based on the Berktay's solution Eq(18). We now take up distortion correction algorithms for the general solution Eq(16) and the saturation solution Eq(20).
A. Distortion Correction Algorithms—General Solution
As mentioned earlier, only the first few terms of the mathematical expression are needed to approximate the secondary wave pressure for moderate ultrasound levels. This section is devoted to describing the distortion correction for the solutions with two, three, four and five terms, respectively. The same scheme can be applied when more terms are used, as needed.
B. Two Terms Solution
A two terms solution is expressed as follows:
Equation (21) shows the secondary wave is proportional to:
As described, a preprocessing scheme for a two terms solution for the audio signal E(t) can be implemented in order to recover the original audio signal. One embodiment of a preprocessing scheme for a two terms solution is illustrated in
Using the scheme illustrated in
Solving Equation (23) and applying a Taylor series expansion, we get:
For a 2 terms solution to be valid, u2/3<<1. We can choose the first 3 terms to represent y(t)2, hence the audio preprocessing algorithms for 2 terms solution will be shown in
C. Three Terms Solution
A three terms solution is expressed as follows:
Equation (26) shows the secondary wave is proportional to:
A preprocessing scheme for a three terms solution for the audio signal E(t) can be implemented in order to recover the original audio signal, as shown in
Here again we assume the transducer has infinite flat bandwidth for simplicity. In order for the parametric array output to recover the original audio signal, we have:
Solving Equation (28), we get:
For a three terms solution, we can choose the first four terms to represent y(t)2. Hence, the audio preprocessing algorithms for the three terms solution, as shown in
D. Four Terms Solution
A four terms solution is expressed as follows
Equation (31) shows the secondary wave is proportional to:
Again, a preprocessing scheme for the four terms solution for the audio signal E(t) can be implemented in order to recover the original audio signal, as illustrated in the exemplary embodiment shown in
We assume the transducer has infinite flat bandwidth for simplicity. In order for the parametric array output to recover the original audio signal, we have:
Solving Equation (33), we get:
For a four terms solution, we can choose the first five terms to represent y(t)2, hence the audio preprocessing algorithms for the four terms solution, as illustrated in
E. Five Terms Solution
A five terms solution is expressed as follows:
Equation (36) shows the secondary wave is proportional to:
Again, a preprocessing scheme for the five terms solution for the audio signal E(t) can be implemented in order to recover the original audio signal, as illustrated in the exemplary embodiment shown in
Here we assume the transducer has infinite flat bandwidth for simplicity. In order for the parametric array output to completely recover the original audio signal, we have
Solving Equation (38), we get
For a five terms solution, as shown in
While the resulting mathematical expressions for more than the five term solution can be implemented well enough, deriving them can be difficult as mentioned above. That greater difficulty attends higher order series correction expression derivation, as will be more apparent, having reviewed the examples. An expert system may be programmed to derive these terms out to higher orders. But as mentioned above, in practice using another more simple approximation instead of the more accurate but difficult to derive high-order series solution can be an advantageous practical implementation.
Also, in one example of an implementation, a look-up table can be used to avoid calculation in real time. In one example the table can contain “pre-distorted” signal values to be substituted or in another example correction values (scalars or addition/subtraction values) to be applied at each time instant to input signal values. These are tabulated in terms of two arguments, namely input signal value E and ultrasound pressure index value u at each time instant. Since such a “multi-dimensional”table can be large, processing overhead is traded for memory overhead. Also, prerecording material to be reproduced in the system can be done to move the preprocessing to a separate step carried out outside the parametric system. Large look-up tables can be less burdensome in such an implementation.
In any case, reducing the processing overhead can be desirable in implementing the preprocessing scheme in commercial parametric system devices. Another simplified approach will now be discussed.
Modified Two Term Approach
The two term solution can be modified to
by eliminating the cubic term. With reference to
The distortion reduction actually begins to improve above about 140 dB. The transition can be made directly to simple AM without pre-correction, if desired, but the algorithm is so simple to implement that as a practical matter, the savings in processing resources is typically not sufficient to switch over, which is another potential advantage of this modification.
In another example embodiment a higher-order series expression (e.g. the fifth order function 1004 shown in
Returning to the plot of the modified two term preprocessing transfer function 1002 illustrated in
Distortion Correction Algorithm for the Saturation Solution
In the discussion above it has been mentioned that at sufficiently high SPL (e.g. 160 dB) the Merklinger solution applies; and simple AM can be used without additional preprocessing. As is known from the work of Willette and Moffett, for a single secondary tone, there is no distortion with simple amplitude modulation if the modulation index is 1. We have also found that under certain conditions—including essentially full saturation in the output—AM also works for a single secondary tone and multiple secondary tones if the modulation index is less than 1, as well as 1. Therefore as discussed, in one example when the level is above a selected value (which may be frequency dependant or otherwise vary), in a DSB implementation the calculation burden of the preprocessing transfer function need not be taken on, especially if many terms are implicated. Confirmation of this is set out below.
A. Single Secondary Tone
The ultrasound wave will be given by:
a0 cos ω0t+a1 cos(ω0−ω1)t+a1 cos(ω0+ω1)t=(a0+2a1 cos ω1t)cos ω0t (42)
So the envelope function is given by:
E(t)=a0+2a1 cos ω1t, (43)
by choosing the carrier amplitude as:
a0≧2a1. (44)
Since |cos ωt|≦1, we have:
|E(t)|=E(t), (45)
and the parametric array output will be:
So under the condition of Equation (46), we completely recover the original audio tones without any distortion. Equation (46) is equivalent to the AM modulation index equal to 1 (full modulation) or less (under-modulation). The only difference is that the non-distorted secondary pressure ps is maximized when the ultrasound is fully modulated (a0=2a1).
B. Multiple Secondary Tones
Suppose we have multiple audio tones at the frequencies of ω1, ω2, . . . , ωi, which have the amplitude a1, a2, . . . , ai, respectively. In a similar manner, we show that if the carrier amplitude satisfies the following condition:
a0≧2a1+2a2+ . . . +2ai, (47)
then the secondary output will have the form:
ps∝−2a1ω12 cos ω1t−2a2ω22 cos ω2t+ . . . 2aiωi2 cos ωit (48)
Again we can completely recover the audio signal if the AM modulation index is 1 (full modulation) or less (under modulation). The difference is that the non-distorted secondary pressure ps is maximized when the ultrasound is fully modulated (a0=2a1+2a2+ . . . +2ai). From the discussion of the single tone and the multiple tone cases it will be appreciated that distortion does not occur when both: 1) AM is used; and, 2) the output is in saturation. Under these conditions, a modulation index of less than one represents both an inefficiency (as some output power is not used to create audible output of the secondary audio signal) and a margin of safety. A modulation index of somewhat less than 1 (say between 0.80 to 0.95) can be used to prevent distortion due to inadvertent over-modulation and otherwise provide for the unexpected.
Dynamic Carrier Level
A significant advantage of distortion reduction in parametric arrays as set out above is that it enables increasing the efficiency of the system. A low modulation index has typically been used to keep distortion lower. A low modulation index for keeping distortion low is not needed when the distortion is already low due to the methods discussed above. Hand in hand with this concept is the idea that efficiency of reproduction can be enhanced by varying the primary signal level to match the envelope of the audio signal. In this way only enough carrier as may be needed to provide for the reproduction of the program material is used. The average system output power requirement, heat generated in the emitter, and unneeded ultrasound energy emitted can be greatly reduced. Situations where there is high carrier SPL and low audio SPL, which occur in a constant carrier level system, are minimized to the extent possible.
With a constant carrier, the envelope function is given by:
E(t)=1+ma(t), (49)
where 1 is for the constant carrier, m is the modulation index, and a(t) is the audio input.
With a dynamic carrier, the envelope function becomes:
E(t)=e(t)+ma(t), (50)
where e(t) is the audio envelope which determines the carrier amplitude, m is the modulation index, and a(t) is the audio input.
A dynamic carrier can cause an effect on the demodulated audio output level due to the carrier amplitude level. Without preprocessing, the demodulated audio output is given by Equation (9), namely, the output is proportional to:
E(t)tan−1[uE(t)], (51)
when the input amplitude is E(t). The output amplitude is not proportional to corresponding input amplitude, therefore, a gain compensation may be required. For example, with a Berktay model, the output is proportional to
We see that the fundamental component amplitude of demodulated audio will be 2me(t)a(t), which is not proportional to the audio a(t). It also depends on the carrier level e(t). Therefore, a gain compensation is required. Note that this gain compensation will depend on the carrier level. It can also be seen that the demodulated audio also introduces the second harmonic term a2(t).
A general block diagram with audio preprocessing and air demodulation is illustrated in
E(t)=e(t)+ma(t). (53)
We see that audio output is always proportional to its input a(t) as long as the modulation index m is kept constant.
In reality a dynamic carrier may not fully track the audio envelop e(t). In other words, a dynamic carrier may be implemented with various attack/release time limitations, which implies the modulation index m may not be kept as a constant during the dynamic carrier processing. Nevertheless, this has been found to be a minor issue generally speaking where the non-linear system transfer function is dependent on the signal envelope (as is the case in the examples given above).
Moreover, in the examples given above the audio preprocessing for distortion correction can be though of as a process to compress or expand (as may be needed, at each time instant) the input audio signal before it is expanded or compressed (as it may be at each time instant) parametrically in the air. Thus even with a dynamic carrier, no gain compensation is needed using a methodology such as that disclosed by example herein. In the examples given above, the u value can be used directly to modulate carrier level. This does include the caveat that rate of change of u is monitored and is limited as needed to prevent over modulation, as just alluded to. Some look-ahead can be used to monitor the input signal envelope through the u value to insure the rise and fall and inflection of the carrier level is within the prescribed rates. Additionally, u value can be monitored to ensure that the peaks of the program material can be met by sufficient primary signal power, and that the system takes as much advantage of the troughs as is possible; all while at the same time limiting the carrier envelope rise and fall rates to those which would give rise to audible artifacts.
The advantages of a dynamic carrier will be apparent. The particular synergy between distortion reduction, as described herein, and using a dynamic carrier is that a much higher modulation index can be maintained. Thus, the power efficiency advantages of using a dynamic carrier are more fully realized. As mentioned, the modulation index changes somewhat as primary signal level changes, but overall a high modulation index can be maintained and carrier (primary signal) power minimized without a distortion penalty. This provides a substantial improvement over prior systems, wherein a low modulation index was considered needful to keep distortion within reasonable limits.
In one example, the system can be configured to depend on a known modulation index, rather than a known primary signal amplitude. The modulation index can be arbitrarily chosen, for example set at 0.85 or 0.90. Moreover, it can be made variable (but again in a known way) according to a selected parameter. For example, it might be reduced as in the modified two term example given, over a specified input level range. Recall from the above discussion that when attack and release rates of carrier change are controlled the modulation index can change within a limited range, and this is found to not be problematic. This is due, at least in part, to the fact that modulation index drops below, rather than rises above, a selected value when this mismatch between carrier envelope and audio envelope occurs. If anything, fidelity improves with this mismatch. Since it represents an inefficiency however, it is not desirable per se. In most cases this is another synergy that can be used as an advantage in an example implementation in a parametric audio reproduction system.
In another embodiment, a method 1300 of distortion reduction in a non-linear audio system is disclosed, as illustrated in
The method 1300 further includes the operation of creating 1320 a preprocessing transfer function which is the solved inverse of the non-linear system transfer function. Another operation provides applying 1330 the preprocessing transfer function to the system electronic input signal to form a pre-processed electronic input signal. The pre-processed electronic input signal is configured to provide a modified system acoustic output having a reduced level of harmonic distortion relative to the system acoustic output.
In yet another embodiment, a method 1400 for reducing distortion in a non-linear audio system is disclosed. The method includes the operation of defining 1410 a non-linear system transfer function which has at least one parameter of an audio input electronic signal as the domain and at least one parameter of a system acoustic output as the range. The non-linear system transfer function enables prediction of the acoustic output based on the electronic input signal. The function is definable by at least a portion of a numerical series. For example, numerical series may be a Taylor series.
The method 1400 includes the additional operation of solving 1420 for the inverse of the non-linear system transfer function. An additional operation includes creating 1430 a preprocessing transfer function using the solved inverse of the non-linear system transfer function. The transfer function is expressible as a series expansion defining the non-linear system transfer function in a general solution. The series expansion is limited to a predetermined number of terms. In one embodiment, the series expansion can be a Taylor series. Alternatively, it may be a modified Taylor series wherein one or more of the series components have been removed. For example, a Taylor series of cubic form may be used, with the cubic term removed.
The method 1400 further includes the operation of preprocessing 1440 the audio input electronic signal using the preprocessing transfer function to form a preprocesses electronic signal operable to provide a modified system acoustic output having a reduced level of harmonic distortion relative to the system acoustic output.
As will be appreciated after study of the forgoing, the invention provides a number of advantages and can improve fidelity in audio reproduction. While certain example embodiments have been described, it is not intended that the invention be limited to those examples. Variations and additional features and advantages that would occur to one skilled in the art exercising that skill and the knowledge imputed to one so skilled without exercise of inventive faculty are intended to be included. Accordingly, it is not intended that the invention be limited, except as by the claims set forth below.
Priority is claimed to copending U.S. Provisional Patent Application Ser. No. 60/919,838, filed Mar. 22, 2007, which is hereby incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
1643791 | Slepian | Apr 1924 | A |
1616639 | Sprague | Feb 1927 | A |
1764008 | Crozier | Jun 1930 | A |
1799053 | Mache | Mar 1931 | A |
1809754 | Steedle | Jun 1931 | A |
1951669 | Ramsey | Mar 1934 | A |
1983377 | Kellogg | Dec 1934 | A |
2461344 | Olson | Feb 1949 | A |
2855467 | Curry | Oct 1958 | A |
2872532 | Buchmann et al. | Feb 1959 | A |
2935575 | Bobb | May 1960 | A |
2975243 | Katella | Mar 1961 | A |
2975307 | Schroeder et al. | Mar 1961 | A |
3008013 | Williamson et al. | Nov 1961 | A |
3012107 | Hanlet | Dec 1961 | A |
3012222 | Hagemann | Dec 1961 | A |
3136867 | Brettell | Jun 1964 | A |
3345469 | Rod | Oct 1967 | A |
3373251 | Seeler | Mar 1968 | A |
3389226 | Peabody | Jun 1968 | A |
3398810 | Clark, III | Aug 1968 | A |
3461421 | Stover | Aug 1969 | A |
3544733 | Reylek | Dec 1970 | A |
3612211 | Clark, III | Oct 1971 | A |
3613069 | Cary, Jr. | Oct 1971 | A |
3641421 | Graf et al. | Feb 1972 | A |
3654403 | Bobb | Apr 1972 | A |
3674946 | Winey | Jul 1972 | A |
3710332 | Tischner et al. | Jan 1973 | A |
3723957 | Damon | Mar 1973 | A |
3742433 | Kay et al. | Jun 1973 | A |
3787642 | Young, Jr. | Jan 1974 | A |
3821490 | Bobb | Jun 1974 | A |
2825834 | Stuart et al. | Jul 1974 | A |
3825834 | Stuart et al. | Jul 1974 | A |
3829623 | Willis et al. | Aug 1974 | A |
3833771 | Collinson | Sep 1974 | A |
3836951 | Geren et al. | Sep 1974 | A |
3892927 | Lindenberg | Jul 1975 | A |
3908098 | Kawakami et al. | Sep 1975 | A |
3919499 | Winey | Nov 1975 | A |
3941946 | Kawakami et al. | Mar 1976 | A |
3961291 | Whitehouse et al. | Jun 1976 | A |
3997739 | Kishikawa et al. | Dec 1976 | A |
4005278 | Gorike | Jan 1977 | A |
4015089 | Ishii et al. | Mar 1977 | A |
4056742 | Tibbetts | Nov 1977 | A |
4064375 | Russell et al. | Dec 1977 | A |
4149031 | Cooper | Apr 1979 | A |
4160882 | Driver | Jul 1979 | A |
4166197 | Moog et al. | Aug 1979 | A |
4207571 | Passey | Jun 1980 | A |
4210786 | Winey | Jul 1980 | A |
4242541 | Ando | Dec 1980 | A |
4245136 | Krauel, Jr. | Jan 1981 | A |
4256922 | Gorike | Mar 1981 | A |
4265122 | Cook et al. | May 1981 | A |
4284921 | Lemonon et al. | Aug 1981 | A |
4289936 | Civitello | Sep 1981 | A |
4295214 | Thompson | Oct 1981 | A |
4322877 | Taylor | Apr 1982 | A |
4378596 | Clark | Mar 1983 | A |
4385210 | Marquiss | May 1983 | A |
4418248 | Mathis | Nov 1983 | A |
4418404 | Gordon et al. | Nov 1983 | A |
4419545 | Kuindersma | Dec 1983 | A |
4429193 | Busch-Vishniac et al. | Jan 1984 | A |
4429194 | Kamon et al. | Jan 1984 | A |
4432079 | Mackelburg et al. | Feb 1984 | A |
4433750 | Neese | Feb 1984 | A |
4434327 | Busch-Vishniac et al. | Feb 1984 | A |
4439642 | Reynard | Mar 1984 | A |
4471172 | Winey | Sep 1984 | A |
4480155 | Winey | Oct 1984 | A |
4503553 | Davis | Mar 1985 | A |
4550228 | Walker et al. | Oct 1985 | A |
4558184 | Busch-Vishniac et al. | Dec 1985 | A |
4593160 | Nakamura | Jun 1986 | A |
4593567 | Isselstein et al. | Jun 1986 | A |
4600891 | Taylor, Jr. et al. | Jul 1986 | A |
4672591 | Breimesser et al. | Jun 1987 | A |
4695986 | Hossack | Sep 1987 | A |
4703462 | Woodsum | Oct 1987 | A |
4751419 | Takahata | Jun 1988 | A |
4784915 | Sakagami et al. | Nov 1988 | A |
4803733 | Carver et al. | Feb 1989 | A |
4809355 | Drefahl | Feb 1989 | A |
4823908 | Tanaka et al. | Apr 1989 | A |
4837838 | Thigpen et al. | Jun 1989 | A |
4885781 | Seidel | Dec 1989 | A |
4887246 | Hossack et al. | Dec 1989 | A |
4888086 | Hossack et al. | Dec 1989 | A |
4903703 | Igarashi et al. | Feb 1990 | A |
4908805 | Sprenkels et al. | Mar 1990 | A |
4939784 | Bruney | Jul 1990 | A |
4991148 | Gilchrist | Feb 1991 | A |
4991687 | Oyaba et al. | Feb 1991 | A |
5054081 | West | Oct 1991 | A |
5079751 | Woodward | Jan 1992 | A |
5095509 | Volk | Mar 1992 | A |
5109416 | Croft | Apr 1992 | A |
5115672 | McShane et al. | May 1992 | A |
5142511 | Kanai et al. | Aug 1992 | A |
5153859 | Chatigny et al. | Oct 1992 | A |
5181301 | Wheeler | Jan 1993 | A |
5276669 | Oohashi et al. | Jan 1994 | A |
5287331 | Schindel et al. | Feb 1994 | A |
5317543 | Grosch | May 1994 | A |
5357578 | Taniishi | Oct 1994 | A |
5392358 | Driver | Feb 1995 | A |
5406634 | Anderson et al. | Apr 1995 | A |
5430805 | Stevenson et al. | Jul 1995 | A |
5487114 | Dinh | Jan 1996 | A |
5539705 | Akerman et al. | Jul 1996 | A |
5572201 | Graham et al. | Nov 1996 | A |
5582176 | Swerling et al. | Dec 1996 | A |
5619383 | Ngai | Apr 1997 | A |
5638456 | Conley et al. | Jun 1997 | A |
5649019 | Thomasson | Jul 1997 | A |
5662190 | Abe | Sep 1997 | A |
5666327 | Medeiros et al. | Sep 1997 | A |
5700359 | Bauer | Dec 1997 | A |
5745582 | Shimpuku et al. | Apr 1998 | A |
5748758 | Menasco, Jr. et al. | May 1998 | A |
5758177 | Gulick et al. | May 1998 | A |
5767609 | Suganuma | Jun 1998 | A |
5809400 | Abramsky et al. | Sep 1998 | A |
5832438 | Bauer | Nov 1998 | A |
5844998 | Nageno | Dec 1998 | A |
5859915 | Norris | Jan 1999 | A |
5885129 | Norris | Mar 1999 | A |
5889870 | Norris | Mar 1999 | A |
5892315 | Gipson et al. | Apr 1999 | A |
5919134 | Diab | Jul 1999 | A |
5982805 | Kaneda | Nov 1999 | A |
6011855 | Selfridge et al. | Jan 2000 | A |
6044160 | Norris | Mar 2000 | A |
6052336 | Lowrey, III | Apr 2000 | A |
6064259 | Takita | May 2000 | A |
6104825 | Thigpen | Aug 2000 | A |
6108427 | Norris et al. | Aug 2000 | A |
6108433 | Norris | Aug 2000 | A |
6151398 | Norris | Nov 2000 | A |
6188772 | Norris et al. | Feb 2001 | B1 |
6205104 | Nagashima et al. | Mar 2001 | B1 |
6229899 | Norris et al. | May 2001 | B1 |
6232833 | Pullen | May 2001 | B1 |
6304662 | Norris et al. | Oct 2001 | B1 |
6356872 | Leung et al. | Mar 2002 | B1 |
6359990 | Norris | Mar 2002 | B1 |
6378010 | Burks | Apr 2002 | B1 |
6445804 | Hirayanagi | Sep 2002 | B1 |
6556687 | Manabe | Apr 2003 | B1 |
6577738 | Norris et al. | Jun 2003 | B2 |
6584205 | Croft et al. | Jun 2003 | B1 |
6606389 | Selfridge et al. | Aug 2003 | B1 |
6666374 | Green et al. | Dec 2003 | B1 |
6678381 | Manabe | Jan 2004 | B1 |
6768376 | Hoyt et al. | Jul 2004 | B2 |
6771785 | Pompei | Aug 2004 | B2 |
6850623 | Norris et al. | Feb 2005 | B1 |
6859096 | Tanaka et al. | Feb 2005 | B2 |
7162042 | Spencer et al. | Jan 2007 | B2 |
20010007591 | Pompei | Jul 2001 | A1 |
20020101360 | Schrage | Aug 2002 | A1 |
20020191808 | Croft, III et al. | Dec 2002 | A1 |
20050207587 | Pompei | Sep 2005 | A1 |
Number | Date | Country |
---|---|---|
0 599 250 | Jun 1994 | EP |
0 973 152 | Jan 2000 | EP |
360021695 | Feb 1985 | JP |
2265397 | Oct 1990 | JP |
H2-265400 | Oct 1990 | JP |
WO 9802978 | Jan 1998 | WO |
WO 9849868 | Apr 1998 | WO |
WO 9935881 | Jul 1999 | WO |
WO 9935884 | Jul 1999 | WO |
WO 0108449 | Feb 2001 | WO |
WO 0133902 | May 2001 | WO |
WO 0152437 | Jul 2001 | WO |
Number | Date | Country | |
---|---|---|---|
60919838 | Mar 2007 | US |