Probabilistic Method of Loudspeaker Detection

BACKGROUND OF THE INVENTION

1. Field of the Invention

Embodiments of the present invention generally relate to a method and apparatus for loudspeaker cutoff detection.

2. Background of the Invention

For applications such as room equalization, loudspeaker equalization and bass management, it is sometimes necessary to measure the frequency response of the loudspeakers. If the low-frequency cutoff of the loudspeakers can be determined, this information can be used to effectively apply bass management, i.e. remove poorly reproduced frequencies and route them to a better loudspeaker such as a subwoofer. However the measured spectrum of a loudspeaker usually contains irregularities caused by reflections and noise, making cutoff detection difficult.

Therefore, there is a need for an improved loudspeaker cutoff detection method and apparatus.

SUMMARY OF THE INVENTION

Embodiments of the present invention relate to a method and apparatus for enhancing cutoff detection of a loudspeaker. The method comprising retrieving a loudspeaker model cutoff and model error, generating a probability distribution of the cutoff frequency based on the retrieved models, and utilizing the generated probability distribution to enhance the detection of the cutoff of the loudspeaker.

BRIEF DESCRIPTION OF THE DRAWINGS

So that the manner in which the above recited features of the present invention can be understood in detail, a more particular description of the invention, briefly summarized above, may be had by reference to embodiments, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only typical embodiments of this invention and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments. In this application, a computer readable processor is any medium accessible by a computer for saving, writing, archiving, executing and/or accessing data. Furthermore, the method described herein may be coupled to a processing unit, wherein said processing unit is capable of performing the method.

FIG. 1 is an embodiment of a bass management filter design;

FIG. 2 is an embodiment of a front left loudspeaker measurement;

FIG. 3 is an embodiment of an effect of M as a shape parameter for critical u=0.6436, maxiflat u=1.5538 and huge u=1000.0 cases;

FIG. 4 is an embodiment of a normalized probability of cutoff frequency ω_cwith u from 0.5 to 500.0 and ω_cfrom 1.0 Hz to 748.5 Hz (thatched region is unrealizable);

FIG. 5 is an embodiment of an audio based method;

FIG. 6 is a flow diagram depicting an embodiment of a method for enhancing a loudspeaker cutoff detection; and

FIG. 7 is a flow diagram depicting an embodiment of a normalization method to make a probability distribution.

DETAILED DESCRIPTION

Bass management refers to routing the low frequency part of the signal to the most effective transducer, typically a subwoofer. Thus, the upper cutoff frequency of the subwoofer and lower cutoff frequencies of the other loudspeakers are usually known. If a subwoofer is not available, a technique, such as bass-boost (creates the sensation of more bass) may be applied. Such technique may be utilized when the loudspeaker cutoff is known to be too high. For these and other applications, it is useful to be able to estimate the lower cutoff frequency of regular loudspeakers. FIG. 1 shows how a loudspeaker measurement is taken with a microphone, analyzed for cutoff frequency, which is then used to design or choose appropriate bass management filters.

The measurement may be the same as loudspeaker equalization. Loudspeaker equalization refers to filters applied to a signal which are designed to compensate for the loudspeaker response. Generally, a known test signal is applied to the loudspeaker. The output is picked up by a microphone with a known frequency response. The unknown system, such as, amplifier, loud-speaker, environment, may be tested by applying a known test signal and recording the output. The frequency response may be derived using standard techniques. This measured frequency response, used primarily to design equalization filters, in principle may be used for several addition purposes including distance detection, polarity detection and cutoff detection. However, the spectrum of the measured system is typically not smooth, as shown in FIG. 2. FIG. 2 is an embodiment of a measurement for the front left speaker of a 5 speaker plus woofer system. The irregularity in the spectrum makes accurate cutoff estimation difficult.

The basic approach of this method is to generate a probability distribution of the cutoff frequency based on a model of loudspeaker cutoff and a Gaussian model of error. The error is the difference between the model and measurement. The error is caused by several factors, such as, background noise, measurement error, and room and speaker reflections. Such error may effect choosing the wrong model function.

The background noise and measurement error are likely to be approximately Gaussian. However, assuming the loudspeaker model is accurate, the largest source of error is usually the room and speaker reflections, which are generally non-Gaussian. Using a Gaussian error model may lead to relatively straight forward mathematical formulations.

After the loudspeaker model and error model are set, a probability distribution for the cutoff frequency remains, which may also require utilizing cutoff frequency as one of the parameters, applying Bayes' Theorem and eliminating the other “nuisance” parameters. Finally, this distribution can be analyzed and action taken based on the result.

A closed-box loudspeaker system model is

$\begin{matrix} G (s) = \frac{s^{2} T_{C}^{2}}{s^{2} T_{C}^{2} + {sT}_{C} / Q_{TC} + 1} & (1) \end{matrix}$

where Q_TCis the total Q of the system at f_C, with f_Cbeing the resonance frequency of closed-box system, and T_Cis the time constant 1/2πf_C. The frequency response of this model is

$\begin{matrix} G (j ω) = \frac{- ω^{2} T_{C}^{2}}{- ω^{2} T_{C}^{2} + j ω T_{C} / Q_{TC} + 1} & (2) \end{matrix}$

and the magnitude response is

$\begin{matrix} {\langle G (j ω) \rangle}^{2} = G (jω) \overline{G (j ω)} = \frac{ω^{4} T_{C}^{4}}{ω^{' 1} T_{C}^{4} - 2 ω^{2} T_{C}^{2} + {ω^{2} (T_{C} / Q_{TC})}^{2} + 1} . & (3) \end{matrix}$

Now the cutoff frequency ω_cis defined as the point at which

$\begin{matrix} {\langle G (j ω) \rangle}^{2} = \frac{1}{2} . & (4) \end{matrix}$

Using this constraint and solving for T_C/Q_TCwe have

$\begin{matrix} ω_{c}^{4} T_{C}^{4} = \frac{1}{2} (ω_{c}^{4} T_{C}^{4} - 2 ω_{c}^{2} T_{C}^{2} + {ω_{c}^{2} (T_{C} / Q_{TC})}^{2} + 1) & (5) \\ 2 ω_{c}^{4} T_{C}^{4} - ω_{c}^{4} T_{C}^{4} + 2 ω_{c}^{2} T_{C}^{2} - 1 = {ω_{c}^{2} (T_{C} / Q_{TC})}^{2} & (6) \\ T_{C} / Q_{TC} = \sqrt{ω_{c}^{2} T_{C}^{4} + 2 T_{C}^{2} - \frac{1}{ω_{c}^{2}}} & (7) \end{matrix}$

Substituting (7) into (3) gives

$\begin{matrix} {\langle G (j ω) \rangle}^{2} = \frac{ω^{4} T_{C}^{4}}{ω^{4} T_{C}^{4} - 2 ω^{2} T_{C}^{2} + ω^{2} (ω_{c}^{2} T_{C}^{4} + 2 T_{C}^{2} - \frac{1}{ω_{c}^{2}}) + 1} = \frac{T_{C}^{4} ω^{4}}{1 - \frac{ω^{2}}{ω_{c}^{2}} + T_{C}^{4} ω_{c}^{2} ω^{2} + T_{C}^{4} ω^{4}} & (8) \end{matrix}$

eliminating the Q_TCparameter and introducing the cutoff ω_cas a new parameter.

Equation (8) takes two parameters, T_Cand ω_c, and one variable ω which represents a frequency. Since the data is taken at discrete bin frequencies, we will usually index this variable with k as ω_kto mean the frequency at the k^thbin which can be interpreted in Hz depending on sampling rate and FFT size. To remain neutral during calculation frequency is measured in bins, i.e. ω_k=k. However in this paper the sampling rate is always 48 kHz and the FFT size is always 32,768 giving a conversion factor of ≈1.46475 Hz per bin. It is also convenient to write the cutoff frequency ω_con the same scale so that ω_c=ω_kwhen c=k. However c need not be restricted to be an integer.

The T_Cparameter determines the shape of the model frequency response once ω_cis fixed. However the effect of T_Cdepends on ω_c. For instance a given value of T_Cmay make the frequency response peaky for some an and very flat for other ω_c. This is due to the fact that scaling ω_cand ω by the same amount α in (8) gives

$\begin{matrix} \frac{α^{4} T_{C}^{4} ω^{4}}{1 - \frac{ω^{2}}{ω_{c}^{2}} + α^{4} T_{C}^{4} ω_{c}^{4} ω^{2} + α^{4} T_{C}^{4} ω^{4}} \neq \frac{T_{C}^{4} ω^{4}}{1 - \frac{ω^{2}}{ω_{c}^{2}} + T_{C}^{4} ω_{c}^{2} ω^{2} + T_{C}^{4} ω^{4}} . & (9) \end{matrix}$

However making the substitution u−ω_cT_Cin (8) gives

$\begin{matrix} {\langle G (j ω) \rangle}^{2} = \frac{\frac{u^{′1}}{ω_{c}^{′1}} ω^{4}}{1 - \frac{ω^{2}}{ω_{c}^{2}} + \frac{u^{4}}{ω_{c}^{2}} ω^{2} + \frac{u^{4}}{ω_{c}^{4}} ω^{4}} = \frac{u^{4} ω^{4}}{ω_{c}^{4} - ω^{2} ω_{c}^{2} + u^{4} ω^{2} ω_{c}^{2} + u^{4} ω^{4}} & (10) \end{matrix}$

so that after scaling ω and ω_cby α it becomes the case that

$\begin{matrix} \frac{α^{4} u^{4} ω^{4}}{\begin{matrix} α^{4} ω_{c}^{4} - α^{4} ω^{2} ω_{c}^{2} + \\ α^{4} u^{4} ω^{2} ω_{c}^{2} + α^{4} u^{4} ω^{4} \end{matrix}} = \frac{u^{4} ω^{4}}{ω_{c}^{4} - ω^{2} ω_{c}^{2} + u^{4} ω^{2} ω_{c}^{2} + u^{4} ω^{4}} & (11) \end{matrix}$

and the shape is persevered.

Note that u should be constrained to physically realizable values derived from the constraint T_C/Q_TC≧0 from (7). We also have from (7) that

$\begin{matrix} ω_{c}^{2} T_{C}^{4} + 2 T_{C}^{2} - \frac{1}{ω_{c}^{2}} \geq 0 or & (12) \\ ω_{c}^{4} T_{C}^{4} + 2 ω_{c}^{2} T_{C}^{2} \geq 1 & (13) \end{matrix}$

so completing the square we have

w
_c
⁴
T
_C
⁴+2ω_c²T_C²+1≧1 (14)

and

(w^c²T_C²+1)²≧2. (15)

Thus

ω_c²T_C²+1≧√{square root over (2)} (16)

and finally

u=ω
_c
T
_C
≧√{square root over (√{square root over (2)}−1)}≈0.643594252905582742. (17)

Thus, u cannot be less than the critical value ≈0.6436.

Another important value of u is that which makes the frequency response maximally flat. A flat response is often a goal in loudspeaker design, so the value of u that achieves this will likely be a good value for a loudspeaker model.

The maxiflat value of u can be found by plugging the denominator of (1) into the quadratic formula and making the discriminant 0 as follows:

$\begin{matrix} 0 = \sqrt{{(\frac{T_{C}}{Q_{TC}})}^{2} - 4 T_{C}^{2}} \Rightarrow \frac{T_{C}^{2}}{Q_{TC}^{2}} = 4 T_{C}^{2} and since & (18) \\ 4 T_{C}^{2} = \frac{T_{C}^{2}}{Q_{TC}^{2}} = ω_{c}^{2} T_{C}^{4} + 2 T_{C}^{2} - \frac{1}{ω_{c}^{2}} we have & (19) \\ ω_{c}^{2} T_{C}^{4} - 2 T_{C}^{2} - \frac{1}{ω_{c}^{2}} = 0 so that & (20) \\ T_{C}^{2} = \frac{1 + \sqrt{2}}{ω_{c}^{2}} . and & (21) \\ u = T_{C} ω_{c} = \sqrt{1 + \sqrt{2}} \approx 1.5537739740300374 . & (22) \end{matrix}$

Equation (10) may need to be scaled by an amplitude A in order to best fit the data. This is important since generally the amplitude of the data is unknown. Thus we can define our basic model to be

$\begin{matrix} {Am}_{k} (ω_{c}, u) = A {\langle G ({jω}_{k}) \rangle}^{2} = A \frac{u^{4} ω_{k}^{4}}{ω_{c}^{4} - ω_{k}^{2} ω_{c}^{2} + u^{4} ω_{k}^{2} ω_{c}^{2} + u^{4} ω_{k}^{4}} . & (23) \end{matrix}$

Substituting (22) into (23) gives

$\begin{matrix} {Am}_{k} (ω_{c}) = A \frac{(3 + 2 \sqrt{2}) ω_{k}^{4}}{ω_{c}^{4} + (2 + 2 \sqrt{2}) ω_{k}^{2} ω_{c}^{2} + (3 + 2 \sqrt{2}) ω_{k}^{4}} . & (24) \end{matrix}$

as a maxiflat loudspeaker model depending only on parameters of amplitude A and cutoff frequency ω_c.

FIG. 3 illustrates the effect of u on the frequency response for the basic model (23) with ω_cset at 100 Hz and A=1. Shown are frequency responses with the critical value of (17) where u=0.6436 and maxiflat value of (22) where u=1.5538 and a “huge” value of u=1000.0. In the critical value case, the resonance peak heads toward ∝ while the maxiflat case is close to the huge case, but below the cutoff frequency drops off more rapidly.

By error we mean the difference between the model and the measured value. For this error, a gaussian model is assumed. Letting D represent our data, which is the squared magnitude of a measured loudspeaker spectrum X, letting d_krepresent the data at frequency bin index k, i.e. d_k=|X[k]|², letting m_k, A, u and ω_crepresent the model and parameters used in (23) and letting I represent our models for loudspeakers and error, the likelihood for a particular set of parameters can be expressed as

$\begin{matrix} P (D  A, u, ω_{c}, I) = \prod_{k} \frac{1}{σ_{k} \sqrt{2 π}} \exp (- \frac{1}{2 σ_{k}^{2}} {(d_{k} - {Am}_{k} (ω_{c}, u))}^{2}) & (25) \end{matrix}$

where σ_kis the standard deviation of the noise at index k. Here the “noise” is really the error at each frequency bin which can be frequency dependent. These σ_kcan be treated as a set of additional parameters, but for now we will assume these are known since doing so doesn't affect the rest of the derivations. The σ_kcan be thought of as a weighting on the frequency, a smaller σ_kvalue indicates more certainty about the d_kvalue and thus the error at that frequency counts more. As a frequency weighting, these σ_kcan also be modified to force the algorithm to weigh some frequencies more than others. Conversely, if there is no reason to emphasize the contribution at any frequency, all σ_kcan be set to the same value.

Equation (25) is called the likelihood of the parameters, since the data is fixed and the parameters can vary. It can be interpreted as saying that the probability of the data given the loudspeaker model, gaussian noise model, and a set of model parameters is just the product of the independent probability densities that gaussian noise makes up the difference between the model with those parameters and the data. The parameter values which maximize the probablity of the data are those that minimize the sum of the squared differences with the data, and are known as the least squares solution.

Bayes' Theorem follows directly from the definition of conditional probability as follows:

$\begin{matrix} \begin{matrix} P (AB) = P (A  B) P (B) \\ = P (B  A) P (A) \Rightarrow P (B  A) \\ = \frac{P (A  B) P (B)}{P (Λ)} \end{matrix} & (26) \end{matrix}$

where A and B can be basically any statements for which conditional probability makes sense. Applying (26) to (25) gives

$\begin{matrix} P (A, u, ω_{c}  D, I) = \frac{P (D  A, u, ω_{c}, I) P (A, u, ω_{c}  I)}{P (D  I)} . & (27) \end{matrix}$

Thus, in addition to the likelihood P(D|A, u, ω_c, I) given by (25), we need a prior probability P(A, u, ω_c|I) and a normalizing term P(D|I) in order to get our posterior probability P(A, u, ω_c|D, I). However another step is then to eliminate the “nuisance” parameters A and u to give the posterior probability of the cutoff frequency P(w_c|D, I). The elimination of A as a “nuisance” parameter can be achieved by exact marginalization.

Let {θ} be a set of parameters, A be a scale (amplitude) parameter, d_kbe the k^thdata value and m_kbe the model value at index k with parameters {θ}. Then using the gaussian error model we have

$P (D  {θ}, A, I) = \prod_{k} \frac{1}{σ_{k} \sqrt{2 π}} \exp (- \frac{1}{2 σ_{k}^{2}} {(d_{k} - {Am}_{k})}^{2}) .$

Note that parameter A appears as a scale term outside of the model itself, which only takes parameters {θ}. Then we have

$\begin{matrix} P (D  {θ}, A, I) = (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} {(d_{k} - {Am}_{k})}^{2}) \\ = (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \\ \exp (\sum_{k} \frac{1}{2 σ_{k}^{2}} (d_{k}^{2} 2 Λ d_{k} m_{k}  Λ^{2} m_{k}^{2})) \\ = (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \\ \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} (- 2 {Ad}_{k} m_{k} + A^{2} m_{k}^{2})) \\ = (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \\ \exp (- A^{2} (\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}) + A (\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})) . \end{matrix}$

From Bayes' Theorem we have

$\begin{matrix} P ({θ}, A  D, I) = \frac{P ({θ}, A  I) P (D  {θ}, A, I)}{P (D  I)} \\ = \frac{P ({θ}  I) P (A  I) P (D  {θ}, A, I)}{\int \int P (D, {θ}, A  I) \partial {θ} \partial A} \\ = \frac{P ({θ} I) P (D  {θ}, A, I) P (A  I)}{\int \partial {θ} \int \partial AP ({θ}  I) P (D  {θ}, A, I) P (AI)} \end{matrix}$

with the integration ranges and prior probabilities appropriately chosen for the parameters. We would like to marginalize A.

$\begin{matrix} P ({θ}  D, I) = \int \partial AP ({θ}, A  D, I) \\ = \frac{P ({θ}  I) \int \partial AP (D  {θ}, A, I) P (A  I)}{\int \partial {θ} P ({θ}  I) \int \partial AP (D  {θ}, A, I) P (A  I)} \end{matrix}$

$\int \partial A \exp (- C_{0} A^{2} + C_{1} A) = \frac{\sqrt{π} crf (\frac{2 A C_{0} - C_{1}}{2 \sqrt{C_{0}}})}{2 \sqrt{C_{0}} \exp (- \frac{C_{1}^{2}}{4 C_{0}})} + constant$

So, if we choose a flat prior for P(A|I) and a range of (−∞, ∞) we have

$\int_{- \infty}^{\infty} \partial A \exp (- C_{0} A^{2} + C_{1} A) = \frac{\sqrt{π} \exp (\frac{C_{1}^{2}}{4 C_{0}})}{\sqrt{C_{0}}}$

$and  with$

$C_{0} = (\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2})$

$C_{1} = (\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})$

$we  have$

$\begin{matrix} \int_{- \infty}^{\infty} \partial AP (D  {θ}, A, I) P (A  I) = (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \int_{- \infty}^{\infty} \partial A \exp (- A^{2} (\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}) + A (\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})) P (A  I) \\ = (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \frac{\sqrt{π}}{\sqrt{\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}}} \exp (\frac{{(\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})}^{2}}{2 \sum_{k} \frac{1}{σ_{k}^{2}} m_{k}^{2}}) \end{matrix}$

$so  that$

$\begin{matrix} P ({θ}  D, I) = \frac{P ({θ}  I) (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \frac{\sqrt{π}}{\sqrt{\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}}} \exp (\frac{{(\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})}^{2}}{2 \sum_{k} \frac{1}{σ_{k}^{2}} m_{k}^{2}})}{\int \partial {θ} P ({θ}  I) (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \frac{\sqrt{π}}{\sqrt{\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}}} \exp (\frac{{(\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})}^{2}}{2 \sum_{k} \frac{1}{σ_{k}^{2}} m_{k}^{2}})} \\ = \frac{P ({θ}  I) (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \frac{\sqrt{π}}{\sqrt{\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}}} \exp (\frac{{(\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})}^{2}}{2 \sum_{k} \frac{1}{σ_{k}^{2}} m_{k}^{2}})}{P (D  I)} . \end{matrix}$

Thus, the marginalization leaves a new equation for the likelihood of the parameters as follows:

$P (D  u, ω_{c}, I) = (\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \frac{\sqrt{π}}{\sqrt{\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}}} \exp (\frac{{(\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})}^{2}}{2 \sum_{k} \frac{1}{σ_{k}^{2}} m_{k}^{2}})$

where m_kis short for the model m_k(u, ω_c).

Since there are only two remaining parameters, this can be shown in a 2-dimensional graph. For the spectrum shown in FIG. 2, a gray-scale plot of (28) is shown in FIG. 4 with black indicating the highest likelihood at each u level. The thatched region at the bottom lies below the critical u value from (17) and is physically unrealizable. The isolated horizontal black line above the critical region indicates the maxiflat u value from (22). When the u value is above the maxiflat value, the peak likelihood region for ω_cis very stable, since the shape of the model frequency response changes only slightly as u increases. However when u goes below the maxiflat value the model frequency response quickly becomes peaky, and the most probable region for ω_cbecomes less stable, changing value and jumping to a higher region before jumping to the lowest frequency as u nears its critical value.

Exact marginalization over u looks very difficult and numerically integrating over u also seems computationally expensive. However u doesn't affect the shape of the speaker rolloff very much beyond some low values which cause a large resonance in the model spectrum, as indicated by FIG. 3. Since loudspeaker designers try to avoid such resonances, it is unlikely such values for u will explain the data well. Instead we hold the value constant at a reasonable value such the maxiflat value u=1.5538, and evaluate the probability of ω_con this basis.

In this example we have

$\begin{matrix} {P (ω_{c}  D, I) = \frac{(\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}}) \exp (- \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}) \frac{\sqrt{π}}{\sqrt{\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}}} \exp (\frac{{(\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})}^{2}}{2 \sum_{k} \frac{1}{σ_{k}^{2}} m_{k}^{2}}) P (ω_{c}  I)}{P (D  I)} \rangle}_{u = 1.5538} . & (29) \end{matrix}$

The prior P(ω_c|I) can be thought of as a weighting based on our belief about what likely values of ω_cshould be. This can be flat over a reasonable range, or have some proprietary shape based on many loudspeaker evaluations. It is also useful to make the prior P(ω_c|I) discrete with the same set of frequencies bins used for the data. Thus the prior can state P(ω_c|I)=0 if c≠k for all frequency bin indexes k, effectively sampling the continuous probability density at some subset of frequency bins. Since the same prior is built into the normalizing denominator, this is a way to move from a continuous distribution to a discrete one defined only at bin frequencies.

A high level implementation of this method is shown in FIG. 5. The loudspeaker spectrum is assumed available from other processes, but if not, these can be calculated by taking the FFT of a test signal recording.

For implementation it is useful to take the log of (29) which gives

$\begin{matrix} \log (P (ω_{c}  D, I)) = \log ((\prod_{k} \frac{1}{σ_{k} \sqrt{2 π}})) \sum_{k} \frac{1}{2 σ_{k}^{2}} d_{k}^{2}  \frac{1}{2} \log (π) - \frac{1}{2} \log (\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}) + \frac{{(\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})}^{2}}{2 \sum_{k} \frac{1}{σ_{k}^{2}} m_{k}^{2}} + \log (P (ω_{c}  I)) - \log (P (D  I)) . & (30) \end{matrix}$

which can be considered as

$\begin{matrix} \log (P (ω_{c}  D, I)) = - \frac{1}{2} \log (\sum_{k} \frac{1}{2 σ_{k}^{2}} m_{k}^{2}) + \frac{{(\sum_{k} \frac{1}{σ_{k}^{2}} d_{k} m_{k})}^{2}}{2 \sum_{k} \frac{1}{σ_{k}^{2}} m_{k}^{2}} + \log (P (ω_{c}  I)) + constant . & (31) \end{matrix}$

Since log(x) is a monotonically increasing function of x and the constant term doesn't affect the location of the maximum value, one approach is just to find the ω_cwhich maximizes (31), ignoring the constant term. If a uniform probability is assumed for the prior probability P(ω_c|I), then this term can be left out as well.

A block diagram of this implementation is given in FIG. 6. Here u is set to 1.5538 as an example. The data[k] is the squared magnitude of the spectrum, which can be calculated beforehand or directly when needed from the k^thvalue of the spectrum times its conjugate. The maximum value val determines the best cutoff best_ω_c. Optionally the values at each ω_ccan be stored in an array for further processing.

If the results are stored for further processing, it is often desirable to convert the values to a probability distribution summing to 1. A way of doing this is shown in FIG. 7, which uses the max value found in FIG. 6.

Although any loudspeaker model function can be used in principle, an implementation of the loudspeaker model is given by (23).

While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.

Probabilistic Method of Loudspeaker Detection

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

US Classifications

International Classifications

Abstract

Description

Claims