METHOD AND APPARATUS FOR CANONICAL NONLINEAR ANALYSIS OF AUDIO SIGNALS

BACKGROUND OF THE INVENTION

1. Statement of the Technical Field

The present application relates generally to the perception and recognition of an audio signal input and, more particularly, to a signal processing method and apparatus for providing a nonlinear frequency analysis of structured audio signals which mimics the operation of the human ear.

2. Description of the Related Art

In general, there are many well-known signal processing techniques that are utilized in signal processing applications for extracting spectral features, separating signals from background sounds, and finding periodicities at the time scale of music and speech rhythms. Generally, features are extracted and used to generate reference patterns (models) for certain identifiable sound structures. For example, these sound structures can include phonemes, musical pitches, or rhythmic meters.

Referring now to FIG. 1, a general signal processing system in accordance with the prior art is shown. The processing system will be described relative to acoustic signal processing, but it should be understood that the same concepts can be applied to processing of other types of signals. The processing system 100 receives an input signal 101. The input signal can be any type of structured signal such as music, speech or sonar returns.

Typically, an acoustic front end (not shown) includes a microphone or some other similar device to convert acoustic signals into analog electric signals having a voltage that varies over time in correspondence to the variation in air pressure caused by the input sounds. The acoustic front end also includes an analog-to-digital (ND) converter for digitizing the analog signal by sampling the voltage of the analog waveform at a desired sampling rate and converting the sampled voltage to a corresponding digital value. The sampling rate is typically selected to be twice the highest frequency component in the input signal.

In processing system 100, spectral features can be extracted in a transform module 102 by computing a wavelet transform of the acoustic signal. Alternatively, a sliding window Fourier transform may be used for providing a time-frequency analysis of the acoustic signals. Following the initial frequency analysis performed by transform module 102, one or more analytic transforms may be applied in an analytic transform module 103. For example, a “squashing” function (such as square root and sigmoid functions) may be applied to modify the amplitude of the result. Alternatively, a synchro-squeeze transform may be applied to improve the frequency resolution of the output. Transforms of this type are described in U.S. Pat. No. 6,253,175 to Basu et al. Next, a cepstrum may be applied in a cepstral analysis module 104 to recover or enhance structural features (such as pitch) that may not be present or resolvable in the input signal. Finally, a feature extraction module 105 extracts from the fully transformed signal those features that are relevant to the structure(s) to be identified. The output of this system may then be passed to a recognition system that identifies specific structures (e.g. phonemes) given the features thus extracted from the input signal. Processes for the implementation of each of the aforementioned modules are well-known in the art of signal processing.

The primarily linear foregoing audio processing techniques have proven useful in many applications. However, they have not addressed some important problems. For example, as is now known in the art, the ear and brain process sound in a nonlinear manner utilizing nonlinear oscillation. Inputs are received at the cochlea, dorsal cochlear nucleus, inferior colliculus and other brain areas where they are processed as a function of excitatory and inhibitory processes in interaction with each other to produce nonlinear neural oscillations to provide outputs to be processed by still other brain areas. The prior art suffers from the shortcoming that it utilizes a linear oscillation model to mimic the nonlinear processing of sound required to mimic the brain's processing of complex signals. As a result, these conventional approaches are not always effective for determining the structure of a time varying input signal because they do not effectively recover components that are not present or fully resolvable in the input signal. Therefore, the full range of audio responses cannot be mimicked.

To overcome these shortcomings, it is known from U.S. Pat. No. 7,376,562 (Large) to process audio signals using networks of nonlinear oscillators. This is conceptually similar to signal processing by a bank of linear oscillators, with the important difference that the processing units are nonlinear and can resonate nonlinearly. Nonlinear resonance provides a wide variety of behaviors that are not observed in linear resonance (e.g., neural oscillations). Moreover, oscillators can be connected into complex networks. FIG. 2a shows a typical architecture used to process acoustic signals. It consists of one-dimensional arrays of nonlinear oscillators, called gradient-frequency nonlinear oscillator networks (GFNNs). In FIG. 2a, GFNNs are arranged into processing layers to simulate auditory processing by the cochlea, dorsal cochlear nucleus (DCN), and inferior colliculus (ICC). From a physiological point of view, nonlinear resonance models outer hair cell nonlinearities in the cochlea, and phase-locked neural responses on the DCN and ICC (see FIG. 2b). From a signal processing point of view, processing by multiple GFNN layers is not redundant; information is added at every layer due to nonlinearities.

As seen from FIG. 2a, the oscillators are coupled together, both across a simple linear array 200 and between adjacent layers of linear arrays 200, 202, 204 of nonlinear oscillators. The connections between nonlinear oscillator pairs determines the processing of the input audio signal s(t).

A common signal processing operation is frequency decomposition of a complex input signal, for example by a Fourier transform. Often this operation is accomplished via a bank of linear bandpass filters processing an input signal, s(t). For example, a widely used model of the cochlea is a gammatone filter bank (Patterson, et al., 1992). For comparison with the Large model, it can be written as a differential equation

ż=z(α+iω)s(t) (1)

where the overdot denotes differentiation with respect to time (for example, dz/dt), z is a complex-valued state variable (function of time), ω, is radian frequency (ω=2πf, f in Hz), α, for which α<0 in the prior art model is a linear damping parameter. The term, s(t), denotes linear forcing by a time-varying external signal. For simplicity, in the above and following equations, we write z for the i^thfilter or oscillator z_i. Because z is a complex number at every time, t, it can be rewritten in polar coordinates revealing system behavior in terms of amplitude, r, and phase, φ. Resonance in a linear system means that the system oscillates at the frequency of stimulation, with amplitude and phase determined by system parameters. As stimulus frequency, ω₀, approaches the oscillator frequency, ω, oscillator amplitude, r, increases, providing band-pass filtering behavior.

Recently, nonlinear models of the cochlea have been proposed to simulate the nonlinear responses of outer hair cells. It is important to note that outer hair cells are thought to be responsible for the cochlea's extreme sensitivity to soft sounds, excellent frequency selectivity and amplitude compression (e.g., Eguiluz, Ospeck, Choe, Hudspeth, & Magnasco, 2000). Models of nonlinear resonance that explain these properties have been based on the Hopf normal form for nonlinear oscillation, and are generic. Normal form (truncated) models have the form and as known from Large may be expressed as

ż=z(α+iω+β|z|²)+s(t)+h.o.t. (2)

Note the surface similarities between this form and the linear oscillator of Equation 1. Again, z is the state of an oscillator represented by the real and imaginary parts of z at a point of time within a cycle, ω is radian frequency, and α is again a linear damping parameter. However in this nonlinear formulation, α becomes a bifurcation parameter which can assume both positive and negative values, as well as α=0. The value α=0 is termed a bifurcation point. β<0 is a nonlinear damping parameter, which prevents amplitude from blowing up when α>0. Again, s(t) denotes linear forcing by an external signal. The term h.o.t. denotes higher-order terms of the nonlinear expansion that are truncated (i.e., ignored) in normal form models. Like linear oscillators, nonlinear oscillators come to resonate with the frequency of an auditory stimulus; consequently, they offer a sort of filtering behavior in that they respond maximally to stimuli near their own frequency. However, there are important differences in that nonlinear models address behaviors that linear ones do not, such as extreme sensitivity to weak signals, amplitude compression and high frequency selectivity. The compressive gammachirp filterbank exhibits nonlinear behaviors similar to Equation 2, but is formulated within a signal processing framework (Irino & Patterson, 2006).

Although the application of nonlinear oscillators and nonlinear modeling lends itself to mimic and produce outputs which represent very complex behaviors, previously unobtainable with linear models, the Large system suffers from the disadvantage that it too did not adequately process the entire frequency spectrum. The high order terms were not fully expanded. Rather, it was required that the characteristics of the wave form be known in advance, particularly the frequencies, so that only the most significant higher order terms are processed while the less significant terms are ignored even if their values do not go to 0. Therefore, a system for processing nonlinear oscillators to take advantage of and mimic substantially the entire complexity of an audio sound input is desired.

SUMMARY OF THE INVENTION

The present invention is directed to systems and methods designed to ascertain the structure of acoustic signals. The approach involves an alternative transform of an acoustic input signal, utilizing a network of nonlinear oscillators in which each oscillator is tuned to a distinct frequency; referred to as the natural or intrinsic frequency. Each oscillator receives input and interacts with the other oscillators in the network, yielding nonlinear resonances that are used to identify structure in an acoustic input signal. The output of the nonlinear frequency transform can be used as input to a system that will provide further analysis of the signal. According to one embodiment, the nonlinear responses are defined as a network of n expanded canonical oscillators z_i, with an input, for each oscillator as a function of an external stimulus. In this way, the response of oscillators to inputs that are not close to its natural frequency are accounted for.

BRIEF DESCRIPTION OF THE DRAWINGS

Other objects, features and advantages of the present invention will be apparent from the written description and the drawings in which:

FIG. 1 is a block diagram which illustrates the way in which linear frequency analysis is used in a variety of signal processing systems, in accordance with the prior art;

FIG. 2
a is a diagram illustrating the basic structure of a nonlinear neural network showing an input signal;

FIG. 2
b shows the graphical representation of an individual oscillator in a nonlinear oscillator network;

FIG. 3
a and FIG. 3b are a graphic comparison of the approximation and the generalized resonant terms as a function of time with ε=1;

FIG. 4 is a graphical representation of the amplitude as a function of frequency for the approximation and the generalized resonant terms with ε=1; and

FIG. 5 is a block drawing of a system for processing a nonlinear signal in accordance with the invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

In the current invention a canonical model is utilized to solve for and account for all of the frequencies for the higher order terms. In this way, in order to model the response of the nonlinear neural network, it is not required to know anything about the wave form because, rather than in the nonlinear operation of Large which selects only the consequential significant high order terms, the present method solves for all of the high order terms.

This enables efficient computation of gradient frequency networks of nonlinear oscillators, representing a radical improvement to the technology. The canonical model (Equation 3, below) is related to the normal form (Equation 2; see e.g., Hoppensteadt & Izhikevich, 1997; Murdock, 2003), but it has properties beyond those of Hopf normal form models because the underlying, more realistic oscillator model is fully expanded, rather than truncated. The complete expansion of higher-order terms produces a model of the form

$\begin{matrix} {\dot{z}}_{i} = z_{i} (α_{i} +  ω_{i} + (β_{1 i} +  δ_{1 i}) {\langle z_{i} \rangle}^{2} + \frac{(β_{2 i} +  δ_{2 i}) ε {\langle z_{i} \rangle}^{4}}{1 - ε {\langle z_{i} \rangle}^{2}}) + RT . & (3) \end{matrix}$

Equation 3 describes a network of n nonlinear oscillators, and as will be discussed, solves for the response of each oscillator, i.e., the response at each frequency of the system. Equation 3 oscillatory dynamics follow well known cases such as Andronov-Hopf and generalized Andronov-Hopf (Bautin) bifurcations (Guckenheimer & Holmes, 1983; Guckenheimer & Kuznetsov, 2007; Wiggins, 1990; Murdock, 2003).

There are surface similarities with the models of Equations 2 and 3. The parameters, ω, α and β₁correspond to the parameters of the truncated model of Equation 2. However, β₂is an additional amplitude compression parameter. Two frequency detuning parameters δ₁and δ₂are new in this formulation, and make oscillator frequency dependent upon amplitude to better mimic real world behavior of the hair cell inputs found in the ear. The parameter ε controls the amount of nonlinearity in the system.

RT (resonant terms) represents a general expression mainly consisting of nonlinear (resonant) monomials. These nonlinearities are critical for pattern recognition and auditory scene analysis capabilities. In general, the canonical model given by Equation 3 is more general than the Hopf normal form and encompasses a wide variety of behaviors that are observed neither in the Large use of Hopf normal form, nor in linear oscillators (filters).

Higher order terms of the normal form are necessary to capture the response of an oscillator to input that is not close to its natural frequency. In Large, coupling terms were written as sums of higher order terms based on normal form theory, which is known in the art. The present invention employs the linear relationship, or resonance, given by Equation 4 in terms of the system's eigenvalues. The behavior of the system is a function of the intrinsic frequency of each oscillator in the system; this method automatically accounts for those values which go to zero, and those which remain with significant resonance. Note that near an Andronov-Hopf bifurcation, the absolute values of the eigenvalues of a canonical oscillator system are the same as their natural frequencies {ω₁, . . . , ω_n} (Hoppensteadt & Izhikevich, 1996, 1997). In this case, the resonance relationship satisfies:

ω_r=m₁ω₁+ . . . +m_nω_n

n∈ custom-character ; m₁. . . , m_n∈; ω_r, ω₁, . . . ω_n∈ (4)

Wherein custom-character =set of all integers, =set of all positive integers, and =set of all real numbers.

The number ω_ris known as the resonant frequency and is typically restricted to be positive.

These considerations lead to an expanded canonical oscillator model (e.g., Equation 3) for a nonlinear neural oscillator z under the influence of input x(t). In the expanded model, the resonant terms RT include all monomials obtained (as described above) satisfying Equation 4. Including all resonant monomials in RT allows the model to respond appropriately to external stimuli, regardless of frequency, because only the monomials that are resonant with the stimulus will have a significant effect on oscillator dynamics in the long term.

We can now define a network of n expanded canonical oscillators z_i, with external input x(t). From now on, to avoid notational complexity and depending on the context, it is assumed that x represents a function of time t, that is, x=x(t). In most applications, either x=an input signal s(t) or x is a signal originating from other oscillators. In more general cases, x may represent a set of parameters and functions of time.

As a first case, we consider an expansion of RT for a sinusoidal external stimulus of unknown frequency, x(t)=Fe^2πift+φ; F,f,φ∈ custom-character .

Wherein F is the force (amplitude) of the signal, f is the frequency of the signal, and φ is the phase.

$\begin{matrix} \begin{matrix} RT = x + \sqrt{ε} x \overline{z} + ε x {\overline{z}}^{2} + ε \sqrt{ε} x {\overline{z}}^{3} + \dots + \\ \sqrt{ε} x^{2} + ε x^{2} \overline{z} + ε \sqrt{ε} x^{2} {\overline{z}}^{2} + ε^{2} x^{2} {\overline{z}}^{3} + \dots + \\ ε x^{3} + ε \sqrt{ε} x^{3} \overline{z} + ε^{2} x^{3} {\overline{z}}^{2} + ε^{2} \sqrt{ε} x^{3} {\overline{z}}^{3} + \dots + \\ ε \sqrt{ε} x^{4} + ε^{2} x^{4} \overline{z} + ε^{2} \sqrt{ε} x^{4} {\overline{z}}^{2} + ε^{3} x^{4} {\overline{z}}^{3} + \dots \\ = (x + \sqrt{ε} x^{2} + ε x^{3} + ε \sqrt{ε} x^{4} + \dots) \cdot \\ (1 + \sqrt{ε} \overline{z} + ε {\overline{z}}^{2} + ε \sqrt{ε} {\overline{z}}^{3} + \dots) . \end{matrix} & (5) \end{matrix}$

Equation 5 contains infinite geometric series that converge (see Equation 6) when |z|<1/√{square root over (ε)} and |x|<1/√{square root over (ε)}. Thus, the choice of ε constrains both the magnitude of the input and the magnitude of the oscillation.

The series converge as follows,

$\begin{matrix} RT = x \sum_{k = 0}^{\infty} {(\sqrt{ε} x)}^{k} \sum_{k = 0}^{\infty} {(\sqrt{ε} \overline{z})}^{k} = \frac{x}{1 - \sqrt{ε} x} \cdot \frac{1}{1 - \sqrt{ε} \overline{z}} when \langle x \rangle < \sqrt{1 / ε}, \langle z \rangle < \sqrt{1 / ε} & (6) \end{matrix}$

Consider the relation between Equation 3 and the result shown in Equation 6 derived in the prior Large art. Equation 6 suggests, here presented as new art, a generalization for RT defined as a product of a coupling factor c and two functions; one a passive factor custom-character (ε, x) and the other an active factor (ε, z). We can write Equation 6 as

$\begin{matrix} RT = c P (ε, x) A (ε, z) where P (ε, x) = \frac{x}{1 - \sqrt{ε} x}, A (ε, z) = \frac{1}{1 - \sqrt{ε} z}, and c = 1 in this nonlimiting example . & (7) \end{matrix}$

In the above case, x represents a single component frequency (sinusoidal) signal. In this new art we generalize RT. In the general case, x can represent an external input (e.g., a sound) of any complexity, or x can represent a coupling matrix, A, times a vector of oscillators, z. In the latter case,

x=Σα_jz_j

where α_jranges over a row of the matrix A (i.e., α_jis a row vector) and z_jis the j^thoscillator in a column vector representing the network state. Note that in both cases, x is a complex input signal to an oscillator. Also, in both cases x(t) can be written as a sum of frequency components

$x = \sum_{j} x_{j}$

where x_jrepresents a frequency component of the input signal defined as

x
_j(t)=F_je^2πif^j^t+φj;F_j,f_j,φ_j,t∈ custom-character .

Here, F_jrepresents the forcing amplitude, f_jthe components frequency, φ_jthe phase, and t is time. Given the general definition of x and x_jabove, custom-character (ε, x) can be formulated as a function consisting of (resonant) monomials from a set .

custom-character ={ε^(−1+Σ^j^(p^j^+q^j^))/2x₁^p¹. . . x_n^pⁿx₁^q¹. . . x_n^qⁿ|p_i,q_i∈,n∈,ε∈}

where the coefficient ε^(−1+Σ^j^(p^j^+qⁱ^))/2specifies the contribution of each term (see, e.g., Hoppensteadt & Izhikevich, 1997).

The formulation of the passive factor custom-character (ε, x) in Equation 7 can be generalized to include other components as follows.

The generalized form of the passive nonlinearity custom-character (ε, x) consists of a sum of expressions formed from elements of the set above. More specifically, (ε, x) consists of the sum of all monomials which correspond to positive frequencies ω_rin the resonance relation Equation 4. It is expressed as:

custom-character (ε,x)=Σε^(−1+Σ^j^(p^j^+q^j^))/2x₁^p¹. . . x_n^pⁿx₁^q¹. . . x_n^qⁿ (8)

To clarify, a monomial from the set custom-character is included in the sum of Equation 8 if the following four conditions are satisfied. 1) n is the number of (frequency) components of a signal or of oscillators, etc. 2) The p's and q's are positive integers or 0, at least one of the p's is not zero. 3) The total number of nonzero p's and q's is less than or equal to n. 4) The resonance relation Equation 4 is satisfied with a positive resonant frequency, i.e.,

ω_r=p₁ω₁+ . . . +p_nω_n−(q₁ω₁+ . . . +q_nω_n)>0

and by rewriting we get

ω_r=(p₁−q₁)ω₁+ . . . +(p_n−q_n)ω_n>0

where the coefficients m₁, . . . , m_nof Equation 4 become

m
₁=(p₁−q₁), . . . , m_n=(p_n−q_n)

Using this form of the passive part custom-character (ε, x) provides a very general form of RT where RT=c(ε, x)(ε, z).

A more explicit way of expressing this form of the passive nonlinearity custom-character (ε, x) follows.

Let n=Number of oscillators in a network or frequency components of a signal and let:

custom-character ={1, 2, 3, . . . , n}

{ω₁, . . . , ω_n}=The set of the natural frequencies of the oscillators or components.

N( custom-character )=Power Set of \{{ }, {1}, . . . , {n}}=Set of all subsets of minus the empty set and singleton sets.

Recall that a partition of a set S is a set of nonempty subsets of S such that every element x in S is in exactly one of these subsets. Whereas, a k-partition of a set S is a partition of S of cardinality k. Also let:

P( custom-character )=A partition of

P( custom-character ,k)=A k-partition of ,1≦k≦n

Now we can write the passive part as:

$\begin{matrix} P (ε, x_{1}, \dots, x_{n}) = \frac{1}{\sqrt{ε}} (- 1 + \prod_{k \neq i}^{n} \frac{1}{1 - \sqrt{ε} x_{k}} + \sum_{I} (S 1 + S 2)) & (9) \end{matrix}$

where l is an index set and

$I = {P_{1}, P_{2}} \in P (S \in N (ℤ_{n}^{+}), 2)$

$\prod_{k \neq i}^{n} \frac{1}{1 - \sqrt{ε} x_{k}} = \prod_{k \neq i}^{n} \sum_{p = 0}^{\infty} {(\sqrt{ε} x_{k})}^{p} S 1 = \sum_{\underset{k 1 \in P_{1}}{p_{k 1} = 1}}^{\infty} \sum_{\underset{k 2 \in P_{2}}{q_{k 2} = 1}}^{\infty} H 1 \cdot (\prod_{k 1 \in P_{1}} {(\sqrt{ε} x_{k 1})}^{p_{k 1}}) (\prod_{k 2 \in P_{2}} {(\sqrt{ε} {\overline{x}}_{k 2})}^{q_{k 2}})$

$S 2 = \overset{\infty}{\sum_{\underset{k 1 \in P_{2}}{p_{k 1} = 1}}} \sum_{\underset{k 2 \in P_{1}}{q_{k 2} = 1}}^{\infty} H 2 \cdot (\prod_{k 1 \in P_{2}} {(\sqrt{ε} x_{k 1})}^{p_{k 1}}) (\prod_{k 2 \in P_{1}} {(\sqrt{ε} {\overline{x}}_{k 2})}^{q_{k 2}})$

$H 1 = (1 + \frac{h 1 + h 2}{\langle h 1 + h 2 \rangle}) / 2, H 2 = (1 + \frac{h 3 + h 4}{\langle h 3 + h 4 \rangle}) / 2 h 1 = \sum_{k 1 \in P_{1}} p_{k 1} ω_{k 1}, h 2 = - \sum_{k 2 \in P_{2}} q_{k 2} ω_{k 2} h 3 = \sum_{k 1 \in P_{2}} p_{k 1} ω_{k 1}, h 4 = - \sum_{k 2 \in P_{1}} q_{k 2} ω_{k 2}$

h1 and h2 are frequency correcting factors.

Equation 9 provides a method for computing coupling within and/or between gradient frequency oscillator networks. The expression

$\frac{1}{\sqrt{ε}} (- 1 + \prod_{k \neq i}^{n} \frac{1}{1 - \sqrt{ε} x_{k}})$

contained in Equation 9 represents the complete set of harmonics present in a stimulus to which oscillators, e.g., in a GFNN, can resonate. Similarly, S1 and S2 represent a complete set of combination and difference frequencies. Thus, all higher order resonances are accounted for in this formulation.

There is another form of custom-character (ε, x) similar to the one above (Equation 9) which simplifies further and reduces to a real valued expression because S1 and S2 are complex conjugates. For this case, the frequency correcting factors H1 and H2 are not used.

Since the geometric series converge, S1 and S2 simplify further to produce:

$\begin{matrix}  (ε, x_{1}, \dots, x_{n}) = \frac{1}{\sqrt{ε}} (- 1 + \prod_{k \neq i}^{n} \frac{1}{1 - \sqrt{ε} x_{k}} + \sum_{I} (U 1 + U 2)) & (10) \end{matrix}$

where

$I = {P_{1}, P_{2}} \in P (S \in N (ℤ_{n}^{+}), 2)$

$U 1 = (\prod_{k 1 \in P_{1}} \frac{x_{k 1}}{1 - \sqrt{ε} x_{k 1}}) (\prod_{k 2 \in P_{2}} \frac{{\overline{x}}_{k 2}}{1 - \sqrt{ε} {\overline{x}}_{k 2}})$

$U 2 = (\prod_{k 1 \in P_{2}} \frac{x_{k 1}}{1 - \sqrt{ε} x_{k 1}}) (\prod_{k 2 \in P_{1}} \frac{{\overline{x}}_{k 2}}{1 - \sqrt{ε} {\overline{x}}_{k 2}})$

Equation 10 provides a method for computing coupling within and/or between gradient frequency oscillator networks when there is no frequency correction on the resonant monomials. In this case custom-character (ε, x) consists of finite expressions and is a real valued signal.

The above are complicated expressions for the passive part of RT. They contain infinite sums as described above or large numbers of partitions to sum over for large n's. In practice these forms of RT may be difficult to use. The precise form of these expressions depends upon the frequencies present in the stimulus or frequencies of oscillators. To compute with the above expressions, one would have to obtain the frequency components of an input signal by Fourier analysis or some other technique. Moreover, because the computation is expensive in both space and time, one would have to limit the number of components and truncate the expansion of resonant monomials in Equation 9. This leads us to seek suitable approximations. One approximation is given by:

$\begin{matrix}  (ε, x) \approx (\frac{x}{1 - \sqrt{ε} x}) (\frac{1}{1 - \sqrt{ε} \overline{x}}) & (11) \end{matrix}$

where x=Σx_ior an input signal x=s(t).

Equation 11 provides a method for computing coupling within and/or between gradient frequency oscillator networks. It has the advantage that it can be applied to 1) external input comprised of any number of unknown frequency components 2) input from other oscillators within the same GFNN, or 3) input from oscillators in another GFNN. It is also far more efficient to compute than Equations 9 and 10, and it approximates Equation 9 quite closely.

An example comparing this approximation (gray curves) and the generalized RT (black dashed curves) is shown in FIGS. 3a, 3b and 4. The generalized RT was truncated to monomials of degree 4 (per variable). There are 3 components (n=3) with respective natural frequencies f₁=200, f₂=300, f₃=400 Hz and corresponding input x₁, x₂, and x₃with amplitude=0.1, i.e.,

x₁=0.1e^2πi200t,x₂=0.1e^2πi300t,x₃=0.1e^2πi400t

From FIG. 3, we can see that both the generalized RT and the approximation have maximum response at their natural frequencies. Harmonics and sub-harmonics are also captured. Also, the generalized RT and the approximation overlap increasingly better as the amplitude of the stimulus is decreased.

Finally, we write RT in a general abstract form covering the entire class of scenarios including separate coupling terms for inputs from different sources. This includes internal couplings, external input and input from other networks as illustrated in FIG. 2. The general formulation is as follows:

$\begin{matrix} RT = \sum_{k \in I} _{k} where _{k} = c_{k} _{k} (t, x_{k}) A_{k} (ε, z) & (12) \end{matrix}$

custom-character (t, x_k) is the k^thpassive part, (ε, z) is the k^thactive part, c_kcorresponds to the strength of coupling, and l is some index set. As an example employing this generalized RT, Equation 3 can be restated to include network layers and external input signals as in FIG. 2. The equation for the complex valued state variable of the i^thoscillator can be written as:

where ω is the oscillator frequency in radians, α is a linear damping parameter, β is a nonlinear damping parameter, δ is the nature in which the oscillator frequency is dependent upon amplitude.

Each R_khas a unique passive nonlinearity corresponding to the internal, external, afferent, and efferent couplings respectively. The active nonlinearities are as in Equation 7.

Reference is now made to FIG. 5 wherein a system constructed in accordance with the invention for processing the signals is provided. A system 700 includes an audio input 702 such as a microphone, which provides an input to an oscillator network 704 as a time varying electrical signal. Network 704 is made up of a plurality of nonlinear oscillators for receiving the input audio signal s(t). Each oscillator of network of oscillators 704 has a different natural frequency of oscillation and obeys the dynamical equation of the form.

${\dot{z}}_{i} = z_{i} (α_{i} +  ω_{i} + (β_{1 i} +  δ_{1 i}) {\langle z_{i} \rangle}^{2} + \frac{(β_{2 i} +  δ_{2 i}) ε {\langle z_{i} \rangle}^{4}}{1 - ε {\langle z_{i} \rangle}^{2}}) + RT .$

The oscillators may be in the form of a computer which generates at least one frequency output useful for describing the time bearing structure of the input signal s(t) oscillator network 704. A transmitter 706 receives the signal and transmits it to an audio or visual display output. The computing device can be any computing device capable of analyzing a mathematical representation of a sound signal such as a computer processing unit (CPU), a field programmable gate array (FPGA) or an ASIC chip.

As can be seen from the above, it is possible to analyze complex wave signals utilizing an array of nonlinear oscillators in a manner which takes into account much more of the signal. By accounting for resonant terms and analyzing the acoustic signal in a nonlinear manner, the analysis may more closely mimic the manner in which the brain and auditory system actually operates on signals so that more of the full range of audio responses can be mimicked. It is understood that modifications can be made to the described preferred embodiments of the invention by those skilled in the art. Therefore, it is intended that all matters in the foregoing description and shown in the accompanied drawings, be interpreted as illustrative and not in a limiting sense. Thus, the scope of the invention is determined by the appended claims.

METHOD AND APPARATUS FOR CANONICAL NONLINEAR ANALYSIS OF AUDIO SIGNALS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

US Classifications

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATION

Government Interests

Provisional Applications (1)