This application claims the benefit, under 35 U.S.C. § 119, of German patent application DE 10 2017 200 320.0, filed Jan. 11, 2017; the prior application is herewith incorporated by reference in its entirety.
The invention relates to a method for the frequency distortion of an audio signal, wherein at least one division frequency is selected. The audio signal is divided at the at least one division frequency into a low-frequency and a high-frequency band. An output signal is generated through respectively different distortions of frequencies for the high-frequency band and for the low-frequency band.
The control of an acoustic feedback often plays a central role for the operation of acoustic systems with which sound signals in the broadest sense from the environment are reproduced in electrically amplified form, including for example hearing aids. The acoustic feedback can occur in these cases when an output sound signal generated by the acoustic system is partially coupled into an input transducer of the acoustic system which is provided to pick up the sound signal from the environment and for the corresponding generation of an electrical input signal. In this case, signal bands of the output sound signal can be electrically amplified again by the acoustic system, so that as a result interfering noise is formed in the output sound signal which can fully overlay possible useful signals in the sound signal of the environment, to the extent that they are entirely unable to be heard. A suppression or compensation of an acoustic feedback can therefore be provided in the electrical signal path of the acoustic system. Compensation of this sort is often implemented by an adaptive filter to which the finished, amplified output signal, from which the output sound signal is generated, is supplied as an input magnitude, and from this a compensation signal is generated which is fed back to the as yet unamplified input signal for compensation of the feedback. The adaptive filter is usually controlled here by an error signal which is formed from the difference between the input signal and the compensation signal.
If the sound signal of the environment that is to be electrically amplified by the acoustic system now consists of a pure sinusoidal tone with a fixed frequency, then the compensation signal generated by the adaptive filter on the basis of the amplified output signal is also a sinusoidal signal of the same frequency as the sound signal of the environment, and is thus the same as the input signal. With a subtraction in the correct phase, the compensation signal which is actually provided to suppress an acoustic feedback therefore fully cancels out the input signal. This consideration shows that, in general, for sound signals with a high proportion of tonal signals, cancellations or artifacts in the output signal can be generated by the adaptive filter signal, and these should preferably be avoided.
For this purpose, the output signal in the acoustic system is often subjected after amplification to a frequency distortion, whereby the output signal is decorrelated from the input signal, so that the occurrence of the signal cancellation described above can largely be avoided. Depending on the nature of the sound signal of the environment, the frequency distortion is usually only applied to a specific frequency range of the amplified signal, for which purpose the latter is filtered at a given division frequency into a signal band that is to be distorted and a signal band that is not to be distorted. Due to the finite edge gradient of the filter applied for this purpose, overlaps between frequency-distorted signal bands and non-frequency-distorted signal bands can arise in the output signal in the region of the division frequency which can be experienced as unwanted or unpleasant in the output sound signal generated by the acoustic system. For sound signals with a high tonal proportion in particular, i.e. precisely in the case for which it is preferred to apply a frequency distortion for effective suppression of an acoustic feedback without artifacts, these kinds of overlaps can have particularly negative effects on the auditory experience of the output signal, in particular when one of the tonal components of the sound signal coincides with the division frequency.
The invention is therefore based on the object of providing a method for frequency distortion of an audio signal which minimizes to the greatest possible extent unpleasant effects from the overlap of frequency-distorted signal bands with non-frequency-distorted signal bands.
The object is achieved according to the invention by a method for frequency distortion of an audio signal wherein the audio signal is divided at at least one division frequency into a low-frequency band and a high-frequency band. A frequency-distorted signal is generated through respectively different distortions of frequencies for the high-frequency band and for the low-frequency band, and the division frequency is selected such that it is located between two neighboring tones of a given tonal system. Advantageous embodiments, to some extent inventive in themselves, are objects of the subsidiary claims and the following description.
In particular here, either only the high-frequency band is frequency-distorted, without changing frequencies of the low-frequency band in the process, or only the low-frequency band is frequency-distorted, without changing frequencies of the high-frequency band in the process. In both cases, the resulting frequency-distorted signal then contains both frequency-distorted signal bands and signal bands whose frequencies have not been distorted. Preferably, moreover, the division frequency is selected so as to maintain a specified minimum distance, in absolute values or in terms of the frequency ratio, from each of the frequencies of the two neighboring tones of the given tonal system. A frequency distortion contains, in particular, a frequency shift, wherein the value of the shift may in appropriate cases depend on the frequency of the audio signal concerned, or, however, can remain constant over the individual frequencies to which the shift is applied.
The problems that can arise for a further processing of frequency-distorted audio signals at a division frequency between regions of different frequency distortion, and therefore in particular for an auditory experience of an output signal whose processing is complete, depend to a significant extent on the proportion of tonal components in the audio signal whose frequency is to be distorted. As a consequence of the finite edge gradient of the filter which is to be used to provide the signal bands at the division frequency, and of the finite overlap of the signal bands resulting from that, to which the respectively different frequency distortions are to be applied, the frequency distortion of a clearly defined tonal component that is the same as the division frequency, or which lies in the overlap region described, can lead to an audible overlap of the differently frequency-distorted signal bands in the output signal which in the end originate from the same tonal components.
In usual applications of frequency distortions, the individual frequencies of the audio signal are only changed by a relatively small amount in relation to the respective output frequency by the distortion, so that an output signal generated from the frequency-distorted audio signal still permits an auditory experience of the acoustic information of the original audio signal that is as true as possible to the original. In the case of the overlaps described above of the differently frequency-distorted signal bands of the same tonal components, this now leads to overlaps with a relatively small frequency spacing, which in the case of pure frequency shifts for frequency distortion leads to beat frequencies with varying amplitudes, which otherwise in the case of non-trivial frequency-dependent distortions can be expressed in clattering or rattling interfering noises.
In an advantageous manner, the circumstance that the tonal components, i.e. local spectrum maxima of the concentration of signal energy, often do not occur randomly, is now exploited by the invention. Whereas, for example, the tonal components of spoken language are usually of relatively short duration, and in addition do not necessarily exhibit regularly recurring frequency patterns, recurring tonal components with stable frequencies are mostly associated with a musical sound. Music is in this respect usually characterized in that the majority of sound events are composed of tonal sound signals which, in comparison with other sound sources such as for example speech, exhibit a stationary or quasi-stationary behavior, in which the frequencies of the tones can be found from a clearly defined frequency pattern, that of the tonal system on which the music is based. With the knowledge of the tonal systems usual for music, it is now possible, in order to avoid the problems with the tonal sound signals of music described above, for a division frequency to be selected so as to lie between two frequencies that are adjacent in a given tonal system, and preferably here to be sufficiently distant from the frequencies concerned that the overlap at the division frequency has no effect on the subsequent frequency distortion of the individual tonal components that correspond to tones in the tonal system.
For a determination of the distance of the division frequency from the two neighboring frequencies in the tonal system, the edge gradient of the filter to be used for the division of the audio signal and/or a spectral range that is to be expected of the tonal components of the tonal system, and/or a possible deviation of the concrete implementation of the tonal system that is to be expected from the exact frequencies are preferably to be employed, for example through systematically shifting the tonal system when tuning.
Favorably the tonal system is given by a division of an octave, based on a predetermined reference tone, into twelve tonal steps, each with the same frequency ratio
This corresponds to the equal tempering of the octave. As a result of the psychoacoustic perception of an octave—i.e. two tones with a frequency ratio of 2:1—as being the “same” or at least “similar” tones, a tonal system for the whole audible frequency spectrum is given in this way. Preferably the concert pitch with a1 at 440 Hz is to be chosen here as the reference tone, although an equally tempered tuning with a different reference tone is also possible, e.g. a specification of a1=430.539 Hz (which corresponds to choosing c1 to be 256 Hz). In particular, the possibility of alternative reference tones (a1=442 Hz, for example) with the minimum spacing to be maintained between the division frequency and the frequencies of the tones of the tonal system—defined in absolute values or according to the frequency ratio—can be taken into account. In addition, this minimum distance to be maintained can also be determined depending on deviations from the exact frequencies of the tonal system which result from the use of pure or perfect fifth (so-called “Pythagorean”) intervals. Individual musical instruments, such as brass instruments which are particularly used in orchestral music and in jazz, and which therefore contribute significantly to the character of the sound image of such music, generate tone sequences of pure intervals above particular basic tones, containing, for example, pure fourths or also pure thirds. Other musical instruments, in particular string instruments such as the bowed strings that characterize the sound of orchestral music, or the guitars that characterize the sound of modern rock music are tuned in series of fifths. Both tuning by series of fifths with respect to a base tone and the use of pure intervals above the base tone lead to deviations from the equally tempered frequency ratio.
Preferably, therefore, within the framework of the equally tempered tonal system which divides the octave into 12 equal semitone steps with a frequency ratio
a frequency corridor is specified between the individual tones, from which the division frequency can be selected, wherein the frequency corridor takes into account different tunings such as, for example, a selection of the reference tone as a1=442 Hz, as well as the sequence of pure and perfect fifth intonation from certain musical instruments.
Favorably only the frequencies of the high-frequency band or only those of the low-frequency band are shifted by a constant amount for the distortion. A frequency shift of this sort of only one of the two frequency bands at the division frequency can, on the one hand, be implemented particularly easily and, on the other hand, has the consequence that, not least as a result of the unchanged reproduction of one of the two frequency bands, the auditory experience of an output signal derived from the frequency-distorted signal comes particularly close to a non-frequency-distorted signal, with the exception of possible problems at the division frequency. The proposed method now contributes to overcoming these problems, and to clearing the auditory experience of audible effects of the frequency distortion, even in the immediate vicinity of the division frequency.
It is further found to be advantageous if the division frequency is selected from a frequency interval that is located between the frequencies of two neighboring tones of the tonal system in such a way that the lowest frequency and the highest frequency of the frequency interval are equidistant, or logarithmically equidistant, from the frequencies of the two neighboring tones. Equidistant positioning of the lowest frequency and the highest frequency of the frequency interval between the frequencies of the two neighboring tones is to be understood here to mean that each adjacent pair of the four said frequencies—i.e. the two neighboring tones of the tonal system and the two limits of the frequency interval—have the same distance from one another. A logarithmic equidistance is accordingly to be understood to mean that the logarithms of each neighboring pair of the four frequencies have the same distance from one another, and thus the two neighboring frequencies exhibit the same frequency ratio. A selection of the division frequency of this sort supplies a frequency corridor which in particular takes adequate account of real implementations of the tonal system that deviate from the theoretical ideal.
Favorably here the division frequency is selected as the geometric mean value of the frequencies of the two neighboring tones. As a result, the frequency ratio of the division frequency to the frequencies of the two neighboring tones—in the ascending direction—is the same, and thus also the distance in the tonal system, which makes the behavior of the frequency-distorted signal at the division frequency particularly robust against non-ideal implementations, for example detunings, of the tonal system.
In one favorable embodiment, a frequency profile of the audio signal is determined, wherein the division frequency is selected such that the audio signal exhibits a lowest possible signal energy at the division frequency. One possible criterion for the signal energy can then, for example, be a local minimum of the signal energy, or can be defined as an attenuation with respect to the total maximum value of the signal energy, for example, an upper limit of 10% of the maximum value of the signal energy over the entire audible spectrum. With respect to the signal energy, a range can, for example, be determined, from which the division frequency is advantageously to be selected, wherein the selection is associated with the additional boundary conditions that are given, in the manner described above, by the tonal system.
It is furthermore found to be advantageous if a value for a tonality of the audio signal is determined, and the division frequency is only then selected such that it is located between two neighboring tones of the given tonal system if the value for the tonality exceeds a predetermined limit value. This procedure makes it possible that for audio signals that do not exhibit a significant tonal signal proportion, the division frequency is specified directly, without further restrictions from the tonal system, in a manner as required by a high-level specification—for example, an optimum suppression of a feedback in an acoustic system. The definition usual in psychoacoustics can be employed here in particular to find the value for tonality, and/or a stationarity of the audio signal —for example making use of a temporal mean value—can also be taken into account.
The invention furthermore discloses a method for the suppression of an acoustic feedback in an acoustic system. An input transducer of the acoustic system generates an input signal from a sound signal of the environment and an intermediate signal is generated by a signal processing unit on the basis of the input signal. An output signal is generated from a frequency-distorted signal which is converted into an output sound signal by an output transducer of the acoustic system and on the basis of the frequency-distorted signal, an acoustic feedback occurring in the acoustic system through coupling the output sound signal into the input transducer is suppressed. The method for frequency distortion described above is applied to the intermediate signal and thereby the frequency-distorted signal is generated. Preferably the suppression of the acoustic feedback is achieved through a signal-technology feedback loop in the acoustic system which receives the frequency-distorted signal, amongst other things, as the input variable and outputs a compensation signal for the input signal as its output variable. A hearing aid in particular, as well as systems for recording, amplifying and replaying sound signals from studio and/or stage technology are included here as acoustic systems.
Generally speaking, an input transducer contains an acoustic-electrical converter that is configured to convert the sound signal of the environment into a corresponding electrical or electromagnetic signal, i.e. for example a microphone. An output transducer generally contains an electro-acoustic converter that is configured to generate an output sound signal from an electrical and/or electromagnetic signal, i.e. for example a loudspeaker or a sound source for bone-conducted sound. Signal processing here particularly refers to a processing of the input signal or of a signal derived from the input signal, i.e. in particular a frequency-band-dependent amplification and/or noise suppression.
A generation of the intermediate signal on the basis of the input signal here refers in particular to the fact that the signal processing unit directly receives the input signal as the input variable, and generates the intermediate signal from this, or that the signal processing unit receives a signal directly depending on the input signal and generates the intermediate signal from this, i.e. for example the input signal which is corrected by a compensation signal to compensate for an acoustic feedback.
The advantages quoted for the method for frequency distortion of an audio signal and its developments can analogously be transferred to the method for the suppression of an acoustic feedback in an acoustic system.
It is here found to be further advantageous if a provisional division frequency is selected, wherein an estimation of a transfer function of the acoustic system for the high-frequency band in the region of the provisional division frequency is made, wherein, when the estimated transfer function exceeds a permissible total amplification, the at least one division frequency is selected below the provisional division frequency. The frequency-distorted signal is generated through a distortion of frequencies of only the high-frequency band, and the provisional division frequency is selected such that it lies between two neighboring tones of the given tonal system. In particular here only the high-frequency band of the frequency-distorted signal is used for a compensation signal that is added to the input signal for suppression of the acoustic feedback, so that the suppression of the acoustic feedback only takes place in the region of the high-frequency band. In this way only a restricted frequency range is exposed to a possible auditory impairment through the suppression of the feedback, which is determined according to the conditions resulting from the tonal system, in order to avoid as far as possible impairments to the sound quality at the transition to this range.
The invention furthermore discloses a hearing aid containing an input transducer for the generation of an input signal from a sound signal of the environment, a signal processing unit for the generation of an audio signal on the basis of the input signal, and a frequency distorter, which is configured to carry out the above-described method for the frequency distortion of an audio signal. The advantages described for the method and for its development can here be analogously transferred to the hearing aid. In particular, the signal processing unit and the frequency distorter are each parts of a common control unit; in this case, the audio signal is an intermediate signal in the control unit.
Other features which are considered as characteristic for the invention are set forth in the appended claims.
Although the invention is illustrated and described herein as embodied in a method for the frequency distortion of an audio signal, it is nevertheless not intended to be limited to the details shown, since various modifications and structural changes may be made therein without departing from the spirit of the invention and within the scope and range of equivalents of the claims.
The construction and method of operation of the invention, however, together with additional objects and advantages thereof will be best understood from the following description of specific embodiments when read in connection with the accompanying drawings.
Parts and values that correspond to one another are each given the same reference signs in all the figures.
Referring now to the figures of the drawings in detail and first, particularly to
On the other hand the output signal 22 is diverted to the electrical feedback loop 12, where it is fed to an adaptive filter 28 which also receives the error signal 14 as a further input value, and from these generates the compensation signal 10 for the suppression of the acoustic feedback g. Through the frequency distortion 20, the output signal 22 is decorrelated from the input signal 8, and thus also from the error signal 14, so that through the renewed input of the error signal 14 into the adaptive filter 28, the latter is not fully adapted to the tonal signal components of the output signal 22. In this way the formation of artifacts in the output signal 22, and thus in the output sound signal 26, can be avoided. The suppression of the acoustic feedback g by the compensation signal 10 can here in particular remain restricted to specific frequency ranges, i.e. the compensation signal 10 in this case only contains significant signal components for the frequency bands.
In
When, following iteration if necessary, the examination S3 indicates a suitability of the provisional division frequency tf0 for the suppression of the acoustic feedback g, the provisional division frequency tf0 is output as the division frequency tf, and the audio signal 18 is divided in a step S4 at the division frequency tf into a high-frequency band HF and a low-frequency band NF. In a step S5, the high-frequency band HF is now shifted in frequency by a constant amount, while the low-frequency band NF is retained. The frequency-distorted signal 21, which forms the output signal 22 in the hearing aid, now results from this. A tonality in the audio signal 18 can optionally be determined in a step ST, and the execution of steps S2 and S3, i.e. the adaptation of the division frequency tf between two neighboring tones of the tonal system T, takes place depending on the tonality of the audio signal 18 determined in step ST.
In
A diagram of the frequency profile of the filter according to
Although the invention has been more closely illustrated and described in more detail through the preferred exemplary embodiment, the invention is not restricted by this exemplary embodiment. Other variations can be derived from this by the expert without leaving the scope of protection of the invention.
The following is a summary list of reference numerals and the corresponding structure used in the above description of the invention:
Number | Date | Country | Kind |
---|---|---|---|
10 2017 200 320 | Jan 2017 | DE | national |
Number | Name | Date | Kind |
---|---|---|---|
4000676 | Love | Jan 1977 | A |
5184219 | Cho | Feb 1993 | A |
7907678 | Mizuta | Mar 2011 | B2 |
8326638 | Tammi | Dec 2012 | B2 |
8411885 | Kornagel | Apr 2013 | B2 |
8538053 | Puder | Sep 2013 | B2 |
20020044597 | Shively | Apr 2002 | A1 |
20030022650 | Tsuji | Jan 2003 | A1 |
20060262880 | Mizuta | Nov 2006 | A1 |
20100272289 | Kornagel | Oct 2010 | A1 |
20100290631 | Petrausch | Nov 2010 | A1 |
20110194714 | Puder | Aug 2011 | A1 |
20120250918 | Pape et al. | Oct 2012 | A1 |
20120306631 | Hughes | Dec 2012 | A1 |
Number | Date | Country |
---|---|---|
102009018812 | Nov 2010 | DE |
102010006154 | Aug 2011 | DE |
102011006515 | Oct 2012 | DE |
2016143028 | Aug 2016 | JP |
Number | Date | Country | |
---|---|---|---|
20180199141 A1 | Jul 2018 | US |