This application is based upon and claims the benefit of priority under 35 U.S.C. § 119 from Japanese Patent Application No. 2021-087381 filed on May 25, 2021, the entire contents of which are incorporated herein by reference.
The present disclosure relates to an audio processing device, an audio processing method, and an audio processing program.
In an environment where background noise is present, it is desirable to clarify voice to be amplified in order to make it easier to hear the voice. Japanese Patent No. 6772889 discloses clarifying voice by emphasizing formants. When voice picked up by a microphone is amplified by a public address system and emitted from a loudspeaker, the microphone's picking up of the voice emitted from the loudspeaker and the loudspeaker's output of the voice picked up by the microphone and amplified are repeated, which may cause howling. Japanese Unexamined Patent Application Publication No. 3-263999 discloses an example of a technique for preventing howling.
In a public address system or any other audio processing devices, it is required to clarify voice to be processed and suppress howling.
A first aspect of one or more embodiments provides an audio processing device including: an input terminal to input an audio signal; and a peak shift filter configured to increase sound pressure with a first center frequency in the audio signal as a first peak, and to shift the first center frequency between a preset lowest frequency and a preset highest frequency.
A second aspect of one or more embodiments provides an audio processing method including: increasing sound pressure with a first center frequency in an input audio signal as a first peak; and shifting the first center frequency between a preset lowest frequency and a preset highest frequency.
A third aspect of one or more embodiments provides an audio processing program stored in a non-transitory storage medium causing a computer to execute: increasing sound pressure with a first center frequency in an input audio signal as a first peak; and shifting the first center frequency between a preset lowest frequency and a preset highest frequency.
Hereinafter, an audio processing device, an audio processing method, and an audio processing program according to each embodiment will be described with reference to the accompanying drawings.
In
The operational amplifier 3 calculates the difference between the in-phase input signal and the reverse-phase input signal, and supplies the difference as an unbalanced signal to the A/D converter 4. In this way, the operational amplifier 3 supplies the analog audio signal obtained by converting the input balanced signal into an unbalanced signal to the A/D converter 4.
The A/D converter 4 converts the input analog audio signal into a digital audio signal, and supplies it to the digital signal processor (hereinafter, DSP) 5. The DSP 5 includes an equalizer 51, switches 52, 54, and 56, a howling suppressor 53, a peak shift filter 55, and a volume regulator 7.
In the audio processing device 101 shown in
The time-domain digital audio signal or the discrete Fourier transformed frequency-domain digital audio signal is input to the equalizer 51.
The equalizer 51 corrects the sound quality of the digital audio signal by increasing or decreasing sound pressure at a predetermined one or more frequencies in the input digital audio signal. The howling suppressor 53 includes a plurality of filters for suppressing howling, and filters the input digital audio signal by means of the plurality of filters so as to suppress howling. The howling suppressor 53 may reduce the sound pressure at a specific frequency at which sound pressure is increased by howling.
The equalizer 51 and the howling suppressor 53 may have existing configurations. The user can turn off the sound quality correction function performed by the equalizer 51 by operating the operation unit 6 to switch the switch 52 from the terminal Ta to the terminal Tb. The user can turn off the howling suppression function performed by the howling suppressor 53 by operating the operation unit 6 to switch the switch 54 from the terminal Ta to the terminal Tb.
As shown in
In
The peak shift filter 55 is configured to shift the center frequency f0, which determines the frequency of the peak Pk1, within a predetermined frequency range. The center frequency (here, 2 kHz) serves as a reference in the state where the center frequency f0 is not shifted, and is referred to as a reference center frequency f0r (see
The lowest frequency and the highest frequency when shifting the center frequency f0 may be f0r/2 and 2×f0r, respectively. In this case, the peak shift filter 55 shifts the center frequency f0 in the range of 1 kHz to 4 kHz as another example of the frequency range. It may be configured so that the user can operate the control unit 6 to select the frequency range in which the center frequency f0 is to be shifted.
The peak shift filter 55 shifts the center frequency f0 from the lowest frequency to the highest frequency over a predetermined first time, and from the highest frequency to the lowest frequency over a predetermined second time. As the first and second times, arbitrary times such as 10 ms, 50 ms, 125 ms, 250 ms, 500 ms, or is are set, and the first time and the second time may be different. It may be configured so that the user can select the first and second times by operating the operation unit 6.
In this way, if the peak shift filter 55 shifts the center frequency f0 within a predetermined frequency range, the clarity of voice can be improved and howling can be suppressed. As described above, since howling is generated by repeating the microphone's picking up of the voice emitted from the loudspeaker and the loudspeaker's output of the voice picked up by the microphone and amplified, howling increases over time. When the center frequency f0 is shifted, howling is prevented from increasing, and as a result, howling can be suppressed.
In the audio processing device 101 shown in
The digital audio signal output from the switch 56 is supplied to the volume regulator 7. The volume regulator 7 adjusts the volume of the input digital audio signal, and supplies it to the D/A converter 8. The user can operate the operation unit 6 to adjust the volume of the digital audio signal produced by the volume regulator 7. The D/A converter 8 converts the input digital audio signal into an analog audio signal, and supplies it to an unillustrated output terminal or a loudspeaker.
By means of the peak shift filter 55, the frequency at which sound pressure is increased is not limited to a single location, but may be increased at a plurality of locations. Specifically, the peak shift filter 55 may increase sound pressure with reference center frequencies f0r of around 450 Hz and 800 Hz, which correspond to the frequencies of the first and second formants of the vowel, respectively, in addition to around 2 kHz, which corresponds to the frequency of the consonant. The peak shift filter 55 shifts the center frequencies f0 within a predetermined frequency range even when the three frequencies are set to the reference center frequency f0r.
The peak shift filter 55 provided in the audio processing device 102 according to a second embodiment will be described. The matters common to the peak shift filter 55 in a second embodiment and the peak shift filter 55 in a first embodiment may be omitted.
As shown in
In the example shown in
The peak shift filter 55 is configured to shift the center frequencies f0, f(−1), and f(+1), which determine the frequency of each of the peaks Pk1 to Pk3, within a predetermined frequency range while maintaining their relationship with each other. The reference center frequencies in the state where the center frequencies f0, f(−1), and f(+1) are not shifted are referred to as the reference center frequencies f0r, f(−1)r, and f(+1)r, respectively (see
The preset lowest and highest frequencies when shifting the center frequency f0 may be the same as those in a first embodiment. The peak shift filter 55 shifts the center frequencies f(−1) and f(+1) in the same manner in conjunction with shifting the center frequency f0. The first time for shifting the center frequency f0 from the lowest frequency to the highest frequency and the second time for shifting the center frequency f0 from the highest frequency to the lowest frequency may be the same as those in a first embodiment.
According to a second embodiment, howling that cannot be suppressed in a first embodiment can be suppressed. That is, a second embodiment has a better howling suppressing effect than a first embodiment. The howling that cannot be suppressed in a first embodiment is howling where the frequency at which howling occurs is different from the center frequency f0 and still occurs even if sound pressure is increased by the peak shift filter 55, or howling that occurs faster than the speed at which the peak shift filter 55 shifts the center frequency f0, for example. In a second embodiment, the peak shift filter 55 decreases sound pressure in a valley shape at the center frequencies f(−1) and f(+1) so that such howling can be suppressed.
The peak shift filter 55 should shift the center frequencies f(−1) and f(+1) so that the range of frequencies that decrease in a valley shape at the center frequencies f(−1) and f(+1) overlaps with the frequency at which howling occurs.
Also in a second embodiment, the peak shift filter 55 may increase sound pressure while shifting the three center frequencies f0 with reference center frequencies f0r of around 2 kHz, which corresponds to the frequency of the consonant, and around 450 Hz and 800 Hz, which correspond to the frequencies of the first and second formants of the vowel, respectively. The peak shift filter 55 decreases sound pressure in a valley shape at the center frequencies f(−1) and f(+1) so as to sandwich each center frequency f0.
In
As shown in
The center frequency fc of the phase inversion filter 57 shifts to the low frequency side if the center frequency f0 of the peak shift filter 55 shifts to the lower frequency side of the reference center frequency f0r, and shifts to the higher frequency side if the center frequency f0 of the peak shift filter 55 shifts to the higher frequency side of the reference center frequency f0r. In the example shown in
According to a third embodiment, since the peak shift filter 55 and the phase inversion filter 57 are provided, even if howling cannot be completely suppressed by the peak shift filter 55, the original voice and the voice inverted by the phase inversion filter 57 cancel out howling, and howling that cannot be suppressed in a first embodiment can be suppressed. When the center frequency fc and the center frequency f0 are matched, a higher howling suppression effect can be obtained than when they are not matched.
Note that the user can turn off the phase inversion function performed by the phase inversion filter 57 by operating the operation unit 6 to switch the switch 58 from the terminal Ta to the terminal Tb.
As shown in
According to a fourth embodiment, even if howling cannot be completely suppressed by the peak shift filter 55, since the original voice and the voice inverted by the phase inversion filter 57 cancel out howling, howling that cannot be suppressed in a second embodiment can be suppressed.
In
A first peak setting calculator 551 is a calculator for setting the center frequency f0 of the peak Pk1 in the peak shift filter 55. A second peak setting calculator 552 is a calculator for setting the center frequency f(−1) of the peak Pk2 in the peak shift filter 55. A third peak setting calculator 553 is a calculator for setting the center frequency f(+1) of the peak Pk3 in the peak shift filter 55. An inverting phase calculator 570 is a calculator for setting the center frequency fc for inverting the phase of the digital audio signal in the phase inverting filter 57.
The first and second clocks generated by the frequency dividers 502 and 503 are commonly supplied to the first peak setting calculator 551, the second peak setting calculator 552, the third peak setting calculator 553, and the inverting phase calculator 570. Therefore, the first peak setting calculator 551, the second peak setting calculator 552, the third peak setting calculator 553, and the inverting phase calculator 570 always operate in synchronization with each other, since they are operated according to the common first and second clocks.
As described above, howling does not occur immediately when an analog audio signal is input, but increases over time. Therefore, it may be better to delay the phase inversion performed by the phase inversion filter 57 by a predetermined time and activate the function of suppressing howling by the phase inversion filter 57 after the howling has increased to some extent.
Therefore, as shown in
The configuration corresponding to the units from the equalizer 51 to the volume regulator 7 included in the DSP 5 shown in
The audio processing program according to a first embodiment causes a computer (including DSP 5) to execute a process of increasing sound pressure with the center frequency f0 of the input digital audio signal as the peak Pk1. In addition, the audio processing program according to a first embodiment causes the computer to execute a process of shifting the center frequency f0 between the preset lowest frequency and the preset highest frequency.
The audio processing program according to a second embodiment causes a computer to execute a process of decreasing sound pressure with the center frequency f(−1) on the lower frequency side of the center frequency f0 as the peak Pk2, and with the center frequency f(+1) on the higher frequency side of the center frequency f0 as the peak Pk3. In addition, the audio processing program according to a second embodiment causes the computer to execute a process of shifting the center frequencies f0, f(−1), and f(+1) while maintaining their relationship with each other.
The audio processing program according to a third embodiment causes a computer to execute a process of inverting the phase of the digital audio signal at the center frequency fc in the digital audio signal, and shifting the center frequency fc in synchronization with the shift of the center frequency f0. The audio processing program according to a third embodiment causes the computer to execute a process of shifting the center frequencies f0, f(−1), and f(+1) while maintaining their relationship with each other, and a process of inverting the phase of the digital audio signal at the center frequency fc in the digital audio signal, and shifting the center frequency fc in synchronization with the shift of the center frequency f0.
Incidentally, an audio signal that does not cause howling, such as a synthetic audio signal, may be input to the DSP 5 of
The present invention is not limited to first to fourth embodiments described above, and various modifications can be made without departing from the scope of the present invention. In first to fourth embodiments, a digital audio signal is processed by the DSP 5, but howling can be suppressed by processing an analog audio signal using an audio signal processing circuit other than the DSP, for example.
The present disclosure includes matters that contribute to the realization of the SDGs' “Sustainable Cities and Communities” and contribute to the safety and security of public facilities.
Number | Date | Country | Kind |
---|---|---|---|
2021-087381 | May 2021 | JP | national |