This application claims priority under 35 U.S.C. § 119 from Japanese Patent Application No. 2020-105371 filed on Jun. 18, 2020. The entire subject matter of the application is incorporated herein by reference.
The present disclosures relate to a signal processing device, a signal processing method, and a non-transitory computer-readable recording medium for the signal processing device.
Conventionally, there has been known a signal processing technique using an IIR (Infinite Impulse Response) filter to compensate for a frequency characteristic of a sound signal.
It is noted, however, the IIR filter has a relatively low frequency resolution, and it is difficult to accurately compensate for the frequency characteristic of the sound signal with the IIR filter. In this regard, it is considered to use other digital filters to compensate for the frequency characteristic of the sound signals. For example, with use of an FIR (Finite Impulse Response), it is possible to accurately compensate for the frequency characteristic of the sound signal because of its high frequency resolution.
However, it has also been known that, if the FIR filter is used for compensating the frequency characteristics of the sound signal, a so-called pre-echo is generated.
According to aspects of the present disclosures, there is provided a signal processing device, comprising a measuring section configured to measure an impulse response between each of a plurality of speakers and a predetermined listening position from a signal of each of sounds respectively output from the plurality of speakers at timings at which the sounds do not interfere with each other at the predetermined listening position and collected at the listening position, a Fourier transformer configured to obtain a frequency spectrum corresponding to each of the plurality of speakers by applying a Fourier transform to the impulse responses corresponding to each of the plurality of speakers, respectively, a phase adjustment amount calculator configured to calculate, based on the frequency spectrum corresponding to each speaker, a phase adjustment amount for each frequency of a sound signal input to a target speaker subjected to control of a phase of the sound signal, a band detector configured to detect a leading phase band in which a phase is a leading phase based on the phase adjustment amount for each frequency calculated by the phase adjustment amount calculator, a phase converter configured to convert a phase of the leading phase band detected by the band detector to a lagging phase, and a filter coefficient generator configured to generate a filter coefficient corresponding to the target speaker based on the phase adjustment amount after conversion by the phase converter.
According to aspects of the present disclosures, the band detector is configured to detect a band including a frequency at which the phase adjustment amount is equal to or larger than a predetermined threshold as the leading phase band.
According to aspects of the present disclosures, the band detector is configured to allocate the phase adjustment amount for each frequency calculated by the phase adjustment amount calculator to a positive phase adjustment amount or a negative phase adjustment amount, convert the negative phase adjustment amount to an absolute value, apply synthesis to the positive phase adjustment amount and the phase adjustment amount converted to the absolute values, and detect the leading phase band based on the phase adjustment amount for each frequency after the synthesis.
According to aspects of the present disclosures, the band detector is configured to apply smoothing to the phase adjustment amount for each frequency after the synthesizing in a frequency domain, and detect the leading phase band based on the phase adjustment amount for each frequency after the smoothing.
According to aspects of the present disclosures, the phase converter is configured to generate first lagging phase data in which a phase is shifted in a negative side every time a starting frequency of the leading phase band appears in the frequency domain, and generate second lagging phase data in which a phase is shifted in a negative side every time an ending frequency of the leading phase band appears in the frequency domain. The filter coefficient generator generates the filter coefficient based on the first lagging phase data and the second lagging phase data.
According to aspects of the present disclosures, the phase converter is configured to apply smoothing to the first lagging phase data and the second lagging phase data with respect to the frequency axis.
According to aspects of the present disclosures, the filter coefficient generator is configured to convert each of the first lagging phase data and the second lagging phase data to an impulse response, and obtain the filter coefficient by convoluting the impulse response obtained by converting the first lagging phase data and the impulse response obtained by converting the second lagging phase data, the convoluted impulse response being the filter coefficient.
According to aspects of the present disclosures, the signal processing device further comprises an FIR filter configured to convolute the filter coefficient generated by the filter coefficient generator into a sound signal to be input to the target speaker.
According to aspects of the present disclosures, there is provided a non-transitory computer-readable recording medium for a signal processing device. The recording medium containing computer-executable instructions which cause, when executed, the signal processing device to perform measuring an impulse response between each of a plurality of speakers and a predetermined listening position from a signal of each of sounds respectively output from the plurality of speakers at timings at which the sounds do not interfere with each other at the predetermined listening position and collected at the listening position, obtaining a frequency spectrum corresponding to each of the plurality of speakers by applying a Fourier transform to the impulse responses corresponding to each of the plurality of speakers, respectively, calculating, based on the frequency spectrum corresponding to each speaker, a phase adjustment amount for each frequency of a sound signal input to a target speaker subjected to control of a phase of the sound signal, detecting a leading phase band in which a phase is a leading phase based on the phase adjustment amount for each frequency, converting a phase of the leading phase band detected to a lagging phase, and generating a filter coefficient corresponding to the target speaker based on the phase adjustment amount after the converting.
According to aspects of the present disclosures, there is provided a signal processing method performed in a signal processing device including measuring an impulse response between each of a plurality of speakers and a predetermined listening position from a signal of each of sounds respectively output from the plurality of speakers at timings at which the sounds do not interfere with each other at the predetermined listening position and collected at the listening position, obtaining a frequency spectrum corresponding to each of the plurality of speakers by applying a Fourier transform to the impulse responses corresponding to each of the plurality of speakers, respectively, calculating, based on the frequency spectrum corresponding to each speaker, a phase adjustment amount for each frequency of a sound signal input to a target speaker subjected to control of a phase of the sound signal, detecting a leading phase band in which a phase is a leading phase based on the phase adjustment amount for each frequency, converting a phase of the leading phase band detected to a lagging phase, and generating a filter coefficient corresponding to the target speaker based on the phase adjustment amount after the converting.
Hereinafter, an acoustic system 1 according to an embodiment of the present disclosures will be described with reference to the drawings.
As shown in
The signal processing device 10 has an FIR filter to compensate for a frequency characteristic of a sound signal. The signal processing device 10 according to the present embodiment is configured such that an occurrence of a pre-echo is reduced while being configured to compensate for the frequency characteristic of the sound signal using the FIR filter.
It is noted that various processes in the signal processing device 10 are performed by cooperation of software and hardware provided in the signal processing device 10. At least an operating system (OS), which is a part of the software in the signal processing device 10, is provided as an embedded system, but other parts, such as a software module for performing a filter coefficient generation process for the FIR filter, may be provided as application software that can be distributed over a network or stored on a storage medium such as a memory card. In other words, a filter coefficient generation function according to the present embodiment may be a function that is built into the signal processing device 10 in advance (e.g., before shipment), or a function that can be added to the signal processing device 10 via a network or recording medium.
As shown in
As shown in
When the filter coefficient generation process shown in
The measuring signal is passed through the controller 100 and the FIR filter 110 (i.e., a through output is performed), and is sequentially output to each of the speakers SPFR and SPFL through the amplifier 112 (S102). As a result, predetermined measuring sounds are sequentially output from the speakers SPFR and SPFL with a predetermined time interval.
The microphone MIC is installed in a position at which the pre-echo is to be reduced. In this embodiment, the microphone MIC is arranged at a driver's seat so that a listener sitting in the driver's seat does not perceive the pre-echo. The driver's seat where the listener is sitting will be hereinafter referred to as a “listening position.”
The microphone MIC collects the sounds for measurement sequentially output from the speakers SPFR and SPFL at timings at which the sounds do not interfere with each other at the driver's seat (i.e., at the listening position). The signals (i.e., measured signals) representing the measuring sounds captured by the microphone MIC are stored in the signal recorder 114 and are input from the signal recorder 114 to the calculator 116 (S103). When the calculator 116 has a function to store the measured signals, the measured signals output from the microphone MIC are directly input to the calculator 116 without the signal recorder 114.
Each of the measurement sections 116A and 116B is configured to measure an impulse response (S104).
Specifically, the measurement section 116A obtains a cross-correlation function between the measured signal of the sound for measurement from the speaker SPFL (hereinafter referred to as a “measured signal L”) and a reference measuring signal input from the controller 100, and calculates the impulse response of the measured signal L (in other words, an impulse response between the speaker SPFL and the listening position; hereinafter referred to as an “impulse response Li”). The reference measuring signal is the same as the measuring signal generated by the measuring signal generator 106 and is time-synchronized with the measuring signal.
Similarly, the measurement section 116B calculates the cross-correlation function between the measured signal of the sound for measurement from the speaker SPFR (hereinafter referred to as a “measured signal R”) and the reference measuring signal input from the controller 100, and calculates the impulse response of the measured signal R (in other words, the impulse response between the speaker SPFR and the listening position; hereinafter referred to as an “impulse response Ri”).
Thus, the measurement sections 116A and 116B operate as measurement sections to measure the impulse responses Li and Ri between each of the plurality of speakers and the listening position based on the signals (i.e., the measured signals L and R) representing respective sounds that are output from respective speakers (speakers SPFR and SPFL in this embodiment) at timings in which the sounds do not interfere with each other at the listening position (i.e., the driver's seat in this embodiment) and are collected at the listening position.
As shown in
The Fourier transform section 116C is configured to apply the Fourier transform to the impulse response Li input from the measurement section 116A to obtain frequency spectrums of the impulse response Li (the frequency characteristic of amplitude and the frequency characteristic of phase; hereinafter referred to as frequency spectrums Lf) (S105). The Fourier transform section 116D is configured to apply the Fourier transform to the impulse response Ri input from the measurement section 116B to obtain the frequency spectrums of the impulse response Ri (the frequency characteristic of amplitude and the frequency characteristics of phase; hereinafter referred to as frequency spectrums Rf) (S105).
As above, the Fourier transform sections 116C and 116D are configured to obtain the frequency spectrums corresponding to respective speakers by applying the Fourier transform to the impulse responses corresponding to respective speakers.
In the examples of
As shown in
The phase adjustment amount calculator 116E is configured to calculate a phase adjustment amount for each frequency (in this embodiment, for each frequency point) of the sound signal input to a target speaker for which the phase of the sound signal is controlled based on the frequency spectrums Rf and Lf respectively corresponding to the speakers SPFR and SPFL. It is noted that filter coefficients generated by the filter coefficient generation process shown in
Specifically, the phase adjustment amount calculator 116E shifts (changes) the phase of the frequency spectrum Rf corresponding to the target speaker SPFR sequentially in the range of −180 to +180 degrees in predetermined angular increments (e.g., 1-degree increments), and synthesizes the frequency spectrum Rf after the phase shift with the frequency spectrum Lf (i.e., the frequency spectrum Lf as it is without phase shift) is synthesized at every phase shift (S106). This synthesis of the frequency spectrums means a synthesis of complex spectrums each containing amplitude and phase information. This synthesis process is performed not for all frequency points (2,097 points), but for a total of 88 frequency points included in the frequency range from 50 Hz to 1 kHz, for example, to reduce the processing load.
For example, a case where the amplitude and the phase of the frequency spectrum Lf at the frequency point of 200 Hz are 1 and 0 degrees, respectively, and the amplitude and the phase of the frequency spectrum Rf at the frequency point of 200 Hz is 1 and 180 degrees, respectively, will be considered. When there is no phase shift of the frequency spectrum Rf, the amplitude of the synthesized spectrum will be zero because the spectrums cancel each other due to their opposite phases. When the phase of the frequency spectrum Rf is shifted by +180 degrees, the amplitude of the synthesized spectrums will be 2 since the phase spectrums are in-phase (i.e., the phase is 0 degrees in the frequency spectrum Lf and 360 degrees in the frequency spectrum Rf; 360-degree phase shift means one rotation). As described above, the synthesized value varies depending on the shift angle of the phase of the frequency spectrum Rf.
The synthesis process at the frequency point of 250 Hz is a process of shifting the frequency spectrum Rf in the range of −180 to +180 degrees in predetermined angular increments, and synthesizing the frequency spectrum Rf at the frequency point of 250 Hz after the phase shift with the frequency spectrum Lf at the frequency point of 250 Hz at every execution of the phase shift. By connecting the synthesized values each obtained when the phase of the frequency spectrum Rf is shifted with a smooth curve (for example, an approximate curve by the least-square method or polynomial equation), a result shown by the solid line in
The synthesis process at the frequency point of 500 Hz is a process of shifting the frequency spectrum Rf in the range of −180 to +180 degrees in predetermined angular increments, and synthesizing the frequency spectrum Rf at the frequency point of 500 Hz after the phase shift with the frequency spectrum Lf at the frequency point of 500 Hz at every execution of the phase shift. By connecting the synthesized values each obtained when the phase of the frequency spectrum Rf is shifted with a smooth curve (for example, an approximate curve by the least-square method or polynomial equation), a result shown by the single-dotted line in
In
In the case of the frequency point of 250 Hz, when the phase of the sound signal input to the speaker SPFR is shifted by +135 degrees, the phase of the sound from the speaker SPFR and the phase of the sound from the speaker SPFL become substantially in-phase at the listening position, while when the phase of the sound signal input to the speaker SPFR is shifted by −45 degrees, the phase of the sound from the speaker SPFR and the phase of the sound from the speaker SPFL become substantially opposite at the listening position.
In the case of the frequency point of 500 Hz, when the phase of the sound signal input to the speaker SPFR is shifted by +160 degrees, the phase of the sound from the speaker SPFR and the phase of the sound from the speaker SPFL become substantially in-phase at the listening position, while when the phase of the sound signal input to the speaker SPFR is shifted by −20 degrees, the phase of the sound from the speaker SPFR and the phase of the sound from the speaker SPFL become substantially opposite at the listening position.
The phase adjustment amount calculator 116E calculates the value of the phase shift (hereinafter, referred to as a “phase adjustment amount”) of the frequency spectrum Rf at each frequency point to make the phase of the sound from the speaker SPFR and the phase of the sound from the speaker SPFL substantially in-phase at the listening position (S107).
In the present embodiment, in order to increase the sound pressure at the listening position, the phase adjustment amount of the frequency spectrum Rf at each frequency point necessary for making the phase of the sound from the speaker SPFR and the phase of the sound from the speaker SPFL substantially in-phase at the listening position is obtained.
It is noted that the phase adjustment amounts differ greatly depending on the frequency due to the difference in propagation delay time at each frequency, which is caused by reflection, shield, interference, etc. of sounds in the vehicle interior, as shown in
As shown in
Specifically, the band detection section 116F allocates the phase adjustment amounts (see
The band detection section 116F converts the phase adjustment amounts allocated to be negative in step S108 to absolute values, i.e., converts the negative phase adjustment amounts to positive phase adjustment amounts (S109). According to the present embodiment, in order to facilitate a threshold judgment in step S112 described below, the negative phase adjustment amounts are converted into the positive phase adjustment amounts (S109). That is, in order to easily perform a threshold determination in S112 (described later) (specifically, for example, in order to determine that the phase adjustment amount is less than the threshold value when the phase adjustment amount is greater than −90 degrees and less than +90 degrees, and determine that the phase adjustment amount is greater than the threshold value when the phase adjustment amount is less than −90 degrees or greater than +90 degrees), the negative phase adjustment amounts are converted into the positive phase adjustment amounts (S109).
The band detection section 116F synthesizes the positive phase adjustment amounts allocated in S108 and the phase adjustment amounts converted from negative to positive (i.e., the phase adjustment amounts converted to absolute value) in S109 (S110).
The band detection section 116F performs smoothing of the phase adjustment amounts of respective frequency points after synthesis in step S110 along the frequency axis (S111). The smoothing is, for example, a process to remove noise and singularities from a graph. In this embodiments, the smoothing is performed using, for example, a low-pass filter with an FIR of 8 taps.
The band detection section 116F detects bands having leading phases, or in other words, the bands that cause pre-echoes, using a predetermined threshold (S112). Specifically, the band detection section 116F detects the bands each including the frequency point at which the phase adjustment amount is +90 degrees or more among the frequency points after smoothing in step S111 as the bands that has the leading phases. The bands detected in step S112 will be hereinafter referred to as the “leading phase bands.”
In step S112, since first and second lagging phase data described below are data that control the phase at an interval of 180 degrees, the threshold is set to +90 degrees, which is half the value of 180 degrees. It is noted, however, this threshold (i.e., +90 degrees) is only one example, and may be another value, such as +45 degrees or +135 degrees, for example.
As shown in
Specifically, the phase converter 116G generates data in which a value (i.e., amplitude) of 1 is set to the leading phase bands detected in S112, and a value of 0 is set to the other frequency bands (i.e., the frequency bands in each of which the phase adjustment amount is less than +90 degrees) (S113).
In the present embodiment, two leading phase bands are detected. Of the two leading phase bands shown in
In the setting result shown in
As shown in
In the present embodiment, since the two leading phase bands are detected in step S112, there are two areas, along the frequency axis, where the value changes from 0 to 1 (see frequencies f1a and f1b in
It is considered herein a case where no smoothing is performed in step S111. In such a case, the band detection section 116F detects the leading phase bands based on the phase adjustment amount at each frequency point after synthesis in S110. In this case, there are a total of four leading phase bands to be detected. Therefore, the first and second lagging phase data generated in S114 will be data having a lagging phase of at most 720 degrees. The larger the lagging phase of the lagging phase data is, the better the effect of reducing the pre-echo is. However, too much delay in the phase deteriorates the accuracy of the compensation of the frequency characteristics of the sound signals using the FIR filter and reduces the effect of improving the sound pressure and/or sound quality. In addition, the larger the lagging phase is, the steeper the phase change along the frequency axis becomes, and the more likely it is that abnormal sounds will be generated. Therefore, according to the present embodiment, the smoothing is performed in S111 to reduce the number of the leading phase bands to be detected in S112 so that the first and second lagging phase data do not have an excessive lagging phase.
In this embodiment, a phase shift of −180 degrees is applied in one step (e.g., so as to continuously and smoothly change from 0 to −180 degrees) as shown in
The above-described shift angle (i.e., −180 degrees) is only one example. The shift angle may be a different angle, such as −45 degrees or −90 degrees. The shift angles respectively corresponding to the first lagging phase data and the second lagging phase data may be different angles.
When there is a point, along the frequency axis, where the phase change is too steep, an abnormal noise may be generated easily. Therefore, the phase converter 116G is configured to apply the smoothing to the first and second lagging phase data along the frequency axis (S115). The smoothing is performed by, for example, a low-pass filter using the FIR with the tap number of 16.
As shown in
Specifically, the filter coefficient generator 116H converts the first lagging phase data, which represents signals in the frequency domain, into an impulse response, which represents signals in the time domain, and converts the second lagging phase data, which represents signals in the frequency domain, into an impulse response, which represents signals in the time domain, by applying the inverse Fourier transform. Next, the filter coefficient generator 116H convolutes the impulse response obtained by converting the first lagging phase data and the impulse response obtained by converting the second lagging phase data to obtain the convolved impulse responses as the filter coefficients corresponding to the speaker SPFR (S116). In other words, the filter coefficient generator 116H generates the filter coefficients corresponding to the speaker SPFR by convolving the two impulse responses obtained by the inverse Fourier transform. The filter coefficients are hereinafter referred to as the “filter coefficients FC.”
Next, an operation of playing back the sound signal input from the sound source using the filter coefficients FC generated by the calculator 116.
The recording medium playback unit 108 plays back sound signals SR and SL input from a sound source such as a CD or a DVD (hereinafter, also referred to as “audio signals SR and SL”). The controller 100 outputs the audio signals SL and SR played back by the recording medium playback unit 108 to the FIR filter 110.
The FIR filter 110 compensates the frequency characteristics of the phases of the sound signals by convolving the filter coefficients FC generated by the calculator 116 into the audio signals to be input to the target speaker (in this embodiment, the audio signal SR to be input to the speaker SPFR). Since the data, which is obtained by converting the phases of the leading phase bands into the lagging phases (i.e., the first and second lagging phase data) and further converting into the impulse response, is convolved into the audio signal SR as the filter coefficients, the sound pressure and sound quality (in the present embodiment, the sound pressure only) can be improved while reducing the pre-echo.
It is noted that the FIR filter 110 is configured to output the audio signals to be input to speakers that are not the target speakers (hereinafter, referred to as “non-target speakers”) without compensating the frequency characteristic of the phase by a through output. The audio signals SR and SL output from the FIR filter 110 are output to the vehicle interior via the amplifier 112 and then the speakers SPFR and SPFL, respectively. By compensating the frequency characteristics of the phase using the FIR filter 110, a music piece or the like of which sound pressure and sound quality are improved while the pre-echo being reduced is played back in the vehicle interior.
In each of
In each of
In the examples of
In the examples of
Since the front passenger's seat is relatively close to the driver's seat, as shown in
It is noted that aspects of the present disclosures should not be limited to the configuration of the above-described embodiments, but various modifications are possible within aspects of the technical concept of the present disclosures. For example, an appropriate combination of configurations explicitly or inexplicitly disclosed or suggested in the above description may be fallen within aspects of the present disclosures.
In the above embodiment, a case where the impulse response is measured at the driver's seat is described, but the same process may be performed for each seat. In such a case, the controller 100 may retain the filter coefficients FC generated when the impulse responses are measured at respective seats as preset data. The listener may arbitrarily switch the filter coefficients FC for reducing the pre-echo by operating the operation panel 104 to select the preset data.
The above embodiment describes the process when two front speakers are arranged in the vehicle interior. Aspects of the present disclosure should not be limited to such a configuration, and the same process can be used to generate filter coefficients FC for reducing pre-echo when more speakers are arranged in the vehicle interior.
As an example, it is considered a case where two rear speakers are further arranged in the vehicle interior in addition to the two front speakers (i.e., a total of four speakers). In such a case, the frequency spectrum of the impulse response between each speaker and the listening position (i.e., four impulse responses) is obtained in steps S101 to S105, the phase adjustment amount (e.g., the frequency spectrum Rf corresponding to the target speaker SPFR) is calculated to make the phase of the sound from each of the four speakers substantially in-phase at the listening position in steps S106 to S107, the amount of the phase adjustment to make the phase of the sound from each of the four speakers substantially in-phase at the listening position (e.g., the phase adjustment amount for each frequency point of the frequency spectrum Rf corresponding to the target speaker SPFR) is obtained in steps S106 to S107, the leading phase band is detected in steps S108 to S112, and the filter coefficients are generated after converting the phase of the leading phase band to the lagging phase in steps S113 to S116. As a result, the filter coefficients FC for reducing the pre-echo in an acoustic system with four speakers are generated.
In another embodiment, both the speakers SPFR and SPFL may be the target speakers to be controlled. In such a case, in S106 to S107, the phase adjustment amount to make the phase of the sound from each speaker substantially in-phase at the listening position is obtained for both the frequency spectrum Rf and the frequency spectrum Lf, and in S108 to S116, the filter coefficients corresponding to the speakers SPFR and SPFL, respectively, are generated. In this way, a plurality of speakers including the speaker closest to the listening position may be the target speakers to be controlled.
In the above embodiment, in order to increase the sound pressure at the listening position, the leading phase band is detected based on the phase adjustment amount at each frequency point to make the phase of the sound from each speaker substantially in-phase at the listening position, and the filter coefficients are generated after converting the phase of this leading phase band to the lagging phase. However, the present disclosures are not necessarily be limited to such a configuration. For example, to improve the sound quality at the listening position, the leading phase band may be detected based on the phase adjustment amount for each frequency point, which is suitable for reducing peaks and dips in the frequency domain at the listening position, and the filter coefficients may be generated after converting the phase of this leading phase band to a lagging phase.
Number | Date | Country | Kind |
---|---|---|---|
JP2020-105371 | Jun 2020 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
8116465 | Asada | Feb 2012 | B2 |
20050244012 | Asada | Nov 2005 | A1 |
20080033726 | Kudoh | Feb 2008 | A1 |
20090304213 | Cook | Dec 2009 | A1 |
20190342662 | Fukue | Nov 2019 | A1 |
Number | Date | Country |
---|---|---|
2001-117595 | Apr 2001 | JP |
2004-325284 | Nov 2004 | JP |
2005-341534 | Dec 2005 | JP |
2012-195791 | Oct 2012 | JP |
2020-137040 | Aug 2020 | JP |
2020-137041 | Aug 2020 | JP |
Number | Date | Country | |
---|---|---|---|
20210400387 A1 | Dec 2021 | US |