The present disclosure relates to a technology of loudly producing a predetermined reproduction sound to a vicinity and enabling a person in the vicinity and a remote utterer to communicate with each other.
For instance, Patent Literature 1 describes an acoustic echo canceler device with a loudly producing function. The device includes a sound detector that detects a level of a reception signal from a far-end and outputs a signal for suspending a modification of a coefficient to a coefficient modifier when the level of the reception signal indicates a given level or lower. That is to say, the modifier modifies the coefficient of an echo canceller only when receiving an input of a reception signal having the given level or higher.
Moreover, for instance, Patent Literature 2 describes a sound communication device having operability of resetting an acoustic echo cancelation function (adaptive filter) through a remote manipulation. The device thus enables confirmation as to whether sound data sent from its own terminal is appropriately reproduced at a destination terminal in sound communication via a communication network.
However, each conventional art makes it difficult for a remote utterer to confirm whether a reproduction sound is loudly produced, and thus still needs further improvement.
Patent Literature 1: Japanese Unexamined Patent Publication No. SHO 63-79449
Patent Literature 2: Japanese Patent Publication No. 5245884
The present disclosure has been accomplished to overcome the drawbacks described above, and has an object of providing a technology that achieves cancelation of an acoustic echo attributed to loud producing of a communication sound of a remote utterer and enables the remote utterer to confirm whether the reproduction sound is loudly produced.
A communication device according to the present disclosure is a communication device that is configured to loudly produce a predetermined reproduction sound to a vicinity and enables a person in the vicinity and a remote utterer to communicate with each other. The communication device includes: a first adder that adds the reproduction sound to a communication sound of the remote utterer and outputs the communication sound additionally including the reproduction sound to a speaker; and an acoustic echo canceler that cancels, on the basis of the communication sound to serve as a reference signal, an echo component of the communication sound contained in an input signal acquired by a microphone configured to collect a sound in the vicinity.
The present disclosure achieves cancelation of an acoustic echo attributed to loud producing of a communication sound of a remote utterer, and enables remote utterer to confirm whether a reproduction sound is loudly produced.
A conventional transportation system makes a vehicle transport an article, the vehicle being configured to autonomous travel and to travel under remote control. In the transportation system, a remote supervisor remotely controls a movement of the vehicle, and the remote supervisor and a recipient to receive the article are required to communicate with each other. The vehicle includes a speaker, a microphone, and an acoustic echo canceler, and the recipient of the article communicates with the remote supervisor through loud sound production. The speaker loudly produces a communication sound of the remote supervisor, and the microphone collects an uttering sound of the recipient. The acoustic echo canceler cancels an echo component of the communication sound of the remote supervisor contained in an input signal acquired by the microphone.
The vehicle further loudly reproduces a reproduction sound saying, for example, “the vehicle leaves” or “the vehicle moves backward”, to draw attention of a person in a vicinity. The remote supervisor starts to move the vehicle after giving an instruction of the loud producing of the reproduction sound. The remote supervisor is thus required to confirm that the reproduction sound is loudly produced from the vehicle and that the loud producing of the reproduction sound is finished.
In this regard, in Patent Literature 1, a reproduction sound coming from the speaker to the microphone is removed as an acoustic echo as well as a reception signal coming from the speaker to the microphone in a case where the speaker loudly produces the reproduction sound together with the reception signal at a far-end. Hence, the reproduction sound is inaudible to the remote supervisor, and the remote supervisor faces difficulty in confirming whether the reproduction sound is appropriately loudly produced.
Further, in Patent Literature 2, a filter coefficient is reset to allow one's own terminal to confirm loud producing of a sound at a destination terminal without removing an echo coming from a speaker to a microphone by an cancelation way. However, the echo coming from the speaker to the microphone is removed in progress of an update of the filter coefficient by the echo canceling way. This may lead to a failure at confirming a finish of loud producing of a reproduction sound depending on a length of the reproduction sound.
Additionally, in Patent Literature 2, a sound signal and the reproduction sound of the one's own terminal are transmitted from the destination terminal. This may make the reproduction sound be superimposed on the sound signal of the one's own terminal and make the reproduction sound inaudible, resulting in a failure at confirming whether the reproduction sound is loudly produced in an appropriate manner.
The following technologies will be described to overcome the drawbacks.
In this configuration, the acoustic echo canceler receives the communication sound of the remote utterer to serve as a reference signal, and the acoustic echo canceler avoids receiving the reproduction sound as the reference signal. Accordingly, an echo component of the communication sound is canceled by the acoustic echo canceler from the communication sound additionally including the reproduction sound coming from the speaker to the microphone, but an echo component of the reproduction sound is output toward the remote utterer without being canceled.
The configuration hence achieves cancelation of an acoustic echo attributed to loud producing of the communication sound of the remote utterer, and enables the remote utterer to confirm whether the reproduction sound is loudly produced.
This configuration avoids the update of the filter coefficient of the adaptive filter when the reproduction sound is detected, and thus enables accurate cancelation of only the echo component of the communication sound of the remote utterer.
This configuration in which the speaker outputs the reproduction sound having the adjusted sound volume and the communication sound allows the reproduction sound to be reliably audible to a person in the vicinity of the speaker.
This configuration achieves automatic adjustment of the sound volume indicated by the output signal from the acoustic echo canceler. The configuration thus allows a sound volume of an echo of the reproduction sound and a sound volume of an uttering sound of a near-end utterer in the vicinity of the microphone to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
This configuration includes setting the first lowering gain to be multiplied by an output signal from the acoustic echo canceler so that a sound volume of an echo of the reproduction sound which is audible to the remote utterer and a sound volume of an uttering sound of a near-end utterer are equal to each other when the reproduction sound is detected. Consequently, the configuration allows the sound volume of the echo of the reproduction sound and the sound volume of the uttering sound of the near-end utterer in the vicinity of the microphone to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
In this configuration, a sound volume of the reproduction sound output from the speaker differs from a sound volume of the reproduction sound input into the acoustic echo canceler to serve as the reference signal. Hence, an echo component in a part of the reproduction sound is output without being canceled by the acoustic echo canceler. The configuration accordingly enables the remote utterer to confirm whether the reproduction sound is loudly produced.
This configuration avoids the update of the filter coefficient of the adaptive filter when the reproduction sound is detected, and thus enables accurate cancelation of only the echo component of the communication sound of the remote utterer.
This configuration includes setting the first lowering gain to be multiplied by an output signal from the acoustic echo canceler, depending on a situation of removal of an echo by the acoustic echo canceler, so that a sound volume of an echo of the reproduction sound and a sound volume of an uttering sound of a near-end utterer which are audible to the remote utterer are equal to each other when the reproduction sound is detected. Consequently, the configuration allows the sound volume of the echo of the reproduction sound and the sound volume of the uttering sound of the near-end utterer in the vicinity of the microphone to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
In this configuration, when the filter coefficient is not updated and the filter coefficient has a small power value, the echo cancelation amount reduces and the sound volume of the echo of the reproduction sound output from the acoustic echo canceler increases. Hence, the echo component of the reproduction sound is suppressed or lowered with the first lowering gain until the power value of the filter coefficient increases in progress of the update of the filter coefficient. Consequently, the configuration allows the sound volume of the echo of the reproduction sound and a sound volume of an uttering sound of a near-end utterer in the vicinity of the microphone to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
In this configuration, the echo cancelation amount shows, for example, a ratio between the level of the input signal input into the acoustic echo canceler from the microphone and the level of the output signal from the acoustic echo canceler. When the acoustic echo canceler does not cancel the echo, the echo cancelation amount reaches “1” and the sound volume of the echo of the reproduction sound output from the acoustic echo canceler increases. Hence, the echo component of the reproduction sound is suppressed or lowered with the first lowering gain until the acoustic echo canceler satisfactorily cancels the echo. Consequently, the configuration allows the sound volume of the echo of the reproduction sound and a sound volume of an uttering sound of a near-end utterer in the vicinity of the microphone to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
In this configuration, the acoustic echo canceler removes the echo component of the reproduction sound coming from the speaker to the microphone. Then, the reproduction sound is added to the output signal from the acoustic echo canceler excluding the echo component of the reproduction sound, and an output signal additionally including the reproduction sound having not passed through an acoustic echo route is output toward the remote utterer. This enables the remote utterer to confirm a clear reproduction sound without an acoustic echo.
This configuration achieves adjustment of the sound volume of the reproduction sound to be added to the output signal from the acoustic echo canceler. Consequently, the configuration allows the sound volume of the reproduction sound and a sound volume of an uttering sound of a near-end utterer in the vicinity of the microphone to agree with each other in such a manner that both the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
In this configuration, when the control signal indicates returning of the communication sound to the remote location, the switch switches to the OFF-mode of avoiding the inputting of the communication sound as the reference signal into the acoustic echo canceler. The configuration thus keeps the acoustic echo canceler from removing the communication sound and enables the remote utterer to confirm that the communication sound of the remote utterer is loudly produced at a destination.
This configuration achieves adjustment of the communication sound to a sound volume which is audible to the remote utterer when the control signal indicates returning of the communication sound to the remote location. The configuration in which the communication sound having the sound volume adjusted to be audible to the remote utterer by the third sound volume adjuster is added to the output signal from the acoustic echo canceler enables the remote utterer to confirm that the communication sound of the remote utterer is loudly produced at a destination.
In this configuration, when the control signal indicates returning of the communication sound to the remote location, the switch switches to the OFF-mode of avoiding the inputting of the output signal from the first adder as the reference signal into the acoustic echo canceler. The configuration thus keeps the acoustic echo canceler from removing the communication sound and enables the remote utterer to confirm that the communication sound of the remote utterer is loudly produced at a destination.
This configuration achieves adjustment of the communication sound to a sound volume which is audible to the remote utterer when the control signal indicates returning of the communication sound to the remote location. The configuration in which the communication sound having the sound volume adjusted to be audible to the remote utterer by the third sound volume adjuster is added to the output signal from the second adder enables the remote utterer to confirm that the communication sound of the remote utterer is loudly produced at a destination.
The configuration enables reading-out of a reproduction sound stored by the communication device, and allows the speaker to output the read out reproduction sound.
The configuration enables receiving of the reproduction sound from the remote communication device and outputting of the received reproduction sound from the speaker.
Moreover, the disclosure can be realized as: a communication device including the above-described distinctive configuration; and a communication method executing distinctive ways each corresponding to the distinctive configuration of the communication device. Additionally, the disclosure can be realized by a computer program causing a computer to execute the distinctive ways included in the communication method. From these perspectives, the same advantageous effects as those of the communication device are achievable in the following other aspects.
Embodiments of this disclosure will be described with reference to the accompanying drawings. Each of the embodiments which will be described below represents a specific example of the disclosure. Numeric values, shapes, constituent elements, steps, and the order of the steps described below in each embodiment are mere examples, and thus should not be construed to delimit the disclosure. Moreover, constituent elements which are not recited in the independent claims each showing the broadest concept among the constituent elements in the embodiments are described as selectable constituent elements. The respective contents are combinable with each other in all the embodiments.
The communication system is mountable to a vehicle. The vehicle is, for example, an electric robot, an electric vehicle, an electric truck, or an electric drone. For instance, the vehicle moves within a predetermined region to transport an article of a user. The vehicle is supervised by a remote supervisor. The vehicle is configured to autonomously travel and to travel under remote control.
The communication system shown in
The communication device 100 is configured to loudly produce a predetermined reproduction sound to a vicinity and enables a person in the vicinity and a remote supervisor to communicate with each other. The remote supervisor serves as an example remote utterer. The reproduction sound includes a sound or voice saying, for example, “the vehicle leaves” or “the vehicle moves backward”, to draw attention of the person in the vicinity. For instance, the reproduction sound is loudly produced when the vehicle starts to move.
The communication sound input part 21 receives a remote communication sound of the remote supervisor from a remote supervising device (not shown) at a receiver, and outputs the remote communication sound to the communication device 100.
The reproduction sound input part 22 outputs the reproduction sound to the communication device 100.
The speaker 1 loudly produces the remote communication sound of the remote supervisor and loudly produces the reproduction sound. Here, when the microphone 2 collects the sound output from the speaker 1, a speaker at the receiver reproduces a sound uttered by an utterer at the receiver in delay. As a result, a so-called acoustic echo occurs. In this regard, an acoustic echo canceler 3 to be described later suppresses an acoustic echo signal contained in an input signal output from the microphone 2.
The microphone 2 is configured to collect a sound or voice in a vicinity. The microphone 2 collects a sound or voice of a near-end utterer (sender). The microphone 2 outputs an input signal indicating the collected sound to the communication device 100.
The communication device 100 includes the acoustic echo canceler 3, a first amplifier 4, a first adder 5, and a reproduction sound detector 6. The communication device 100 may include at least one of the speaker 1 and the microphone 2.
The acoustic echo canceler 3 cancels, on the basis of the remote communication sound of the remote supervisor to serve as a reference signal, an echo component of the remote communication sound contained in the input signal acquired by the microphone 2. The acoustic echo canceler 3 receives an input of a remote communication sound xf(j) to serve as the reference signal and an input of an input signal y(j) from the microphone 2. The acoustic echo canceler 3 removes an acoustic echo attributed to the remote communication sound loudly produced by the speaker 1 and coming to the microphone 2. The acoustic echo canceler 3 outputs an echo removal signal e(j) excluding the echo component of the remote communication sound xf(j) from the input signal y(j) acquired by the microphone 2. A configuration of the acoustic echo canceler 3 will be described in detail later.
The first amplifier 4 controls a sound volume of the reproduction sound in such a manner that the reproduction sound is audible in a loud sound required area. The first amplifier 4 serves as an example first sound volume adjuster. The first amplifier 4 adjusts the sound volume of the reproduction sound and outputs the reproduction sound having the adjusted sound volume to the first adder 5.
The first adder 5 adds the reproduction sound output from the first amplifier 4 to the remote communication sound output from the communication sound input part 21. The first adder 5 adds the reproduction sound to the remote communication sound of the remote utterer, and outputs the communication sound additionally including the reproduction sound to the speaker 1.
The remote communication sound xf(j) serves as a reference signal for the acoustic echo canceler 3. The first amplifier 4 multiplies a reproduction sound xr(j) by a predetermined gain G1. The gain G1 is determined in advance depending on an environment for use of the communication device 100. The gain G1 has a value larger than “0”. The first adder 5 adds “G1·xr(j)” to the remote communication sound xf(j). The speaker 1 loudly produces “xf(j)+G1·xr(j)”. The acoustic echo canceler 3 deduces an impulse response H to reach the microphone 2 from the speaker 1.
The reproduction sound detector 6 detects presence or absence of the reproduction sound xr(j). The reproduction sound detector 6 calculates an amplitude level xr˜(j) of the reproduction sound xr(j) by short-time averaging processing for the amplitude as expressed by the following Equation (1). The reproduction sound detector 6 determines whether the amplitude level xr˜(j) is higher than a threshold.
In Equation (1), the sign “|xr(j)|” denotes an absolute value of the reproduction sound xr(j), the sign “xr˜(j)” denotes a short-time averaging value of absolute values of the reproduction sound xr(j), the sign “j” denotes a time, and the sign “B” denotes an update gain (0<B<1).
The reproduction sound detector 6 determines presence of a reproduction sound when the amplitude level is higher than the threshold. In contrast, the reproduction sound detector 6 determines absence of the reproduction sound when the amplitude level is equal to or lower than the threshold.
The acoustic echo canceler 3 deduces an impulse response to reach the microphone 2 from the speaker 1 when the reproduction sound detector 6 does not detect a reproduction sound. The acoustic echo canceler 3 suspends the deduction of the impulse response to reach the microphone 2 from the speaker 1 when the reproduction sound detector 6 detects a reproduction sound. In other words, the acoustic echo canceler 3 suspends an update of a filter coefficient of an adaptive filter when the reproduction sound detector 6 detects a reproduction sound.
The communication sound output part 23 outputs the echo removal signal e(j) output from the acoustic echo canceler 3 to serve as a near-end communication sound.
The communication sound input part 21, the reproduction sound input part 22, and the communication sound output part 23 are connected to a communicator (not shown). The communicator is configured to send the near-end communication sound to the remote supervising device (not shown) at the receiver via a network and receive a remote communication sound from the remote supervising device (not shown) at the receiver via the network. The communicator is further configured to receive a reproduction sound from the remote supervising device (not shown) at the receiver via the network. The network includes, for example, the internet.
The acoustic echo canceler 3 includes an adaptive filter 31, a subtractor 32, and a filter coefficient updater 33.
The adaptive filter 31 generates a deduced echo yhat(j) through convolution of a filter coefficient HHAT(j) and a reference signal series Xf(j).
The subtractor 32 subtracts the deduced echo yhat(j) from the input signal y(j) from the microphone 2 to remove an echo component of the remote communication sound xj(j) contained in the input signal y(j). The subtractor 32 outputs the echo removal signal e(j) excluding the echo component from the input signal y(j).
The filter coefficient updater 33 updates the filter coefficient HHAT(j) by utilzing the echo removal signal e(j) output from the subtractor 32 and the reference signal series Xf(j).
The input signal y(j) output from the microphone 2 is expressed with the following Equation (2) including the input signal series Xf(j)+G1·Xr(j) of the speaker 1 and an impulse response H in an acoustic echo route from the speaker 1 to the microphone 2.
In Equation (2), the values of “Xf(j)”, “Xr(j)”, and “H” are respectively expressed with the following Equation (3), Equation (4), and Equation (5). In Equation (2), the sign “*” denotes a convolution operation.
In Equation (3), Equation (4), and Equation (5), the sign “N” denotes a filter tap length.
Here, the number of taps for the impulse response H is defined as the same as the number of taps for the filter coefficient HHAT(j) of the acoustic echo canceler 3. Actually, the impulse response attenuates as a time elapses. In this respect, when the acoustic echo canceler 3 enables satisfactory removal of the acoustic echo, the number of taps for the impulse response H can be defined as the number of taps for the filter coefficient HHAT(j) without any practical problem.
The deduced echo yhat(j) generated by the adaptive filter 31 is expressed with the following Equation (6).
In Equation (6), the filter coefficient “HHAT(j)” is expressed with the following Equation (7).
The echo removal signal e(j) output from the subtractor 32 is expressed with the following Equation (8).
When the reproduction sound detector 6 does not detect the reproduction sound, the filter coefficient HHAT(j) updated by the filter coefficient updater 33 is expressed with the following Equation (9).
Here, the value of “Δ” in Equation (9) is expressed with the following Equation (10).
Further, the sign “∥Xf(j)∥” in Equation (10) denotes a norm of “Xf(j)” and the value thereof is expressed with the following Equation (11). The sign “α” denotes a gain (0≤α≤1).
Equation (9) employs a normalized mean least squares algorithm as an algorithm for the update of the coefficient, but may employ another algorithm as the algorithm for the update of the coefficient.
When the reproduction sound detector 6 detects a reproduction sound, the filter coefficient HHAT(j) whose update by the filter coefficient updater 33 is suspended is expressed with the following Equation (12).
A whole area is processed at once concerning Equation (2) to Equation (12), but a plurality of bands resulting from dividing the whole area may be processed. In each of Equation (2) to Equation (12), the processing is based on a time domain, but may be based on a frequency domain.
Correct deduction of the impulse response H in the actual acoustic echo route with the filter coefficient HHAT(j) of the acoustic echo canceler 3 provides “HHAT(j)≈H”. The reference signal for the acoustic echo canceler 3 reaches “Xf(j)” in the case of the reproduction sound series Xr(j)=0. In Equation (8), in the case of “HHAT(j)≈H” and “Xr(j)=0”, the echo removal signal e(j) output from the acoustic echo canceler 3 results in “e(j)≈0”. This consequently achieves removal of the echo.
Subsequently, an operation by the communication device 100 according to the first embodiment of the disclosure will be described.
First, in step S1, each of the acoustic echo canceler 3 and the first adder 5 acquires a remote communication sound output from the communication sound input part 21. The communication sound input part 21 outputs the remote communication sound received from a remote supervising device (not shown) at a receiver to each of the acoustic echo canceler 3 and the first adder 5. The acoustic echo canceler 3 acquires only the remote communication sound as a reference signal.
Next, in step S2, the first amplifier 4 acquires a reproduction sound output from the reproduction sound input part 22. The reproduction sound input part 22 outputs the reproduction sound received from the remote supervising device (not shown) at the receiver to the first amplifier 4.
Then, in step S3, the first amplifier 4 adjusts a sound volume of the reproduction sound. The first amplifier 4 multiplies the reproduction sound by a predetermined gain G1. The first amplifier 4 outputs the reproduction sound having the adjusted sound volume to the first adder 5.
Thereafter, in step S4, the first adder 5 adds the reproduction sound having the sound volume adjusted by the first amplifier 4 to the remote communication sound.
Further, in step S5, the first adder 5 outputs the remote communication sound additionally including the reproduction sound to the speaker 1. The speaker 1 loudly produces the remote communication sound additionally including the reproduction sound.
Next, in step S6, the acoustic echo canceler 3 acquires an input signal from the microphone 2. At this time, the microphone 2 collects a sound in a vicinity and outputs the input signal to the acoustic echo canceler 3. The input signal includes a sound uttered by a near-end utterer and the remote communication sound additionally including the reproduction sound coming from the speaker 1 to the microphone 2.
Then, in step S7, the reproduction sound detector 6 detects presence or absence of the reproduction sound.
Thereafter, in step S8, the reproduction sound detector 6 determines whether the reproduction sound is detected. When it is determined that the reproduction sound is not detected (NO in step S8), the process proceeds to step S10. The reproduction sound detector 6 outputs, to the acoustic echo canceler 3, a detection result signal det indicating whether the reproduction sound is detected.
In contrast, when it is determined that the reproduction sound is detected (YES in step S8), the acoustic echo canceler 3 suspends an update of a filter coefficient of the adaptive filter 31 in step S9.
Next, in step S10, the acoustic echo canceler 3 cancels, on the basis of the remote communication sound to serve as the reference signal, an echo component of the remote communication sound contained in the input signal acquired by the microphone 2.
Subsequently, in step S11, the acoustic echo canceler 3 outputs an echo removal signal excluding the echo component of the remote communication sound from the input signal. The acoustic echo canceler 3 outputs the echo removal signal to the communication sound output part 23. The communication sound output part 23 sends, to the remote supervising device (not shown) at the receiver, the echo removal signal output from the acoustic echo canceler 3 to serve as a near-end communication sound.
Conclusively, the acoustic echo canceler 3 receives the communication sound of a remote utterer to serve as a reference signal, and the acoustic echo canceler 3 avoids receiving the reproduction sound as the reference signal. Accordingly, an echo component of the communication sound is canceled by the acoustic echo canceler 3 from the communication sound additionally including the reproduction sound coming from the speaker 1 to the microphone 2, but an echo component of the reproduction sound is output toward the remote utterer without being canceled. The configuration hence achieves cancelation of an acoustic echo attributed to loud producing of the communication sound of the remote utterer, and enables the remote utterer to confirm whether the reproduction sound is loudly produced.
In the configuration in
A reproduction sound xr(j) having a sound volume adjusted by the first amplifier 4 is added to the remote communication sound xf(j) by the first adder 5. The reproduction sound xr(j) added to the remote communication sound xf(j) is loudly produced by the speaker 1 and collected by the microphone 2. The reproduction sound xr(j) is not included in the reference signal series Xf(j) for the acoustic echo canceler 3. Thus, the deduced echo yhat(j) does not contain an echo component H*G1·Xr(j) of the reproduction sound. The echo component of the reproduction sound xr(j) is sent to the remote supervising device without being removed by the acoustic echo canceler 3. In other words, the echo component of the remote communication sound xf(j) is removed, but the echo component of the reproduction sound xr(j) is not removed. This allows the acoustic echo of the reproduction sound xr(j) to be audible to the remote supervisor.
In the first embodiment, the communication device 100 may exclude the reproduction sound detector 6. In the first embodiment, the communication device 100 may further exclude the first amplifier 4.
For example, a sound volume of a reproduction sound to encourage a person in a vicinity of a vehicle to pay attention is set to a large value in such a manner that the reproduction sound is reliably audible to the person in the vicinity. In this example, a sound volume of a reproduction sound collected by the microphone 2 may be larger than a sound volume of an uttering sound of a near-end utterer in the vicinity. Besides, when a distance between the speaker 1 and the microphone 2 is short, the sound volume of the reproduction sound collected by the microphone 2 is much larger. In the communication device 100 in the first embodiment, a sound volume of a reproduction sound may be larger than a sound volume of an uttering sound of a near-end utterer in the vicinity. Such a difference between the sound volumes of the two sounds may make it difficult to allow an echo of the reproduction sound and the uttering sound to be audible. In this regard, a communication device according to the second embodiment performs automatic adjustment of a gain so that a sound volume indicated by an echo removal signal output from an acoustic echo canceler 3 does not exceed a threshold.
The communication system shown in
The communication device 101 includes the acoustic echo canceler 3, a first amplifier 4, a first adder 5, a reproduction sound detector 6, and an automatic gain controller 7. The communication device 101 shown in
The acoustic echo canceler 3 outputs an echo removal signal to the automatic gain controller 7.
The automatic gain controller 7 automatically adjusts a sound volume indicated by an output signal from the acoustic echo canceler 3. The automatic gain controller 7 automatically adjusts a gain so that a level of a signal having been input to fall within a predetermined range. When a sound volume indicated by the input signal exceeds the predetermined range, the automatic gain controller 7 reduces an amplitude of the input signal in such a manner that a sound volume of an echo of a reproduction sound and a sound volume of an uttering sound of a near-end utterer agree with each other. This allows both the echo of the reproduction sound and the uttering sound of the near-end utterer to be easily audible at a remote location.
Subsequently, an operation by the communication device 101 according to the second embodiment of the disclosure will be described.
Step S21 to step S30 in
Next, in step S31, the automatic gain controller 7 automatically adjusts a sound volume indicated by an echo removal signal output from the acoustic echo canceler 3. The automatic gain controller 7 automatically adjusts a gain so that the sound volume indicated by the echo removal signal falls within a predetermined range.
Subsequently, in step S32, the automatic gain controller 7 outputs the echo removal signal indicating the adjusted sound volume. The automatic gain controller 7 outputs the echo removal signal indicating the adjusted sound volume to the communication sound output part 23. The communication sound output part 23 sends, to a remote supervising device (not shown) at a receiver, the echo removal signal output from the automatic gain controller 7 to serve as a near-end communication sound.
This configuration achieves automatic adjustment of the sound volume indicated by the output signal from the acoustic echo canceler 3. Conclusively, the configuration allows a sound volume of an echo of a reproduction sound and a sound volume of an uttering sound of a near-end utterer in the vicinity of the microphone 2 to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
In the second embodiment, the communication device 101 may exclude the reproduction sound detector 6. In the second embodiment, the communication device 101 may further exclude the first amplifier 4.
The third embodiment aims at reducing a difference between a sound volume of an echo of a reproduction sound and a sound volume of an uttering sound of a near-end utterer at a remote location in the same manner as the second embodiment.
The communication system shown in
The communication device 102 includes an acoustic echo canceler 3, a first amplifier 4, a first adder 5, a reproduction sound detector 6, a lowering gain setter 8, and a variable amplifier 9. The communication device 102 shown in
When the reproduction sound detector 6 detects a reproduction sound, the lowering gain setter 8 sets a first lowering gain for lowering a sound volume indicated by an output signal from the acoustic echo canceler 3. When the reproduction sound detector 6 does not detect the reproduction sound, the lowering gain setter 8 sets a second lowering gain for avoiding the lowering of the sound volume indicated by the output signal from the acoustic echo canceler 3.
When the reproduction sound detector 6 detects a reproduction sound, the lowering gain setter 8 sets a lowering gain Gsup(j) for the variable amplifier 9 to the first lowering gain Gsup0 for suppressing a signal. When the reproduction sound detector 6 does not detect the reproduction sound, the lowering gain setter 8 sets the lowering gain Gsup(j) for the variable amplifier 9 to the second lowering gain Gsup1 for avoiding the suppressing of the signal. The first lowering gain Gsup0 has a predetermined value to fall within a range “0<Gsup0<1”. The second lowering gain Gsup1 is “1”. The value of the first lowering gain Gsup0 is preset so that a sound volume of an echo of the reproduction sound which is audible at the remote location and a sound volume of the uttering sound of the near-end utterer are equal to each other.
The variable amplifier 9 multiplies an output signal from the acoustic echo canceler 3 by the first lowering gain or the second lowering gain set by the lowering gain setter 8.
Subsequently, an operation by the communication device 102 according to the third embodiment of the disclosure will be described.
Step S41 to step S49 in
Next, in step S50, the lowering gain setter 8 sets the first lowering gain Gsup0 for lowering a sound volume indicated by an output signal from the acoustic echo canceler 3 for the variable amplifier 9.
When it is determined that a reproduction sound is not detected (NO in step S48), the lowering gain setter 8 sets the second lowering gain Gsup1 for avoiding the lowering of the sound volume indicated by the output signal from the acoustic echo canceler 3 for the variable amplifier 9 in step S51. The second lowering gain Gsup1 is “1”.
Step S52 in
Next, in step S53, the variable amplifier 9 multiplies an echo removal signal output from the acoustic echo canceler 3 by the first lowering gain or the second lowering gain set by the lowering gain setter 8.
Subsequently, in step S54, the variable amplifier 9 outputs the echo removal signal resulting from multiplying the first lowering gain or the second lowering gain. The variable amplifier 9 outputs the echo removal signal to the communication sound output part 23. The communication sound output part 23 sends, to a remote supervising device (not shown) at a receiver, the echo removal signal output from the variable amplifier 9 to serve as a near-end communication sound.
Conclusively, this configuration includes setting the first lowering gain to be multiplied by an output signal from the acoustic echo canceler 3 so that a sound volume of an echo of a reproduction sound which is audible to a remote utterer and a sound volume of an uttering sound of a near-end utterer are equal to each other when the reproduction sound is detected. Consequently, the configuration allows the sound volume of the echo of the reproduction sound and the sound volume of the uttering sound of the near-end utterer in a vicinity of the microphone 2 to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
In the third embodiment, the communication device 102 may exclude the first amplifier 4.
The fourth embodiment aims at reducing a difference between a sound volume of an echo of a reproduction sound and a sound volume of an uttering sound of a near-end utterer at a remote location in the same manner as the second embodiment.
The communication system shown in
The communication device 103 includes an acoustic echo canceler 3A, a first amplifier 4, a first adder 5, a reproduction sound detector 6, a second amplifier 10, and a second adder 11. The communication device 103 shown in
The second amplifier 10 lowers a sound volume of a reproduction sound. The second amplifier 10 serves as an example sound volume lowering part. The second amplifier 10 multiplies an output signal G1·xr(j) from the first amplifier 4 by a predetermined gain G2 (0<G2<1). The second amplifier 10 controls an amplitude of the output signal from the first amplifier 4.
The second adder 11 adds the reproduction sound having the sound volume lowered by the second amplifier 10 to a remote communication sound. The second adder 11 adds an output signal G1·G2·xr(j) from the second amplifier 10 to a remote communication sound xf(j). The second adder 11 generates a reference signal xf(j)+G1·G2·xr(j) for the acoustic echo canceler 3A.
The acoustic echo canceler 3A cancels, on the basis of an output signal from the second adder 11 to be a reference signal, an echo component of each of the remote communication sound and the reproduction sound having the lowered sound volume contained in an input signal acquired by the microphone 2.
The second adder 11 adds the output signal G1·G2·xr(j) from the second amplifier 10 to the remote communication sound xf(j) when the reproduction sound xf(j) is reproduced in a state where an echo of the remote communication sound xf(j) is removed in progress of an update of a filter coefficient HHAT(j) of the acoustic echo canceler 3A and accurate deduction of an impulse response H in an acoustic echo route from the speaker 1 to the microphone 2.
An input signal y(j) output from the microphone 2 is expressed with the aforementioned Equation (2) including an input signal series Xf(j)+G1·Xr(j) of the speaker 1 and an impulse response H in an acoustic echo route from the speaker 1 to the microphone 2.
The acoustic echo canceler 3A generates a deduced echo yhat(j) through convolution of the deduced filter coefficient HHAT(j) on the output signal series Xf(j)+G1·G2·Xr(j) of the second adder 11 as expressed with the following Equation (13).
The echo removal signal e(j) output from the acoustic echo canceler 3A is expressed with the following Equation (14).
The echo removal signal e(j) output from the acoustic echo canceler 3A in the case of HHAT(j)=H in Equation (14) is expressed with the following Equation (15).
As shown in Equation (14) and Equation (15), the echo by “G1·G2” of each of the remote communication sound xf(j) and the reproduction sound xr(j) is removed by the acoustic echo canceler 3A, but the echo component G1·(1−G2)·H*Xr(j) by “G1·(1−G2)” is output from the acoustic echo canceler 3A without being removed.
In presence of a reproduction sound, a signal for loud producing from the speaker 1 indicates “xf(j)+G1·xr(j)”, and a reference signal indicates “xf(j)+G1·G2·xr(j)”. The signal in the loud producing from the speaker 1 differs from the reference signal. Updating the filter coefficient HHAT(j) in this situation results in an incorrect update. To avoid the incorrect update, when the reproduction sound detector 6 detects the reproduction sound xr(j), the acoustic echo canceler 3A suspends the update of the filter coefficient HHAT(j) in the same manner as in the first embodiment. In contrast, when the reproduction sound xr(j) is not detected, the acoustic echo canceler 3A updates the filter coefficient.
Adjustment of the gain G2 for the second amplifier 10 leads to achievement in the control of the sound volume of the echo component G1·(1−G2)·H*Xr(j) of the reproduction sound contained in the echo removal signal e(j) output from the acoustic echo canceler 3A to be equivalent to the sound volume of the uttering sound of the near-end utterer. This results in allowing both the uttering sound of the near-end utterer and the reproduction sound to be easily audible to the remote supervising device.
Subsequently, an operation by the communication device 103 according to the fourth embodiment of the disclosure will be described.
Step S61 to step S65 in
Next, in step S66, the second amplifier 10 multiplies an output signal (G1·xr(j)) from the first amplifier 4 by a predetermined gain G2. The second amplifier 10 outputs a reproduction sound (G1·G2·xr(j)) resulting from multiplying the gains G1 and G2 to the second adder 11.
Then, in step S67, the second adder 11 adds an output signal from the second amplifier 10 to a remote communication sound. The second adder 11 outputs, to the acoustic echo canceler 3A, the remote communication sound ((xf(j)+G1·G2·xr(j)), to which an output signal from the second amplifier 10 is added, to serve as a reference signal.
Step S68 to step S71 in
Next, in step S72, the acoustic echo canceler 3A cancels, on the basis of the remote communication sound additionally including the reproduction sound having a lowered sound volume with the gains G1 and G2 to serve as a reference signal, an echo component of each of the remote communication sound and the reproduction sound resulting from multiplying the gains G1 and G2 contained in an input signal acquired by the microphone 2.
Subsequently, in step S73, the acoustic echo canceler 3A outputs an echo removal signal excluding, from the input signal, the echo component of each of the remote communication sound and the reproduction sound resulting from multiplying the gains G1 and G2. The acoustic echo canceler 3A outputs the echo removal signal to the communication sound output part 23. The communication sound output part 23 sends, to a remote supervising device (not shown) at a receiver, the echo removal signal output from the acoustic echo canceler 3A to serve as a near-end communication sound.
Conclusively, a sound volume of the reproduction sound output from the speaker 1 differs from a sound volume of the reproduction sound input into the acoustic echo canceler 3A to serve as a reference signal. Hence, an echo component in a part of the reproduction sound is output without being canceled by the acoustic echo canceler 3A. The configuration accordingly enables the remote utterer to confirm whether the reproduction sound is loudly produced.
In the second embodiment and the third embodiment, a lowering gain changes depending on presence or absence of a reproduction sound. Such a change in the lowering gain may cause a fluctuation in a background noise in the second embodiment and the third embodiment. Further, in a case where a reproduction sound is reproduced concurrently with inputting of an uttering sound of a near-end utterer, the change in the lowering gain may lead to lowering of the uttering sound of the near-end utterer in the second embodiment and the third embodiment. In contrast, in such case where a reproduction sound is reproduced concurrently with inputting of an uttering sound of a near-end utterer, the fourth embodiment has an advantage of avoiding the lowering of the uttering sound of the near-end utterer without suppression of an output from the acoustic echo canceler 3A and a fluctuation in a background noise.
The fourth embodiment achieves appropriate control of a sound volume of a reproduction sound included in an output from the acoustic echo canceler 3A in a state where the acoustic echo canceler 3A satisfactorily removes an echo. In contrast, the fifth embodiment aims at appropriately controlling a sound volume of a reproduction sound included in an output from an acoustic echo canceler 3A even in a state where the acoustic echo canceler 3A does not satisfactorily remove an echo.
The communication system shown in
The communication device 104 includes the acoustic echo canceler 3A, a first amplifier 4, a first adder 5, a reproduction sound detector 6, a variable amplifier 9, a second amplifier 10, a second adder 11, and a lowering gain setter 81. The communication device 104 shown in
When the reproduction sound detector 6 detects a reproduction sound, the lowering gain setter 81 estimates an echo cancelation amount of the acoustic echo canceler 3A. The lowering gain setter 81 calculates a power value of a filter coefficient and estimates the echo cancelation amount from the calculated power value of the filter coefficient. The lowering gain setter 81 calculates a first lowering gain for lowering a sound volume indicated by an output signal from the acoustic echo canceler 3A in association with the estimated echo cancelation amount, and sets the calculated first lowering gain.
When the reproduction sound detector 6 does not detect the reproduction sound, the lowering gain setter 81 sets a second lowering gain for avoiding the lowering of the sound volume indicated by the output signal from the acoustic echo canceler 3A.
The variable amplifier 9 multiplies the output signal from the acoustic echo canceler 3A by the first lowering gain or the second lowering gain set by the lowering gain setter 81.
When the reproduction sound detector 6 detects a reproduction sound, the lowering gain setter 81 calculates an estimated value (∥HHAT(j)∥)1/2/Pwrh0 of the echo cancelation amount of the acoustic echo canceler 3A by utilizing a filter coefficient HHAT(j) of the acoustic echo canceler 3A. The sign “∥HHAT(j)∥” denotes a norm of the coefficient “HHAT(j)” and is expressed with the following Equation (16). The sign “Pwrh0” denotes a square route of a total power value of impulse responses in an acoustic echo route ((∥H∥)1/2). The lowering gain setter 81 then sets a lowering gain Gsup(j) for the variable amplifier 9 to a first lowering gain Gsup0 on the basis of the estimated value of the echo cancelation amount.
When the reproduction sound detector 6 does not detect a reproduction sound, the lowering gain setter 81 sets the lowering gain Gsup(j) for the variable amplifier 9 to the second lowering gain Gsup1 for avoiding suppressing of a signal. The second lowering gain Gsup1 is “1”.
The variable amplifier 9 suppresses the output signal from the acoustic echo canceler 3A with the first lowering gain Gsup0 or the second lowering gain Gsup1 set by the lowering gain setter 81.
Specifically, when the reproduction sound detector 6 detects a reproduction sound, the lowering gain setter 81 sets the first lowering gain Gsup0 in accordance with the following Equation (17).
In Equation (17), the sign “min(a, b)” means selection of a smaller value from “a” and “b”. The sign “G1” denotes a gain (0<G1) for the first amplifier 4, the sign “G2” denotes a gain (0<G2≤1) for the second amplifier 10, and the sign “j” denotes a time.
The first lowering gain Gsup0 approaches “1” as the norm ∥HHAT(j)∥ gradually increases in progress of an update of the filter coefficient HHAT(j) of the acoustic echo canceler 3A. In contrast, in a state of no update of the filter coefficient HHAT(j) by the acoustic echo canceler 3A, that is, in the case of “∥HHAT(j)∥=0”, the first lowering gain Gsup0 reaches “1−G2” and the echo component of the reproduction sound is suppressed or lowered in the variable amplifier 9. In this case, a sound volume of an echo of the reproduction sound at a remote location is equal to a sound volume of an echo of the reproduction sound resulting from satisfactorily removing the echo by the acoustic echo canceler 3A. When the acoustic echo canceler 3A satisfactorily cancels the echo, the first lowering gain Gsup0 reaches “1”. The result is the same as the result in the fourth embodiment. The fifth embodiment enables control of a lowering gain for a reproduction sound depending on a situation of removal of an echo by the acoustic echo canceler 3A, and thus allows a sound volume of the reproduction sound and a sound volume of an uttering sound of a near-end utterer at the remote location to be equal to each other without relying on an echo cancelation amount of the acoustic echo canceler 3A.
Subsequently, an operation by the communication device 104 according to the fifth embodiment of the disclosure will be described.
Step S81 to step S91 in
Next, in step S92, the lowering gain setter 81 calculates a power value of a filter coefficient of the acoustic echo canceler 3A and estimates an echo cancelation amount of the acoustic echo canceler 3A from the calculated power value of the filter coefficient.
Subsequently, in step S93, the lowering gain setter 81 calculates the first lowering gain Gsu0 for lowering a sound volume indicated by an output signal from the acoustic echo canceler 3A in association with the estimated echo cancelation amount, and sets the calculated first lowering gain Gsup0 for the variable amplifier 9.
When it is determined that a reproduction sound is not detected (NO in step S90), the lowering gain setter 81 sets the second lowering gain Gsup 1 for avoiding the lowering of the sound volume indicated by the output signal from the acoustic echo canceler 3A for the variable amplifier 9 in step S94. The second lowering gain Gsup1 is “1”.
Step S95 in
Conclusively, this configuration includes setting the first lowering gain to be multiplied by an output signal from the acoustic echo canceler 3A, depending on a situation of removal of an echo by the acoustic echo canceler 3A, so that a sound volume of the echo of the reproduction sound and a sound volume of an uttering sound of a near-end utterer which are audible to a remote utterer are equal to each other when the reproduction sound is detected. Consequently, the configuration allows the sound volume of the echo of the reproduction sound and the sound volume of the uttering sound of the near-end utterer in the vicinity of the microphone 2 to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
Besides, when the filter coefficient is not updated and the filter coefficient has a small power value, the echo cancelation amount reduces and the sound volume of the echo of the reproduction sound output from the acoustic echo canceler 3A increases. Hence, the echo component of the reproduction sound is suppressed or lowered with the first lowering gain until the power value of the filter coefficient increases in progress of the update of the filter coefficient. Consequently, the configuration allows the sound volume of the echo of the reproduction sound and the sound volume of the uttering sound of the near-end utterer in the vicinity of the microphone 2 to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to the remote utterer.
The lowering gain setter 81 in the fifth embodiment estimates an echo cancelation amount from a power value of a filter coefficient of the acoustic echo canceler 3A. In contrast, a lowering gain setter in the sixth embodiment estimates an echo cancelation amount on the basis of a level of an input signal input into an acoustic echo canceler 3A from a microphone 2 and a level of an output signal from the acoustic echo canceler 3A.
The communication system shown in
The communication device 105 includes the acoustic echo canceler 3A, a first amplifier 4, a first adder 5, a reproduction sound detector 6, a variable amplifier 9, a second amplifier 10, a second adder 11, and a lowering gain setter 82. The communication device 105 shown in
When the reproduction sound detector 6 detects a reproduction sound, the lowering gain setter 82 estimates an echo cancelation amount of the acoustic echo canceler 3A. The lowering gain setter 82 estimates the echo cancelation amount on the basis of a level of an input signal input into the acoustic echo canceler 3A from the microphone 2 and a level of an output signal from the acoustic echo canceler 3A. The lowering gain setter 82 calculates a first lowering gain for lowering a sound volume indicated by the output signal from the acoustic echo canceler 3A in association with the estimated echo cancelation amount, and sets the calculated first lowering gain.
When the reproduction sound detector 6 does not detect the reproduction sound, the lowering gain setter 82 sets a second lowering gain for avoiding the lowering of the sound volume indicated by the output signal from the acoustic echo canceler 3A.
When the reproduction sound detector 6 does not detect a reproduction sound, the lowering gain setter 82 estimates an echo cancelation amount can(j) from a ratio between a level of an input signal into the acoustic echo canceler 3A and a level of an output signal from the acoustic echo canceler 3A. When the reproduction sound detector 6 does not detect the reproduction sound, the lowering gain setter 82 calculates the echo cancelation amount can(j) in accordance with the following Equation (18).
In Equation (18), the sign “e(j)” denotes a short-time average value of absolute values of the echo removal signal e(j), the sign “y(j)” denotes a short-time average value of absolute values of an input signal y(j), and the sign “min(a, b)” means selection of a smaller value from “a” and “b”, and the sign “j” denotes a time. The short-time average value y(j) of the input signal y(j) and the short-time average value e(j) of the output signal e(j) concerning the acoustic echo canceler 3A are calculated in the same manner as the level xr(j) in Equation (1).
When the reproduction sound detector 6 detects a reproduction sound, the lowering gain setter 82 holds a previously estimated echo cancelation amount. When the reproduction sound detector 6 detects the reproduction sound, the lowering gain setter 82 calculates the echo cancelation amount can(j) in accordance with the following Equation (19).
When the reproduction sound detector 6 detects a reproduction sound, the lowering gain setter 82 calculates the first lowering gain Gsup0 in accordance with the following Equation (20), and sets a lowering gain Gsup(j) for the variable amplifier 9 to the calculated first lowering gain Gsup0.
In Equation (20), the sign “G1” denotes a gain (0<G1) for the first amplifier 4, the sign “G2” denotes a gain (0<G2≤1) for the second amplifier 10, and the sign “j” denotes a time.
When the reproduction sound detector 6 does not detect the reproduction sound, the lowering gain setter 82 sets the lowering gain Gsup(j) for the variable amplifier 9 to the second lowering gain Gsup1 for avoiding suppressing of a signal. The second lowering gain Gsup1 is “1”.
When the reproduction sound detector 6 does not detect the reproduction sound, suppression or lowering of an echo is not executed. Hence, the lowering gain Gsup(j) for the variable amplifier 9 is “1”, that is, a second lowering gain Gsup1.
When the reproduction sound detector 6 detects a reproduction sound and the acoustic echo canceler 3A does not cancel an echo at all, the echo cancelation amount can(j) reaches “1” and the first lowering gain Gsup0 becomes minimum. At this time, the first lowering gain Gsup0 reaches “1−G2”.
In contrast, when the reproduction sound detector 6 detects a reproduction sound and the acoustic echo canceler 3A satisfactorily cancels the echo, the echo cancelation amount can (j) reduces. In a case of “can(j)≈0”, the first lowering gain Gsup0 reaches “1”.
The process in the sixth embodiment is the same as the process in the fifth embodiment except for calculation of a lowering gain that is different from the calculation in the fifth embodiment. From this perspective, description about the operation of the communication device 105 in the sixth embodiment will be omitted.
In the sixth embodiment, the echo cancelation amount shows, for example, a ratio between a level of an input signal input into the acoustic echo canceler 3A from the microphone 2 and a level of an output signal from the acoustic echo canceler 3A. When the acoustic echo canceler 3A does not cancel the echo, the echo cancelation amount reaches “1” and the sound volume of the echo of the reproduction sound output from the acoustic echo canceler 3A increases. Hence, the echo component of the reproduction sound is suppressed or lowered with the first lowering gain until the acoustic echo canceler 3A satisfactorily cancels the echo. Consequently, the configuration allows the sound volume of the echo of the reproduction sound and a sound volume of an uttering sound of a near-end utterer in the vicinity of the microphone 2 to agree with each other in such a manner that both the echo of the reproduction sound and the uttering sound of the near-end utterer are easily audible to a remote utterer.
In the first embodiment, only a remote communication sound is input into the acoustic echo canceler 3 to serve as a reference signal. In contrast, in the seventh embodiment, a remote communication sound additionally including a reproduction sound is input into an acoustic echo canceler to serve as a reference signal, and furthermore, the reproduction sound is added to an output signal from the acoustic echo canceler.
The communication system shown in
The communication device 106 includes an acoustic echo canceler 3B, a first amplifier 4, a first adder 5, a third amplifier 12, and a third adder 13. The communication device 106 may include at least one of the speaker 1 and the microphone 2.
The first adder 5 adds a remote communication sound and an output signal G1·xr(j) from the first amplifier. The first adder 5 adds a reproduction sound to the remote communication sound of a remote utterer, and outputs the communication sound additionally including the reproduction sound to each of the speaker 1 and the acoustic echo canceler 3B. An output signal xf(j)+G1·xr(j) from the first adder 5 serves as a reference signal for the acoustic echo canceler 3B. The output signal from the first adder 5 is in loud producing from the speaker 1 as well.
The acoustic echo canceler 3B cancels, on the basis of the output signal from the first adder 5 to be the reference signal, an echo component of each of the communication sound and the reproduction sound contained in an input signal acquired by the microphone 2. The acoustic echo canceler 3B removes an echo component of the reproduction sound as well as an echo component of the remote communication sound. The acoustic echo canceler 3B receives a remote communication sound xf(j), which additionally includes the reproduction sound G1·xr(j) having an adjusted sound volume, to serve as the reference signal, and receives an input signal y(j) from the microphone 2. The acoustic echo canceler 3B removes an acoustic echo loudly produced from the speaker 1 and coming to the microphone 2.
The third amplifier 12 multiplies the reproduction sound by a predetermined gain G3. The third amplifier 12 adjusts a sound volume of the reproduction sound and outputs the reproduction sound having the adjusted sound volume to the third adder 13. The third amplifier 12 serves as an example second sound volume adjuster. The gain G3 for the third amplifier 12 is defined so that an uttering sound of a near-end utterer and a reproduction sound become equivalent to each other at a remote location.
The third adder 13 adds an output signal from the third amplifier 12 and an output signal from the acoustic echo canceler 3B. The third adder 13 adds the reproduction sound to the output signal from the acoustic echo canceler 3B. The third adder 13 serves as an example second adder. The third adder 13 adds the reproduction sound having the sound volume adjusted by the third amplifier 12 to the output signal from the acoustic echo canceler 3B.
The remote communication sound xf(j) is loudly produced from the speaker 1 and collected as an echo component by the microphone 2 in the same manner as in the first embodiment, but the echo component is removed by the acoustic echo canceler 3B. An echo component of the reproduction sound xr(j) is also removed by the acoustic echo canceler 3B unlike the corresponding echo component in the first embodiment.
The reproduction sound G1·G3·xr(j) having the sound volume adjusted by the third amplifier 12 is added to the echo removal signal e(j) output from the acoustic echo canceler 3B by the third adder 13, and a signal “e(j)+G1·G3·xr(j)” is sent to the remote location. This enables confirmation of the reproduction sound at the remote location.
In the first embodiment, an echo of a reproduction sound loudly produced from the speaker 1 and collected by the microphone 2 is sent to a remote location. In contrast, in the seventh embodiment, the reproduction sound G1·G3·xr(j) resulting from multiplying the reproduction sound G1·xr(j) input into the speaker 1 by the gain G3 for the third amplifier 12 is sent to the remote location. This provides an advantageous effect of successful confirmation of a clear reproduction sound without an acoustic echo at the remote location by avoiding passing through an acoustic echo route.
Subsequently, an operation by the communication device 106 according to the seventh embodiment of the disclosure will be described.
Steps S101 to S102 in
Next, in step S103, the first amplifier 4 adjusts a sound volume of a reproduction sound. The first amplifier 4 multiplies the reproduction sound by a predetermined gain G1. The gain G1 is determined in advance depending on an environment for use of the communication device 106. The first amplifier 4 outputs the reproduction sound having the adjusted sound volume to each of the first adder 5 and the third amplifier 12.
Step S104 in
Next, in step S105, the first adder 5 outputs a remote communication sound additionally including the reproduction sound to each of the speaker 1 and the acoustic echo canceler 3B. The speaker 1 loudly produces the remote communication sound additionally including the reproduction sound. The acoustic echo canceler 3B utilizes the remote communication sound additionally including the reproduction sound as a reference signal.
Step S106 in
Next, in step S107, the acoustic echo canceler 3B cancels, on the basis of the remote communication sound additionally including the reproduction sound to serve as the reference signal, an echo component of each of the remote communication sound and the reproduction sound contained in the input signal acquired by the microphone 2.
Subsequently, in step S108, the third amplifier 12 adjusts a sound volume of the reproduction sound resulting from multiplying the gain G1 output from the first amplifier 4. The third amplifier 12 multiplies an output signal from the first amplifier 4 by a predetermined gain G3. The gain G3 is determined in advance so that an uttering sound of a near-end utterer and the reproduction sound become equivalent to each other at a remote location. The third amplifier 12 outputs the reproduction sound having the adjusted sound volume to the third adder 13.
Then, in step S109, the third adder 13 adds the reproduction sound output from the third amplifier 12 to an echo removal signal output from the acoustic echo canceler 3B. The third adder 13 adds a reproduction sound G1·G3·xr(j) having the sound volume adjusted by the first amplifier 4 and the third amplifier 12 to an echo removal signal e(j) from the acoustic echo canceler 3B.
Thereafter, in step S110, the third adder 13 outputs the echo removal signal additionally including the reproduction sound to the communication sound output part 23. The communication sound output part 23 sends, to a remote supervising device (not shown) at a receiver, the echo removal signal output from the third adder 13 and additionally including the reproduction sound to serve as a near-end communication sound.
Conclusively, the acoustic echo canceler 3B removes the echo component of the reproduction sound coming from the speaker 1 to the microphone 2. Then, the reproduction sound is added to the output signal from the acoustic echo canceler 3B excluding the echo component of the reproduction sound, and an output signal additionally including the reproduction sound having not passed through an acoustic echo route is output toward a remote utterer. This enables the remote utterer to confirm a clear reproduction sound without an acoustic echo.
In the seventh embodiment, the communication device 106 may exclude the first amplifier 4 and the third amplifier 12.
A remote utterer who plans to hold a remote meeting, a remote lecture, or other event has a demand for confirming that a communication sound of the remote utterer is loudly produced at a destination. Taking this into consideration, a communication device according to the eighth embodiment is configured to loudly produce a remote communication sound from a speaker 1 and return the remote communication sound collected by a microphone 2 to a remote communication device at a receiver.
The communication system shown in
The control signal input part 24 outputs, to the communication device 107A, a control signal indicating whether to return a remote communication sound to a remote location. The control signal is sent from a remote communication device (not shown) at a receiver. The control signal input part 24 is connected to a communicator (not shown). The communicator is configured to receive a control signal from the remote supervising device at the receiver via a network. The remote supervising device receives an input from the remote utterer as to whether to return the remote communication sound to the remote location.
The communication device 107A includes an acoustic echo canceler 3, a first amplifier 4, a first adder 5, a reproduction sound detector 6, and a reference change switch 14. The communication device 107A shown in
The reference change switch 14 switches, on the basis of the control signal indicating whether to return the remote communication sound to the remote location, between an ON-mode of inputting the remote communication sound as a reference signal into the acoustic echo canceler 3 and an OFF-mode of avoiding the inputting of the remote communication sound as the reference signal into the acoustic echo canceler 3. The ON/OFF-mode of the reference change switch 14 is changed in response to the control signal input from the control signal input part 24. Normally, the reference change switch 14 is set to the ON-mode.
The reference change switch 14 switches to the OFF-mode when the control signal indicates returning of the remote communication sound to the remote location. Specifically, when the remote utterer confirms that the remote communication sound is loudly produced at a destination, the control signal input part 24 acquires a control signal for setting the reference change switch 14 to the OFF-mode. In this manner, the reference change switch 14 is set to the OFF-mode.
When the reference change switch 14 is set to the ON-mode, the acoustic echo canceler 3 removes the remote communication sound. In this case, the remote utterer cannot hear the remote communication sound loudly reproduced at the destination. In contrast, when the reference change switch 14 is set to the OFF-mode, the acoustic echo canceler 3 avoids the removing of the remote communication sound. In this case, the remote utterer can hear the remote communication sound loudly produced at the destination.
When the reference change switch 14 is set to the OFF-mode, the acoustic echo canceler 3 receives an input of “0” to serve as the reference signal therefor in place of the remote communication sound xf(j). This consequently suspends an update of a coefficient of an adaptive filter of the acoustic echo canceler 3.
Conclusively, the reference change switch 14 switches to the OFF-mode of avoiding the inputting of a remote communication sound into the acoustic echo canceler as a reference signal when a control signal indicates returning of the remote communication sound to a remote location. The configuration thus keeps the acoustic echo canceler 3 from removing the remote communication sound and enables the remote utterer to confirm that the communication sound of the remote utterer is loudly produced at a destination.
A communication device according to the ninth embodiment is configured to loudly produce a remote communication sound from a speaker 1 and return the remote communication sound collected by a microphone 2 to a remote communication device at a receiver in the same manner as the communication device according to the eighth embodiment.
The communication system shown in
The control signal input part 24 outputs, to the communication device 107B, a control signal indicating whether to return a remote communication sound to a remote utterer. The control signal is sent from a remote communication device (not shown) at a receiver. The control signal input part 24 is connected to a communicator (not shown). The communicator is configured to receive a control signal from the remote supervising device at the receiver via a network. The remote supervising device receives an input from the remote utterer as to whether to return the remote communication sound to the remote location.
The communication device 107B includes an acoustic echo canceler 3, a first amplifier 4, a first adder 5, a reproduction sound detector 6, a fourth amplifier 15, and a fourth adder 16. The communication device 107B shown in
The fourth amplifier 15 adjusts, on the basis of the control signal indicating whether to return the remote communication sound to the remote location, a sound volume of the remote communication sound to be input into the fourth adder 16. The fourth amplifier 15 serves as an example third sound volume adjuster. The fourth amplifier 15 adjusts, on the basis of the control signal from the control signal input part 24, the sound volume of the remote communication sound from the communication sound input part 21. The fourth amplifier 15 adjusts the remote communication sound to a sound volume which is audible to the remote utterer when the control signal indicates returning of the remote communication sound to the remote location. The fourth amplifier 15 adjusts the remote communication sound to a sound volume which is inaudible to the remote utterer when the control signal indicates avoiding the returning of the remote communication sound to the remote location.
The fourth amplifier 15 changes a gain in response to the control signal input from the control signal input part 24. The fourth amplifier 15 multiplies the remote communication sound from the communication sound input part 21 by a predetermined gain G4 when the control signal indicates returning of the remote communication sound to the remote location. The gain G4 is determined in advance so that an uttering sound of a near-end utterer and a remote communication sound become equivalent to each other at a remote location. When the control signal does not indicate the returning of the remote communication sound to the remote location, the fourth amplifier 15 multiplies the remote communication sound from the communication sound input part 21 by “0” to mute the remote communication sound.
The fourth adder 16 adds an output signal from the fourth amplifier 15 and an output signal from the acoustic echo canceler 3. The fourth adder 16 serves as an example third adder. When the control signal indicates the returning of the remote communication sound to the remote location, the fourth adder 16 adds the remote communication sound having a sound volume amplified by the fourth amplifier 15 to an output signal from the acoustic echo canceler 3. When the control signal indicates avoiding the returning of the remote communication sound to the remote location, the fourth adder 16 avoids adding the remote communication sound to the output signal from the acoustic echo canceler 3.
Conclusively, this configuration achieves adjustment of a remote communication sound to a sound volume which is audible to a remote utterer when a control signal indicates returning of the communication sound to a remote location. The configuration in which the communication sound having the sound volume adjusted to be audible to the remote utterer by the fourth amplifier 15 is added to an output signal from the acoustic echo canceler 3 enables the remote utterer to confirm that the remote communication sound of the remote utterer is loudly produced at a destination.
A communication device according to the tenth embodiment is configured to loudly produce a remote communication sound from a speaker 1 and returns the remote communication sound collected by a microphone 2 to a remote communication device at a receiver in the same manner as the communication device according to each of the eighth embodiment and the ninth embodiment.
The communication system shown in
The control signal input part 24 outputs, to the communication device 108A, a control signal indicating whether to return a remote communication sound to a remote location. The control signal is sent from a remote communication device (not shown) at a receiver. The control signal input part 24 is connected to a communicator (not shown). The communicator is configured to receive a control signal from the remote supervising device at the receiver via a network. The remote supervising device receives an input from the remote utterer as to whether to return the remote communication sound to the remote location.
The communication device 108A includes an acoustic echo canceler 3B, a first amplifier 4, a first adder 5, a third amplifier 12, a third adder 13, and a reference change switch 14. The communication device 108A shown in
The reference change switch 14 switches, on the basis of the control signal indicating whether to return the remote communication sound to the remote location, between an ON-mode of inputting an output signal from the first adder 5 as a reference signal into the acoustic echo canceler 3B and an OFF-mode of avoiding the inputting of the output signal from the first adder 5 as the reference signal into the acoustic echo canceler 3B. The ON/OFF-mode of the reference change switch 14 is changed in response to the control signal input from the control signal input part 24. Normally, the reference change switch 14 is set to the ON-mode.
The reference change switch 14 switches to the OFF-mode when the control signal indicates returning of the remote communication sound to the remote location. Specifically, when the remote utterer confirms that the remote communication sound is loudly produced at the destination, the control signal input part 24 acquires a control signal for setting the reference change switch 14 to the OFF-mode. In this manner, the reference change switch 14 is set to the OFF-mode.
When the reference change switch 14 is set to the ON-mode, the acoustic echo canceler 3B removes an output signal xf(j)+G1·xr(j) from the first adder 5. In this case, the remote utterer cannot hear the remote communication sound loudly reproduced at the destination. In contrast, when the reference change switch 14 is set to the OFF-mode, the acoustic echo canceler 3B avoids removing the output signal xf(j)+G1·xr(j) from the first adder 5. In this case, the remote utterer can hear the remote communication sound loudly produced at the destination.
When the reference change switch 14 is set to the OFF-mode, the acoustic echo canceler 3B receives an input of “0” to serve as the reference signal therefor in place of the output signal xf(j)+G1·xr(j) from the first adder 5. This consequently suspends an update of a coefficient of an adaptive filter of the acoustic echo canceler 3B.
Conclusively, the reference change switch 14 switches to the OFF-mode of avoiding inputting of an output signal from the first adder 5 into the acoustic echo canceler 3B as a reference signal when a control signal indicates returning of the remote communication sound to a remote location. The configuration thus keeps the acoustic echo canceler 3B from removing the remote communication sound and enables the remote utterer to confirm that the communication sound of the remote utterer is loudly produced at a destination.
A communication device according to the eleventh embodiment is configured to loudly produce a remote communication sound from a speaker 1 and return the remote communication sound collected by a microphone 2 to a remote communication device at a receiver in the same manner as the communication device according to each of the eighth embodiment to the tenth embodiment.
The communication system shown in
The control signal input part 24 outputs, to the communication device 108B, a control signal indicating whether to return a remote communication sound to a remote location. The control signal is sent from a remote communication device (not shown) at a receiver. The control signal input part 24 is connected to a communicator (not shown). The communicator is configured to receive a control signal from the remote supervising device at the receiver via a network. The remote supervising device receives an input from the remote utterer as to whether to return the remote communication sound to the remote location.
The communication device 108B includes an acoustic echo canceler 3B, a first amplifier 4, a first adder 5, a third amplifier 12, a third adder 13, a fourth amplifier 15, and a fourth adder 16. The communication device 108B shown in
The fourth amplifier 15 adjusts, on the basis of the control signal indicating whether to return the remote communication sound to the remote location, a sound volume of the remote communication sound to be input into the fourth adder 16. The fourth amplifier 15 serves as an example third sound volume adjuster. The fourth amplifier 15 adjusts, on the basis of the control signal from the control signal input part 24, the sound volume of the remote communication sound from the communication sound input part 21. The fourth amplifier 15 adjusts the communication sound to a sound volume which is audible to a remote utterer when the control signal indicates returning of the remote communication sound to the remote location. The fourth amplifier 15 adjusts the communication sound to a sound volume which is inaudible to the remote utterer when the control signal indicates avoiding the returning of the remote communication sound to the remote location.
The fourth amplifier 15 changes a gain in response to the control signal input from the control signal input part 24. The fourth amplifier 15 multiplies the remote communication sound from the communication sound input part 21 by a predetermined gain G4 when the control signal indicates returning of the remote communication sound to the remote location. The gain G4 is determined in advance so that an uttering sound of a near-end utterer and a remote communication sound become equivalent to each other at a remote location. When the control signal does not indicate the returning of the remote communication sound to the remote location, the fourth amplifier 15 multiplies the remote communication sound from the communication sound input part 21 by “0” to mute the remote communication sound.
The fourth adder 16 adds an output signal from the fourth amplifier 15 and an output signal e(j)+G1·G3·xr(j) from the third adder 13. The fourth adder 16 serves as an example third adder. When the control signal indicates returning of the remote communication sound to the remote location, the fourth adder 16 adds the remote communication sound having a sound volume amplified by the fourth amplifier 15 to the output signal from the third adder 13. When the control signal indicates avoiding the returning of the remote communication sound to the remote location, the fourth adder 16 avoids adding the remote communication sound to the output signal from the third adder 13.
Conclusively, this configuration achieves adjustment of a remote communication sound to a sound volume which is audible to a remote utterer when a control signal indicates returning of the remote communication sound to a remote location. The configuration in which the remote communication sound having the sound volume adjusted by the fourth amplifier 15 to be audible to the remote utterer is added to an output signal from the third adder 13 hence enables the remote utterer to confirm that the remote communication sound of the remote utterer is loudly produced at a destination.
Next, a configuration of: a vehicle including the communication device 100 according to the first embodiment; and a remote supervising device will be described.
The vehicle 201 includes a speaker 1, a microphone 2, the communication device 100, and a communicator 110. In
The communicator 110 includes a communication sound input part 21, a reproduction sound input part 22, and a communication sound output part 23.
The communicator 110 receives a remote communication sound of a remote supervisor sent from the remote supervising device 202. The communicator 110 outputs the received remote communication sound to the communication device 100. The communicator 110 sends, to the remote supervising device 202, a near-end communication sound output from the communication device 100. The near-end communication sound serves as a sound signal including an echo of each of the communication sound of a near-end utterer and the reproduction sound, and excluding an echo component of the remote communication sound. The communicator 110 receives the reproduction sound sent from the remote supervising device 202. The communicator 110 outputs the received reproduction sound to the communication device 100. The communication device 100 may include the communicator 110.
The vehicle 201 and the remote supervising device 202 are communicably connected to each other via a network 203. The network 203 includes, for example, the internet or a mobile phone network.
The remote supervising device 202 includes a microphone 41, a manipulation part 42, a speaker 43, a storage part 210, a reader 211, and a communicator 212. The remote supervising device 202 serves as an example remote communication device.
The microphone 41 collects a sound of a remote supervisor. The microphone 41 outputs a remote communication sound representing the collected sound to the communicator 212.
Examples of the manipulation part 42 include a keyboard, a mouse, and a touch screen to receive an input manipulation from the remote supervisor. The manipulation part 42 receives selection of a reproduction sound by the remote supervisor to be output to the outside of the vehicle 201 from among a plurality of reproduction sounds.
The manipulation part 42 may further receive an input of control information about a remote control of the vehicle 201. The remote supervisor uses the manipulation part 42 to move the vehicle 201.
The storage part 210 stores the plurality of reproduction sounds in advance.
The reader 211 reads out the reproduction sound selected via the manipulation part 42 from among the reproduction sounds stored in the storage part 210.
The communicator 212 sends, to the vehicle 201, the remote communication sound output from the microphone 41. The communicator 212 receives the near-end communication sound sent from the vehicle 201, and outputs the near-end communication sound to the speaker 43. The communicator 212 sends the reproduction sound read out by the reader 211 to the vehicle 201.
The speaker 43 loudly produces the near-end communication sound sent from the vehicle 201.
As illustrated in
The vehicle 201A includes a speaker 1, a microphone 2, the communication device 100, a storage part 111, a communicator 112, and a reader 113. In
The storage part 111 stores a plurality of reproduction sounds in advance.
The communicator 112 includes a communication sound input part 21 and a communication sound output part 23.
The communicator 112 receives a remote communication sound of a remote supervisor sent from the remote supervising device 202A. The communicator 112 outputs the received remote communication sound to the communication device 100. The communicator 112 sends, to the remote supervising device 202A, a near-end communication sound output from the communication device 100. The near-end communication sound serves as a sound signal including an echo of each of the communication sound of a near-end utterer and the reproduction sound, and excluding an echo component of the remote communication sound. The communicator 112 receives, from the remote supervising device 202A, reproduction information designating a reproduction sound to be output among the reproduction sounds.
The reader 113 reads out, from the storage part 111, the reproduction sound designated by the reproduction information received by the communicator 112. The reader 113 outputs the read-out reproduction sound to the communication device 100. The reader 113 includes a reproduction sound input part 22.
The communication device 100 may include the storage part 111, the communicator 112, and the reader 113.
The vehicle 201A and the remote supervising device 202A are communicably connected to each other via a network 203. The network 203 includes, for example, the internet.
The remote supervising device 202A includes a microphone 41, a manipulation part 42, a speaker 43, and a communicator 213. The remote supervising device 202A serves as an example remote communication device. The microphone 41 and the speaker 43 shown in
Examples of the manipulation part 42 include a keyboard, a mouse, and a touch screen to receive an input manipulation from the remote supervisor. The manipulation part 42 receives selection of a reproduction sound by the remote supervisor to be output to the outside of the vehicle 201A from among the reproduction sounds.
The communicator 213 sends, to the vehicle 201A, a remote communication sound output from the microphone 41. The communicator 213 receives the near-end communication sound sent from the vehicle 201A, and outputs the near-end communication sound to the speaker 43. The communicator 213 sends, to the vehicle 201A, reproduction information designating a reproduction sound selected, from among the reproduction sounds, by the manipulation part 42 to be output from the vehicle 201A.
As illustrated in
In the embodiments, each constituent element may be realized with dedicated hardware or by executing a software program suitable for the constituent element. Each constituent element may be realized by a program execution unit, such as a CPU or a processor, reading out and executing a software program recorded on a recording medium, such as a hard disk or a semiconductor memory. Other independent computer system may establish a program by recording the program in a recording medium to be transferred, or transferring the program via a network.
A part of or a whole of the functions of the device according to each embodiment of the disclosure is typically realized as a large scale integration (LSI), which is an integrated circuit. These functions may be formed as separate chips, or some of or a whole of the functions may be included in one chip. The circuit integration is not limited to the LSI, and may be realized with a dedicated circuit or a general-purpose processor. A field programmable gate array (FPGA) that is programmable after manufacturing of an LSI or a reconfigurable processor in which connections and settings of circuit cells within the LSI are reconfigurable may be used.
A part of or a whole of the functions of the device according to each embodiment of the present disclosure may be implemented by a processor, such as a CPU executing a program.
Numerical values used above are merely illustrative to be used to specifically describe the present disclosure, and thus the present disclosure is not limited to the illustrative numerical values.
Order in which steps shown in the flowcharts are executed is merely illustrative to be used to specifically describe the present disclosure, and thus steps may be executed in order other than the above order as long as similar effects are obtained. Some of the steps may be executed simultaneously with (in parallel to) other steps.
The technology according to the present disclosure achieves cancelation of an acoustic echo attributed to loud producing of a communication sound of a remote utterer, and enables the remote utterer to confirm whether the reproduction sound is loudly produced. The technology is thus useful as a technology of loudly producing a predetermined reproduction sound to a vicinity and enabling a person in the vicinity and a remote utterer to communicate with each other.
Number | Date | Country | Kind |
---|---|---|---|
2022-131967 | Aug 2022 | JP | national |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2023/026716 | Jul 2023 | WO |
Child | 19056110 | US |