The invention relates to a communication system enabling several users to be connected in conference mode, i.e. each user is able to speak and at the same time hear all the other users. The invention relates more particularly to a device for improving the intelligibility of speech when the users are speaking in a noisy environment, for example a sporting event in a stadium.
In a multi-user communication system suitable for a noisy environment, it is desired to transcribe speech in intelligible manner and to attenuate the ambient noise, in particular during mute phases. Indeed, if each terminal were permanently transmitting the ambient noise, each terminal would receive the sum of the noises picked up by all the other terminals, this problem being aggravated when the number of users increases.
A well-known solution for eliminating noise during mute phases is that used by walkie-talkies, i.e. the user switches his terminal between a transmission-only mode and a reception-only mode by means of a button. However, this solution is unadapted when the number of users who are liable to speak is more than three, it is constraining as it monopolizes one of the user's hands, and it does not enable a user who is speaking to hear an important message which may be transmitted by another user.
Recourse is therefore being had to communication systems operating in conference mode wherein each terminal is capable of detecting the user's speech and of removing the ambient noise from the signal. Patent application EP 1843326 describes such a system.
The DSP is programmed to perform the envisaged signal processing operations, in particular improving the intelligibility of speech. An example of processing is described in the above-mentioned patent application. It involves detection of the speech signature and calculation of parameters of a filter which enables the ambient noise to be removed from the signal while at the same time preserving the speech signal.
The signal output from the DSP is conveyed to an antenna 18 trough a RF transmission module 20, which performs the required processing operations to convert the digital signal provided by the DSP into a signal transmissible to the antenna, according to the standard used by all the terminals.
Antenna 18 also receives the signals emitted by other terminals, which RF module 20 converts and transmits to the DSP. These received signals are processed by the DSP and sent to a loudspeaker 22 via a shaping circuit 24 which performs digital-to-analog conversion, filtering, and amplification.
A terminal of the type of
The use for example of a lapel mike, with which the speech measurement conditions will vary over time according to the user's movements and to the orientation of his/her head, is therefore excluded.
If the user wishes to change microphone, the gain of amplifier 12 has to be adjustable, for example by means of a potentiometer. This is not compatible with equipment that needs to be ready to operate at any time.
A need therefore exists for a terminal designed for use in a noisy environment, enabling a great freedom of placing of the microphone. A need also exists to enable the use of several types of microphone without the user having to perform adjustments.
To satisfy at least one of these needs, a device is provided for improving the intelligibility of a signal arising from a source subjected to a noisy environment, said source marking the signal with a specific signature, said device comprising a processing circuit receiving the signal and means for analyzing the signal and for parameterizing the processing circuit according to characteristics of the signature present in the signal. The device comprises a first channel with low distortion conveying the signal from the source to the means for analyzing, and a second channel susceptible to introduce a distortion, conveying the signal from the source to the processing circuit.
Other advantages and features will become more clearly apparent from the following description of particular embodiments of the invention, given for non-restrictive example purposes only and represented in the appended drawings in which:
In a situation where it is desired to match the signal processing channel with a source which may have a large range, it is commonplace to use a variable gain input amplifier in a negative feedback loop adjusting the gain according for example to the envelope of the signal.
Nevertheless, in situations where the ambient noise can undergo sharp amplitude variations, for example in a stadium during a sporting event, the amplifier gain adjustment does not react fast enough to prevent saturation of the channel (the reaction time of the loop is moreover deliberately slow to reduce the distortion under nominal conditions of use).
This solution with a variable gain amplifier is proved unsuitable in a terminal of the type of
To prevent saturation in such situations, recourse is often had to a dynamic range compressor, which is an amplifier having a non-linear gain curve flattened asymptotically towards the saturation limit.
Nevertheless, a dynamic range compressor introduces such a distortion that speech signature detection is seriously disturbed in the event of saturation.
The signal from microphone 10 is transmitted to the DSP by a first channel incorporating amplifier 12, filter 14 and convertor 16 described in relation with
In other words, it is desired for the first channel to present a low distortion over the whole input signal range. In this case, even if the signals are of low amplitude, the DSP will be able to detect the speech signature.
This first channel is analyzed by a process 26 of the DSP which detects the speech signature and calculates the filter parameters according to the characteristics of the signature. These calculations can be similar to those described in patent application EP 1843326.
Furthermore, the signal from microphone 10 is transmitted to a second process 28 of the DSP by a second channel comprising an amplifier 30 with gain K, a filter 32 attenuating the frequencies outside the speech band, and an analog-to-digital converter 34.
Gain K of amplifier 30 is chosen such as to produce a speech signal that is audible under most conditions. It is of little importance if the channel saturates on ambient noise peaks, as this channel is not used for speech detection. Preferably, as represented, gain K is variable and is controlled by process 26 so as to adjust the amplitude of the signal as best as possible to the dynamic range of the second channel. The gain is determined for example according to the envelope of the signal conveyed in the first channel.
As the distortions introduced by the second channel do not affect the reliability of signature detection, a dynamic range compressor, for example incorporated in amplifier 30, can also be inserted therein. A dynamic range compressor will introduce a greater distortion in situations where the channel would not be saturated, but it has the advantage of producing a more intelligible signal in saturation situations.
Process 28 implemented by the DSP on the second channel performs noise filtering using the parameters calculated by process 26. This filtering can, as in patent application EP 1843326, consist in removing the ambient noise from the signal, thereby preserving the speech signal.
Removal of the ambient noise during mute phases generally does not manage to totally eliminate the signal, so that the terminals continue to emit a certain noise level during these phases. The sum of these noises can become non-negligible in the presence of a large number of terminals. This drawback can be avoided as will be seen in relation with the following figure.
To improve the noise level situation during mute phases, it is provided to disable the output of the second channel during the phases where process 26 does not detect a speech signature. This functionality is symbolized by a gate 36 located in second channel 30, 32, 34 upstream of filtering process 28.
As stipulated in the foregoing, a trade-off has to be made in the choice of gain k of amplifier 12 of the first channel so as to obtain a sufficient signal amplitude to detect a speech signature in the case where the microphone is far from the mouth, and not to saturate the channel too much in the case where the microphone is close to the mouth.
This trade-off is not difficult to achieve when the terminal is scheduled to use a single type of microphone. Users may however want to use different types of microphone, which differ in particular by their sensitivity. In this case, a trade-off for gain k is more difficult to find. The gain of amplifier 12 will preferably be adjusted to the sensitivity of the microphone. This can naturally be achieved by providing a manual gain adjustment, such as a selector switch, but this goes against this type of terminal which has to be ready to operate under all circumstances.
It will therefore be preferred to equip the terminal with automatic detection of the type of microphone. Professional-quality microphones which are used with terminals of this type are not equipped with connectors, so that the manufacturer of the terminals can equip them with the connectors of their choice. It is provided here to equip the microphones with a connector comprising an identification system.
In
Inside the terminal, identification terminal 42 is connected to a supply voltage Vdc by a current source 44. The voltage drop at the terminals of resistor 40, which is proportional to the value of the resistor, is converted into digital form by a converter 46 and analyzed by process 26.
According to the type of microphone identified by resistor 40, process 26 adjusts gain k of amplifier 12 and possibly other parameters, such as the bias current necessary for electret microphones. The bias current is supplied for example by a current source 48 connected between voltage Vdc and a dedicated terminal of connector 38.
As represented, analysis process 26 also receives the signal coming from the second channel. This enables finer signature detection and filter parameter determination algorithms to be implemented in analysis process 26, if required.
Although the emphasis in the foregoing description has been placed on eliminating the audible ambient noise, it can be understood that the system is just as efficient to eliminate noise of a different nature, in particular the noise generated by the electronic circuits themselves, provided that the noise is not consistent with the signature that is to be detected. Such noise, which may prove to be inconvenient, is the burst noise induced by the antenna in the audio acquisition channel. Burst noise is the noise of audible frequency generated by the envelope of the RF signals which alternate between transmission and receipt.
Speech signature detection has so far been considered. The system described here can however also apply to detection of other signatures. In a use of the system by referees of a sporting event, it may prove useful to detect whistle blows to trigger stopping and starting of a stopwatch. Process 26 can thus be provided to also detect the signature of a whistle. In this case, the purpose of signature detection is to trigger a signal which can be sent to a particular terminal which will make the desired use of the signal.
Furthermore, having a first audio acquisition channel which remains linear means that an echo suppression function can be further provided in the terminal, and that an “open” loudspeaker (a loudspeaker whereof the sound is able to be picked up by the microphone) can therefore be used.
Numerous variants and modifications of the system described here will be apparent to the person skilled in the art. The system has been described in relation with wireless terminals designed to transmit the human voice. It is however not excluded to use these principles in a wired system to process signal sources other than a voice picked up by a microphone.
Number | Date | Country | Kind |
---|---|---|---|
0903038 | Jun 2009 | FR | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/FR10/00457 | 6/22/2010 | WO | 00 | 12/20/2011 |