The invention relates to a method and a device for improving voice quality on transparent telecommunications paths.
With transparent voice transmission channels, such as e.g. used in mobile telecommunication systems via GSM or UMTS standards, the digital voice signal is sent with no extra Radio Link Protocol (RLP). Radio Link Protocol is a radio transmission route protocol that is transmitted in addition to the useable signal and also ensures quality of the data during transmission.
When using transparent transmission, the voice quality is influenced on the spokesperson's end by:
Speaking too loudly only slightly affects the listener's ability to understand what is being said but can be uncomfortable.
There are already new known technical means of improving voice quality in telecommunications transmission.
According to DE 198 52 091 C1, an equalizer is routed into the audio path preferably between BSC and MSC. The equalizer compensates for the influences in audio quality between the two participating end devices.
DE 198 52 805 A 1 processes signals from at least two microphones with a voice processing algorithm, whereby background interference noise is compensated by 180° phase angle rotation.
Both of these methods are associated with extra technical requirements in the end devices or in the communication network systems.
The task of the following invention is therefore to recommend a process and a system of suppressing environmental noise outside of the voice spectrum and to improve voice quality based on the audio spectrum and the individual volume of the spokesperson.
This task will be handled by means of the process of adjusting the audio transmission channel in defined time intervals to the corresponding audio spectrum of the spokesperson. Solutions for this are included in a means of analyzing the voice signals according to the used band width and amplitude as well as a means of influencing the band width Δf and amplitude v of the audio transmission channel based on the used band width and amplitude that are determined. These means can be installed in the communication transmission devices and in the communication network itself.
The audio spectrum of voice signals is mainly determined through:
Based on a preferred best-case instance of the invention, the voice signals are then analyzed in an analyzer for usable band width Δf, mid-range frequency fO and amplitude v.
Control signals with voice signal information on the band width Δf, mid-range frequency fO and amplitude v from the determined frequency/amplitude analysis parameters will then preferably be generated in a microprocessor.
Based on a preferred best-case instance of the invention, control signals are used on the determined mid-range frequency fO and band width Δf of the voice signal for controlling a band pass filter whereas the transmission characteristics are defined in accordance with the determined parameters.
The control signal is also fed into a controllable amplifier along with the determined amplitude. The amplification is defined depending on the determined amplitude of the voice signal. To achieve optimal voice pattern recognition (optimally understandable), the amplifier has integrated logarithmic transmission characteristics.
Since the control signals are regenerated in timed intervals τ by the microprocessor, a quasi-continuous adjustment of the audio transmission channel to the voice spectrum of the spokesperson is achieved.
An example of an application in graphic representation
The device will preferably be installed in a telecommunications transmitting device between the microphone output or pre-amplifier and the following components (voice encoder in digital mobile telephones). Basically, it could also be installed in a communication network, e.g. in a network node.
The voice signal 7 (input signal) from the microphone is amplified by a first linear amplifier 1 to such a level that the signal can be analyzed according to frequency and amplitude in an analyzer 2 in the next step. Broken down, the analyzer 2 will determine the signal band width Δf, mid-range frequency fO and the amplitude v of the voice signal 7.
Three control signals 8-10 containing the information on the voice signal consisting of band width Δf, mid-range frequency fO and amplitude v are generated from the determined parameters from the frequency/amplitude analysis in a control unit, e.g. a microprocessor 3.
These three control signals 8-10 are regenerated in time intervals τ (e.g. τ=5 sec.) by the microprocessor. A clock generator 4 generating at τ causes a reset of the microprocessor 3 after which the control signal is determined again. The interval τ will preferably be able to be defined and can be set and optimized to the spokesperson.
The control signals 8, 9 are used with the determined mid-range frequency fO and the band width of the voice signal for controlling a band pass filter 5 with transmission characteristics defined based on the determined parameters.
Just as an example, the useable transmission band width of the NF channel of a communication transmitting device is 300 to 3400 Hz.
The analyzer determines an actual band width for a given spokesperson of e.g. 350 to 2500 Hz at a mid-range frequency fO of 1450 Hz however. The band pass filter 5 is adjusted to the determined values whereas interference signals which are outside of the defined cutoff frequencies of 350 Hz or 2550 Hz are effectively suppressed.
Using the device based on the invention, especially the adaptive band pass filter 5 which controls the mid-range and cutoff frequencies preferably via digital signals, the NF transmission characteristics are adjusted almost continuously in time intervals τ to the audio spectrum of the corresponding spokesperson, i.e. adapted to the voice, vocal register and intonation of the respective spokesperson.
The voice signal is regulated for optimal understanding (generally boosted) using information 10 on the amplitude of the voice signal in a controlled amplifier 6 via preferably logarithmic response curves. The logarithmic response curve of this amplifier 6 puts the emphasis on the formants of a spokesperson whereby, especially for muffled intonation and vocally toned languages, the ability to understand the voice is greatly improved on the listening end.
Number | Date | Country | Kind |
---|---|---|---|
101 10 983 | Mar 2001 | DE | national |
101 36 491 | Jul 2001 | DE | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/DE02/00790 | 3/6/2002 | WO | 00 | 1/9/2003 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO02/071731 | 9/12/2002 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4000369 | Paul et al. | Dec 1976 | A |
4460952 | Risinger | Jul 1984 | A |
4639932 | Schiff | Jan 1987 | A |
4700362 | Todd et al. | Oct 1987 | A |
4950999 | Agnello et al. | Aug 1990 | A |
5136267 | Cabot | Aug 1992 | A |
5139020 | Koestner et al. | Aug 1992 | A |
5280525 | Wesel | Jan 1994 | A |
5471143 | Doany | Nov 1995 | A |
5646940 | Hotto | Jul 1997 | A |
5950153 | Ohmori et al. | Sep 1999 | A |
6226608 | Fielder et al. | May 2001 | B1 |
6510313 | Rapeli | Jan 2003 | B1 |
Number | Date | Country |
---|---|---|
28 40 243 | May 1988 | DE |
43 43 366 | Jun 1995 | DE |
44 05 723 | Aug 1995 | DE |
198 52 091 | May 2000 | DE |
198 52 805 | May 2000 | DE |
0 539 017 | Apr 1993 | EP |
0 705 016 | Apr 1996 | EP |
WO 9732430 | Sep 1997 | WO |
Number | Date | Country | |
---|---|---|---|
20040037427 A1 | Feb 2004 | US |