1. Field of the Invention
The invention relates to a system and a method for providing sound to at least one user, wherein audio signals from an audio signal source, such as a microphone for capturing a speaker's voice, are transmitted via a wireless link to a receiver unit, such as an audio receiver for a hearing aid, from where the audio signals are supplied to means for stimulating the hearing of the user, such as a hearing aid loudspeaker.
2. Description of Related Art
Typically, wireless microphones are used by teachers teaching hearing impaired persons in a classroom (wherein the audio signals captured by the wireless microphone of the teacher are transmitted to a plurality of receiver units worn by the hearing impaired persons listening to the teacher) or in cases where several persons are speaking to a hearing impaired person (for example, in a professional meeting, wherein each speaker is provided with a wireless microphone and with the receiver units of the hearing impaired person receiving audio signals from all wireless microphones). Another example is audio tour guiding, wherein the guide uses a wireless microphone.
Another typical application of wireless audio systems is the case in which the transmission unit is designed as an assistive listening device. In this case, the transmission unit may include a wireless microphone for capturing ambient sound, in particular from a speaker close to the user, and/or a gateway to an external audio device, such as a mobile phone; here the transmission unit usually only serves to supply wireless audio signals to the receiver unit(s) worn by the user.
The wireless audio link often is an FM (frequency modulation) radio link operating in the 200 MHz frequency band. Examples for analog wireless FM systems, particularly suited for school applications, are described in European Patent Application EP 1 864 320 A1 and corresponding International Patent Application Nos. WO 2006/104634 A2 and WO 2008/138365 A1.
In recent systems the analog FM transmission technology is replaced by employing digital modulation techniques for audio signal transmission, most of them working on other frequency bands than the former 200 MHz band.
U.S. Pat. No. 8,019,386 B2 relates to a hearing assistance system comprising a plurality of wireless microphones worn by different speakers and a receiver unit worn at a loop around a listener's neck, with the sound being generated by a headphone connected to the receiver unit, wherein the audio signals are transmitted from the microphones to the receiver unit by using spread spectrum digital signals. The receiver unit controls the transmission of data, and it also controls the pre-amplification gain level applied in each transmission unit by sending respective control signals via the wireless link.
International Patent Application WO 2008/098590 A1 relates to a hearing assistance system comprising a transmission unit having at least two spaced apart microphones, wherein a separate audio signal channel is dedicated to each microphone, and wherein at least one of the two receiver units worn by the user at the two ears is able to receive both channels and to perform audio signal processing at ear level, such as acoustic beam forming, by taking into account both channels.
International Patent Application WO 2010/078435 A1 relates to a communication system comprising a plurality of transmission units comprising a microphone for capturing the respective speaker's voice and transmitting audio signal data packets to a receiver unit which may be connected to an earphone or to a hearing aid via a plug jack. The transmission units and the receiver unit form a wireless network using a pseudo random sequence frequency hopping scheme and having a master-slave architecture, wherein certain slots in each frame are individually attributed to each of the transmission units, so that each transmission unit is allowed to transmit audio signals in its dedicated slots and receive audio signals transmitted in the remaining slots. Synchronization information data may be transmitted by the master in a certain slot of the frame. Each audio data packet is redundantly transmitted three times in three dedicated slots, with the receiver unit only listening until a correct copy of the audio data packet has been received, so that, when already the first copy is correctly received, the receiver unit would not listen to the redundant copies. Audio signals are encoded by using sub-band ADPCM (Adaptive Differential Pulse Code Modulation), and the packets may be compressed from 16 bits to 4 bits using a G.722 encoder.
International Patent Application WO 99/16050 A1 relates to a scalable and embedded audio codec to be used for internet multimedia applications, wherein a single audio stream is provided for a plurality of devices which may have different sampling rates and/or bit rates. Lower bit rate output bit streams are embedded in higher bit rate bit streams in a manner that low quality audio devices may decode only part of the bit stream, while high quality audio devices may decode the full bit stream. The audio information corresponding to the lowest bit rate application may be inserted in a first priority packet, while secondary information may be inserted in second and third priority packets, so that devices operating only at the lowest bit rate can automatically separate the first priority packets from the remainder of the bit stream and use only these packets for signal reconstruction.
U.S. Pat. No. 5,570,363 relates to a personal computer based conferencing system using a scalable audio codec which provides for a single output audio stream which can be decoded by audio devices having different bandwidths and bit rates. Different data packets are produced for different devices, wherein the packets for higher quality audio devices include additional parts including the surplus of audio information.
U.S. Pat. No. 7,272,556 B1 relates to an audio codec providing compatibility over a range of communication devices operating at different sampling frequencies or bit rates, wherein the input signal is divided in different portions, at least one of which carries information sufficient to provide intelligible reconstruction of the input signal, and wherein separate information about other portions of the signal is encoded in an embedded manner, so that a smooth transition can be achieved from low bit rate to high bit rate applications. Thereby communication devices operating at different sampling rates or bit rates can extract corresponding information from the output bit stream. A similar audio codec is described in US 2008/0052068 A1.
European Patent Application EP 2 129 170 A1 relates to a system for wireless audio signal transmission from a TV-set to a hearing aid, wherein a G.722 audio codec is used.
Receiver devices for high fidelity audio reception, which support high sampling rates and thus offer large audio bandwidths as well as high resolution, typically require a relatively large power source (battery), so that the achievable degree of miniaturization is limited. On the other hand, receiver devices for speech quality audio reception, which support moderate sampling rates and thus offer a reduced audio bandwidth as well as lower resolution, can be designed for relatively low power consumption, so that a relatively high degree of miniaturization can be achieved.
In order to communicate with such different types of receiver devices, the transmission devices have to adapt their encoding scheme to the specific requirements of the receiver devices. Such adaptation of the audio quality to the requirements of a receiver device can be achieved, for example, by employing a sub-band ADPCM codec, such as the G.722 standard. This codec is particularly suited for low complexity, battery powered devices, since the computational requirements for encoding and decoding are reasonable. In addition, the delay introduced by this codec is low, which is particularly interesting for applications like wireless microphones, where lip synchronicity has to be guaranteed, as well as TEM (In-Ear-Monitoring) systems.
It is an object of the invention to provide for a wireless sound transmission system, wherein receiver units of different audio quality can be utilized while minimizing power requirements of the receiver units. It is also an object of the invention to provide for a corresponding wireless sound transmission method.
According to the invention, these objects are achieved by a system and a method as described herein.
The invention is beneficial in that a single audio stream is sufficient for supplying different types of receiver units, while the low quality receiver units do not suffer from increasing decoding complexity (which would be necessary for decoding the high quality audio signal) and increased power consumption (as a consequence of the increased decoding complexity). By requiring a single transmitted audio stream only, power consumption of the transmission unit can be kept low (since the transmission of several audio streams encoded at different quality in parallel can be avoided) and inefficient usage of the available transmission bandwidths due to redundancy of transmitted information can be avoided. These benefits result from encoding the audio signals in such a manner that each audio data block is distributed onto at least two audio data packets in such a manner that one of the packets is a low quality packet including an encoded low quality version of the audio signal and one of the packets is a high quality packet including the surplus of an encoded high quality version of the audio signal, wherein the low quality packets are transmitted in dedicated slots of a multiple access protocol frame and the high quality packets are transmitted in other dedicated slots of the multiple access protocol frame, and wherein each receiver unit is either adapted to receive and decode both the low quality packets and the high quality packets or is adapted to receive and decode the low quality packets only, while sleeping during the slots dedicated to the transmission of the high quality packets.
Preferably, an ADPCM codec is used. The multiple access protocol preferably is a TDMA protocol; however, also other multiple access protocols, such as FDMA and CDMA, may be used.
Hereinafter, examples of the invention will be illustrated by reference to the accompanying drawings.
As shown in
The system may include a plurality of devices on the transmission side and a plurality of devices on the receiver side, for implementing a network architecture, usually in a master-slave topology.
The transmission unit typically comprises or is connected to a microphone for capturing audio signals, which is typically worn by a user, with the voice of the user being transmitted via the wireless audio link to the receiver unit.
The receiver unit typically is connected to a hearing aid via an audio shoe or is integrated within a hearing aid.
In addition to the audio signals, control data is transmitted bi-directionally between the transmission unit and the receiver unit. Such control data may include, for example, volume control or a query regarding the status of the receiver unit or the device connected to the receiver unit (for example, battery state and parameter settings).
In
Another typical use case is shown in
A modification of the use case of
According to a variant of the embodiments shown in
The transmission units 10, 110 may comprise an audio input for a connection to an audio device, such as a mobile phone, a FM radio, a music player, a telephone or a TV device, as an external audio signal source.
In each of such use cases, the transmission unit 10 usually comprises an audio signal processing unit (not shown in
An example of a transmission unit 10 is shown in
The VAD 24 uses the audio signals from the microphone arrangement 17 as an input in order to determine the times when the person 11 using the respective transmission unit 10 is speaking. The VAD 24 may provide a corresponding control output signal to the microcontroller 26 in order to have, for example, the transmitter 28 sleep during times when no voice is detected and to wake up the transmitter 28 during times when voice activity is detected. In addition, a control command corresponding to the output signal of the VAD 24 may be generated and transmitted via the wireless link 12 in order to mute the receiver units 14 or saving power when the user 11 of the transmission unit 10 does not speak. To this end, a unit 32 is provided which serves to generate a digital signal comprising the audio signals from the processing unit 20 and the control data generated by the VAD 24, which digital signal is supplied to the transmitter 28. The unit 32 acts to replace audio data by control data blocks. In addition to the VAD 24, the transmission unit 10 may comprise an ambient noise estimation unit (not shown in
According to one embodiment, the transmission units 10 may be adapted to be worn by the respective speaker 11 below the speaker's neck, for example as a lapel microphone or as a shirt collar microphone.
An example of a digital receiver unit 14 is shown in
Rather than supplying the audio signals amplified by the variable gain amplifier 62 to the audio input of a hearing aid 64, the receiver unit 14 may include a power amplifier 78 which may be controlled by a manual volume control 80 and which supplies power amplified audio signals to a loudspeaker 82 which may be an ear-worn element integrated within or connected to the receiver unit 14. Volume control also could be done remotely from the transmission unit 10 by transmitting corresponding control commands to the receiver unit 14.
Another alternative implementation of the receiver maybe a neck-worn device having a transmitter 84 for transmitting the received signals via with an magnetic induction link 86 (analog or digital) to the hearing aid 64 (as indicated by dotted lines in
In general, the role of the microcontroller 24 could also be taken over by the DSP 22. Also, signal transmission could be limited to a pure audio signal, without adding control and command data.
Some details of an example of the protocol of the digital link 12 will be discussed by reference to
Data transmission may occur in the form of TDMA (Time Division Multiple Access) frames comprising a plurality (for example 10) of time slots, wherein in each slot one data packet may be transmitted. In
Preferably, a slow frequency hopping scheme is used, wherein each slot is transmitted at a different frequency according to a frequency hopping sequence calculated by a given algorithm in the same manner by the transmitter unit 10 and the receiver units 14, wherein the frequency sequence is a pseudo-random sequence depending on the number of the present TDMA frame (sequence number), a constant odd number defining the hopping sequence (hopping sequence ID) and the frequency of the last slot of the previous frame.
The first slot of each TDMA frame (slot 0 in
The second slot (slot 1 in
Rather than allocating separate slots to the beacon packet and the response of the slaves, the beacon packet and the response data may be multiplexed on the same slot, for example, slot 0.
The audio data is compressed in the transmission unit 10 prior to being transmitted.
Each audio data packet comprises a start frame delimiter (SFD), audio data and a frame check sequence, such as CRC (Cyclic Redundancy Check) bits. Preferably, the start frame delimiter is a 5 bytes code built from the 4 byte unique ID of the network master. This 5 byte code is called the network address, being unique for each network.
In order to save power, the receivers 61 in the receiver unit 14 are operated in a duty cycling mode, wherein each receiver wakes up shortly before the expected arrival of an audio packet. If the receiver is able to verify (by using the CRC at the end of the data packet), the receiver goes to sleep until shortly before the expected arrival of a new audio data packet (the receiver sleeps during the repetitions of the same audio data packet), which, in the example of
In order to further reduce power consumption of the receiver, the receiver goes to sleep already shortly after the expected end of the SFD, if the receiver determines, from the missing SFD, that the packet is missing or has been lost. The receiver then will wake up again shortly before the expected arrival of the next audio data packet (i.e., the copy/repetition of the missing packet).
According to the present invention, a codec, which typically is a sub-band ADPCM codec, is used, wherein the audio signal is encoded in such a manner that each audio data block is distributed onto at least two audio data packets in such a manner that one of the packets is a low quality packet representing an encoded low quality version of the audio signal and one of the packets is a high quality packet representing the surplus of an encoded high quality version of the audio signal with regard to the low quality version, so that by decoding of the low quality packets only a low quality version of the audio signal is retrievable, whereas by decoding of both the low quality packets and the high quality packets a high quality version of the audio signal is retrievable.
Preferably, the audio signal is split into at least two spectral sub-bands prior to encoding, with each sub-band being encoded by a separate encoder. Preferably, the low quality packets include only part of the sub-bands (hereinafter “basic sub-bands”), i.e., not all sub-bands, with the remaining sub-bands being included in the high quality packets, and with the low quality packets preferably including only the lowest sub-band(s).
Preferably, the low quality packets include only the most significant bits of the basic sub-bands, with the remaining, i.e., least significant, bits of the basic sub-bands being included in the high quality packets. Preferably, the audio signal in each of the basic sub-bands is encoded by a two-stage encoder comprising a first stage for generating the most significant bits included in the low quality packets, a unit for computing the residual quantization error of the first stage, and a second stage for encoding the computed residual quantization error of the first stage in order to generate the remaining, i.e., least significant, bits included in the high quality packets. The most significant bits of the basic sub-bands retrieved by decoding of the low quality packets are added to the least significant bits of the basic sub-bands retrieved by decoding the high quality packets in order to reconstruct the audio signal in the basic sub-bands.
Preferably, the low quality packets include only two sub-bands, while the high quality packets include one or two additional sub-bands.
Typically, the audio signal reconstructed by decoding both the low quality packets and the high quality packets has an increased bandwidth and/or an increased quantization resolution compared to the audio signal retrieved by decoding the low quality packets only. Preferably, the audio signal reconstructed by decoding both the low quality packets and the high quality packets has a higher quantization resolution in lower frequency sub-bands compared to higher frequency sub-bands.
In
The units 120, 122, 124, 126, 128 and 130 may be functionally implemented as part of the signal processing unit 20 of
According to the example illustrated in
As shown in
The high quality packet (which is labeled “HQF1” in
Consequently, by receiving and decoding both the low quality packet and the high quality packet, the high quality version of the audio signal can be retrieved, while by receiving and decoding only the low quality packet a low quality version of the audio signal can be retrieved.
The example shown of
For the LQ codec a 16 kHz signal is divided into 2 sub-bands. Every 2 samples of 16 bits are converted to 8 bits with 6 bits for the 0-4 kHz band (LQ0) and 2 bits for the 4-8 kHz band (LQ1). This results in a data rate of 8*16000/2=64 kb/s. For the transmission via the digital radio link a packet of 32*8 or 16*16 bits is sent every 4 ms. This packet is defined as the low quality frame (LQF).
For the HQ codec a 32 kHz signal is divided into 4 sub-bands. Every 2 samples of 16 bits are converted to 16 bits with 8 bits for the 0-4 kHz band (HQ0), 4 bits for the 4-8 kHz band (HQ1), 2 bits for the 8-12 kHz band (HQ2) and 2 bits for the 12-16 kHz band (HQ3). This results in a data rate of 16*32000/4=128 kb/s. For the transmission over the digital radio link 2 packets of 16*16 bits are sent every 4 ms. These packets are defined as the high quality frames (HQF0 and HQF1).
To maintain compatibility with the LQ codec, the most significant bits of HQband i should be equal to LQband i:
HQ
band i
[m−1; m−n]=LQband i[n−1; 0],
where m and n are the number of bits of HQ respectively LQ quantizer for sub-band i.
To simplify access to the LQ data, the first frame (HQF0) contains only the most significant bits of HQ0 and HQ1 and, thus, corresponds exactly to the low quality frame (LQF). The second frame (HQF1) contains the least significant bits of HQ0 and HQ1 as well as HQ2 and HQ3. This way, the signal can be decoded by either a LQ decoder using only HQF0 or by a HQ decoder using both HQF0 and HQF1. Additionally, a HQ decoder should also be able to decode a LQF by putting “silence” encoded frames for the HQF1.
In
In addition, the output of the LQ unit 132 is used by the residual error extraction unit 134 for extracting the residual quantization error resulting from such 6 bit quantization of the signal in the first sub-band. The output of the residual error extraction unit 134 is supplied to the HQ unit 136 which generates the two least significant bits of the signal in the first sub-band, which are supplied to the unit 128 for being included in the high quality packets (the output of the HQ unit is labeled “HQ transmission bits” in
As already mentioned above, the encoder 126B used for encoding the second sub-band may have the same structure as the encoder 126A used for the first sub-band, with the LQ unit 132 then generating the two most significant bits of the second sub-band to be included in the low quality packets and the HQ unit 136 generating the two least significant bits of the second sub-band to be included in the high quality packets.
The encoders 126C and 126D used for encoding the third and fourth sub-band, respectively, do not need to have the two-stage structure of the encoders 126A and 126B; rather, the encoders 126C and 126D may consist of a unit like the HQ unit 136 only in order to supply the two bits encoding the third and fourth sub-band, respectively, to be included in the high quality packets.
An example of the four sub-band decoder structure to be implemented in the unit 74 of the receiver unit 14 is illustrated in
Accordingly, the LQ bit combiner unit 140 generates a first output corresponding to the six most significant bits of the first sub-band and a second output corresponding to the two most significant bits of the second sub-band, which outputs are supplied to a 6 bit ADPCM decoder 142A and a 2 bit ADPCM decoder 142B. The respective decoded signals are supplied to a first QMF 144 which generates an 8 kHz low quality version of the audio signal.
The HQ bit combiner unit 138 generates a first output corresponding to the two least significant bits of the first sub-band, a second output corresponding to the two least significant bits of a second sub-band, a third output corresponding to the two bits of the third sub-band and a fourth output corresponding to the two bits of the fourth sub-band. These outputs are supplied to ADPCM decoders 142C, 142D, 142E and 142F, respectively.
The output signals provided by the HQ-decoders 142C to 142F are recombined by using a two-stage QMF arrangement. The output signals of the decoders 142E, 142F are supplied to a second QMF 148 and the output signals of the decoders 142C, 142D are supplied to a third QMF 146, with the output of the decoder 142A being added to the output of the decoder 142C and the output of the decoder 142B being added to the output of the decoder 142D, in order to completely reconstruct the high quality version of the first and second sub-band. The output of the QMF 146 and the QMF 148 are supplied to a fourth QMF 150 in order to reconstruct the 16 kHz high quality version of the audio signal.
While any high quality device seeking to retrieve the high quality version of the audio signal would have to employ the decoder structure shown in
In
The benefit of such three sub-band embodiments is that one QMF computation and one sub-band encoder and decoder can be omitted, thereby simplifying the system. Further, if the allocated number of bits per sub-band is the same (for example two bits for the sub-band from 8 to 12 kHz and two bits for the sub-band from 12 to 16 kHz), then the achieved result is quite similar to the one obtained by allocating the same number of bits to the frequency band containing these sub-bands (for example, two bits for a sub-band from 8 to 16 kHz).
In order to keep the overall power consumption as low as possible, the low quality devices listen only to the transmission of the low quality packets, while sleeping during transmission of the high quality packets. For example, in the TDMA frame structure shown in
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2012/050847 | 1/20/2012 | WO | 00 | 7/18/2014 |