The present application claims priority from Australian Provisional Patent Application No 2006906326 filed on 13 Nov. 2006, the content of which is incorporated herein by reference.
The present, invention relates to headsets and the like for telephony or audio applications, and in particular relates to the provision of off-board signal processing capabilities for such a headset.
A recent trend in head mounted devices such as headsets and earpieces for telephony, communications, and audio applications is towards a small battery-operated headset device which, via a wireless data connection, operates in conjunction with a nearby device such as a mobile or cellular telephone, a personal digital assistant (PDA), a personal computer (PC), a portable media player such as an iPod™, or the like. This arrangement is illustrated in
Because the headset is small and designed to be worn on the user's head it needs to be light with a minimum size battery. It also needs to have digital signal processing (DSP) capability to make up for the non-optimal acoustic properties implied by the small size, such as the relatively long distance between the microphone and the wearer's mouth which gives reduced (worse) signal-to-noise ratio compared to the more conventional headset with a boom microphone. The signal picked up by the microphone usually needs to be “cleaned” by processing with a suitable noise reduction algorithm. A further consequence of the small form factor of the headset is the proximity of the acoustic output speaker to the microphone. This gives rise to an acoustic echo when the output signal from the speaker is picked up by the microphone and re-transmitted back to the remote telephone user. Therefore suitable echo-cancellation signal processing should generally also be applied.
In conventional DSP enabled headsets, all such processing is done by a DSP chip in the headset itself. That is, all such processing is performed “on-board”.
However, the sophistication of digital signal processing and the number of different signal processing algorithms which can be carried out on-board is limited by the small battery size and the need for low power consumption. The low power consumption requirement also limits the number of microphones which can be used, generally preventing use of multi-microphone DSP techniques such as multi-microphone noise cancellation. Also, the small size of the headset imposes limits on the physical separation between the microphone and the speaker of the headset. These limitations make it difficult to implement sophisticated signal processing, especially noise reduction and feedback cancellation, amongst other DSP techniques.
Any discussion of documents, acts, materials, devices, articles or the like which has been included in the present specification is solely for the purpose of providing a context for the present invention. It is not to be taken as an admission that any or all of these matters form part of the prior art base or were common general knowledge in the field relevant to the present invention as it existed before the priority date of each claim of this application.
Throughout this specification the word “comprise”, or variations such as “comprises” or “comprising”, will be understood to imply the inclusion of a stated element, integer or step, or group of elements, integers or steps, but not the exclusion of any other element, integer or step, or group of elements, integers or steps.
According to a first aspect the present invention provides a system for distributed signal processing for a headset, the system comprising:
According to a second aspect the present invention provides a headset enabled for distributed processing, the headset comprising:
According to a third aspect the present invention provides a base device for providing distributed signal processing for a headset; the base device comprising:
According to a fourth aspect the present invention provides a method for providing distributed signal processing for a headset, the method comprising:
The base device may be a desktop telephone, a mobile or cellular telephone, a personal digital assistant (PDA), a personal computer (PC), or a portable media player such as an iPod™ or MP3 player.
Embodiments of the present invention thus provide distributed processing for a telephony or audio headset, in which the headset communicates (probably wirelessly) with a nearby ‘base’ device. The base device could be provided with the capability to implement greater processing abilities to improve performance of the system comprising the headset and base, so that performance of the system is controlled by “distributed” processing. That is, the system operation is controlled in such embodiments by processing which is distributed between the headset and the base device. Alternatively, the base device might take on the burden of required signal processing functions, so that the headset DSP is not faced with the burden of such functions. For example, the headset might process the microphone signal by passing it through one or more filters, with the filter settings being determined by the base device which instructs the headset via the wireless communications link to change the filter settings at suitable times. In this arrangement, the DSP burden of determining appropriate filter settings is “off-board”, being in the nearby base device and not in the headset itself. Embodiments of the invention may provide such distributed processing without control signals, such that functionality is provided to the headset by processing carried out in the base. Alternative implementations may utilise unidirectional control signals whether from the base to the headset or from the headset to the base, or may utilise bidirectional control signals between the base and headset.
In preferred embodiments, the on-board headset processing is limited to aspects that need to be performed with short latency (such as side tone and acoustic echo cancellation) while the nearby base device provides other types of processing functions which are not so time-critical and may require greater processing capacity (such as transmit signal noise reduction, line echo cancellation, multi-band automatic gain and volume control). The nearby base device can also carry a microphone of its own for the particular purpose of measuring general background noise and providing noise cancellation, which will be much more effective because the “background noise” microphone is spaced substantially further away from the headset user's voice than is possible for microphones mounted on the headset itself.
In some embodiments of the invention, a single base device may provide distributed processing for a multiple-headset communications environment. For example, the base device may communicate wirelessly with multiple headsets for use in a conference call, or with multiple headsets used in the same acoustic environment, such as an open plan call centre. In such embodiments, the base device may exploit the presence of multiple microphones in order to improve environmental noise detection and cancellation, without the need for the base device itself to possess such a multiplicity of microphones. Furthermore, the base device is preferably operable to control one or more aspects of digital signal processing of each headset based on signal characteristics of one or more of the other headsets.
The headset may comprise any suitable sound piece, such as an external headset suitable for mounting on the pinna or an earpiece for placement partially or wholly within the auditory canal.
According to a further aspect the present invention provides a base device for providing distributed signal processing for a headset; the base device comprising:
An example of the invention will now be described with reference to the accompanying drawings, in which:
a and 2b illustrate example signal processing capabilities provided by a headset, of which one or more of such capabilities may be moved off-board in accordance with an embodiment of the invention;
a and 4b illustrate the signal processing elements of each of the base device and headset in accordance with one embodiment of the present invention;
a and 2b illustrate a number of desirable DSP functions for an example state-of-the-art headset application. Receive signal processing includes frequency equalization 210 to compensate for output transducer characteristics, automatic gain control 212 to compensate for variability in the received signal level, automatic volume control 212 to compensate for variable ambient noise in the listener's environment, noise reduction 214 or noise cancellation to improve sound quality of the received signal, and line echo cancellation. Transmit signal processing includes frequency equalization 222 to compensate for microphone characteristics, automatic gain control 212 to compensate for variability in speaking level and variability in the microphone position and alignment with the speaker's mouth, noise reduction 226 to remove ambient noise from the transmitted signal, acoustic echo cancellation 224, multiple-microphone noise reduction, and side-tone 228 with howling suppression (acoustic feedback cancellation).
Preferably, in accordance with the invention the portion of the processing carried out in the headset is minimized, so as to provide maximum battery life and minimum headset size. Nevertheless, some elements of the DSP processing generally should be located in the headset itself because of the short time latency required to make the processing effective. Examples of such on-board processing include side-tone with howling suppression 228, and two-microphone noise reduction using a beam-former directional microphone approach. However, much of the remaining types of processing could be done off-board in the nearby base device, which usually has a larger battery and/or mains power supply and a greater processing capacity than the headset DSP chip.
The base device preferably controls the operation of the DSP of the headset by way of a control data stream established over the wireless communications link. The control data stream may be interleaved with the audio data stream in the wireless link. The headset may also control aspects of the digital signal processing in the base device and/or communicate operational parameters to the base device using the same bidirectional wireless link using data interleaved with the audio stream. An advantage of interleaving such control data with audio signals is that it maintains timing of the control data relative to the audio signals despite potentially unknown or variable delays inherent in the wireless link.
The system could include more than one headset, for example for conference calls. The headsets could all share the processing in the single nearby base device. Each headset thus contributes an additional microphone signal for the noise reduction in the transmitted signal, with the noise reduction processing performed by the base device. This will add to the effectiveness of the processing, with additional cost and power savings.
Thus, this invention splits DSP processing between the headset and the nearby base device to achieve a “distributed processing” solution to the problem.
Advantages of distributed processing for wireless headsets include:
A particular example of such distributed processing is now described in relation to a Bluetooth headset and base, with reference to
In addition to the issues set out above in relation to
When becoming paired with a headset by establishment of a Bluetooth link, the base device interrogates the headset to find out what model it is. The base device then ensures that processing it carries out on behalf of that headset can be adapted to meet that headset's specific configurations, performance characteristics, and the like. The base device is thus preferably equipped with, or connected to, a library of device-specific information for a plurality of devices with which it is anticipated the base device may become paired.
The headset often has acoustic echo, that is, some of the sound output from the speaker is picked up by the microphone. An acoustic echo canceller 332 is used to remove this echo. The total delay in an echo path and the level of the echo signal contribute to how it is perceived by the user. For a long delay a low level of echo will be more evident to the user than with a shorter delay but the same level. A general design goal is to keep the system delay as low as possible.
The typical software functions of the system 300 are shown in
NLP=Non-linear processing (removes residual echo)
The ADRO technique set out in U.S. Pat. No. 6,731,767, the content of which is incorporated herein by reference, may be used in place of the EQ, AVC and AGC on receive and in place of basic EQ in transmit. NLP, ANR, AGC, AVC, EQ and LIM on receive are implemented as “off-line processing” using a single on-line adaptive filter which is controlled by appropriate off-line processing to effect these functions.
EQ, AES and ANR on transmit are often implemented as “off-line processing” using a single adaptive filter. Many of the functions rely on having knowledge of the receiver and microphone response (a calibration) to function correctly. Therefore, if they are performed in the base it is important that the base knows what model headset is connected.
One configuration, shown in
b further shows two adaptive filters in the headset, one in each of the receive path and transmit path. Filter tap settings for the filters of the headset may be computed by the base device of
At times when the headset is paired with a mobile phone, all processing is transferred back to the headset. That is, in this embodiment the processing distribution is re-configured as the system configuration changes.
Headset distributed processing thus provides a means for simultaneously optimizing sound quality and minimizing power consumption in a headset or similar listening device.
A further embodiment relating to an ultra-small headset for mobile telephony is now described with reference to
In the embodiment of
Outputs of the system of
Sound sources in the acoustic environment of the headset wearer include the user's voice, noise whether environmental, music, other voices, wind, etc, and the output signal as played through the headset speaker including side tone and received signal. The signal processing goals are to optimize the headset user's voice and minimize other sound sources so as to maximise quality of the transmitted telephone signal 516, to optimize intelligibility of the received signal 510 when played out as signal 518, and to output a signal to cancel low-frequency environmental sounds for enhanced received intelligibility and comfort in noise. This will be more important for headsets that offer minimal occlusion.
In the embodiment of
The Bluetooth link is used to convey control signals between the headset and mobile phone in addition to the bidirectional transmitted and received voice signals. The control link from the headset to the phone can be used to identify the type of headset and provide information about the microphone and speaker characteristics of the headset for the equalization, AGC, and AVC processing in the mobile phone handset. The control link from the base to the headset could provide control of analog amplification in the headset and other functions such as power off at the end of a call.
More sophisticated applications could include speech synthesis and recognition in the base device to produce a completely hands-free voice activated system which may be impossible with the more limited processing capacity available in the miniature headset processor. In this case, the use of the microphones and signal processing in the headset would provide a cleaner voice signal than is available from the microphone on the mobile handset, enhancing the overall performance of the speech recognition.
A further embodiment relating to stereo headphones for music is now described with reference to
In the embodiment of
The embodiment of
The system of
The advantages of the distributed processing over a wireless system with a single processor in the headset are the possibility of a smaller form factor and longer battery life. The advantages of the distributed processing over a wireless system with a single processor in the base are higher sound quality because of the active noise cancellation which requires low delay, more accurate sound level monitoring and more effective hearing protection because of the proximity of the microphone to the ear.
A further embodiment relating to teleconferencing system with multiple headsets is now described. This embodiment provides for a single mains powered base device that communicates via a wireless link with two or more headsets. The base device has access to the signals picked up by microphones on all the headsets. As for the examples discussed in the preceding, headset size can be smaller, power consumption can be lower, and the cost of production can be lower for the individual headsets because a large part of the processing is done in the base device. Using multiple headsets, each with its own processing, can improve sound quality for the individual headset users by providing more effective echo cancellation, side tone, active noise cancellation, and control of the level of the received signal. Using multiple headsets also increases sound quality for the remote listeners through improved signal pick-up from the microphones on the headsets and improved noise reduction in the base device using all of the available information from the individual headset microphones.
Some portions of this detailed description are presented in terms of algorithms and symbolic representations of operations on data bits within a computer memory. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. An algorithm is here, and generally, conceived to be a self-consistent sequence of steps leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.
As such, it will be understood that such acts and operations, which are at times referred to as being computer-executed, include the manipulation by the processing unit of the computer of electrical signals representing data in a structured form. This manipulation transforms the data or maintains it at locations in the memory system of the computer, which reconfigures or otherwise alters the operation of the computer in a manner well understood by those skilled in the art. The data structures where data is maintained are physical locations of the memory that have particular properties defined by the format of the data. However, while the invention is described in the foregoing context, it is not meant to be limiting as those of skill in the art will appreciate that various of the acts and operations described may also be implemented in hardware.
It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the description, it is appreciated that throughout the description, discussions utilizing terms such as “processing” or “computing” or “calculating” or “determining” or “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.
The present invention also relates to apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus.
The algorithms and displays presented herein are not inherently related to any particular computer or other apparatus. Various general purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the required method steps. The required structure for a variety of these systems will appear from the description. In addition, the present invention is not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the teachings of the invention as described herein.
A machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer). For example, a machine-readable medium includes read only memory (“ROM”); random access memory (“RAM”); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.); etc.
Turning to
In
The hard disk drive 27, magnetic disk drive 28, and optical disk drive 30 are connected to the system bus 23 by a hard disk drive interface 32, a magnetic disk drive interface 33, and an optical disk drive interface 34, respectively. The drives and their associated computer-readable media provide nonvolatile storage of computer readable instructions, data structures, program modules and other data for the personal computer 20. Although the exemplary environment shown employs a hard disk 60, a removable magnetic disk 29, and a removable optical disk 31, it will be appreciated by those skilled in the art that other types of computer readable media which can store data that is accessible by a computer, such as magnetic cassettes, flash memory cards, digital video disks, Bernoulli cartridges, random access memories, read only memories, storage area networks, and the like may also be used in the exemplary operating environment.
A number of program modules may be stored on the hard disk 60, magnetic disk 29, optical disk 31, ROM 24 or RAM 25, including an operating system 35, one or more applications programs 36, other program modules 37, and program data 38. A user may enter commands and information into the personal computer 20 through input devices such as a keyboard 40 and a pointing device 42. Other input devices (not shown) may include a microphone, joystick, game pad, satellite dish, scanner, or the like. These and other input devices are often connected to the processing unit 21 through a serial port interface 46 that is coupled to the system bus, but may be connected by other interfaces, such as a parallel port, game port or a universal serial bus (USB) or a network interface card. A monitor 47 or other type of display device is also connected to the system bus 23 via an interface, such as a video adapter 48. In addition to the monitor, personal computers typically include other peripheral output devices, not shown, such as microphones, speakers and printers.
The personal computer 20 may operate in a networked environment using logical connections to one or more remote computers, such as a remote computer 49. The remote computer 49 may be another personal computer, a server, a router, a network PC, a peer device or other common network node, and typically includes many or all of the elements described above relative to the personal computer 20, although only a memory storage device 50 has been illustrated. The logical connections depicted include a local area network (LAN) 51 and a wide area network (WAN) 52. Such networking environments are commonplace in offices, enterprise-wide computer networks, intranets and, inter alia, the Internet.
When used in a LAN networking environment, the personal computer 20 is connected to the local network 51 through a network interface or adapter 53. When used in a WAN networking environment, the personal computer 20 typically includes a modem 54 or other means for establishing communications over the WAN 52. The modem 54, which may be internal or external, is connected to the system bus 23 via the serial port interface 46. In a networked environment, program modules depicted relative to the personal computer 20, or portions thereof, may be stored in the remote memory storage device. It will be appreciated that the network connections shown are exemplary and other means of establishing a communications link between the computers may be used.
It will be appreciated by persons skilled in the art that numerous variations and/or modifications may be made to the invention as shown in the specific embodiments without departing from the scope of the invention as broadly described. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive.
Number | Date | Country | Kind |
---|---|---|---|
2006906326 | Nov 2006 | AU | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/AU2007/001743 | 11/13/2007 | WO | 00 | 8/31/2009 |