The present disclosure relates to implantable neurostimulator devices and systems, for example, cochlear stimulation systems, and to sound processing strategies employed in conjunction with such systems.
The characteristics of a cochlear implant's front end play an important role in the sound quality (and hence speech recognition or music appreciation) experienced by the cochlear implant (CI) user. These characteristics are governed by the components of the front-end including a microphone and an A/D converter in addition to the acoustical effects resulting from the placement of the CI microphone on the user's head. The acoustic characteristics are unique to the CI user's anatomy and the placement of the CI microphone on his or her ear. Specifically, the unique shaping of the user's ears and head geometry can result in substantial shaping of the acoustic waveform picked up by the microphone. Because this shaping is unique to the user and his/her microphone placement, it typically cannot be compensated for with a generalized solution.
The component characteristics of the microphone must meet pre-defined standards, and this issue can be even more critical in beamforming applications where signals from two or more microphones are combined to achieve desired directivity. It is critical for the microphones in these applications to have matched responses. Differences in the microphone responses due to placement on the patient's head can make this challenging.
Beamforming is an effective tool for focusing on the desired sound in a noisy environment. The interference of noise and undesirable sound tends to be very disturbing for speech recognition in everyday conditions, especially for hearing-impaired listeners. This is due to reduced hearing ability that lead, for example, to increased masking effects of the target signal speech.
A number of techniques based on single and multiple microphone systems have already been applied to suppress unwanted background noise. Single microphone techniques generally perform poorly when the frequency spectra of the desired and the interfering sounds are similar, and when the spectrum of the interfering sound varies rapidly. By using more than one microphone, sounds can be sampled spatially and the direction of arrival can be used for discriminating desired from undesired signals. In this way it is possible to suppress stationary and non-stationary noise sources independently of their spectra. An application for hearing aids requires a noise reduction approach with a microphone array that is small enough to fit into a Behind The Ear (BTE) device. As BTEs are limited in size and computing power, only directional microphones are currently used to reduce the effects of interfering noise sources.
The methods and systems described herein implement techniques for clarifying sound as perceived through a cochlear implant. More specifically, the methods and apparatus described here implement techniques to implement beamforming in the CI.
In one aspect, a beamforming signal is generated by disposing a first microphone and a second microphone in horizontal coplanar alignment. The first and second microphones are used to detect a known signal to generate a first response and a second response. The first response is processed along a first signal path communicatively linked to the first microphone, and the second response is processed along a second signal path communicatively linked to the second microphone. The first and second responses are matched, and the matched responses are combined, to generate the beamforming signal on a combined signal path.
Implementations can include one or more of the following features. For example, matching the first and second responses can include sampling the first response and the second response at one or more locations along the first and second signal paths. In addition, a first spectrum of the sampled first response, a second spectrum of the sampled second response, and a third spectrum of the known signal can be generated. The first and second spectrums can be compared against the third spectrum, and a first filter and a second filter can be generated based on the comparisons. The first filter can be disposed on the first signal path and a second filter disposed on the second signal path.
In addition, implementations can include one or more of the following features. For example, a third filter can be disposed on the combined signal path to eliminate an undesired spectral transformation of the beamforming signal. The first and second microphones disposed in horizontal coplanar alignment can include a behind-the-ear microphone and an in-the-ear (ITE) microphone. The in-the-ear microphone is located in a concha of a cochlear implant user in horizontal coplanar alignment with the user's pinnae to optimize directivity at a high frequency band. Alternatively, the first and second microphones disposed in horizontal coplanar alignment can include two in-the-ear microphones. The two in-the-ear microphones are disposed in a concha of a cochlear implant user in horizontal coplanar alignment with the user's pinnae to optimize directivity at a high frequency band. The first and second microphones disposed in horizontal coplanar alignment can also include an in-the-ear microphone and a sound port communicatively linked to a behind-the-ear microphone. The sound port is located in horizontal coplanar alignment with the in-the-ear microphone, and the in-the-ear microphone is located in a concha of a cochlear implant user in horizontal coplanar alignment with the user's pinnae to optimize directivity at a high frequency band.
Implementations can further include one or more of the following features. The first and second microphones can be positioned to modulate a spacing between the first microphone and the second microphone to optimize directivity at a low frequency band. The behind-the-ear microphone can also include a second sound port designed to eliminate a resonance generated by the first sound port. The first sound port and the second sound port can be designed to have equal length and diameter in order to eliminate the resonance. Alternatively, a resonance filter can be generated to eliminate a resonance generated by the first sound port. The resonance filter includes a filter that generates a filter response having valleys at frequencies corresponding to locations of peaks of the resonance.
The techniques described in this specification can be implemented to realize one or more of the following advantages. For example, the techniques can be implemented to allow the CI user to use the telephone due to the location of the ITE microphone. Most hearing aids implement microphones located behind the ear, and thus inhibit the CI user from using the telephone. The techniques also can be implemented to take advantage of the naturally beamforming ITE microphone due to its location and the shape of the ear. Further, the techniques can be implemented as an extension of the existing ITE microphone, which eliminates added costs and redesigns of existing CI. Thus, beamforming can be implemented easily to current and future CI users alike.
These general and specific aspects can be implemented using an apparatus, a method, a system, or any combination of apparatuses, methods, and systems. The details of one or more implementations are set forth in the accompanying drawings and the description below. Further features, aspects, and advantages will become apparent from the description, the drawings, and the claims.
Like reference symbols indicate like elements throughout the specification and drawings.
A method and system for implementing a beamforming system are disclosed. A beamforming system combines sound signals received from two or more microphones to achieve directivity of the combined sound signal. Although the following implementations are described with respect to cochlear implants (CI), the method and system can be implemented in various applications where directivity of a sound signal and microphone matching are desired.
Applications of beamforming in CIs can be implemented using two existing microphones, a behind-the-ear (BTE) microphone and an in-the-ear (ITE) microphone. The BTE microphone is placed in the body of a BTE sound processor. Using a flexible wire, the ITE microphone is placed inside the concha near the pinna along the natural sound path. The ITE microphone picks up the natural sound using the natural shape of the ear and provides natural directivity in the high frequency without any added signal processing. This occurs because the pinna is a natural beam former. The natural shape of the pinna allows the pinna to preferentially pick up sound from the front and provides natural high frequency directivity. By placing the ITE microphone in horizontal coplanar alignment with the pinna, beamforming in the high frequencies can be obtained. U.S. Pat. No. 6,775,389 describes an ITE microphone that improves the acoustic response of a BTE Implantable Cochlear Stimulation (ICS) system during telephone use and is incorporated herein as a reference.
For beamforming, the microphones implemented must be aligned in a horizontal plane (coplanar). In addition, the spacing or distance between two microphones can affect directivity and efficiency of beamforming. If the spacing is too large, the directivity at high frequencies can be destroyed or lost. For example, a microphone-to-microphone distance greater than four times the wavelength (λ) cannot create effective beamforming. Also, the closer the distance, the higher the frequency at which beamforming can be created. However, the beamforming signal becomes weaker as the distance between the microphones is reduced since the signals from the two microphones are subtracted from each other. Therefore, the gain in directivity due to the closeness of the distance between the microphones also creates a loss in efficiency. The techniques disclosed herein optimize the tradeoff between directivity and efficiency.
To maximize beamforming, the microphones are positioned horizontally coplanar to each other, which can be accomplished in one of several ways. For example, an ITE microphone can be positioned to be aligned with a BTE microphone, but such alignment would result in a loss of the natural beamforming at higher frequencies since the ITE microphone will no longer be placed near the pinna. Therefore, in one aspect of the techniques, the BTE microphone is positioned to align with the ITE microphone. Since the pinna provides free (without additional processing) and natural high frequency directivity, the BTE microphone can be moved in coplanar alignment with the ITE microphone. Directivity for lower frequencies can be designed by varying the distance between the two microphones.
Microphone System Design Strategies
In some implementations, an ITE microphone 230 is implemented in horizontal coplanar alignment 250 with a sound port 240 as shown in
One undesired effect of the sound port 240 is an introduction of resonance or unwanted peaks in the acoustical signal.
In order to help eliminate the undesired effect, a digital filter can be implemented to compensate for the resonance created. The digital filter can be designed to filter out the peaks of the resonance by generating valleys at frequency locations of the peaks. Alternatively, a smart acoustical port design can be implemented with an anti-resonance acoustical structure. The smart acoustical port design includes a second, complementary sound port 270 configured to create a destructive resonance to cancel out the original resonance. The second sound port 270 is of equal length and diameter as the first sound port 240. However, the shape or position of the tube does not affect the smart acoustical port design. Consequently, the second sound port 270 can be coiled up and hidden away.
In some implementations, as described in
As with the other microphone designs, having alignment with the pinnae provides natural beamforming at the high frequency range, and the distance between the two microphones 430, 440 is adjusted to achieve beamforming at the low frequency range. Similar to the microphone strategy described in
Microphone Matching
In general, microphones used in beamforming applications are matched microphones. These matched microphones are sorted and selected by a microphone manufacturer for matching characteristics or specifications. This is not only time consuming but also increases the cost of the microphones. In addition, even if perfectly matching microphones could be implemented in a CI, the location of the microphones and shape and physiology of the CI user's head introduces uncertainties that create additional mismatches between the microphones.
In one aspect, a signal processing strategy is implemented to match two unmatched microphones by compensating for inherent characteristic differences between the microphones in addition to the uncertainties due to the physiology of the CI user's head. Matching of the two microphones is accomplished by implementing a process for customizing an acoustical front end as disclosed in U.S. Pat. No. 7,864,968. The techniques of this patent can be implemented to compensate for an undesired transformation of the known acoustical signal due to the location of the microphones and the shape of the CI user's head including the ear. The techniques also eliminate the need to implement perfectly matched microphones.
In general, the fitting portion 550 is implemented on a computer system located at an office of an audiologist or other medical personnel and is used to perform an initial fitting or customization of a cochlear implant for a particular user. The sound processing portion 502 is implemented on a behind the ear (BTE) headpiece 100 (
The fitting portion 550 can generate an acoustic signal, which can be picked up and processed by the sound processing portion 502. The processed acoustic signal can be passed to an implantable cochlear stimulator (ICS) 518 through an appropriate communication link 516. The ICS 518 is coupled to an electrode array 520 configured to be inserted within the cochlea of a patient. The implantable cochlear stimulator 518 can apply the processed acoustic signal as a plurality of stimulating inputs to a plurality of electrodes distributed along the electrode array 520. The electrode array 520 may be substantially as shown and described in U.S. Pat. Nos. 4,819,647 and 6,129,753, both patents incorporated herein by reference.
In some implementations, both the fitting portion 550 and the sound processing portion 502 are implemented in the external BTE headpiece 100 (
In some implementations, the sound processing portion 502 is incorporated into an internally located implantable cochlear system (not shown) as shown and described in a co-pending U.S. Patent Pub. No. 2007/0260292.
The implantable cochlear stimulator can send information, such as data and status signals, to the sound processing system 514 over the communication link 516. In order to facilitate bidirectional communication between the sound processing system 514 and the implantable cochlear stimulator 518, the communication link 516 can include more than one channel. Additionally, interference can be reduced by transmitting information on a first channel using an amplitude-modulated carrier and transmitting information on a second channel using a frequency-modulated carrier.
The communication links 556 and 524 are wired links using standard data ports such as Universal Serial Bus interface, IEEE 1394 FireWire, or other suitable serial or parallel port connections.
In some implementations, the communication links 556 and 524 are wireless links such as the Bluetooth protocol. The Bluetooth protocol is a short-range, low-power 1 Mbit/sec wireless network technology operated in the 2.4 GHz band, which is appropriate for use in piconets. A piconet can have a master and up to seven slaves. The master transmits in even time slots, while slaves transmit in odd time slots. The devices in a piconet share a common communication data channel with total capacity of 1 Mbit/sec. Headers and handshaking information are used by Bluetooth devices to strike up a conversation and find each other to connect. Other standard wireless links such as infrared, wireless fidelity (Wi-Fi), or any other suitable wireless connections can be implemented. Wi-Fi refers to any type of IEEE 802.11 protocol including 802.11a/b/g/n. Wi-Fi generally provides wireless connectivity for a device to the Internet or connectivity between devices. Wi-Fi operates in the unlicensed 2.4 GHz radio bands, with an 11 Mbit/sec (802.11b) or 54 Mbit/sec (802.11a) data rate or with products that contain both bands. Infrared refers to light waves of a lower frequency out of range of what a human eye can perceive. Used in most television remote control systems, information is carried between devices via beams of infrared light. The standard infrared system is called infrared data association (IrDA) and is used to connect some computers with peripheral devices in digital mode.
In implementations whereby the implantable cochlear stimulator 518 and the electrode array 520 are implanted within the CI user, and the microphone system 510 and the sound processing system 514 are carried externally (not implanted) by the CI user, the communication link 516 can be realized through use of an antenna coil in the implantable cochlear stimulator and an external antenna coil coupled to the sound processing system 514. The external antenna coil can be positioned to be in alignment with the implantable cochlear stimulator, allowing the coils to be inductively coupled to each other and thereby permitting power and information, e.g., the stimulation signal, to be transmitted from the sound processing system 514 to the implantable cochlear stimulator 518.
In some implementations, the sound processing system 514 and the implantable cochlear stimulator 518 are both implanted within the CI user, and the communication link 516 can be a direct-wired connection or other suitable links as shown in U.S. Pat. No. 6,308,101, incorporated herein by reference.
The computer system 562 executing the fitting software 564 can include a display screen for displaying selection screens, stimulation templates and other information generated by the fitting software. In some implementations, the computer system 562 includes a display device, a storage device, RAM, ROM, input/output (I/O) ports, a keyboard, and a mouse. The display screen can be implemented to display a graphical user interface (GUI) executed as a part of the software 564 including selection screens, stimulation templates and other information generated by the software 564. An audiologist, other medical personnel, or even the CI user can easily view and modify all information necessary to control a fitting process. In some implementations, the fitting portion 550 is included within the sound processing system 514 and can allow the CI user to actively perform cochlear implant front end diagnostics and microphone matching.
In some implementations, the fitting portion 550 is implemented as a stand-alone system located at the office of the audiologist or other medical personnel. The fitting portion 550 allows the audiologist or other medical personnel to customize a sound processing strategy and perform microphone matching for the CI user during an initial fitting process after the implantation of the CI. The CI user can return to the office for subsequent adjustments as needed. Return visits may be required because the CI user may not be fully aware of his/her sound processing needs initially, and the user may need time to learn to discriminate between different sound signals and become more perceptive of the sound quality provided by the sound processing strategy. In addition, the microphone responses may need periodic calibrations and equalizations. The fitting system 554 is implemented to include interfaces using hardware, software, or a combination of both hardware and software. For example, a simple set of hardware buttons, knobs, dials, slides, or similar interfaces can be implemented to select and adjust fitting parameters. The interfaces can also be implemented as a GUI displayed on a screen.
In some implementations, the fitting portion 550 is implemented as a portable system. The portable fitting system can be provided to the CI user as an accessory device for allowing the CI user to adjust the sound processing strategy and recalibrate the microphones as needed. The initial fitting process may be performed by the CI user aided by the audiologist or other medical personnel. After the initial fitting process, the user may perform subsequent adjustments without having to visit the audiologist or other medical personnel. The portable fitting system can be implemented to include simple user interfaces using hardware, software, or a combination of both hardware and software to facilitate the adjustment process as described above for the stand alone system implementation.
The microphone system 510 can be implemented to use any of the three microphone design configurations as described with respect to
Microphone matching is accomplished by compensating for an undesired transformation of the known acoustical signal detected by the microphones 530, 532 due to the inherent characteristic differences in the microphones 530, 532, locations of the microphones 530, 532 and the physiological properties of the CI user's head and ear. A microphone matching process includes sampling the detected signal along the signal paths 512, 515 and matching the responses from the microphones 530, 532.
The sampling locations 531, 541, 537, and 547 in the signal pathways 512 and 515 can be determined by the system 500 to include one or more locations after the A/D converters 538 and 540. For example, the digitized signal can be processed using one or more digital signal processing units (DSPs).
In one implementation, optimization of the sampled signal at 640 is performed via the fitting system 550. Alternatively, in some implementations, the sound processing system 514 is implemented to perform the optimization by disposing a DSP module (not shown) within the sound processing system 514. In other implementations, the existing DSP module 546 can be configured to perform the optimization.
Optimizing the sampled electrical signal can be accomplished through at least three signal processing events. The electrical signal is sampled and a spectrum of the sampled signal is determined at 642. The determined spectrum of the sampled signal is compared to the spectrum of the known acoustical signal to generate a ratio of the two spectrums at 644. The generated ratio represents the undesired transformation of the sampled signal due to the positioning of the microphones, mismatched characteristics of the microphones, and physical anatomy of the user's head and ear. The ratio generated is used as the basis for designing and generating an equalization filter to eliminate the undesired transformation of the sampled signal at 646. The generated equalization filter is disposed at the corresponding sampling locations 531, 541, 537, and 547 to filter the sampled signal at 648. The filtered signal is directed to the next available signal processing unit on the signal pathways 512, 515. The available signal processing unit can vary depending on the signal processing scheme designed for a particular CI user.
The transfer functions and the equalization filter based on the transfer functions generated through optimization at 640 is implemented using Equations 1 through 4.
The acoustic signal or stimulus generated from the sound source 552 is s(t) and has a corresponding Fourier transform S(jω). The signal captured or recorded from the microphone system 510 is r(t) and has a corresponding Fourier transform R(jω). The acoustical transfer function from the source to the microphone, H(jω), can then be characterized by Equation (3) above. If the target frequency response is specified by T(jω), then the equalization filter shape is given by Equation (4) above. This equalization filter is appropriately smoothed and then fit with a realizable equalization filter, which is then stored on the sound processing system 514 at the appropriate location(s). The digital filter can be a finite-impulse-response (FIR) filter or an infinite-impulse-response (IIR) filter. Any one of several standard methods (see, e.g., Discrete Time Signal Processing, Oppenheim and Schafer, Prentice Hall (1989)) can be used to derive the digital filter. The entire sequence of operation just described is performed by the fitting system 554. In some implementations, the processing events 642, 644, 646, and 684 are implemented as a single processing event, combined as two processing events or further subdivided into multiple processing events.
If the decision at 625 is to sample the digital signal, the digital signal is forwarded directly to the AGC 542, 544. Alternatively, the digital signal can be forwarded to the next signal processing unit. For example, a first optional digital signal processing (DSP1) can be presented at 630. At the conclusion of the first optional digital signal processing, another opportunity to sample the digital signal can be presented at 635. A decision to sample the digital signal at 635 instructs the fitting system 554 to perform the signal optimization at 640. The signal processing events 642, 644, 646, 648 are carried out on the digital signal to filter out the undesired transformation and match the microphone responses as described above. The filtered digital signal can then be forwarded to the AGC 542, 544 at 655 to provide protection against an overdriven or underdriven signal and to maintain an adequate demodulation signal amplitude while avoiding occasional noise spikes.
However, if the decision at 650 is not to sample the digital signal, then the digital signal is forwarded directly to the AGCs 542, 544 and processed as described above. The gain controlled digital signal is processed at 655 to allow for yet another sampling opportunity. If the decision at 660 is to sample the gain controlled digital signal, the sampled gain controlled digital signal is processed by the fitting system 554 to perform the optimization at 640. The signal processing events 642, 644, 646, and 648 are carried out on the gain controlled digital signal to filter out the undesired transformation and match microphone responses as described above. The filtered digital signal is forwarded to a beamforming module 528 for combining the signals from each signal path 512, 515.
Beamforming Calculation
Once the microphone matching process has been accomplished, the beamforming mathematical operation is performed on the two individual signals along the two signal paths 512, 515. The beamforming module 528 combines the filtered signals from signal paths 512 and 515 to provide beamforming. Beamforming provides directivity of the acoustical signal, which allows the individual CI user to focus on a desired portion of the acoustical signal. For example, in a noisy environment, the individual CI user can focus on the speech of a certain speaker to facilitate comprehension of such speech over confusing background noise.
Beamforming provides a destructive combination of signals form the two microphones 530, 532. In other words, a first signal from the first microphone 530 is subtracted from a second signal from the second microphone 532. Alternatively, the second signal from the second microphone 532 can be subtracted from the first signal from the first microphone 530. A consequence of such destructive combination can include a spectrum shift in the combined signal. The beamforming signal (the combined signal) has directivity associated with the design parameters. However, a spectrum transformation is also generated, and a computed transformation of the beamforming signal can include a first order high pass filter. At the large wavelength (low frequency), more signal strength is lost than at the small wavelength (high frequency). In contrast, at the small wavelength, the signal strength is slightly larger than at the low frequency. In order to compensate for the spectral modification, a digital filter can be provided to counter the high pass filter response of the beamforming signal. The digital filter to compensate for the spectral modification can be determined by sampling the combined beamforming signal and comparing the sampled beamforming signal against a target signal.
A delay factor, Δ, is applied to the response from the microphone 530, 532 farthest away from the sound source 552 using a delay module 562 along the corresponding microphone signal paths 512, 515. If Δ=the back length between the two microphones 530, 532, then Focus is entirely to the front. A gain factor, α, is applied to the same response using a multiplier 560 located along the corresponding microphone signal paths 512, 515 to provide Strength of the Focus. Varying α from 0 to 1 changes the Strength of the Focus. Therefore, the delay factor, Δ, provides Focus (direction), and the gain factor, α, provides Strength of that Focus. A beamforming signal (BFS) is calculated using Equation (5).
BFS=MIC2−α×(MIC1×Δ) (5)
The resultant beamforming signal is forwarded to an optimization unit 575 along a combined signal path 570. The optimization unit 575 performs signal optimization 700 as described in
Modulation of the delay and gain factors, Δ and α, can be implemented using physical selectors such as a switch or dials located on a wired or wireless control device. Alternatively, a graphical user interface can be implemented to include graphical selectors such as a button, a menu, and a tab to input and vary the delay and gain factors.
In some implementations, the gain and delay factors can be manually or automatically modified based on the perceived noise level. In other implementations, the gain and delay factors can be selectable for on/off modes.
Computer Implementation
In some implementations, the techniques for achieving beamforming as described in
Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer, including graphics processors, such as a GPU. Generally, the processor will receive instructions and data from a read only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto optical disks, or optical disks. Information carriers suitable for embodying computer program instructions and data include all forms of non volatile memory, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto optical disks; and CD ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user and a keyboard and a pointing device (e.g., a mouse or a trackball) by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user can be received in any form, including acoustic, speech, or tactile input.
A number of implementations have been disclosed herein. Nevertheless, it will be understood that various modifications may be made without departing from the scope of the claims. Accordingly, other implementations are within the scope of the following claims.
This is a continuation of U.S. patent application Ser. No. 11/534,933, filed Sep. 25, 2006, which is incorporated herein by reference in its entirety and to which priority is claimed.
Number | Date | Country | |
---|---|---|---|
Parent | 11534933 | Sep 2006 | US |
Child | 13172980 | US |