The present invention relates to hearing implant systems, and more specifically, to techniques for producing electrical stimulation signals in such systems.
A normal ear transmits sounds as shown in
Hearing is impaired when there are problems in the ability to transduce external sounds into meaningful action potentials along the neural substrate of the cochlea 104. To improve impaired hearing, hearing prostheses have been developed. For example, when the impairment is related to operation of the middle ear 103, a conventional hearing aid may be used to provide mechanical stimulation to the auditory system in the form of amplified sound. Or when the impairment is associated with the cochlea 104, a cochlear implant with an implanted stimulation electrode can electrically stimulate auditory nerve tissue with small currents delivered by multiple electrode contacts distributed along the electrode.
Typically, the electrode array 110 includes multiple electrode contacts 112 on its surface that provide selective stimulation of the cochlea 104. Depending on context, the electrode contacts 112 are also referred to as electrode channels. In cochlear implants today, a relatively small number of electrode channels are each associated with relatively broad frequency bands, with each electrode contact 112 addressing a group of neurons with an electric stimulation pulse having a charge that is derived from the instantaneous amplitude of the signal envelope within that frequency band.
In some coding strategies, stimulation pulses are applied at a constant rate across all electrode channels, whereas in other coding strategies, stimulation pulses are applied at a channel-specific rate. Various specific signal processing schemes can be implemented to produce the electrical stimulation signals. Signal processing approaches that are well-known in the field of cochlear implants include continuous interleaved sampling (CIS), channel specific sampling sequences (CSSS) (as described in U.S. Pat. No. 6,348,070, incorporated herein by reference), spectral peak (SPEAK), and compressed analog (CA) processing.
The band pass signals B1 to BM (which can also be thought of as frequency channels) are input to a Signal Processor 202 which extracts signal specific stimulation information—e.g., envelope information, phase information, timing of requested stimulation events, etc.—into a set of N stimulation channel signals S1 to SN that represent electrode specific requested stimulation events. For example, channel specific sampling sequences (CSSS) may be used as described in U.S. Pat. No. 6,594,525, which is incorporated herein by reference in its entirety. For example, the envelope extraction may be performed using 12 rectifiers and 12 digital Butterworth low pass filters of 2nd order, IIR-type.
A Pulse Generator 205 includes a Pulse Mapping Module 203 that applies a non-linear mapping function (typically logarithmic) to the amplitude of each band-pass envelope. This mapping function—for example, using instantaneous nonlinear compression of the envelope signal (map law)—typically is adapted to the needs of the individual cochlear implant user during fitting of the implant in order to achieve natural loudness growth. This may be in the specific form of functions that are applied to each requested stimulation event signal S1 to SN that reflect patient-specific perceptual characteristics to produce a set of electrode stimulation signals A1 to AM that provide an optimal electric representation of the acoustic signal. A logarithmic function with a form-factor C typically may be applied as a loudness mapping function, which typically is identical across all the band pass analysis channels. In different systems, different specific loudness mapping functions other than a logarithmic function may be used, with just one identical function is applied to all channels or one individual function for each channel to produce the electrode stimulation signals A1 to AM outputs from the Pulse Mapping Module 203.
The Pulse Generator 205 also includes a Pulse Shaper 204 that develops the set of electrode stimulation signals A1 to AM into a set of output electrode pulses E1 to EM for the electrode contacts in the implanted electrode array which stimulate the adjacent nerve tissue. The electrode stimulation signals A1 to AM may be symmetrical biphasic current pulses with amplitudes that are directly obtained from the compressed envelope signals.
In the specific case of a CIS system, the stimulation pulses are applied in a strictly non-overlapping sequence. Thus, as a typical CIS-feature, only one electrode channel is active at a time and the overall stimulation rate is comparatively high. For example, assuming an overall stimulation rate of 18 kpps and a 12 channel filter bank, the stimulation rate per channel is 1.5 kpps. Such a stimulation rate per channel usually is sufficient for adequate temporal representation of the envelope signal. The maximum overall stimulation rate is limited by the minimum phase duration per pulse. The phase duration cannot be arbitrarily short because, the shorter the pulses, the higher the current amplitudes have to be to elicit action potentials in neurons, and current amplitudes are limited for various practical reasons. For an overall stimulation rate of 18 kpps, the phase duration is 27 μs, which is near the lower limit.
In the CIS strategy, the signal processor only uses the band pass signal envelopes for further processing, i.e., they contain the entire stimulation information. For each electrode channel, the signal envelope is represented as a sequence of biphasic pulses at a constant repetition rate. A characteristic feature of CIS is that the stimulation rate is equal for all electrode channels and there is no relation to the center frequencies of the individual channels. It is intended that the pulse repetition rate is not a temporal cue for the patient (i.e., it should be sufficiently high so that the patient does not perceive tones with a frequency equal to the pulse repetition rate). The pulse repetition rate is usually chosen at greater than twice the bandwidth of the envelope signals (based on the Nyquist theorem).
Another cochlear implant stimulation strategy that does transmit fine time structure information is the Fine Structure Processing (FSP) strategy by Med-El. Zero crossings of the band pass filtered time signals are tracked, and at each negative to positive zero crossing, a Channel Specific Sampling Sequence (CSSS) is started. Typically CSSS sequences are only applied on the first one or two most apical electrode channels, covering the frequency range up to 200 or 330 Hz. The FSP arrangement is described further in Hochmair I, Nopp P, Jolly C, Schmidt M, Schöβer H, Garnham C, Anderson I, MED-EL Cochlear Implants: State of the Art and a Glimpse into the Future, Trends in Amplification, vol. 10, 201-219, 2006, which is incorporated herein by reference.
Many cochlear implant coding strategies use what is referred to as an N-of-M approach where only some number n electrode channels with the greatest amplitude are stimulated in a given sampling time frame. If, for a given time frame, the amplitude of a specific electrode channel remains higher than the amplitudes of other channels, then that channel will be selected for the whole time frame. Subsequently, the number of electrode channels that are available for coding information is reduced by one, which results in a clustering of stimulation pulses. Thus, fewer electrode channels are available for coding important temporal and spectral properties of the sound signal such as speech onset.
One method to reduce the spectral clustering of stimulation per time frame is the MP3000™ coding strategy by Cochlear Ltd, which uses a spectral masking model on the electrode channels. Another method that inherently enhances coding of speech onsets is the ClearVoice™ coding strategy used by Advanced Bionics Corp, which selects electrode channels having a high signal to noise ratio. U.S. Patent Publication 2005/0203589 (which is incorporated herein by reference in its entirety) describes how to organize electrode channels into two or more groups per time frame. The decision which electrode channels to select is based on the amplitude of the signal envelopes.
In addition to the specific processing and coding approaches discussed above, different specific pulse stimulation modes are possible to deliver the stimulation pulses with specific electrodes—i.e. mono-polar, bi-polar, tri-polar, multi-polar, and phased-array stimulation. And there also are different stimulation pulse shapes—i.e. biphasic, symmetric triphasic, asymmetric triphasic pulses, or asymmetric pulse shapes. These various pulse stimulation modes and pulse shapes each provide different benefits; for example, higher tonotopic selectivity, smaller electrical thresholds, higher electric dynamic range, less unwanted side-effects such as facial nerve stimulation, etc. But some stimulation arrangements are quite power consuming, especially when neighboring electrodes are used as current sinks. Up to 10 dB more charge might be required than with simple mono-polar stimulation concepts (if the power-consuming pulse shapes or stimulation modes are used continuously).
It is well-known in the field that electric stimulation at different locations within the cochlea produce different frequency percepts. The underlying mechanism in normal acoustic hearing is referred to as the tonotopic principle. In cochlear implant users, the tonotopic organization of the cochlea has been extensively investigated; for example, see Vermeire et al., Neural tonotopy in cochlear implants: An evaluation in unilateral cochlear implant patients with unilateral deafness and tinnitus, Hear Res, 245(1-2), 2008 Sep. 12 p. 98-106; and Schatzer et al., Electric-acoustic pitch comparisons in single-sided-deaf cochlear implant users: Frequency-place functions and rate pitch, Hear Res, 309, 2014 March, p. 26-35 (both of which are incorporated herein by reference in their entireties).
In a normal hearing ear, one frequency component consecutively stimulates multiple neural populations. This phenomenon was described as the “travelling wave” as shown in
One quality of the travelling wave that is partly reflected in modern cochlear implant systems is that each frequency component reaches a peak amplitude at a specific spot within the cochlea (the tonotopic principle discussed above). These spectro-temporal properties can also be observed in the activity of cat's cochlear nerve fibres shown in
Loeb G., Are cochlear implant patients suffering from perceptual dissonance? Ear Hear, 26, 2005, p. 435-450 (incorporated herein by reference in its entirety) describes that phase-locking occurs over a substantial length of the cochlea. Furthermore, the action potentials exhibit a coherent spatial gradient with the steepest and most rapidly changing gradient of the phase occurring next to the place of the resonant frequency. At this point, the travelling wave starts to significantly slow down and dissipates. The phase gradient is believed to substantially contribute to pitch perception, especially in loud situations where harmonics are not resolved.
Existing coding approaches take into account some of the temporal properties of the acoustic signal. CIS determines frequency-specific envelopes which inherently contain a certain amount of information about individual low frequency components such as the fundamental frequency. More advanced approaches for calculating band specific envelopes also have been described; for example, U.S. Patent Publication 2006/0235486 (which is incorporated herein by reference in its entirety). The latter and CIS both sample the band pass envelopes with fixed rate stimulation pulses to resemble rudimentary properties of the basilar membrane movement. Other advanced systems as described in U.S. Patent Publication 2011/0230934 (which is incorporated herein by reference in its entirety) explicitly extract temporal characteristics of a band pass signal by identifying phase characteristics such as zero crossings. The described system triggers channel-specific sequences of stimulation pulses at each detected zero crossing. Each of the foregoing arrangements attributes certain frequency components to certain stimulation places. U.S. Patent Publication 2011/0230934 also explicitly takes into account the timing of certain frequency components.
Vocoder-based cochlear implant stimulation arrangements such as CIS and N-of-M do not take into account the travelling wave properties of normal acoustic hearing. The acoustic signal is analysed by filter banks or FFT and assigned either to single intracochlear electrodes, or to simultaneous stimulation of multiple adjacent electrodes. While filter banks can mimic the latencies of single frequency components at the place of stimulation, they are not able to mimic other aspects of the travelling wave behaviour such as the spectro-temporal distribution of this component to neighbouring stimulation sites, starting at a more basal site with low amplitude and ending at a more apical stimulation site with a maximum of stimulation at a site in between. An FFT, also used for mimicking the tonotopic principle in a cochlear implant is no better able to replicate the general latency differences between the frequency components (at the place of stimulation) nor does it provide the spectro-temporal behaviour described above.
Embodiments of the present invention are directed to a signal processing arrangement and corresponding method that generates electrode stimulation signals to electrode contacts in an implanted cochlear implant array. An input sound signal is analyzed to determine characteristic frequency components. For each frequency component, one or more stimulation events are requested based on the timing and amplitude of the frequency component. For each requested stimulation event, a frequency-specific stimulation sequence (FSSS) is generated for stimulation of adjacent electrode contacts. The FSSS starts with a stimulation pulse to the highest-frequency, most-basal electrode contact of the adjacent electrode contacts, ends with a stimulation pulse to the lowest-frequency, most-apical electrode contact of the adjacent electrode contacts, and reaches a maximum stimulation amplitude at a frequency-specific location within the cochlea corresponding to a natural traveling wave maximum. The electrode stimulation signals are then generated from the FSSS for delivery by the electrode contacts to adjacent auditory neural tissue.
In further specific embodiments, each stimulation pulse within the FSSS activates either a single electrode contact, or a plurality of adjacent electrode contacts simultaneously and in-phase. Simultaneous stimulation pulses may be amplitude corrected based on Channel Interaction Compensation (CIC). The FSSS may be shorter in time for higher frequency components and longer in time for lower frequency components. The FSSS may be at least partially simultaneous on two or more electrode contacts. For each electrode contact, the FSSS may be a Channel Specific Sampling Sequence (CSSS). The timing of each frequency component may reflect a phase characteristic and/or frequency-specific latency characteristic of the frequency component.
Embodiments of the present invention add to a cochlear implant system an emulation of a normal auditory physiological process which is important for frequency perception in normal hearing individuals, the travelling wave response of the cochlea. The added spectro-temporal features reflect the rise and sharp fall of excitation along the cochlea from the travelling wave, as well as its slowing down. Embodiments of the invention use arrangements that detect a number of relevant (i.e. spectrally spread or psychophysically unmasked) frequency components and translates them into stimulation sequences which can be super-positioned. This approach can be configured for the specific number of individual information channels of a given patient by skipping frequency components while transmitting distinguishable components in a highly natural way.
The details of such an arrangement are set forth in the following discussion.
As in the arrangement discussed above with respect to
For each requested stimulation event, a frequency-specific stimulation sequence (FSSS) output S1 to SN is generated, step 703, for at least partially simultaneous stimulation of adjacent electrode contacts.
For the low frequency component shown in
The pulse generator 205 is configured to convert the requested stimulation events S1 to SN to produce a corresponding sequence of unweighted stimulation signals A1 to AM that provide an optimal electric representation of the acoustic signal, and then apply a linear mapping function (typically logarithmic) and pulse shaping to produce weighted output pulse sequences electrode stimulation signals E1 to EM for delivery by the electrode contacts to adjacent auditory neural tissue, step 704. Simultaneous stimulation pulses may be amplitude corrected based on Channel Interaction Compensation (CIC). The weighted output pulse sequences electrode stimulation signals E1 to EM also are adapted to the needs of the individual implant user based on a post-surgical fitting process that determines patient-specific perceptual characteristics.
The length of the FSSS can vary based on the number of electrode channels and the number of the CSSS per channel. The lengths of the electrode channel CSSS per FSSS may be constant, however, varying CSSS lengths per FSSS also may be possible, such as longer CSSS at more apical channels or longer/shorter CSSS at the maximum level of the FSSS, etc. Some embodiments also may apply a Channel Interaction Compensation (CIC) algorithm (e.g., U.S. Pat. No. 7,917,224; incorporated herein by reference in its entirety) to the amplitudes of simultaneous FSSS to provide a desired loudness level to the user. The onset of the CSSS within a FSSS is controlled by the phase of the travelling wave. Subthreshold stimulation on individual electrode channels may be applied within a single FSSS in order to support and maintain spontaneous action potentials at the stimulation locations.
Frequency specific characteristics of the FSSS such as amplitude shape, spread over electrode positions, and duration (of entire FSSS and channel specific CSSS per FSSS) can be stored as templates in system memory that is accessible to the signal processing module 202.
Temporal overlap of an FSSS can be handled by applying simultaneous stimulation of all necessary electrodes, i.e. superposition. Spectral overlap of two simultaneously requested interleaving FSSS can also be omitted.
The FSSS can be optimized in duration, number of stimulations, and amplitude shape to produce a the most tone-like percept in response to an acoustic presentation of a pure tone. The amplitude shape and timing of the FSSS can reproduce the envelope of the traveling wave by representing portions of the traveling wave at consecutive positions along the cochlea (
Stimulation positions which are intermediate to physical electrode positions can be produced by weighted simultaneous stimulation of one or more adjacent electrodes. Alternatively, a FSSS can also be compiled from a series of focused stimulation modes, e.g. tri- or multipolar stimulation such as phased array stimulation as shown in Bonham et al., Current focusing and steering: modeling, physiology, and psychophysics, Hear Res, 242(1-2), August 2008, pp. 141-153; incorporated herein by reference in its entirety. The focus, amplitude and timing of the stimulation will follow the tempo-spectral shape of the traveling wave envelope.
Embodiments of the invention may be implemented in part in any conventional computer programming language. For example, preferred embodiments may be implemented in a procedural programming language (e.g., “C”) or an object oriented programming language (e.g., “C++”, Python). Alternative embodiments of the invention may be implemented as pre-programmed hardware elements, other related components, or as a combination of hardware and software components.
Embodiments can be implemented in part as a computer program product for use with a computer system. Such implementation may include a series of computer instructions fixed either on a tangible medium, such as a computer readable medium (e.g., a diskette, CD-ROM, ROM, or fixed disk) or transmittable to a computer system, via a modem or other interface device, such as a communications adapter connected to a network over a medium. The medium may be either a tangible medium (e.g., optical or analog communications lines) or a medium implemented with wireless techniques (e.g., microwave, infrared or other transmission techniques). The series of computer instructions embodies all or part of the functionality previously described herein with respect to the system. Those skilled in the art should appreciate that such computer instructions can be written in a number of programming languages for use with many computer architectures or operating systems. Furthermore, such instructions may be stored in any memory device, such as semiconductor, magnetic, optical or other memory devices, and may be transmitted using any communications technology, such as optical, infrared, microwave, or other transmission technologies. It is expected that such a computer program product may be distributed as a removable medium with accompanying printed or electronic documentation (e.g., shrink wrapped software), preloaded with a computer system (e.g., on system ROM or fixed disk), or distributed from a server or electronic bulletin board over the network (e.g., the Internet or World Wide Web). Of course, some embodiments of the invention may be implemented as a combination of both software (e.g., a computer program product) and hardware. Still other embodiments of the invention are implemented as entirely hardware, or entirely software (e.g., a computer program product).
Although various exemplary embodiments of the invention have been disclosed, it should be apparent to those skilled in the art that various changes and modifications can be made which will achieve some of the advantages of the invention without departing from the true scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
102015104614 | Mar 2015 | DE | national |
This application claims priority from German Patent Application DE 102015104614, filed Mar. 26, 2015, from U.S. Provisional Patent Application 62/212,642, filed Sep. 1, 2015, and from U.S. Provisional Patent Application 62/212,643, filed Sep. 1, 2015, all of which are incorporated herein by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
62212642 | Sep 2015 | US | |
62212643 | Sep 2015 | US |