Telecommunications can require a user to clearly interpret sounds generated by his or her communications device. For a hearing impaired user, sound interpretation can range from a minor annoyance to a near impossibility, depending on the user's level of impairment. Additionally, speakers whose voices lie outside of a standard frequency range, e.g. adults or children with a high-pitched voice or who speak with a particularly wide frequency range, can be more difficult to interpret. In such cases, both human and automated receivers are prone to difficulty in understanding the audio information.
Accordingly, selective remapping of sound frequencies to a new range, based either on an individual's hearing needs, or compression to a generalized standard vocal range (i.e. for auto attendants, speech recognition software, and the like), can make sound interpretation more accurate.
The system 100 may enhance an audio experience for a hearing impaired user (e.g. a human, a machine, etc.) using existing and standard telecommunications infrastructure and devices. This is accomplished by adjusting a raw audio 150 signal into a remapped audio 160 signal within a hearing range more readily understood by a user. The audio signal before processing is the raw audio 150 signal, and the audio signal after processing is the remapped audio 160 signal. For example, the system 100 may remap a raw audio 150 signal to shift frequencies out of a user's impaired hearing range (examples of hearing impairments include hearing loss, deafness, tinnitus, ringing, etc.). As another example, the system 100 may remap the speech of a user who has a very high voice into a more acceptable frequency range for an auto-attendant system.
In addition, the system 100 may also benefit a non-impaired user operating within an impaired environment. Preset modes may be used to remap raw audio 150 as appropriate to situations where a normal user would have a hard time hearing. For example, during a voice call from within a boisterous crowd at a sporting event, one might personally find lowering the frequency 20% improves perceived clarity. As another example, remapping to a 30% higher frequency range might make an audio signal more intelligible when received in a rumbling machine shop.
As illustrated in
A destination device 130 is a communications device 110 on a communications network 120 to which a communications device 110 may selectively connect. Once a communications device 110 is connected to another device (e.g. destination device 130) through the communications network 120, the communications device 110 may then be used to send and receive communications signals (e.g. audio, video) with the destination device 130. For example, a raw audio 150 signal is a type of communication signal, composed of an audio signal encoded for transmission across the communications network 120. The raw audio 150 signal may be encoded and transmitted as either an analog or a digital signal, as is well known.
A remapping server 140 may be used to transform raw audio 150 signals into remapped audio 160 signals. In many examples, the remapping server 140 is a computing device, including a processor, and storage. In general, a processor (e.g., a microprocessor) receives instructions, e.g., from a memory, a computer-readable medium, etc., and executes these instructions, thereby performing one or more processes, including one or more of the processes described herein. Such instructions may be stored and transmitted using a variety of known computer-readable media.
In some examples, a remapping server 140 may be implemented as computer-readable instructions (e.g., software) on one or more computing devices (e.g., servers, personal computers, etc.).
A computer-readable medium (also referred to as a processor-readable medium) includes any tangible medium that participates in providing data (e.g., instructions) that may be read by a computer (e.g., by a processor of a computer). Such a medium may take many forms, including, but not limited to, non-volatile media, volatile media, and transmission media. Non-volatile media may include, for example, optical or magnetic disks and other persistent memory. Volatile media may include, for example, dynamic random access memory (DRAM), which typically constitutes a main memory. Such instructions may be transmitted by one or more transmission media, including coaxial cables, copper wire and fiber optics, including the wires that comprise a system bus coupled to a processor of a computer. Transmission media may include or convey acoustic waves, light waves, and electromagnetic emissions, such as those generated during radio frequency (RF) and infrared (IR) data communications. Common forms of computer-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD, any other optical medium, punch cards, paper tape, any other physical medium with patterns of holes, a RAM, a PROM, an EPROM, a FLASH-EEPROM, any other memory chip or cartridge, or any other medium from which a computer can read.
In any event, the remapping server 140 may process raw audio 150 signals from communications network 120 into remapped audio 160 signals that may be received by a destination device 130. The remapping server 140 may also process raw audio 150 signals from the destination device 130 into remapped audio 160 signals for use by communications device 110 (a reverse flow not shown in
In various exemplary implementations, the remapping server 140 uses a plot profile 145 to process the audio signal. A plot profile 145 may include at least one identified range of impaired audio frequencies within an audio signal (e.g. due to hearing loss, deafness, tinnitus, ringing, etc.). A plot profile 145 may also include at least one preset frequency offset (e.g. deepen voice 10%, lower than 3500 Hz, increase volume at trained frequencies). The plot profile 145 may thus be used by a remapping server 140 to indicate which audio frequencies within a raw audio 150 signal to map to other frequencies. For each area of impaired frequency response, the sounds within the impaired area may be moved to an area of less impairment (e.g. by being remapped and compressed, by being shifted in frequency without compression, etc.). Remapping of audio signals is discussed in more detail below with regard to
The plot profile 145 may be a predefined standard/industry profile (e.g. senior citizen, noisy shop floor environment), or it may be a custom profile created for or by a particular user (e.g., a profile including a user's specific hearing range and impairments). Additionally, the system 100 may allow a user may create a custom plot profile 145, discussed in more detail below with regard to
A profile server 170 selectively provides plot profiles 145 to a remapping server 140 for use in remapping a raw audio 150 signal. Profile server 170 generally includes a processor and a memory, as well as a computer readable medium such as a disk or the like for storing data, e.g., plot profiles 145, to be provided to remapping server 140. A profile database 180 may be included within profile server 170, or may be part of a separate computing system. In any event, profile server 170 is generally configured to selectively retrieve information from profile database 180 in response to requests for plot profiles 145. Additionally, profile server 170 is configured to store a plot profile 145 to be retrieved later by a user for use in remapping a raw audio 150 signal in conformance with the user's stored plot profile 145.
An attendant front end 190 may provides a user interface for a user of a communications device 110 to select a plot profile 145 from profile server 170 for use by remapping server 140 in the processing of raw audio 150 signal into remapped audio 160 signal. For example, an automatic attendant front end 190 may answer a call, prompt for a numeric code indicating a desired plot profile 145 to be used for the call, inform a profile server 170 to selectively retrieve the plot profile 145, and indicate to a remapping server 140 of the user's plot profile 145 selection. The indicated plot profile 145 may remain in use for the next call only, or may stay associated with a communications line or a user until another plot profile 145 is selected.
An intelligent communications device 210 (e.g. cellular phone, “softphone,” wired handset, etc.) is a communication device configured to perform audio signal remapping within the intelligent communications device 210 itself. An intelligent communications device 210 may operate on a communications network 120 and perform audio signal remapping without regard to whether the communications network 120 includes facilities for remapping raw audio 150 signals.
Intelligent communications device 210 includes a remapping processor 220 to perform the remapping function. The remapping processor 220 processes a raw audio 150 signal into a remapped audio 160 signal, similar to remapping server 140 discussed above with regard to
The remapping processor 220 may be used to process raw audio 150 signals received from a communications network 120 or to process raw audio 150 signals received from a user of intelligent communications device 210. The intelligent communications device 210 may further include at least one plot profile 145 for use by the remapping processor 220, and may optionally include a profile database 180 for the selective storage and retrieval of plot profiles 145.
For example, in a situation where a user has a hearing impairment, audio from network 230 can be an input source to be routed as raw audio 150 into the remapping processor 220. In this case, a plot profile 145 including a user's specific hearing range and impairments may be used by the remapping processor 220 to process raw audio 150 into remapped audio 160. Then, the remapped audio 160 may be routed to an audio reproducer 250, typically included within the intelligent communications device 210, so that the remapped audio 160 may be heard by the user.
In a further example, a microphone 240 may be included in the intelligent communications device 210 and used as a source of a raw audio 150 signal. In a case where a user has a voice of very high or low frequency, a plot profile 145 may be used to process the raw audio 150 into a remapped audio 160 signal of a more acceptable frequency range, e.g. to improve voice recognition for an auto-attendant system indicated as a destination device 130. Thus, remapped audio 160 may be output as audio to network 260 and sent on to communications network 120.
As mentioned above, a plot profile 145 may include at least one area of impaired frequency response. When utilizing a frequency remapping and compression function, for each area of impaired frequency response, the sounds within the impaired area may be compressed in frequency and shifted in frequency to outside of the area of impairment. Additionally, frequencies adjacent to the impaired frequency range may be compressed and shifted in order to allow for the sounds within the impaired range to be moved out of the impaired range without overlap of any unimpaired frequency range.
As illustrated in
As further illustrated in
Note that these regions are only exemplary and other examples with different regions of interest are possible.
An exemplary remapping system (e.g. including remapping processor 220, remapping server 140, etc.) may determine a minimum frequency (Fmin), a maximum frequency (Fmax), and a center frequency (Fcenter) of an impaired frequency range, based on the selected plot profile 145, where:
In other examples, Fmin, Fcenter, and Fmax may be calculated differently. For example, the calculation of Fcenter may be omitted, and all of the frequencies within region F may be shifted downward, or all shifted upward. Alternately, Fcenter may be calculated, not based on a center of the frequency range, but instead based on the content of a raw audio 150 signal itself (e.g. center of distribution of sound energy, logical break in the distribution of sound energy, etc.), based on a preset value, etc.
As illustrated in
The region outside of the ranges of [Fmin−½F to Fmin], [Fmin to Fmax], and [Fmax to Fmax+½F] are represented in
Additionally, regions of [Fmin−½F to Fmin−¼F] and [Fmax+¼F to Fmax+½F] are calculated. These regions are labeled as region B in
Similarly, regions [Fmin−¼F to Fmin] and [Fmax to Fmax+¼F] are calculated, labeled as region C in
No changes are made to the signal in region A of the raw audio 150 signal in the remapped audio 160 signal. Thus, sounds within region A are unaffected by the frequency compression or shifting operations. However, changes are made to the signal within regions B, C, and F.
In the raw audio 150 signal, regions B and C include the audible signal adjacent to the inaudible range F. In the remapped audio 160 signal, the signal as contained in the raw audio in both regions B and C may be compressed (in this example compressed in a ratio of 2:1) into a narrower frequency range (in this example a range of ½ size), and pitch shifted to occupy only range B of the remapped audio 160 signal.
Additionally, inaudible region F may be compressed (in this example compressed in a ratio of 2:1) into a narrower frequency range (in this example a range of ½ size), and pitch shifted to occupy region C. The lower half of region F may be shifted downward to occupy the entire lower region C, and the upper half of region F may be shifted upward to occupy the entire upper region C.
In the remapped audio 160 signal, region F is empty. In effect, this approach spreads the inaudible signal within region F into the user's audible range. Additionally, this approach may be repeated for each area of impaired frequency range within a plot profile 145.
In other examples, only a portion of the audio signal within region F may be shifted to outside of region F. However, shifting the frequency of at least a portion of the impaired audio frequencies to outside of the identified range is required in order to, for example, make an audio signal more intelligible, or to shift a voice into a more acceptable frequency range.
In further examples, instead of or in addition to moving at least a portion of the impaired audio frequencies to outside of the identified range, at least a portion of the impaired audio frequencies may be copied from region F to outside of the impaired frequency range. In these examples, the audio from the impaired audio frequency frequencies may remain in region F and also appear again outside of region F.
When utilizing a frequency remapping function without compression, for each area of impaired frequency response, the sounds within the impaired area may be shifted in frequency to outside of the area of impairment, without being compressed in frequency. Additionally, instead of compressing and shifting frequencies adjacent to the impaired frequency range, frequencies inside the impaired frequency range may be mapped on top of frequencies adjacent to the impaired frequency range.
As illustrated in
As further illustrated in
It is important to note that other remappings are possible, in addition to the exemplary frequency remapping as illustrated by
As illustrated in
In step 510, a request to create a plot profile 145 may be received by a device on a communications network 120, (e.g. attendant front end 190, profile server 170, etc.). Alternately, an intelligent communications device 210 may receive a request to create a plot profile 145 without regard to a communications network 120, for example through use of a user interface of intelligent communications device 210.
Next, in step 520, a ramping tone may be generated. For example, the handset may generate a ramping tone that covers the entire audio spectrum within its limits (i.e. from ˜50 hz to 8 Khz for a standard PCM telephone range, or wider for a more responsive devices such as an MP3 player, etc., with a more extended range up to 20 KHz, the human hearing limit, etc.).
Next, in step 530, the user may be prompted to input upon reduced sensation (i.e. the user cannot hear the tone or hears the tone with decreased response). For example, a function on an intelligent communications device 210 may prompt a user (e.g. by audio, by visual cues on the screen, audio and visual cues combined, etc.) to input when the user experiences reduced sensation by pressing a button on the device. The user may also release the button when again able to hear the signal. In other examples, the user may press a button when hearing the tone and release when experiencing reduced sensation, respond by speaking, press 1 for an audible tone and press 2 for an inaudible tone, and so on.
In still other examples, the user may be presented with an individual tone, and then prompted for a response with regard to the test tone's audibility. This process of presentation of tones and prompting for responses may thus be repeated for various tones or portions of the ramping tone throughout the system or device range.
Next, in step 540, the user input may be translated into a plot profile 145. The user-frequency markings, as collected in responses to the tones in step 530, thus may be translated into a plot profile 145 including the user's hearing impairments.
Next, in step 550, the plot profile 145 may be stored, possibly with a tag providing information on the specific environment at issue such as a factory shop floor. The plot profile 145 may be stored on an intelligent communications device 210 (e.g. in device memory, in a profile database 180 local to the device, etc.), and/or on a communications network (e.g. on a profile server 170, in a profile database 180, etc.). Then, the process 500 ends.
In step 610, speaker training of a user is initiated. For example, speaker training may be initiated automatically, (e.g. upon first use of a device), or by a user request (e.g. through a user interface of an intelligent communications device 210, through a user request to an attendant front end 190 or profile server 170, etc.).
Next, in step 620, the user may speak into a sound capture component of a device (e.g. microphone 240 of an intelligent communications device 210, etc.). The device may be a communications device 110 such as a POTS telephone, VOIP telephone, cellular/mobile telephone, “softphone,” etc., or another device. The device may be an intelligent communications device 210. In this step, the user may speak into the device (e.g., for a period of time, until completing a speech exercise, etc.).
Next, in step 630, the captured audio spoken by the user may be sampled. In this step, the device may sample the spoken audio. In other examples, another device on the communications network 120 (e.g. attendant front end 190, profile server 170, etc.) may perform the sampling of captured spoken audio.
Next, in step 640 the frequency response of the user's voice may be determined. In this step, the device may determine the complete frequency response of the user's voice. In other examples, another device on the communications network 120 (e.g. attendant front end 190, profile server 170, etc.) may perform the comparison or calculations.
Next, in step 650, the frequency markings calculated in step 640 may be converted into a plot profile 145 representing the user's input data plot profile. For example, the device may compare a frequency plot of the user's voice to a predefined standard/industry vocal plot, and may calculate an appropriate delta to remap the spoken input into these standard plots. This delta may be included in a plot profile 145, and the plot profile 145 may be used to remap the user's outbound audio (e.g., raw audio 150), i.e. to shift the audio into conformity with the standard/industry vocal plot.
Next, in step 660, the plot profile created in step 650 may be stored, possibly with a tag providing information on the specific environment at issue such as a factory shop floor. The plot profile 145 may be stored on an intelligent communications device 210 (e.g. in device memory, in a profile database 180 local to the device, etc.), and/or may be stored on a communications network (e.g. on profile server 170, in profile database 180, etc.). Then, the process 600 ends.
In step 710, an initiate signal may be received. For example, a user may signal through a communications device 110 to indicate the initiation of a request to connect to a destination device 130.
Next, in step 720, a server code may be received. For example, a user may dial a specific code (e.g. “*3324”) to connect to a remapping server 140 or an attendant front end 190.
Next, in step 730, a plot profile 145 code may be received. For example, a user may then dial a plot profile code (e.g. “2”) to activate a specific plot profile 145 (stored, e.g., on a profile server 170, in a profile database 180, etc.). In the case of a communications network 120 such as system 200 (i.e., including an intelligent communications device 210), a user may select a plot profile 145 stored on the intelligent communications device 210 or on another device connected to communications network 120 (e.g. profile server 170, profile database 180, etc.).
Next, in step 740, a call request may be reoriginated through a remapping server 140. For example, a dial tone may be reoriginated through a remapping server 140 on a communications network 120.
Next, in step 750, a call request may be received. For example, a user may dial a specific code indicating a destination device 130 (e.g. “555-1234”).
Next, in step 760, a call is completed through the remapping server 140. In this way, a remapping server 140 may map raw audio 150 into remapped audio 160 on a communications network 120 based on a selected plot profile 145. The selected plot profile 145 may remain in effect for the duration of the call, or may be persistent and remain in effect by default for subsequent calls. Then, process 700 ends.
In step 810, a plot profile 145 is loaded. In some examples, a plot profile 145 is automatically associated with a device or system. In other examples, a plot profile 145 may be selected as discussed above with regard to
Next, in step 820, preprocessing of the audio signal may be performed. As mentioned above, a communications network 120 may utilize analog audio signals or digital audio signals. In the case of a communications network 120 utilizing analog signals, a raw audio 150 signal may be translated into a digital audio signal for processing (e.g. via PCM, ADPCM, etc.). Additionally, audio signals may be further processed for more effective remapping (e.g. normalization, dynamic range compression, filtering, frequency cutoffs, etc.).
Next, in step 830, a first remapping range in the active plot profile 145 may be retrieved. As discussed above, a plot profile 145 may contain at least one remapping range.
Next, in step 840, the raw audio 150 signal may be remapped based on the remapping range. The remapping for the remapping range may include frequency remapping and compression as discussed above with regard to
Next, in step 850, it may be determined if the plot profile 145 includes any more remapping ranges. If yes, step 860 is executed next. Otherwise, step 870 is executed.
In step 860, a next remapping range may be retrieved from the plot profile 145, and therefore step 840 is executed next to remap the audio for the next remapping range.
In step 870, post processing is performed on the remapped audio 160 signal. In the case of a communications network 120 utilizing analog signals, the remapped audio 160 signal may be translated back into an analog audio signal for further transmission through the communications network (e.g. POTS, etc.). Additionally, the audio signal may be further processed to remove any artifacts of the remapping process, (e.g. normalization, dynamic range compression, filtering, frequency cutoffs, etc.).
Next, in step 880, the remapped audio 160 signal may be continued to be routed through the communications network 120, as is known. Then, the process 800 ends.
With regard to the processes, systems, methods, heuristics, etc. described herein, it should be understood that, although the steps of such processes, etc. have been described as occurring according to a certain ordered sequence, such processes could be practiced with the described steps performed in an order other than the order described herein. It further should be understood that certain steps could be performed simultaneously, that other steps could be added, or that certain steps described herein could be omitted. In other words, the descriptions of processes herein are provided for the purpose of illustrating certain embodiments, and should in no way be construed so as to limit the claimed invention.
Accordingly, it is to be understood that the above description is intended to be illustrative and not restrictive. Many embodiments and applications other than the examples provided would be apparent to those of skill in the art upon reading the above description. The scope of the invention should be determined, not with reference to the above description, but should instead be determined with reference to the appended claims, along with the full scope of equivalents to which such claims are entitled. It is anticipated and intended that future developments will occur in the arts discussed herein, and that the disclosed systems and methods will be incorporated into such future embodiments. In sum, it should be understood that the invention is capable of modification and variation and is limited only by the following claims.
All terms used in the claims are intended to be given their broadest reasonable constructions and their ordinary meanings as understood by those skilled in the art unless an explicit indication to the contrary in made herein. In particular, use of the singular articles such as “a,” “the,” “said,” etc. should be read to recite one or more of the indicated elements unless a claim recites an explicit limitation to the contrary.
Number | Name | Date | Kind |
---|---|---|---|
5418818 | Marchetto et al. | May 1995 | A |
5659594 | Toda | Aug 1997 | A |
6173062 | Dibachi et al. | Jan 2001 | B1 |
6192341 | Becker et al. | Feb 2001 | B1 |
6842735 | Covell et al. | Jan 2005 | B1 |
6944474 | Rader et al. | Sep 2005 | B2 |
7483831 | Rankovic | Jan 2009 | B2 |
8031892 | Andersen et al. | Oct 2011 | B2 |
20040264721 | Allegro et al. | Dec 2004 | A1 |
20070230729 | Naylor et al. | Oct 2007 | A1 |
20080254753 | Steenstra et al. | Oct 2008 | A1 |
20090226015 | Zeng et al. | Sep 2009 | A1 |
Number | Date | Country | |
---|---|---|---|
20100094619 A1 | Apr 2010 | US |