None.
The present invention generally relates to audio output technology, and more specifically to improved audio output devices configured for use in artificial voice applications.
People have grappled with disabilities throughout recorded history. According to the World Health Organization, “[d]isabilities is an umbrella term, covering impairments, activity limitations, and participation restrictions.” There is often interaction between these elements, with physical impairments or handicaps often being the root of many limitations or restrictions on a person's ability to participate in certain activities, or their ability to complete certain tasks. Much technological advancement has occurred with the aim of directly benefiting persons with disabilities by lessening the impact such impairments may have on their ability to carry out common tasks. For example, common handicaps like vision impairment have long been correctable with eyeglasses, and more recently, with contact lenses and laser vision correction surgery.
Other impairments can be more physically debilitating, such as complete paralysis or akinesia caused by a neurological disorder. These and other acute impairments can drastically limit a person's ability to participate in wide range of activities. In some cases, impairments can even affect a person's ability to communicate effectively with others.
While communication disorders cover a myriad of specific diagnoses, certain of them can limit a person's communicative abilities by severely impairing or completely disabling their ability to speak. Such impairments may result from dysarthria, apraxia of speech, certain voice disorders, akinetic mutism, or other such conditions. Limitations on speech can have a significant impact on a person's life, especially where the root of the impairment can create problems beyond speech limitations, such as dysarthria. Dysarthrias caused by degenerative neurological diseases can significantly impair a person's fine motor control to such a degree that many common tasks, such as eating and speaking, cannot be accomplished without assistance. When faced with such significant impairments, communication can become simultaneously more arduous to accomplish and more critical for quality of life purposes.
In many cases, a person's inability to communicate is due to a physical impairment. All of their cognitive faculties are available, but it is difficult for that person to express all of their ideas, needs and feelings to others. The current state of the art provides for a range of communication devices that a disabled person can interface with that facilitate communication. In cases of complete or nearly complete paralysis, for example, the only parts of the body that can be moved are the eyes. Currently, eye movement and focusing sensors allow persons with this type of impairment to use their eyes to operate computer software that is configurable to help them communicate and operate other devices.
In other cases, voice disorders caused by physical problems of the throat and neck area may affect a person's ability to speak, but do not otherwise affect motor control. The inability to speak brought about by these types of disorders can be supplemented with portable artificial voice generating software and hardware that translates typed messages into sound, or that play preset messages at the direction of the user.
Regardless of the type or level of impairment that a person may experience, artificial voice generation is a vital tool for those persons coping with speech impairments. To implement such a system, current art teaches the use of software and applicable components integrated with commercially available, portable computers. For example, a system for reading eye movement to generate speech might be implemented using eye movement sensors connected with a tablet computer, which is in turn attached to a wheelchair mount. A software application designed to receive input from the eye movement sensors is then installed and executed on the tablet computer. The user then navigates a series of menus to direct the software to generate artificial speech, which is projected via the tablet computer's integrated sound system. Similarly, software that receives input via a touch screen or keyboard—instead of eye movement sensors—can be used to direct speech generation for users that do not have significantly impaired motor control.
While the current state of the art decreases the limitations and restrictions on activity for many persons with speech disabilities, many problems still exist. One limitation on the effectiveness of current artificial voice generating systems is sound quality and volume. Tablet computers, mobile hand held systems, and other small computing devices are used as the primary platform on which artificial voice generating software and its applicable related sensing components are implemented. By using commercially available computing platforms, the cost of artificial voice generating systems are greatly reduced, making them affordable and accessible to a far greater number of disabled persons than there would be otherwise. Additionally, because artificial voice generating systems are used on a near-constant basis, user needs require that they be compact and portable. This requirement largely constrains the designs of these systems to utilize the functionality contained within the computing platforms—i.e., the integrated sound systems.
Thus, current artificial voice generating systems utilize the integrated sound functionality built into commercially available computing hardware. These devices, while robust and compact, are designed largely with personal use as the primary use environment. The normal output level of these integrated sound systems are designed to meet the needs of an operator using the device in personal settings, such as in an office, at home, or while traveling. That is, when such an operator utilizes the onboard speakers, they are often in a relatively quiet setting and are in close proximity to the speakers themselves. When an audio generating device is used in a setting with more ambient noise, headphones are often used to provide low output sound directly to the listener's ear. These systems are not designed to communicate speech to others in everyday settings.
For audio output used in the context of an artificial voice generating system, however, the design requirements are more demanding than those taken into consideration in the design of tablet or laptop computers. Current systems suffer from a lack of high-level sound output. Therefore, it can be quite difficult for listeners to whom the generated speech is directed, to hear and respond to the communication. This is true even when the device is used in a setting with moderate levels of ambient noise. In a crowded room, or at a meeting, for instance, it can be difficult for current systems to produce enough sound volume to be heard over other voices or over ambient noise. This inability to effectively convey the artificial speech to others greatly reduces the effectiveness of the system. Therefore, there is a need in the current art for an artificial voice generating system that has the ability to produce high-decibel sound output without sacrificing mobility and compactness.
A related problem associated with the output level is the tonal quality of the sound produced. Simply increasing the power output to existing integrated sound systems will distort the voice generated by the system, making it difficult for others to decipher. Also, many integrated systems are not designed for higher levels of output, therefore making such an attempt to increase power in existing systems impractical as doing so may damage or destroy the components of the system. Therefore, the problematic sound output levels inherent in the current art require a confluence of solutions that increase sound output levels while maintaining compactness, portability, and commercial value, while simultaneously maintaining or improving the tonal qualities of the output.
Another problem with the state of the art involves the physical location of the sound output device. As current systems utilize the sound system integrated with the computing platform, the sound naturally emanates from the device itself. These devices are most often secured to a wheel chair mount in front of or to the side of the seat, or are carried by the system's user. Those interacting with the user of an artificial voice generating system, however, initially expect the user's voice to emanate from the general area of the user's head. Upon first contact, the user of an artificial voice generating system often must repeat their initial speech because the person to whom they are speaking is caught off guard. Therefore, giving users the option to have the artificially generated voice emanate from a physical location that is more akin to natural speech is an unmet need in the field.
The ability of an audio output module to be separable from a speech-generating device creates other useful consequences. The present invention seeks to increase not only output volume and tonal quality in the audio modules, but also the level of portability provided by the system. This can inject a level of discreteness in communication utilizing an artificial voice generation system. For example, in some settings, the artificial voice generating system user may not wish to communicate with everyone in the room, wishing to remain discrete. The user may be out to dinner with their spouse, in a more intimate setting. The user might be a student with special needs that must communicate with the teacher or an aide about circumstances that may be embarrassing to communicate to the entire class.
In other settings, portability could also increase safety and assist a user's loved ones. Parents of a disabled child may wish to complete tasks around the house, but have a difficult time leaving the communication range of their child's artificial voice generation system, fearing that they will need assistance. Portability in an audio output module would be extremely useful in these circumstances as the parent could carry the audio output module with them, enabling communication between the child and the parent.
Thus there are presently unmet and growing needs for improvements to existing artificial voice generating systems that will enable more effective communication by and with persons who are speech impaired.
The invention is embodied in an improved detachable audio output module used for projecting artificial voice sounds generated from a speech generating system. In a preferred embodiment, the module is composed of a machined housing having at least two chambers, a sound chamber and an interface chamber, and a cover plate. A power chamber and miscellaneous chambers for extraneous features may be included separately as well, or can be incorporated into the interface chamber. When the cover plate is attached to the housing and sealed, the sound chamber is cordoned off from outside air. Electrical connections between the sound, interface, and power components are made by wires passed through voids in the chamber walls. The voids are sized slightly larger than the wire, around which an insulating hollow tube of material is placed, and the insulated portion of the wire is fitted into the void, sealing the chambers apart. The sound chamber volume can thus be designed to properly damp the speaker system, while simultaneously driving it with high voltage sound signals. The audio output module can thus be designed as a low profile module, or to be worn as a pendant if the interface module is wireless.
The wearable audio output module may comprise a sound output housing with an interface chamber containing an interface module housed in the interface chamber, an acoustically isolated sound chamber containing a sound output speaker in communication with the interface module and with a sound generation port, said interface module within the sound output housing being in wireless communication with a speech generating system, an externally accessible control interface with controls for activating and controlling the sound output device, a pendant attachment point allowing the sound output device to be secured about the body of a user by a pendant attachment, with said pendant attachment securing the sound generation port in a position within about 30 degrees from the center point of the mount of the body of the user from the perspective of a listener, wherein the sound output device generates artificial voice of an improved quality upon activation of the sound output module, whereupon artificial voice output is generated by the speech generating system communication to the audio output module and delivering sound generating signals to the sound output speaker, while said sound output device is secured about the body of the user through the pendant attachment and the audio output is perceived to emanate from the mouth of the body of the user. The audio output module can also include a quick release cradle attachable to the body of a user or to a speech generating system, allowing the sound output device to be removably attached to the quick release cradle.
The invention is further embodied in a sound output device enclosure for in an artificial voice generating system, comprising a sound output housing further comprising an interface chamber having an external antenna connection for allowing an external connection from an independent transmitter to an electronic interface module housed in the interface chamber, a sound chamber, wherein the interface chamber and the sound chamber share a separating wall, an internal port through the separating wall through which a connecting wire may pass, a cover plate secured to the housing such that the cover plate forms an airtight seal and separating the interface chamber and the sound chamber, an external sound output housing surface that mates with a quick release cradle, and the interior of the housing and the cover plate being coated with a first layer conductive copper flake electromagnetic interference and a second layer nonconductive conformal clear coating. The sound output device enclosure further comprises an audio output module fitting within the interface chamber of the sound output housing in communication with an speech generating system, a sound output speaker fitting within the sound chamber of the sound output housing in communication with the audio output module and with a sound generation port, an externally accessible control interface with controls for activating and controlling the sound output device, and a pendant attachment point allowing the sound output device to be secured about the body of a user, wherein the sound output device generates artificial voice of an improved quality upon activation of the sound output module, whereupon artificial voice output is generated by the speech generating system communicating to the audio output module and delivering sound generating signals to the sound output speaker, while said sound output device is either secured about the body of the user through the pendant attachment point or by mating the sound output housing to the sound module cradle.
For a fuller understanding of the nature and advantages of the present invention, reference should be had to the following detailed description taken in connection with the accompanying drawings, in which:
Disclosed herein is an improved audio output module for use in an artificial voice generation system.
A common implementation of an artificial voice generating system is one using a portable tablet computer device to run speech generation software.
Integrated sound systems do not provide a volume or tonal quality sufficient for situations in which an artificial voice needs to be projected over other voices or significant ambient noise levels. To correct this problem, the audio output module 220 is used to produce the artificial voice sound signal in lieu of the speech generating system's 110 integrated sound system, which uses onboard speakers to project sound through the speaker aperture 116. Connecting the audio output module 220 to the speech generating system 110 allows the latter to bypass its integrated sound system, sending the artificial voice sound signals instead to the audio output module 220.
In a preferred embodiment of the present invention, the connection between the speech generating system 110 and the audio output module 220 is accomplished, for instance, via a standard USB connection 222, or through a connection on the reverse side of the audio output module 220 housing (not shown). Many methods of connection will be envisioned by those skilled in the art. The connection used will depend on particular design requirements and the capabilities of the speech generating system used. For example, if the speech generating system had a wireless transmitter that was capable of transmitting R/F, IR, or Bluetooth signals, the audio output module could be connected to the speech generating system wirelessly (see
Turning to
Returning to
The audio signal processing circuit 680 of
The wire 674 is electrically connected to the interface board 670. The interface board 670 is a circuit board with input and output components as needed for a particular application. In this particularly depicted embodiment, the interface board 670 includes a mini-USB connection 240, a ⅛″ analog jack 241, and a first 242 and second 243 USB connection, as depicted in
In some cases, the user of the system may have significant physical impairment, such that it is desirable to include in the speech generating system 110 functionality for remotely controlling other household devices. For example, a user may have remotely controllable blinds, lights, televisions, and ceiling fans, to name a few. In order to provide for remote control functionality, the ability to send and receive wireless signals in the audio output module 220 may be desirable. Thus, wireless system components can be included in the audio output module 220 to facilitate system compactness and reduce the number of necessary additional components.
In whatever manner users of the present invention ultimately configure the audio output module 220, the housing 321, and the cover plate 422, it is important that the sound chamber 623 is sufficiently sealed off from the air outside the chamber when the audio output module 220 is assembled. This is true regardless of the number of chambers used in the module beyond the sound chamber 623. By increasing the driving voltage to the speakers 616, the amplifier 680 increases the magnitudes of the speaker voice coil travel distance with respect to the front plate. In turn, this also increases the derivative or rate of change of the voltage signal, which results in an increased reactive restoring force required by the structure of the speaker. The increased stress on the speaker structure thus decreases a speaker's ability to reproduce accurate and high quality tones as it is mechanically difficult to cease voice coil movement immediately after the termination of an audio signal from the amplifier 680. That is, increasing the voltage magnitude of the audio signal to produce louder tones will result in a speaker that continues to vibrate after a source sound stops or changes. This directly affects the sound system's quality by coloring the original sound signal.
To counteract the problem with sound quality that is created by increasing the system's 220 output volume, damping forces must be introduced. The problems with current output modules stem from the requirement that they be connected to a speech generating system, such as one depicted in
The present invention introduces damping forces into the audio system by segregating the audio components from the interface components and any other desired components. This is accomplished by creating at least two separate chambers as shown in FIG. 6—the sound chamber 623 and the interface chamber 624. The separating wall 625 spatially separates the two chambers from one another when the cover plate 422 is attached. The only pathway between the two chambers is restricted to the wire pass-throughs indicated at 656′, 674′, and 676′. These wire pass-throughs 656′, 674′, and 676′ are grooves cut flush with the top of the separating wall 625. Wire insulators, such as the one depicted as 676, are placed around the wires utilizing the pass-through. For instance, the pass-through at 676′ is sized to be slightly larger than the wire 675 passing through it, and is then fitted with wire insulator 676, which fits snuggly into the pass-through 676′, sealing the chambers from one another.
Sealing off the sound chamber 623 allows for a housing design that can be optimized for the speakers' 616 inherent qualities. Those skilled in the art will appreciate that, given the characteristics of a particular speaker chosen as a component for the audio output module 220, one can easily determine a sound chamber 623 volume that will result in optimal tone quality. The sound chamber 623 may thus be designed to maintain a low overall profile to decrease module bulkiness, while simultaneously providing for the appropriate damping force and eliminating resonant frequencies from the system. The decrease in the profile of the system also allows the module to be utilized with computing devices that are ever-decreasing in size—sleek and lightweight devices such as mobile phones, or Apple iPad and other similar devices.
Referring to
Electronic systems such as the audio output module 220 can be susceptible to many types of wireless signals and magnetic fields. To protect the electronic components from failure and to increase the effectiveness of the wireless IR components, the preferred embodiment of the present invention is treated prior to assembly. The inside surface of the cover plate 422 and the inside surfaces of the housing 321 are first coated with a copper flake electromagnetic interference (EMI) shield spray-on coating to protect the electrical components of the audio output module 220 from electromagnetic conduction and radiation. The same surfaces are then coated with a nonconductive conformal clear coating to insulate the electrical components and prevent unwanted electrical connections.
Another embodiment of the present invention includes a separable audio output module, or sound output device. While high volume output is desired on many occasions, artificial voice generation system users also have a need for portability and high quality low volume sound for certain settings. In many situations, the physical location of the sound source may become as or more important than sound. In a preferred embodiment of the present invention, interface board 670 is configured with a Bluetooth (or other similar protocol) transceiver capable of communicating with a speech generating system via standard Bluetooth protocols for wireless data transmission.
The interface board is housed in the interface chamber 624, along with a rechargeable power source. The power source can alternatively be housed in a separate, third power chamber. In the preferred embodiment, all of the components of the audio output system are housed together in a single chamber. It is preferred, however, that the module remain configured with two chambers, as the power source requires a connection external to the module in order to be recharged. Because the interface board does not need a physical external connection when configured for wireless communication, it may be housed in either chamber.
Another benefit of detachability is an increase in the effectiveness of artificial voice generation systems. Detachability allows the user to wear the audio output module in close proximity to their mouths, creating a more natural sounding artificial voice. Locating the sound output in this manner would make the voice appear to emanate from the area from which voices are expected to emanate. A detachable, wearable embodiment of the invention can be attached to a lanyard or decorative chain, or may be worn on a shirt pocket or neck by way of a belt clip. The wearer is able to configure the length of the lanyard or chain so that the audio output module can be worn close to the head and neck area without infringing the wearer's comfort. Configurations of the invention in this manner would thus decrease or eliminate the initial confusion that some people experience when interacting with a user of an artificial voice generation system for the first time.
For example,
Extensive testing was conducted concerning the expectation of listeners with respect to the source of sound, particularly human speech. It is common instinctive habit of humans, upon recognizing the utterance of a human voice in the form of speech (as opposed to a musical composition produced by a radio, for instance) to turn to face the source of the spoken utterance. For those individuals who are utilizing a voice synthesizer in order to communicate, the listener will address the source of the spoken word, rather than the face of the speaker using a voice synthesizer. As it is well-known in the art that facial cues and other body language emanate from an individual who is speaking in order to communicate, when the listener focuses their attention on the source of the sound, and the source of the spoken word is an electronic voice synthesizing speaker not associated with the speaker, the facial cues and non-verbal communication is lost. Thus, it is an object of the present disclosure to enhance the level of effective communication between a user of a voice synthesizing system and the listener. In addition, it is unsettling for the user, to have attention drawn to either a disconnected inanimate object, or to their hand, mobility device or other location distant from the user's face. Through the testing conducted during the development of the present system, it was determined that placing the sound generating component as near as practical to the mouth of the user was desirable. Prior to this experimentation, it was unrecognized in the field that such allocation was of particular value.
Prior voice generating systems also were relatively bulky, and typically needed to rest on a surface, such as a tray table on a wheelchair, or be slung about the shoulder as a satchel. Other systems, such as voice boxes using the close association with the human trachea to generate a voice-like sound, have not been configured to allow a close connection between a high quality voice-generation, and the user's speech capabilities.
Turning to
As shown in
Basic geometry assists in understanding the preferred angles desired for the 20 zone of recognition. A common approach distance for human interactions is approximately 150 cm. Such a distance, with a sound emanation at 30 cm from the mouth would be approximately an 11 degree angle for 0 for an effective zone of recognition
A wearable sound output device can be hung from the neck, attached to clothing, or a lapel and is effective for use in an artificial voice generating system when a pendant attachment point allowing the sound output device to be secured about the body of the user in a position within about 30 degrees from the center point of the mouth of the body of the user from the perspective of a listener. It is more preferred that the position of angle 0 is within about 10 degrees. In such a situation, the sound output device generates artificial voice through the speech generating system, communicating to the audio output module and delivering sound generating signals, to the sound output speaker, while said sound output device is secured about the body of the user through the pendant attachment, the audio output is then perceived to emanate from the mouth of the body of the user.
Turning to
In one embodiment utilizing wireless audio data transfer, the audio output module is made of a top half 1221 and a bottom half 1222 that are secured together to create a sealed seam. The interface chamber 1245 houses a rechargeable battery 1235 and a circuit board 1236. A battery charging port 1201 is shown in this embodiment as a mini-USB port, however other types of power-supply ports may be used without departing from the scope of the invention. The battery charging port 1201 is used to connect a power supply to the rechargeable battery 1235 to provide for device operation independent of a wired power source for extended periods of time. Thus, the device can be worn around the neck, or placed generally away from the speech-generating device as necessary and convenient.
An externally accessible control interface, or button assembly 1207 may be constructed in a manner that contributes to the overall effectiveness of the device in that the sealing of the chambers is benefitted. For example, button assembly 1207 may be provided as a single piece sealed membrane overlay, contributing to effectiveness of the device's seal by providing less openings. The ribbon connector (not shown) passes through void 1215 and is used for communicative connection between the button assembly 1207 and the circuit board 1236, connecting as 1216. Using a single assembly, such as at 1207 provides a better seal which simultaneously allowing for a slimmer, thinner profile (i.e., depth) in the device, which is desired as bulky thick devices would be uncomfortable to wear.
The interface board 1235 may also be provided with an integrated wireless module 1225, such as a Bluegiga® integrated Bluetooth module. Providing wireless connectivity allows the audio output module to be used separately from the speech-generating device, and to be worn, for example. It also eliminates more ports to the outside of the module, which increases the ability to effectively seal the module.
The embodiment depicted in
It would be useful for wireless audio output modules to have the ability to detach from the speech generating system quickly. This would enable safe storage through attachment to the speech generating system, but would also improve usability by permitting the uses discussed above, and do so without much delay. A magnetic attachment between the audio output module and the speech generating system is therefore the preferred method of attachment. Other quick attachment methods of similar operation may be used without departing from the scope of the invention, magnetic attachment being preferred due to the lack of securing fasteners, straps, and other impediments to use by those with decreased motor skills.
Turning to
As shown in
Turning to
Also in the preferred embodiment utilizing wireless audio data transfer, the audio output module is made of a top half 2521 and a bottom half 2522 that are secured together to create a sealed seam. The halves 2521 and 2522 can be secured together using, for instance, multiple screws such as depicted at 2562 inserted through threaded hole 2564 in the bottom half 2522 and into the top half 2521. The interface chamber 2545 houses a rechargeable battery 2535 and a circuit board 2536. A battery charging port 2501 is shown in this embodiment as a mini-USB port accessible through void 2501′, however other types of power-supply ports may be used without departing from the scope of the invention. The battery charging port 2501 is used to connect a power supply to the rechargeable battery 2535 to provide for device operation independent of a wired power source for extended periods of time. Thus, the device can be worn around the neck, or placed generally away from the speech-generating device as necessary and convenient.
An externally accessible control interface, or button assembly 2507 may be constructed in a manner that contributes to the overall effectiveness of the device in that the sealing of the chambers is benefitted. For example, button assembly 2507 may be provided as a single piece sealed membrane overlay, contributing to effectiveness of the device's seal by providing less openings. The ribbon connector (not shown) passes through void 2515 and is used for communicative connection between the button assembly 2507 and the circuit board 2536, connecting as 2516. Using a single assembly, such as at 2507 provides a better seal which simultaneously allowing for a slimmer, thinner profile (i.e., depth) in the device, which is desired as bulky thick devices would be uncomfortable to wear.
The interface board 2535 may also be provided with an integrated wireless module 2525, such as a Bluegiga® integrated Bluetooth module. Providing wireless connectivity allows the audio output module to be used separately from the speech-generating device, and to be worn, for example. It also eliminates more ports to the outside of the module, which increases the ability to effectively seal the module.
The embodiment depicted in
The embodiments of the audio output module and cradle described in connection with
Detachable audio output devices also allow for selectable multi-directional audio output from a single speech generating device. Several detachable modules may be utilized in conjunction with a unique circuit and software design to direct audio output to discrete locations, and to selectively choose particular recipients of an intended communication from a group of many. For example, a user may wish to communicate personal care needs to an aide during a public event, such as when the user is attending a class. Non-verbal students sitting near the back of the classroom may also have one module on his person for communicating with those around him or her, and have a third audio output module located at the front of the class near the teacher. The ability to direct speech among particular output devices not located on the user's person provides for several improvements over current systems, so that the non-verbal user may provide output to his or her aide without providing output to the entire classroom, they may carry on private conversations without disturbing the teacher, or can answer questions at a normal volume, through the on-person module and the teacher module. Such abilities are highly desirable in many situations.
While the invention has been described with reference to preferred embodiments, those skilled in the art will understand that various changes may be made and equivalents may be substituted for elements thereof without departing from the scope of the invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention without departing from the essential scope thereof. Since certain changes may be made in the above compositions and methods without departing from the scope of the invention herein involved, it is intended that all matter contained in the above descriptions and examples or shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense. In this application all units are in the metric system and all amounts and percentages are by weight, unless otherwise expressly indicated. Also, all citations referred herein are expressly incorporated herein by reference. All terms not specifically defined herein are considered to be defined according to Webster's New Twentieth Century Dictionary Unabridged, Second Edition. The disclosures of all of the citations provided are being expressly incorporated herein by reference. The disclosed invention advances the state of the art and its many advantages include those described and claimed.
This application is a continuation-in-part of an application filed subject to 35 USC §371, application Serial No. PCT/US2011/001959, filed Dec. 5, 2011.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2011/001959 | Dec 2011 | US |
Child | 14120588 | US |