This invention relates generally to emergency notification systems, and more particularly to a system and method for providing speech-to-text translation, preview, and transmission of emergency notification messages.
Typical building fire alarm systems include a number of fire detectors positioned throughout a building. Signals from those detectors are monitored by a system controller, which, upon sensing an alarm condition, sounds one or more audible alarms throughout the building. Flashing light strobes may also be positioned throughout the building to provide a visual alarm indication
While in many situations such audible alarms and strobes operate sufficiently to notify an alarm condition, in other situations these alarms are insufficient. For example, the audible alarm can be difficult to hear in areas with high noise or poor acoustic properties. In addition, audible alarms cannot be used in areas requiring quiet, such as for example explosives areas. Still further, audible alarms are of limited usefulness in areas with populations that include the deaf and hard of hearing.
In addition to audible alarms, spoken live voice messages can be passed from an operator directly through a building's speaker system. Such spoken live messages, however, are susceptible to a variety of problems, including operator misspeaking, inadvertent provision of incorrect instructions, and unintelligibility of instructions.
Current approaches to improving intelligibility of audible instructions in the fire alarm industry have focused on improving quality of equipment, quality of speakers, layout of speakers, and loudness of the speakers. Little focus has been placed on improving the intelligibility of the message itself.
Although voice message preview functionality exists in systems such as voicemail, such functionality has not been implemented in fire alarm systems. In addition, such voice message preview capabilities have not been combined with speech-to-text systems in any application.
It would, therefore, be desirable to provide a system and method for enabling generation of emergency instruction text messages using a speech-to-text functionality. It would also be desirable to provide a system and method for and verifying the accuracy of such text messages before they are transmitted to text displays viewable by building occupants.
An emergency notification system is disclosed. The emergency system can include a system controller, a microphone coupled to the system controller, and a speech-to-text engine coupled to the microphone and the system controller. The speech-to-text engine can be configured to receive audio data from the microphone and to convert the received audio data to text data. In one embodiment, the system can also include a local display for displaying the converted text data, and a text editor for editing the displayed text data. A text notification device is also included for displaying the text data as a text message. The text notification device can be coupled to the system controller, and can be disposed at a location remote from the system controller.
A method of providing a notification is disclosed. The method includes: receiving audio data, converting the received audio data to text data, displaying the converted text data to a user as a text message, editing the text message, and transmitting the edited text message for display on a remote text notification device. The edited text message can be an emergency notification provided to at least one occupant of a building in which an alarm condition has been identified.
A notification system is disclosed, including a workstation having a processor, a memory, a microphone, a display and a keyboard. The workstation is configured to receive an audible message via the microphone and to convert the audible message to a text message. The workstation can display the text message on the display to enable a user to edit the displayed text message using the keyboard. The system also includes a text notification device coupled to the workstation, the text notification device configured to display the edited text message.
The disclosed system and method provide improved emergency communications capabilities which reduce the chance that incorrect or unintelligible communications will be given during an emergency. In one embodiment, a user can speak into a microphone to record an audible message. The audible message is automatically converted to a text message, and the converted text message is displayed to the user. The user can review and edit the text message prior to sending it to building occupants. The display can be a computer monitor or other display device.
By automatically converting a voice message to a text message and displaying the text message locally, the user can validate the correctness of the message prior to sending the message to building occupants.
In some embodiments, the user interface includes functionality that highlights any word that the speech-to-text engine is unable to interpret, or indicates a low probability of correctness of the interpretation that is made. This can facilitate identification of unintelligible portions of the audio message, and can also allow the user to quickly correct a word that was not in the speech-to-text engine's dictionary.
As will be appreciated, the disclosed system and method can be implemented as part of an existing workstation, or it can be an add-on feature implemented in new workstations.
By way of example, a specific embodiment of the disclosed device will now be described, with reference to the accompanying drawings, in which:
The alarm system includes a fire panel 20 coupled to a detector network 22. The detector network 22 can include an addressable channel that connects to a plurality of initiation devices such as smoke detectors (S), pull stations (P), toxic or flammable gas detectors, heat detectors, or any of a variety of other types of hazard detectors, which may be distributed throughout the rooms 14 and hallway 16. In one embodiment, standard notification devices such as audible alarms and light strobe alarms can be provided connected in the detector network 22. When an alarm condition is sensed, the system controller 27 commands an alarm signal to be sent from the fire panel 20 to the appropriate devices through the detector network 22.
One or more textual notification devices 36, and one or more audible notification devices 37, can be provided in the detector network 22 integral to the alarm system 1. An example of a textual notification system is shown in U.S. Patent Application Publication No. 2007/0046455, titled “Fire Alarm Textual Notification Related Application,” the entirety of which is incorporated by reference herein. As with the initiation devices, each room 14 and the hallway 16 can include at least one textual notification device 36 and/or one audible notification device 37. Alarm messages can activated from the system controller 27 (
Although the illustrated embodiment shows the notification devices 36, 37 as being hardwired to the detector network 22, it will be appreciated that the devices 36, 37 may instead be wireless devices. For example, the audible notification device 37 can be a hard wired or wireless speaker. In addition, although he illustrated embodiment shows the audible notification device 37 as being hardwired to the detector network 22, it could instead be hardwired (or wireless) on a circuit that is separate from the detector network 22. The textual notification device 36 can be a hard-wired display using light emitting diode (LED) lights (or LCD, CRT, projection display, or the like) to display a commanded message. Alternatively, the textual notification device 36 can be the monitor of a desktop, laptop, or handheld computer, or it can be a pager, cell phone or any of a variety of other handheld device, including pagers.
In one non-limiting exemplary embodiment, the textual and audible notification devices 36, 37 are supervised via an addressable loop that is also connected to initiating and/or other notification devices installed in the network. One or more of the notification devices 36, 37 can be supervised, controlled, and powered directly from the fire panel 20. Alternatively, the notification devices 36, 37 may simply be receivers of messages transmitted by the fire panel 20, for example, where the textual notification devices 36 are cell phones, pagers, PDA's or desktop, laptop or handheld computers.
Referring to
The system controller 27 can be connected to, and controlled by, an external workstation 40 having a processor 41, a display 42 and a microphone 44. Such an arrangement facilitates centralized control of a multiplicity of fire panels 20 disposed on one or more buildings. In one embodiment, the external workstation 40 is a Simplex® TrueSite workstation that communicates with the system controller 27 via a communications link 45. The communication link 45 can be a PSTN (Public Switched Telephone Network), a cellular network such as, for example, a GSM (Global System for Mobile Communications) network for SMS and packet voice communication, General Packet Radio Service (GPRS) network for packet data and voice communication, or a wired data network such as, for example, Ethernet/Internet for TCP/IP, VOIP communication, etc. It will be appreciated that while the system controller 27 is described as being part of the fire panel 20, it could instead be part of the workstation 40, or it could be located separate from the fire panel 20 and the workstation 40.
In one embodiment, the textual notification device is hardwired or wireless LED signboard.
Additional lines in a multi-line signboard 36 can scroll additional details about the alarm state. An example of such scrolled instructional message can include “Do not use the elevators! Walk to the nearest stairwell exit and walk down to street level and exit the building;” because this is a scrolling message, the additional line depicted in
The audible notification device 37 can be used to send an audible version of the message that is displayed on the textual notification device 36. Alternatively, the audible notification device 37 may be used to send messages that complement, or are otherwise different from, the text messages displayed on the textual notification device 36. The audible notification device 37 can be used to transmit audible messages input by a user using microphone 44 at the workstation 40.
Instructional messages, whether they are to be displayed on the textual notification device 36 or broadcast using the audible notification device, are input to the system 1 by a user. In one embodiment, audible messages are input to the system 1 by a user speaking into a microphone 44 associated with the workstation.
In one embodiment, the system 1 includes the capability for automatically translating audible messages that are input via the microphone 44 into text that is viewable on the display 42 prior to transmission to the fire panel 20.
Logically, audio data input via the microphone 44 is input to the speech-to-text engine 46, which converts the audio data into text data, which is stored in memory 50 as a file having an appropriate format, such as .wav, .wma, .mp3, or any other appropriate format for storing and processing digitized audio. The text data is also displayed as viewable text on the display 42. The user can preview the viewable text (i.e., the text message) to verify its accuracy prior to sending the message to building occupants via the textual notification device 36.
Because the audio data is stored as a data file (i.e., .wav, .wma, .mp3), additional processing can be performed on the data. For example, additional processing can be aimed at making the message more intelligible. In one embodiment, such additional processing can include compression to decrease extremely loud portions of a message that can result in overdriving the speakers 37 and make the message unintelligible. Alternatively, or in combination, such additional processing can include increasing the volume of messages that are spoken too softly, or equalization to increase base or other frequencies to better reproduce voice on the speakers 37.
In addition, the converted text can be automatically translated into one or more alternative languages such as French or Spanish, and sent to the textual notification devices 36 in those languages. Furthermore, the converted text (either before or after editing) can be converted back to audio data which can then be transmitted to one or more audible notification devices 37 for issuance.
To edit a converted message (e.g., message 54), the user may move a cursor to the appropriate location in the message 54 and edit the text (add, delete, modify) in a manner associated with normal word processing. In this manner, the user can identify and correct any mistakes in the text translation prior to sending the text message to occupants via the textual notification device 36.
In some embodiments, it may be possible for the user to simply enter the text message by typing it on the keyboard 48. The typed message may then be converted to audio in the manner previously described, and the audio and text messages can be sent to the appropriate textual and audible notification devices 36, 37. Typically, this may take longer than embodiments in which a spoken message is automatically converted to text, but it may be efficient for very short messages.
The speech-to-text engine 46 can be any of a variety of known speech-to-text systems, including those marketed under the Dragon® Speech Recognition Software name, as well as systems offered by Microsoft®. In one embodiment, a Microsoft® Speech SDK 5.1 software development kit can be used to implement the speech-to-text engine 46 in workstation 40.
The speech-to-text engine 46 may be configured to identify portions of translated text that it either does not recognize, or that it recognizes but determines has less than a predetermined percentage likelihood of accuracy. In one embodiment, the Application Programming Interface (API) for the speech-to-text engine 46 may return a probability of correctness for each word that it interprets. In general, the speech-to-text engine 46 performs the recognition/identification function by comparing properties of an input audio clip to properties of particular words contained in an internal dictionary. Multiple words in the dictionary may have similarities to the individual words of the input audio, but fewer words (ideally one word) will be most similar to each input word, and thus will have a higher “score” than other candidate words. The highest scoring word is then presented on the display as corresponding text. Where more than one word is identified as corresponding to a word in the input audio (i.e., where several words have a similar score), the speech-to-text engine 46 may select the highest ranking candidate word, but may identify that candidate word to a user as requiring confirmation. This identification may be shown as highlighting of the suspect text. In one example, the speech-to-text engine 46 may be set to highlight specific text that it determines to have a less than 80% chance of being accurate. This predetermined percentage can be user selectable. In one embodiment, the speech-to-text engine 46 identifies for highlighting words that are not found in its internal dictionary. The speech-to-text engine 46 may also identify for highlighting sections of text that appear to be grammatically incorrect.
Although the workstation 40 is illustrated in
Referring to
It will be appreciated that the user, upon previewing the translated message, or upon reviewing the audible version of the message, may decide to re-record the message for any reason. The disclosed system 1 enables the user to preview the audible message to confirm its content and clarity. The system 1 also enables the user to preview and edit the text message to confirm its content. After the audible and text messages have been confirmed by the user, they can be sent to building occupants via the textual and audible notification devices 36, 37.
As previously noted, these messages may be transmitted via hard wired or wireless connections, via a public or private communications network, to any of a variety of textual notification devices 36, a non-limiting list of which include sign boards, cell phones, desktop, laptop or handheld computers, and pagers. In addition, the audible and text messages may be played and displayed locally and/or may be played and displayed at the fire panel 20 or other location. For example, the audible and text messages may be played remotely at a local fire and/or police department facility.
Some embodiments of the disclosed device may be implemented, for example, using a storage medium, a computer-readable medium or an article of manufacture which may store an instruction or a set of instructions that, if executed by a machine, may cause the machine to perform a method and/or operations in accordance with embodiments of the disclosure. Such a machine may include, for example, any suitable processing platform, computing platform, computing device, processing device, computing system, processing system, computer, processor, or the like, and may be implemented using any suitable combination of hardware and/or software. The computer-readable medium or article may include, for example, any suitable type of memory unit, memory device, memory article, memory medium, storage device, storage article, storage medium and/or storage unit, for example, memory (including non-transitory memory), removable or non-removable media, erasable or non-erasable media, writeable or re-writeable media, digital or analog media, hard disk, floppy disk, Compact Disk Read Only Memory (CD-ROM), Compact Disk Recordable (CD-R), Compact Disk Rewriteable (CD-RW), optical disk, magnetic media, magneto-optical media, removable memory cards or disks, various types of Digital Versatile Disk (DVD), a tape, a cassette, or the like. The instructions may include any suitable type of code, such as source code, compiled code, interpreted code, executable code, static code, dynamic code, encrypted code, and the like, implemented using any suitable high-level, low-level, object-oriented, visual, compiled and/or interpreted programming language.
As used herein, an element or step recited in the singular and proceeded with the word “a” or “an” should be understood as not excluding plural elements or steps, unless such exclusion is explicitly recited. Furthermore, references to “one embodiment” of the present invention are not intended to be interpreted as excluding the existence of additional embodiments that also incorporate the recited features. The term computer is not limited to just those integrated circuits referred to in the art as computers, but broadly refers to, microprocessors, microcontrollers, microcomputers, programmable logic controllers, application specific integrated circuits, and other programmable circuits, and these terms are used interchangeably herein.
While certain embodiments of the disclosure have been described herein, it is not intended that the disclosure be limited thereto, as it is intended that the disclosure be as broad in scope as the art will allow and that the specification be read likewise. Therefore, the above description should not be construed as limiting, but merely as exemplifications of particular embodiments. Those skilled in the art will envision other modifications within the scope and spirit of the claims appended hereto.
Number | Date | Country | |
---|---|---|---|
20130041646 A1 | Feb 2013 | US |
Number | Date | Country | |
---|---|---|---|
60713569 | Sep 2005 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 11514612 | Sep 2006 | US |
Child | 13207140 | US |