N/A
N/A
The presently disclosed technology generally pertains to systems and methods for linguistic analysis, and more particularly to features for automatically assigning symbols to text in an instructional application.
Many software-based reading and/or writing instructional applications utilize symbols in addition to text to represent words or other portions of language. Sometimes, instructional software authoring tools can help a user generate printed materials that combine text and symbols to help create symbol-based communication and/or educational tools. One example of a symbol-based desktop publishing software used for the creation of printed materials corresponds to BOARDMAKER® software offered by DynaVox Mayer-Johnson of Pittsburgh, Pa.
Symbol-based instructional software authoring tools have become useful not only for the generation of printed educational and communication materials, but also for integration with electronic devices that facilitate user communication and instruction. For example, electronic devices such as speech generation devices (SGDs) or Alternative and Augmentative Communication (AAC) devices can include a variety of features to assist with a user's communication.
Such devices are becoming increasingly advantageous for use by people suffering from various debilitating physical conditions, whether resulting from disease or injuries that may prevent or inhibit an afflicted person from audibly communicating. For example, many individuals may experience speech and learning challenges as a result of pre-existing or developed conditions such as autism, ALS, cerebral palsy, stroke, brain injury and others. In addition, accidents or injuries suffered during armed combat, whether by domestic police officers or by soldiers engaged in battle zones in foreign theaters, are swelling the population of potential users. Persons lacking the ability to communicate audibly can compensate for this deficiency by the use of speech generation devices.
In general, a speech generation device may include an electronic interface with specialized software configured to permit the creation and manipulation of digital messages that can be translated into audio speech output. The messages and other communication generated, analyzed and/or relayed via an SGD or AAC device may include symbols or text alone or in some combination. In one example, messages may be composed by a user by selection of buttons, each button corresponding to a graphical user interface element composed of some combination of text and/or graphics to identify the text or language element for selection by a user.
In order to better facilitate the use of communication “buttons” and other graphical interface features for use in SGD or AAC devices, as well as in other symbol-assisted reading and/or writing instructional applications, the automated creation and adaptation of such elements can be further improved. In light of the various uses of symbol-based communication technologies, a need continues to exist for refinements and improvements to address such concerns. While various implementations of speech generation devices and associated features have been developed, no design has emerged that is known to generally encompass all of the desired characteristics hereafter presented in accordance with aspects of the subject technology.
In general, the present subject matter is directed to various exemplary speech generation devices (SGD) or other electronic devices having improved configurations for providing selected AAC features and functions to a user.
More specifically, the present subject matter provides improved features and steps for associating and automatically discovering and/or assigning symbols to selected text. Such associations can be advantageous because symbols may be used to represent words, names, phrases, sentences and other messages to provide some individuals with a communication environment in which vocabulary choices can be made effectively and independently. Symbols provide an opportunity for people who are not literate or who are still developing literacy skills to have an effective representation of words and thoughts for speech or written communication.
In one exemplary embodiment, a method of automatically discovering and assigning symbols for identified text in a software application includes a first step of receiving electronic signals from an input device identifying text for which symbol assignment is desired. Text may be provided by a user as electronic input to a processing device or may be selected from pre-existing, downloaded, imported or other electronic data accessible by a processing device. Additional data associated with such text also may be received or identified. For example, part of speech information and/or other identifiers, contextual information such as the sentence such text/word is used in, contextual topic in which such text/word is discussed, keywords from the surrounding context or other information may be provided. Some or all of the electronic information identified for a specific portion of text is analyzed to map the text to one or more word senses from a word sense model database.
If additional information such as part of speech information is provided in addition to the text, more specific word senses may be narrowed down by the system. If the information needed for mapping cannot be determined automatically because the information such as part of speech, context or other related information is initially unavailable, it may be possible to prompt a user to enter such information. For example, once text is identified and a determination is made that there are multiple matching word senses in a database, a graphical user interface may be provided to a user requesting needed information (part of speech, context, etc.). Alternatively, a graphical user interface may depict the different word senses that are found and provide features by which a user can select the appropriate word sense for their intended use of the text.
Matched word senses then may be analyzed further to determine if a matched word sense has an associated symbol. If so, then the identified matching symbol can be automatically associated with the text. Alternatively, the identified matching symbol may be displayed graphically to a user for confirmation of association with the analyzed text. If multiple symbols are matched then such multiple symbols may be displayed graphically to a user to prompt user selection of the desired symbol selection. The symbol then may be displayed with or without the text as visual output to a user. For example, once an identified symbol is associated, the text may from that point forward be represented in the system as an icon including the symbol with or without the associated text.
If no matching word sense has an associated symbol, then a determination may be made regarding whether selected related word senses have any associated symbols. Selection of related word senses can be structured relative to a given word sense by type of relation (e.g., “kind of”, “instance of”, “part of”, etc.). Some of those relations (e.g., “kind of”, “part of”, etc.) can be further defined by a direction of relation (e.g., general or specific), number of degrees of relational separation, etc. If one or more selected related word senses are determined to have an associated symbol, then some or all of such symbols can be associated with the identified text and displayed as visual output to a user. In some embodiments, the symbols for related words may be automatically or manually modified (e.g., to reflect the type of relation between the identified word sense and related word sense.) If selected word sense relations are exhausted and no associated symbols are found, then additional steps can be taken. For example, an optional step may involve providing a symbol menu or other graphical user interface to a user so that the user can manually select a pre-existing or imported symbol for the text, create a symbol from scratch or from predefined symbol selection or creation features, or modify an existing or imported symbol.
The symbols discussed herein may correspond to a graphical image, or may correspond to different file formats such as an audio file, video file or the like. In some examples, a symbol may be configured manually (by electronic user input) or automatically by the subject symbol assignment system features to include some combination of graphic image, sound, motion, action/behavior and/or other effects and/or specialized user customization. For example, text with an automatically associated symbol may be configured as a graphical interface element having an associated action, thus functioning as a “button” in graphical user interfaces. In a speech generation device, a button having a symbol and/or text may be selected by a user via a touch screen or other input device. The action resulting from this selection then may correspond to speaking the text corresponding to such symbol and/or placement of the selected text/symbol into a message window for further message composition.
It should be appreciated that still further exemplary embodiments of the subject technology concern hardware and software features of an electronic device configured to perform various steps as outlined above. For example, one exemplary embodiment concerns a computer readable medium embodying computer readable and executable instructions configured to control a processing device to implement the various steps described above or other combinations of steps as described herein.
In a still further example, another embodiment of the disclosed technology concerns an electronic device, such as but not limited to a speech generation device, including such hardware components as a processing device, at least one input device and at least one output device. The at least one input device may be adapted to receive electronic input from a user regarding selection or identification of text to which symbol assignment is desired, as well as additional optional electronic input regarding part of speech, context, or other information related to the identified text. The processing device may include one or more memory elements, at least one of which stores computer executable instructions for execution by the processing device to act on the data stored in memory. The instructions adapt the processing device to function as a special purpose machine that electronically analyzes the text relative to a word sense relation database and maps word senses to the identified text. A determination is then made as to whether the mapped word senses have any symbols associated with them or with selected related word senses (e.g., related as more specific and/or more general forms of the mapped word senses). Once one or more symbols are found, they may be provided on a display in combination with the text and/or other visual features or action items for user confirmation. The mapped symbol to text assignment is then stored for later use within the electronic device.
Additional aspects and advantages of the disclosed technology will be set forth in part in the description that follows, and in part will be obvious from the description, or may be learned by practice of the technology. The various aspects and advantages of the present technology may be realized and attained by means of the instrumentalities and combinations particularly pointed out in the present application.
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate one or more embodiments of the presently disclosed subject matter. These drawings, together with the description, serve to explain the principles of the disclosed technology but by no means are intended to be exhaustive of all of the possible manifestations of the present technology.
Reference now will be made in detail to the presently preferred embodiments of the disclosed technology, one or more examples of which are illustrated in the accompanying drawings. Each example is provided by way of explanation of the technology, which is not restricted to the specifics of the examples. In fact, it will be apparent to those skilled in the art that various modifications and variations can be made in the present subject matter without departing from the scope or spirit thereof. For instance, features illustrated or described as part of one embodiment, can be used on another embodiment to yield a still further embodiment. Thus, it is intended that the presently disclosed technology cover such modifications and variations as may be practiced by one of ordinary skill in the art after evaluating the present disclosure. The same numerals are assigned to the same or similar components throughout the drawings and description.
The technology discussed herein makes reference to processors, servers, memories, databases, software applications, and/or other computer-based systems, as well as actions taken and information sent to and from such systems. One of ordinary skill in the art will recognize that the inherent flexibility of computer-based systems allows for a great variety of possible configurations, combinations, and divisions of tasks and functionality between and among components. For instance, computer-implemented processes discussed herein may be implemented using a single server or processor or multiple such elements working in combination. Databases and other memory/media elements and applications may be implemented on a single system or distributed across multiple systems. Distributed components may operate sequentially or in parallel. All such variations as will be understood by those of ordinary skill in the art are intended to come within the spirit and scope of the present subject matter.
When data is obtained or accessed between a first and second computer system, processing device, or component thereof, the actual data may travel between the systems directly or indirectly. For example, if a first computer accesses a file or data from a second computer, the access may involve one or more intermediary computers, proxies, or the like. The actual file or data may move between the computers, or one computer may provide a pointer or metafile that the second computer uses to access the actual data from a computer other than the first computer.
The various computer systems discussed herein are not limited to any particular hardware architecture or configuration. Embodiments of the methods and systems set forth herein may be implemented by one or more general-purpose or customized computing devices adapted in any suitable manner to provide desired functionality. The device(s) may be adapted to provide additional functionality, either complementary or unrelated to the present subject matter. For instance, one or more computing devices may be adapted to provide desired functionality by accessing software instructions rendered in a computer-readable form. When software is used, any suitable programming, scripting, or other type of language or combinations of languages may be used to implement the teachings contained herein. However, software need not be used exclusively, or at all. For example, as will be understood by those of ordinary skill in the art without required additional detailed discussion, some embodiments of the methods and systems set forth and disclosed herein also may be implemented by hard-wired logic or other circuitry, including, but not limited to application-specific circuits. Of course, various combinations of computer-executed software and hard-wired logic or other circuitry may be suitable, as well.
It is to be understood by those of ordinary skill in the art that embodiments of the methods disclosed herein may be executed by one or more suitable computing devices that render the device(s) operative to implement such methods. As noted above, such devices may access one or more computer-readable media that embody computer-readable instructions which, when executed by at least one computer, cause the at least one computer to implement one or more embodiments of the methods of the present subject matter. Any suitable computer-readable medium or media may be used to implement or practice the presently-disclosed subject matter, including, but not limited to, diskettes, drives, and other magnetic-based storage media, optical storage media, including disks (including CD-ROMS, DVD-ROMS, and variants thereof), flash, RAM, ROM, and other solid-state memory devices, and the like.
Referring now to the drawings,
A first exemplary step 100 in the method of
In step 104, some or all of the electronic information identified in steps 100 and/or 102 are analyzed to map the identified text to one or more identified word senses from a word sense model database. Word senses generally correspond to the meanings of a word, such as when multiple meanings exist for the same word or text. For example, if the subject system and method receives the text “bat” from a user, a word sense model database is searched to map the text to word sense information. A search of this particular text may result in identification and mapping of the text “bat” to one or more word senses. For example, the following word senses and some or all of the related information listed in Table 1 may exist for the text “bat” in a word sense database.
The analysis set forth in step 104 may also include word sense disambiguation if pure textual analysis results in identification of multiple word senses. In general, word sense disambiguation involves identifying one or more most likely choices for a word sense used in a given context, when the word/text itself has a number of distinct senses. For example, part of speech information and/or word sense relations may be used in disambiguation. Alternatively, word sense disambiguation may include analyzing conditional probabilities, for example the probability that a user is concerned with a particular sense given the text/word being analyzed. In other words, conditional probabilities in the form pi=p(senseiword), i=1, 2, . . . , n for n different word senses are considered to choose the word sense having a greater probability of applicability. In other examples, word sense disambiguation may involve more sophisticated probabilistic models such as those possibly from a sense-tagged corpus.
Consider the following example of word sense disambiguation using part of speech information. If additional information such as part of speech identification is provided in addition to the text “bat”, the automated analysis in step 104 may more specifically map the text “bat” to only noun or verb forms of the word. If the sentence from which the text “bat” is selected or extracted is available, additional words may be analyzed to determine context. For example, if the part of speech is provided as a noun, then the above six word senses in Table 1 can be narrowed down to three—word senses (1), (2) and (3) in the list above. If the sentence contains other keywords such as “Halloween” then the text “bat” might be mapped to word sense (1) in the list above.
If the information needed for mapping cannot be determined automatically because the information such as part of speech, context or other related information is initially unavailable, it may be possible to prompt a user to enter such information. For example, once text is identified and a determination is made that there are multiple matching word senses in a database, a graphical user interface may be provided to a user requesting needed information (part of speech, context, etc.). Alternatively, a graphical user interface may depict the different word senses that are found and provide features by which a user can select the appropriate word sense for their intended use of the text.
In a still further alternative, a more specific determination of an appropriate word sense is made after step 106. For example, any identified word senses mapped in step 104, and any symbols associated with such identified word senses may be determined in step 106. After this point, the various symbol options for all possible identified word senses could be displayed to a user via a graphical user interface for user selection of a desired or appropriate symbol for the text identified in step 100.
Referring still to
If no matching word sense has an associated symbol, then a step 110 may involve an automated determination of whether selected related word senses have an associated symbol. In one embodiment, the determination made in step 110 may involve a first step of selecting one or more word senses that are related to the word senses and a second step of determining whether any of such selected related word senses has an associated symbol. The initial selection of word senses related to the identified word senses can be configured in a variety of fashions based on the fact that relationships among word senses can be defined in a plurality of different ways. For example, word sense relations can be defined in accordance with such non-limiting examples as listed in Table 2 below.
It should be appreciated that word senses may be defined in terms of different relations, but also that some relations can be characterized even more specifically. For example, “kind of” and “part of” relations can further involve a direction of relation, such as more generally related or more specifically related. For example, word sense (1) from Table 1 defining “bat” as a mouselike mammal may be more generally related through a “kind of” relation to the word sense “mammal” or more specifically related through a “kind of” relation to the word sense “vampire bat.” These more general and specific relations applicable to some of relations among words in a word sense model database can also be defined over multiple levels. For example, the “kind of” relation between “bat” and “mammal” may involve one level of separation. However, “kind of” relations between “bat” and “vertebrate” may involve two levels of separation, namely one level from “bat” to “mammal” and a second level from “mammal” to “vertebrate.” As such, all word sense relations can be considered in terms of type (e.g., kind of, part of, instance of, etc.), while some of those types can be further characterized by direction (e.g., general or specific) and degree of separation (e.g., number of levels separating the related word senses).
Because there are so many ways in which the relations can be defined, the determination in step 110 may be preconfigured or customized based on one or more or all of the various types of relations, non-limiting examples of which have been presented in Table 2. For example, step 110 may consider all related word senses or only selected relations. One embodiment may involve determining if particular selected types of related word senses have associated symbols (e.g., only “kind of”, “part of”, “related to”, “similar to”, etc.) The determination in step 110 may involve even further distinctions, such as whether any more general or more specific “kind of” or “part of” word senses related to the identified text have associated symbols. Step 110 may involve determining if any word senses related to the identified text by “part of”, “kind of” or similar relations within a predetermined number of degrees of relational separation (e.g., two or three levels) have associated symbols.
If a related word sense is determined to have an associated symbol in step 110, then that symbol can be associated to the new text and displayed as visual output to a user in step 108. Such visual display may result from automatic association of identified text to the symbol for a related word sense or to presentation of the suggested symbol to a user for confirmation. Again, if multiple word senses are found in step 110, then the possible candidates may be presented to a user for further selection.
In some embodiments, an optional step 111 can involve an automated modification to the symbol stored for a related word sense before it is associated with the identified text in step 108. The automated modification in step 111 can reflect the type of relation to enhance the symbol's appropriateness for a related word sense. For example, given a word sense for “sharp” and a word sense for “dull” that are related to one another by an “opposite of” relation, and a situation where a symbol exists for “sharp” but not for “dull,” it may not be appropriate to show the “sharp” symbol for “dull” because it is the opposite related word sense. However, a modification of the “sharp” symbol with a slash or “X” symbol through it might be more appropriate and could be implemented in step 111. Additional variations implemented in step 111 could involve adding a name, number or identifying image to the related symbol to identify the type of relation between the identified text and the related symbol. For example, the plural version of a symbol could be modified by adding a plus sign (+) in the corner of the symbol. Alternatively, the plural version of a symbol could be modified by showing a composite symbol having several examples of the singular symbol. Or the number of degrees of relational separation for relations such as “part of” or “kind of” could be indicated with the symbol.
If word sense relation criteria are exhausted and no associated symbols are found, then additional steps can be taken. For example, an optional step 112 may involve providing a symbol menu or other graphical user interface to a user so that the user can manually select a pre-existing or imported symbol for the text, create a symbol from scratch or from predefined symbol selection or creation features, or modify an existing or imported symbol. Once the new symbols is selected, created or modified by a user in step 112, it may then be associated with the identified text for subsequent display and implementation within an electronic device per step 108.
The symbols discussed herein may correspond to a graphical image, or may correspond to different file formats such as an audio file, video file or the like. In some examples, a symbol may be configured manually (by electronic user input) or automatically by the subject symbol assignment system features to include some combination of graphic image, sound, motion, action/behavior and/or other effects and/or specialized user customization. For example, text with an automatically associated symbol may be configured as a graphical interface element having an associated action, thus functioning as a “button” in graphical user interfaces. In a speech generation device, a button having a symbol and/or text may be selected by a user via a touch screen input device. The action resulting from this selection then may correspond to speaking the text corresponding to such symbol and/or placement of the selected text/symbol into a message window for further message composition.
Symbols that are associated with a particular word sense, text, or the like may be stored in the same or a separate database as the word sense model database previously mentioned. Additional discussion of such data storage will follow with reference to
Referring again to the exemplary analysis of the text “bat,”
With more particular reference to exemplary analysis of the text “bat,” assume that the text “bat” is provided in step 100 as well as a part of speech indicator in step 102 that the text “bat” is being used or is intended for use as a noun. An analysis of a word sense database in step 104 may identify three word senses for the text “bat” used as a noun—namely, word senses (1), (2) and (3) listed in Table 1 above. A determination is then made in step 106 as to whether any of these three word senses has any associated symbol(s). For example, word sense (1) of Table 1 identifying “bat” as a nocturnal mouselike mammal may have an associated symbol such as shown in graphical element 204 of
Referring still to the “bat” example, it may be possible that none of the symbols shown in graphical elements 201 or 204 is discovered or available in the system. In that case, related word senses may be analyzed to discover possible symbols for the “bat” text. An exemplary schematic representation of a portion of the word senses related to some different word senses for “bat” is provided in
With more particular reference to
The same word sense (1) from Table 1 also may be mapped to relational information tracking from “bat” 301 to “Halloween” 306 to “holiday” 307 to “event” 308. Although the relations among elements 301-305, respectively, are homogeneous in the sense that they are all related by “kind of” relations, elements 301 and 306-308, respectively, are heterogeneous in nature. So, for example, the relation 325 may be defined as a “related to” relation since “bat” 301 is related to “Halloween” 306. Relation 326 may be defined, for example, as an “instance of” since “Halloween” 306 is a specific instance of a “holiday” 307. Relation 327 may be defined as a “kind of” relation since a “holiday” 307 is a kind of an “event” 308.
Referring still to
In the current example, step 110 depicted in
Referring now to
Referring more particularly to the exemplary hardware shown in
At least one memory/media device (e.g., device 404a in
The various memory/media devices of
In one particular embodiment of the present subject matter, memory/media device 404b is configured to store input data received from a user, such as but not limited to information corresponding to or identifying text (e.g., one or more words, phrases, acronyms, identifiers, etc.) for performing the desired symbol assignment analysis, and any optional related information such as part of speech, context and the like. Such input data may be received from one or more integrated or peripheral input devices 410 associated with electronic device 400, including but not limited to a keyboard, joystick, switch, touch screen, microphone, eye tracker, camera, or other device. Memory device 404a includes computer-executable software instructions that can be read and executed by processor(s) 402 to act on the data stored in memory/media device 404b to create new output data (e.g., audio signals, display signals, RF communication signals and the like) for temporary or permanent storage in memory, e.g., in memory/media device 404c. Such output data may be communicated to integrated and/or peripheral output devices, such as a monitor or other display device, or as control signals to still further components.
Additional actions taken by the processor(s) 402 within computing device 401 may access and/or analyze data stored in one or more databases, such as word sense database 406, language database 407 and symbol database 408, which may be provided locally relative to computing device 401 (as illustrated in
In general, word sense database 406 and language database 407 work together to define all the informational characteristics of a given text/word. Word sense database 406 stores a plurality of entries that identify the different possible meanings for various text/word items, while the actual language-specific identifiers for such meanings (i.e., the words themselves) are stored in language database 407. In addition to the words, optional additional lexical information such as but not limited to definitions, parts of speech, different regular and/or irregular forms of such words, pronunciations and the like are stored in language database 407. The entries in the word sense database 406 are thus cross-referenced to entries in language database 407 which provide the actual labels for a word sense. As such, word sense database 406 generally stores semantic information about a given word while language database 407 generally stores the lexical information about a word.
The basic structure of the databases 406 and 407 is such that the word sense database is effectively language-neutral. Because of this structure and the manner in which the word sense database 406 functionally interacts with the language database 407, different language databases (e.g., English, French, German, Spanish, Chinese, Japanese, etc.) can be used to map to the same word sense entries stored in word sense database 406. Considering again the “bat” example, an entry for “bat” in an English language database (one particular embodiment of language database 407) may be cross-referenced to six different entries in word sense database 406, all of which are outlined in Table 1 above. However, an entry for “chauve-souris” in a French language database 407 (another particular embodiment of language database 407) would be linked to the first word sense in Table 1 correlating the semantic meaning of a nocturnal mouselike mammal, while an entry for “batte” in the same French language database would be linked to the second word sense in Table 1 correlating the meaning of a club used for hitting a ball.
The word sense database 406 also stores information defining the relations among the various word senses. For example, an entry in word sense database 406 may also store information associated with the word entry defining which word senses it is related to by various predefined relations as described above in Table 2. It should be appreciated that although relation information is stored in word sense database 406 in one exemplary embodiment, other embodiments may store such relation information in other databases such as the language database 407 or symbol database 408, or yet another database specifically dedicated to relation information, or a combination of one or more of these and other databases.
In some embodiments of the subject technology, the information stored in word sense database 406 and language database 407 is customized according to the needs of a user and/or device. In other embodiments, preconfigured collective databases may be used to provide the information stored within databases 406 and 407. Non-limiting examples of preconfigured lexical and semantic databases include the WordNet lexical database created and currently maintained by the Cognitive Science Laboratory at Princeton University of Princeton, N.J., the Semantic Network distributed by UMLS Knowledge Sources and the U.S. National Library of Medicine of Bethesda, Md., or other preconfigured collections of lexical relations. Such lexical databases and others store groupings of words into sets of synonyms that have short, general definitions, as well as the relations between such sets of words.
Part of speech data for each entry in a language database may also be provided from customized or preconfigured tagset sources. One example of a part of speech tagset that could be used for analysis in the subject text mapping and analysis is the CLAWS (Constituent Likelihood Automatic Word-tagging System) tagsets (e.g., CLAWS4, CLAWS5, CLAWS6, CLAWS7) as developed by UCREL of Lancaster University in Lancaster, United Kingdom.
Symbol database 408 may correspond to a database of graphical images, as well as additional optional features such as audio files, video or animated graphic files, action items, or other features. One example of a symbol database for use with the subject technology corresponds to that available as part of the Boardmaker Plus! brand software available from DynaVox Mayer-Johnson of Pittsburgh, Pa.
It should be appreciated that the hardware components illustrated in and discussed with reference to
Central computing device 501 may include all or part of the functionality described above with respect to computing device 401, and so a description of such functionality is not repeated. Memory device or database 504a of
Referring still to
In general, the electronic components of an SGD 500 enable the device to transmit and receive messages to assist a user in communicating with others. For example, the SGD may correspond to a particular special-purpose electronic device that permits a user to communicate with others by producing digitized or synthesized speech based on configured messages. Such messages may be preconfigured and/or selected and/or composed by a user within a message window provided as part of the speech generation device user interface. As will be described in more detail below, a variety of physical input devices and software interface features may be provided to facilitate the capture of user input to define what information should be displayed in a message window and ultimately communicated to others as spoken output, text message, phone call, e-mail or other outgoing communication.
With more particular reference to exemplary speech generation device 500 of
Display device 512 may correspond to one or more substrates outfitted for providing images to a user. Display device 512 may employ one or more of liquid crystal display (LCD) technology, light emitting polymer display (LPD) technology, light emitting diode (LED), organic light emitting diode (OLED) and/or transparent organic light emitting diode (TOLED) or some other display technology. Additional details regarding OLED and/or TOLED displays for use in SGD 500 are disclosed in U.S. Provisional Patent Application No. 61/250,274 filed Oct. 9, 2009 and entitled “Speech Generation Device with OLED Display,” which is hereby incorporated herein by reference in its entirety for all purposes.
In one exemplary embodiment, a display device 512 and touch screen 506 are integrated together as a touch-sensitive display that implements one or more of the above-referenced display technologies (e.g., LCD, LPD, LED, OLED, TOLED, etc.) or others. The touch sensitive display can be sensitive to haptic and/or tactile contact with a user. A touch sensitive display that is a capacitive touch screen may provide such advantages as overall thinness and light weight. In addition, a capacitive touch panel requires no activation force but only a slight contact, which is an advantage for a user who may have motor control limitations. Capacitive touch screens also accommodate multi-touch applications (i.e., a set of interaction techniques which allow a user to control graphical applications with several fingers) as well as scrolling In some implementations, a touch-sensitive display can comprise a multi-touch-sensitive display. A multi-touch-sensitive display can, for example, process multiple simultaneous touch points, including processing data related to the pressure, degree, and/or position of each touch point. Such processing facilitates gestures and interactions with multiple fingers, chording, and other interactions. Other touch-sensitive display technologies also can be used, e.g., a display in which contact is made using a stylus or other pointing device. Some examples of multi-touch-sensitive display technology are described in U.S. Pat. No. 6,323,846 (Westerman et al.), U.S. Pat. No. 6,570,557 (Westerman et al.), U.S. Pat. No. 6,677,932 (Westerman), and U.S. Pat. No. 6,888,536 (Westerman et al.), each of which is incorporated by reference herein in its entirety for all purposes.
Speakers 514 may generally correspond to any compact high power audio output device. Speakers 514 may function as an audible interface for the speech generation device when computer processor(s) 502 utilize text-to-speech functionality. Speakers can be used to speak the messages composed in a message window as described herein as well as to provide audio output for telephone calls, speaking e-mails, reading e-books, and other functions. A volume control module 522 may be controlled by one or more scrolling switches or touch-screen buttons.
SGD hardware components also may include various communications devices and/or modules, such as but not limited to an antenna 515, cellular phone or RF device 516 and wireless network adapter 518. Antenna 515 can support one or more of a variety of RF communications protocols. A cellular phone or other RF device 516 may be provided to enable the user to make phone calls directly and speak during the phone conversation using the SGD, thereby eliminating the need for a separate telephone device. A wireless network adapter 518 may be provided to enable access to a network, such as but not limited to a dial-in network, a local area network (LAN), wide area network (WAN), public switched telephone network (PSTN), the Internet, intranet or ethernet type networks or others. Additional communications modules such as but not limited to an infrared (IR) transceiver may be provided to function as a universal remote control for the SGD that can operate devices in the user's environment, for example including TV, DVD player, and CD player.
When different wireless communication devices are included within an SGD, a dedicated communications interface module 520 may be provided within central computing device 501 to provide a software interface from the processing components of computer 501 to the communication device(s). In one embodiment, communications interface module 520 includes computer instructions stored on a computer-readable medium as previously described that instruct the communications devices how to send and receive communicated wireless or data signals. In one example, additional executable instructions stored in memory associated with central computing device 501 provide a web browser to serve as a graphical user interface for interacting with the Internet or other network. For example, software instructions may be provided to call preconfigured web browsers such as Microsoft® Internet Explorer or Firefox® internet browser available from Mozilla software.
Antenna 515 may be provided to facilitate wireless communications with other devices in accordance with one or more wireless communications protocols, including but not limited to BLUETOOTH, WI-FI (802.11 b/g), MiFi and ZIGBEE wireless communication protocols. In one example, the antenna 515 enables a user to use the SGD 500 with a Bluetooth headset for making phone calls or otherwise providing audio input to the SGD. The SGD also can generate Bluetooth radio signals that can be used to control a desktop computer, which appears on the SGD's display as a mouse and keyboard. Another option afforded by Bluetooth communications features involves the benefits of a Bluetooth audio pathway. Many users utilize an option of auditory scanning to operate their SGD. A user can choose to use a Bluetooth-enabled headphone to listen to the scanning, thus affording a more private listening environment that eliminates or reduces potential disturbance in a classroom environment without public broadcasting of a user's communications. A Bluetooth (or other wirelessly configured headset) can provide advantages over traditional wired headsets, again by overcoming the cumbersome nature of the traditional headsets and their associated wires.
When an exemplary SGD embodiment includes an integrated cell phone, a user is able to send and receive wireless phone calls and text messages. The cell phone component 516 shown in
Operation of the hardware components shown in
Buttons or other features can provide a user interface element by which a user can select additional interface options or language elements. Such user interface features then may be selectable by a user (e.g., via an input device, such as a mouse, keyboard, touchscreen, eye gaze controller, virtual keypad or the like). When selected, the user input features can trigger control signals that can be relayed to the central computing device within an SGD to perform an action in accordance with the selection of the user buttons. Such additional actions may result in execution of additional instructions, display of new or different user interface elements, or other actions as desired. As such, user interface elements also may be viewed as display objects, which are graphical representations of system objects that are selectable by a user. Some examples of system objects include device functions, applications, windows, files, alerts, events or other identifiable system objects.
User interface buttons or other elements also may correspond to language elements and can be activated by user selection to “speak” words or phrases. Speaking consists of playing a recorded message or sound or speaking text using a voice synthesizer. In accordance with such functionality, some user interfaces are provided with a “Message Window” in which a user provides text, symbols corresponding to text, and/or related or additional information which then may be interpreted by a text-to-speech engine and provided as audio output via device speakers. Speech output may be generated in accordance with one or more preconfigured text-to-speech generation tools in male or female and adult or child voices, such as but not limited to such products as offered for sale by Cepstral, HQ Voices offered by Acapela, Flexvoice offered by Mindmaker, DECtalk offered by Fonix, Loquendo products, VoiceText offered by NeoSpeech, products by AT&T's Natural Voices offered by Wizzard, Microsoft Voices, digitized voice (digitally recorded voice clips) or others.
While the present subject matter has been described in detail with respect to specific embodiments thereof, it will be appreciated that those skilled in the art, upon attaining an understanding of the foregoing may readily produce alterations to, variations of, and equivalents to such embodiments. Accordingly, the scope of the present disclosure is by way of example rather than by way of limitation, and the subject disclosure does not preclude inclusion of such modifications, variations and/or additions to the present subject matter as would be readily apparent to one of ordinary skill in the art.