Today, input method editors (IMEs) may be used to input non-Latin characters (e.g., Chinese characters) into a computer program using a conventional keyboard. IMEs are an indispensable tool for a significant portion of East Asian users. IMEs may also provide an English mode, wherein assistance is provided for non-native users creating English language documents. In the English mode, English assistance may be provided, such as word or spelling suggestions in response to problematic user typing. The assistance may also include automated spelling corrections, multiple word-choice suggestions and/or anticipated words suggestions, etc. Unfortunately, known IMEs are intrusive, and tend to distract a user with unwanted suggestions. Turning such IMEs on and off as suggestions are needed is awkward. Moreover, the user interfaces (UIs) of known IMEs are inflexible, and do not allow the user to optimally control intrusion of the IME.
Techniques to provide language assistance and operate an input method editor (IME) are disclosed. The IME may be configured to provide language assistance across a plurality of applications. In one example, an IME may be adapted for use by English-as-a-second-language (ESL) users. In a specific example, language assistance may be provided by first detecting a need to provide a suggestion to a user typing within an application. The suggestion may include a substitute word, substitute spelling or substitute grammar. The detection may be based on a probability that a current word or word-use is in error. If need for a suggestion is detected, a suggestion may be obtained for the word. The suggestion may be based on input from a cloud linguistic service (CLS). If network connectivity is unavailable, a local lexicon and/or language-model may be used to obtain the suggestion. Once obtained, the suggestion may be displayed to the user. The display may include adjustable user interface element(s), which allow the user to minimize intrusiveness of the user interface when not needed. The IME may provide opportunities for interaction with the user. For example, the interaction may allow may allow the user to accept or reject the suggestion, or copy the suggestion to a clipboard for use by a plurality of applications. The interaction may allow the user to relocate or minimize user interface elements.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter. The term “techniques,” for instance, may refer to device(s), system(s), method(s) and/or computer-readable instructions as indicated by the context above and throughout the document.
The detailed description is described with reference to the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The same numbers are used throughout the drawings to reference like features and components. Moreover, the figures are intended to illustrate general concepts, and not to indicate required and/or necessary elements.
The disclosure describes techniques for operation of an input method editor (IME) configured to provide language assistance across a plurality of applications. In one example, the IME provides an adjustable user interface that may be made unobtrusive if desired, to reduce cognitive overload of a user. The IME may be adapted for use by English-as-a-second-language (ESL) users. In one example, language assistance is provided by detecting a need to suggest a substitute word to a user who is typing within an application. The detection may be based on a probability that a current word is in error or by observation of the difficulty (typing speed, backspaces, etc.) of the user typing the word. If a need for assistance is detected, a suggestion may be obtained for the word. The suggestion may be obtained from a cloud linguistic service or from a local lexicon and/or language-model, if network connectivity is poor. Once obtained, the suggestion may be displayed to the user by a user interface. Aspects of the user interface may be adjustable by the user, and may particularly provide non-intrusive information to the user that does not distract from the user's focus on typing and input. Following the suggestion, interaction with the user allows the user to accept or reject the suggestion, and perform other functions, such as relocating user interface elements utilized by the display.
In one example, the client device 102 may include one or more processors 108 in communication with one or more memory devices 110. A network connection 112 may provide an interface to the network 104. The client device 102 may include and/or communicate with one or more peripheral devices. A first example of a peripheral device, a display 114, can operate as an input and output peripheral with a touch screen, or as an output peripheral without a touch screen. Thus, display 114 may be provided to receive touch input and present visual output. In a second example of peripheral device(s) 116, input peripheral devices can include a hard and/or soft keyboard, a mouse or other pointing device, a microphone, and/or gestural input devices such a one or more cameras or other gesture sensors. In a third example of peripheral device(s) 118, output peripheral devices can include one or more speakers, a printer, auxiliary monitor, or other output devices.
An application 120 may be defined within memory 110 and configured for operation and/or execution by the processor(s) 108. The application 120 may be configured for any of a wide variety of purposes, including word-processing and/or text-based data entry. Thus, application 120 may include a word processor, an Internet browser, an email client, or other application into which a user may type or otherwise input text.
In the example of
The IME DLL 122 may be defined “in-process,” i.e., within the application 120. The IME DLL 122 may be configured to receive input, such as keystrokes, and pass them on to an IME platform process 124 and to the application 120.
In the example of
The suggestion renderer 126 may be configured to provide and manage a user interface, such as seen in the examples of
Detection techniques 202 may detect or recognize a language problem associated with input entered by a user into an application (e.g., application 120 of
The detection techniques 202 may be configured to detect or recognize when suggestions should be sought for possible errors in spelling, word use, grammar or other language issues. In the example shown, the suggestions may be sought from the suggestion techniques 204A, 204B, as will be discussed infra. In one implementation, the IME may be part of an operating system, such as Windows® or another operating system. According to the example, characters (e.g., keystrokes resulting from user operating a keyboard) may be intercepted by the IME dynamic link library (e.g., IME DLL 122 of
The detection techniques 202 may maintain previously typed word(s) in memory, such as within the IME platform process 124 or other location. Such previously typed words may be used (e.g., by the IME platform process 124) to determine the appropriateness of a current word in terms of spelling, context of use, grammar and other linguistic considerations.
The detection techniques 202 may utilize a test or check to determine whether suggestions are needed to replace a word typed by the user. Such testing may be based on a probability of correctness of a word typed by the user and intercepted and recognized by the IME platform process 124. The probability may be based on multiple factors and testing, which may be performed by one or both of local checks (e.g., performed on the client device 102) and remote checks (e.g., performed by the cloud linguist services 106). In an example of local checks, the intercepted word may be checked against a lexicon (e.g., a dictionary), as well as its likelihood of appearing near previously written (possibly adjacent) words. Thus, a sequence of words may be examined and a determination made if each word is correct and/or if a suggestion for replacement is needed. The determination may be based at least in part on a statistical language model (e.g., unigram models, N-gram models, etc.).
The detection techniques 202 may include one or more statistical language models, which may operate on the IME platform process 124, on the cloud linguistic service(s) 106 or both. Consideration of network connectivity and speed may be used to determine if a local or cloud statistical model should be utilized. Generally, if sufficient network speed and bandwidth is available, statistical language model(s) on the cloud linguistic services 106 should be utilized. The cloud based models may have larger lexicon, better ability to process word sequences and better ability overall to detect errors in spelling, word use and/or grammar. Statistical language models operable in either location may detect errors utilizing factors such as input from a lexicon (dictionary, word list, technical vocabulary, etc.), typing speed and/or back-spacing or other corrections made by the user.
Thus, detection techniques 202 may be performed using IME-based techniques. Using such techniques, the IME DLL 122 may be loaded into each application in a configuration that will be able to intercept all keystrokes within the application and will be able to take into account specific input scopes (words, word sequences and/or word usages, etc.). The IME-based techniques also allow keystrokes to be processed in various ways, including forwarding all processing to a running application, a local service or an external service.
The detection techniques 202 may be utilized in conjunction with non-IME based mechanisms, as well as the IME-based mechanisms previously described. For example, applications programming interfaces (APIs) provided by an operating system of the client device may be used directly to monitor keystrokes either by polling or subscribing to global events.
Suggestion techniques 204A and 204B may be utilized to provide suggestions or alternatives to words indicated by the detection techniques 202 as having a high likelihood of error. The suggestion techniques may include remotely-based or cloud-implemented suggestion techniques 204A and locally-based suggestion techniques 204B. Depending on implementation, suggestion techniques 204A and 204B may be operated in an alternative, parallel and/or complementary manner(s). In one example, the remote suggestion techniques 204A, which may be based within the cloud linguistic services 106, may be the primary source of suggestions. The local suggestion techniques 204B may be utilized only when network connectivity does not allow use of the cloud-based suggestion techniques 204A. In some embodiments, the suggestion techniques 204A available in the cloud linguistic services 106 may provide greater capacity and performance than is available from the suggestion techniques 204B operating on the IME platform process 124. In particular, suggestion techniques that are based on a particular technology may be more effective when provided by the cloud-based suggestion techniques 204A, due to computational advantages of the cloud over the IME platform process 124. Example technologies may include language models based at least in part on unigram model technology, N-gram model technologies, etc. Cloud-based word suggestion-generation resources may be configured to leverage computing power of the cloud, including faster execution, massive lexicons and computationally complex algorithms for which implementation may be impossible on the IME platform process 124. In additional examples, cloud linguistic services 106 may arbitrate between multiple cloud-based services or resources to collect candidate suggestions to replace word errors. The arbitration may be performed by comparison of suggestion provider-based design patterns, provider-used technologies, and how those patterns compare to the suggestion process.
In one example, the cloud linguistic services 106 may return an ordered list of candidates to be suggested to the user to replace the word typed by the user. The suggestions on the list, obtained from cloud-based sources, may be derived from globally based statistical language model(s) and ranked by their respective confidence scores. The IME platform process 124 may also collect an ordered list from locally-determined suggestion candidates. The locally-based suggestion techniques 204B may be based at least in part on user-specific personalization within the application 120 (seen in
In other examples, candidate suggestions to replace word errors may be either entirely locally-based or entirely cloud-based. In one example of cloud-based suggestion generation, the suggestions can be queried from multiple cloud-based services and aggregated on the client. In the example, a number of data sources (e.g., cloud-based statistical language models, lexicons, etc.) may be varied according to a required speed of the system. In a contrasting example, a single cloud linguistics service may be used as a point of entry to the IME platform 124. Generally, by reducing the number of data sources, candidate suggestions for an erroneous word may be obtained more quickly.
Various display techniques shown by the example user interface 300 may be performed by the suggestion renderer 126 (as seen in
The user may misspell a word, violate a rule of grammar or otherwise enter text that indicates that the user would benefit from assistance. In the example of
In one example, only the single most preferred suggestion for replacement of the detected erroneous word is displayed. The preferred suggestion is provided if a confidence score is greater than a threshold value (e.g., 90%) that the suggestion is correct. Also, the preferred suggestion is provided if it can be obtained within a threshold period of time after the user typed the word thought to be erroneous.
The notification window 304 may provide a suggestion 312 for a misspelled word, misused word, or grammatical error, etc. Three examples are representative of ways that the suggestion may be utilized by the user. In a first example, the user may click command 314, to thereby replace the erroneously typed text with the suggestion 312. In a second example, the user may click command 316 to copy the suggested change to the clipboard. Once on the clipboard, the suggested text may be used in any application, and may be used with applications that fail to perform correctly if the command 314 selected. In a third example, the user may simply copy the text seen at suggestion 312 by typing it into any application. Additionally, the user may turn off the English assistance by operation of command 318.
An event indicator 402 may be configured to appear and/or change in appearance (e.g., color) to indicate an event. In one example, the event indicator may be an exclamation mark or other symbol, typically indicating concern and/or urgency. The event or issue indicated by the event indicator 402 may include detection of an error or potential error in spelling, word use, grammar or other aspect checked by the IME. To continue the example, upon appearance (or change in appearance) of the event indicator 402, the user may “mouse over” or otherwise select the event indicator to receive a suggestion from the IME. The suggestion may be provided by appearance of the notification window 304 or other user interface element. Thus, selection of the event indicator 402 may result in appearance of, or change in the content of, the notification window 304.
At operation 502, a need to suggest a word to replace a word typed by a user is detected. In some instances, the detection is in response to recognition that the word typed by the user is misspelled. Alternatively, the detection may result from recognition that the word is used in a grammatically incorrect manner or has otherwise been misused. In the examples of
At operation 504, a suggestion is obtained for the word, e.g., in response to detection of the need for a suggestion. The suggestion may be obtained over a network and from cloud-based resources (e.g., cloud linguistic services 106 of
At operation 506, the suggestion is displayed to the user, such as by operation of a user interface. In one example, the IME platform process 124 may provide data, and together with the suggestion renderer 126, may cause the suggestion to be presented. The user interface may be unobtrusive and/or may be regulated by the user to control a degree to which it intrudes on the user's operation of an application on the client device.
At operation 508, the user may interact with the user interface. In several examples, the user may provide input regarding: the detection process whereby a suggestion was found to be necessary; the suggestion process by which a replacement word was obtained; the display process by which the replacement word was presented to the user; or feedback as provided and/or supported by the IME. In the example of
In a further example, at operation 610 keystrokes are detected within an application by operation of an in-process IME dynamic link library (DLL). In the example of
At operation 616, local- and/or server-based testing may be performed on one or more received words. The testing may be performed using local and/or cloud-based lexicon(s), statistical language model(s) and/or cloud-based linguistic service(s). Local testing may be performed by the IME platform process 124 and cloud-based testing may be performed by cloud linguistic services 106. The testing may be coordinated, and/or the results of the testing received, at the IME platform process.
In the example of operation 708, multiple cloud-based services may be queried for correction candidates and the results aggregated. The results may be aggregated by the IME platform process 124 (
In the example of operation 710, the IME may determine if technologies employed by local and remote (e.g., server- or cloud-based) resources to suggest word replacements are similar. At operation 712, if the technologies are similar, then the remote resources are utilized. In one example, the IME platform and a cloud-based linguistic service may use a similar statistical language model, and the remote statistical model may be used. The computing power of the cloud may make this choice preferable.
In the example of operation 714, a suggestion may be obtained from a cloud linguistic service (CLS) that is configured to collect candidates for the suggestion to replace a word typed by the user. Such a suggestion may be obtained if network connectivity is adequate. Alternatively, at operation 716 a suggestion may be obtained from a local lexicon and/or language model if the network connectivity is not adequate.
In the example of operation 806, the user may be monitored by the IME to determine when the user stops typing. If the user stops typing, that may be a factor indicating or weighing toward uncertainty on the part of the user regarding the accuracy of one or more words the user has typed. Other factors indicating possible user confusion may include slowed typing, back-spacing and/or deleting, re-typing, and others. At operation 808, the user interface (e.g., the notification bubble 304 of
In the example of operation 810, unwanted intrusion on the user may be prevented by binding, fixing or moving the notification bubble 304 to a location selected by the user. At operation 812, the display of the word suggestion (e.g., the notification bubble) may be located adjacent to the IME bar. Thus, the user may move the notification bubble and/or the IME bar to a location on a display (e.g., a video display screen, display of a mobile device, etc.) that is less intrusive and/or preferred for any reason. In some examples, the notification bubble 304 and IME bar 308 move together (e.g., joined by the tail 306), and in other examples they are separately movable.
In the example of operation 814, the suggestion may be provided to the user in response to an indicator that a word suggestion is available. In the example of
In the example of operation 816, the suggestion for use in word replacement may be displayed if a confidence in the suggestion is greater than a threshold value. In the example, the threshold may be set at a level that does not interfere with the user unless the confidence level of a user error is sufficiently high.
In the example of operation 904, the IME may respond to user input to move the IME bar and/or the notification bubble. In the example of
In the example of operation 910, the IME may receive a rejection from the user of the suggestion made by the IME to replace a word(s) typed by the user. At operation 912, in response to the rejection, the IME may reduce a likelihood of presenting any suggestion in the future under similar circumstances. At operation 914, the IME may reduce a likelihood of presenting the rejected suggestion if a suggestion is to be presented in the future under similar circumstances.
As used herein, “computer-readable media” includes computer storage media and communication media. Computer storage media includes volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer-readable instructions, data structures, program modules, or other data. Computer storage media includes, but is not limited to, random access memory (RAM), read only memory (ROM), electrically erasable programmable ROM (EEPROM), flash memory or other memory technology, compact disk ROM (CD-ROM), digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to store information for access by a computing device. In contrast, communication media may embody computer-readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave. As defined herein, computer storage media does not include communication media.
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described. Rather, the specific features and acts are disclosed as exemplary forms of implementing the claims.
This application is a 35 U.S.C. 371 National Stage application of International Application No. PCT/CN2013/081156, filed Aug. 9, 2013, the entire contents of which are incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2013/081156 | 8/9/2013 | WO | 00 |