1. Technical Field
Aspects of the invention relate to generating text in a handheld electronic device and to expediting the process, such as for example, where the handheld electronic device receives text from sources external to the device.
2. Background Information
Generating text in a handheld electronic device examples of which include, for instance, personal data assistants (PDA's), handheld computers, two-way pagers, cellular telephones, text messaging devices, and the like, has become a complex process. This is due at least partially to the trend to make these handheld electronic devices smaller and lighter in weight. A limitation in making them smaller has been the physical size of keyboard if the keys are to be actuated directly by human fingers. Generally, there have been two approaches to solving this problem. One is to adapt the ten digit keypad indigenous to mobile phones for text input. This requires each key to support input of multiple characters. The second approach seeks to shrink the traditional full keyboard, such as the “qwerty” keyboard by doubling up characters to reduce the number of keys. In both cases, the input generated by actuation of a key representing multiple characters is ambiguous. Various schemes have been devised to interpret inputs from these multi-character keys. Some schemes require actuation of the key a specific number of times to identify the desired character. Others use software to progressively narrow the possible combinations of letters that can be intended by a specified sequence of key strokes. This latter approach uses multiple lists that can contain, for instance, generic words, application specific words, learned words and the like.
An object of aspects of the invention is to facilitate generating text in a handheld electronic device. In another sense, an object is to assist the generation of text by processes that utilize lists of words, ideograms and the like by gathering new language objects from sources of text external to the handheld electronic device.
The generation of text in a handheld electronic device that utilizes lists of language objects, such as for example, words, abbreviations, text shortcuts, and in some languages ideograms and the like to facilitate text generation, adapts to the user's experience by adding new language objects gleaned from text received from sources external to the handheld electronic device. An exemplary external source of text is e-mail messages. Additional non-limiting examples include SMS (Short Message Service), MMS (Multi-Media Service) and instant messages.
More particularly, aspects of the invention are directed to a method of entering text into a handheld electronic device. The handheld electronic device has at least one application for receiving text from sources external to the handheld electronic device and a text input process that accesses at least one list of stored language objects to facilitate generation of text. The general nature of the method can be stated as including processing received text received from an external source comprising scanning the received text for any new language objects not in any list of stored language objects, and identifying any of the new language objects that fail to meet a number of specified characteristics that are at least partially determinative of a language.
Aspects of the invention also embrace a handheld electronic device having a plurality of applications including at least one that receives text from a source external to the handheld electronic device. The device also includes a user interface through which a user inputs linguistic elements and a text generator that has a first language object list and a new language object list and a text input processor. This text input processor comprises processing means selecting new language objects not in the first or new list and identifying any of the new language objects that fail to meet a number of specified characteristics that are at least partially determinative of a language, and means using selected language objects stored in the first list and the new list to generate the desired text from the linguistic elements input through the user interface. This handheld electronic device also includes an output means presenting the desired text to the user.
Turning to
Various types of text input processes 27 can be used that employ lists 29 to facilitate the generation of text. For example, in the exemplary handheld electronic device where the reduced “qwerty” keyboard produces ambiguous inputs, the text input process 27 utilizes software to progressively narrow the possible combination of letters that could be intended by a specified sequence of keystrokes. Such “disambiguation” software is known. Typically, such systems employ a plurality of lists of linguistic objects. By linguistic objects it is meant in the example words and in some languages ideograms. The keystrokes input linguistic elements, which in the case of words, are characters or letters in the alphabet, and in the case of ideograms, strokes that make up the ideogram. The list of language objects can also include abbreviations, and text shortcuts, which are becoming common with the growing use of various kinds of text messaging. Text shortcuts embraces the cryptic and rather clever short representations of common messages, such as, for example, “CUL8R” for “see you later”, “PXT” for “please explain that”, “SS” for “so sorry”, and the like. Lists that can be used by the exemplary disambiguation text input process 27 can include a generic list 31 and a new list 33. Additional lists 35 can include learned words and special word lists such as technical terms for biotechnology. Other types of text input processes 27, such as for example, prediction programs that anticipate a word intended by a user as it is typed in and thereby complete it, could also use word lists. Such a prediction program might be used with a full keyboard.
Known disambiguation programs can assign frequencies of use to the language objects, such as words, in the lists it uses to determine the language object intended by the user. Frequencies of use can be initially assigned based on statistics of common usage and can then be modified through actual usage. It is known for disambiguation programs to incorporate “learned” language objects such as words that were not in the initial lists, but were inserted by the user to drive the output to the intended new word. It is known to assign such learned words an initial frequency of use that is near the high end of the range of frequencies of use. This initial frequency of use is then modified through actual use as with the initially inserted words.
Aspects of the present invention are related to increasing the language objects available for use by the text input process 27. One source for such additional language objects is the e-mail application. Not only is it likely that new language objects contained in incoming e-mails would be used by the user to generate a reply or other e-mail responses, such new language objects could also be language objects that the user might want to use in generating other text inputs.
However, if any of the language objects examined at 47 are determined at 49 to be missing from the current lists, meaning that they are new language objects, processing continues to 51 where it is determined whether any of the new language objects can be considered to be in the current language being employed by the user on the handheld electronic device 1 to input text. An example of the processing at 51 is described in greater detail in
An exemplary language analysis procedure, such as is performed at 51, is depicted in detail in
On the other hand, continuing the example, if it is determined at 61 that in any line or other segment of text the threshold is exceeded, processing continues at 63 where the linguistic elements in all of the new language objects in the text are compared with a set of predetermined linguistic elements. A determination of the ratio of new language objects to language objects and the set of predetermined linguistic elements are non-limiting examples of specified characteristics that may be at least partially indicative of or particular to one or more predetermined languages.
If, for example, the current language is English, an exemplary set of predetermined linguistic elements indicative of the English language might include, for instance, the twenty-six Latin letters, both upper and lower case, symbols such as an ampersand, asterisk, exclamation point, question mark, and pound sign, and certain predetermined diacritics. If a new language object has a linguistic element other than the linguistic elements in the set of predetermined linguistic elements particular to the current language, the new language object is considered to be in a language other than the current language. If the English language is the current language used on the handheld electronic device 1, such as if the language objects stored in the lists 29 are generally in the English language, the routine 38 can identify and ignore non-English words.
If any new language objects are identified at 63 as having a linguistic element not in the set of predetermined linguistic elements, such new language objects are ignored, as at 65. The routine 38 then determines at 67 whether any non-ignored new language objects exist in the text. If yes, the routine 38 then ascertains at 68 whether a ratio of the ignored new language objects in the text to the new language objects in the text exceeds another threshold, for example fifty percent (50%). Any appropriate threshold may be applied. For instance, if the routine 38 determines at 68 that fifty percent or more of the new language objects were ignored at 65, processing returns to the queue at 41, as is indicated at the numeral 71 in
If it is determined at 67 that no non-ignored new language objects exist in the text, processing returns to the queue at 41 as is indicated in
The above process not only searches for new words in a received e-mail but also for new abbreviations and new text shortcuts, or for ideograms if the language uses ideograms. In addition to scanning e-mails for new words, other text received from sources outside the handheld electronic device can also be scanned for new words. This can include gleaning new language objects from instant messages, SMS (short message service), MMS (multimedia service), and the like.
While specific embodiments of the invention have been described in detail, it will be appreciated by those skilled in the art that various modifications and alternatives to those details could be developed in light of the overall teachings of the disclosure. Accordingly, the particular arrangements disclosed are meant to be illustrative only and not limiting as to the scope of the invention which is to be given the full breadth of the claims appended and any and all equivalents thereof.
The instant application is a continuation of and claims priority to U.S. patent application Ser. No. 11/119,455, filed Apr. 29, 2005 now U.S. Pat. No. 7,548,849, entitled “METHOD FOR GENERATING TEXT THAT MEETS SPECIFIED CHARACTERISTICS IN A HANDHELD ELECTRONIC DEVICE AND A HANDHELD ELECTRONIC DEVICE INCORPORATING THE SAME”, the entire contents of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4984177 | Rondel et al. | Jan 1991 | A |
4994966 | Hutchins | Feb 1991 | A |
5377280 | Nakayama | Dec 1994 | A |
5490061 | Tolin et al. | Feb 1996 | A |
5526259 | Kaji | Jun 1996 | A |
5608624 | Luciw | Mar 1997 | A |
5943643 | Van De Veen | Aug 1999 | A |
6002998 | Martino et al. | Dec 1999 | A |
6026410 | Allen et al. | Feb 2000 | A |
6208435 | Zwolinski | Mar 2001 | B1 |
6219731 | Gutowitz | Apr 2001 | B1 |
6233546 | Datig | May 2001 | B1 |
6286064 | King et al. | Sep 2001 | B1 |
6307548 | Flinchem et al. | Oct 2001 | B1 |
6321188 | Hayashi et al. | Nov 2001 | B1 |
6490492 | Fertig et al. | Dec 2002 | B1 |
6553103 | Forlenza et al. | Apr 2003 | B1 |
6801190 | Robinson et al. | Oct 2004 | B1 |
6810376 | Guan et al. | Oct 2004 | B1 |
6991162 | Chang | Jan 2006 | B2 |
7146381 | Allen et al. | Dec 2006 | B1 |
7162086 | Ikeda | Jan 2007 | B2 |
7174295 | Kivimaki | Feb 2007 | B1 |
7389124 | Fux et al. | Jun 2008 | B2 |
7403890 | Roushar | Jul 2008 | B2 |
7430716 | Weir | Sep 2008 | B2 |
7447627 | Jessee et al. | Nov 2008 | B2 |
7548849 | Fux et al. | Jun 2009 | B2 |
7620540 | Fux et al. | Nov 2009 | B2 |
20010029455 | Chin et al. | Oct 2001 | A1 |
20020045463 | Chen et al. | Apr 2002 | A1 |
20020049831 | Platner et al. | Apr 2002 | A1 |
20020103646 | Kochanski et al. | Aug 2002 | A1 |
20020111792 | Cherny | Aug 2002 | A1 |
20020116528 | Vale | Aug 2002 | A1 |
20030023422 | Menezes et al. | Jan 2003 | A1 |
20030040899 | Ogilvie | Feb 2003 | A1 |
20030073451 | Kraft | Apr 2003 | A1 |
20030078766 | Appelt et al. | Apr 2003 | A1 |
20030154071 | Shreve | Aug 2003 | A1 |
20030189731 | Chang | Oct 2003 | A1 |
20030200078 | Luo et al. | Oct 2003 | A1 |
20030204392 | Finnigan et al. | Oct 2003 | A1 |
20040034521 | Kawakura et al. | Feb 2004 | A1 |
20040078191 | Tian et al. | Apr 2004 | A1 |
20040102957 | Levin | May 2004 | A1 |
20040104896 | Suraqui | Jun 2004 | A1 |
20040155869 | Robinson et al. | Aug 2004 | A1 |
20040239533 | Bollman | Dec 2004 | A1 |
20040263486 | Seni | Dec 2004 | A1 |
20050071150 | Nasypny | Mar 2005 | A1 |
20050091030 | Jessee et al. | Apr 2005 | A1 |
20050108017 | Esser et al. | May 2005 | A1 |
20050114770 | Sacher et al. | May 2005 | A1 |
20050125217 | Mazor | Jun 2005 | A1 |
20050125218 | Rajput et al. | Jun 2005 | A1 |
20050197828 | McConnell et al. | Sep 2005 | A1 |
20050251382 | Chang et al. | Nov 2005 | A1 |
20050273332 | Scott et al. | Dec 2005 | A1 |
20050283540 | Fux et al. | Dec 2005 | A1 |
20060028450 | Suraqui | Feb 2006 | A1 |
20060041422 | Davis et al. | Feb 2006 | A1 |
20060119583 | Potera | Jun 2006 | A1 |
20060136223 | Brun et al. | Jun 2006 | A1 |
20060217965 | Babu | Sep 2006 | A1 |
20060247916 | Fux et al. | Nov 2006 | A1 |
20060247917 | Fux et al. | Nov 2006 | A1 |
20070239434 | Rubanovich et al. | Oct 2007 | A1 |
20070294078 | Kim et al. | Dec 2007 | A1 |
20080097745 | Bagnato et al. | Apr 2008 | A1 |
20090221309 | Fux et al. | Sep 2009 | A1 |
Number | Date | Country |
---|---|---|
1 225 184 | Nov 2002 | EP |
1320023 | Jun 2003 | EP |
1 480 421 | Nov 2004 | EP |
2 396 940 | Jul 2004 | GB |
9705541 | Feb 1997 | WO |
2004059958 | Jul 2004 | WO |
Entry |
---|
Lai, Yu-Shen et al. “Unknown Word and Phrase Extraction Using a Phrase-Like-Unit-Based Likelihood Ratio.” International Journal of Computer Processing of Oriental Languages. vol. 13, Issue 1: Mar. 2000. pp. 83-95. |
Beesley, Kenneth R., “Language Identifier: A Computer Program for Automatic Natural-Language Identification of On-Line Text”, Automated Language Processing Systems, ′Online!, pp. 1-21, XP002342527, URL http://www.xrce.xerox.com/competencies/content-analysis/tools/publis/langid.pdf> Sep. 5, 2005. |
Examination Report, dated Jul. 19, 211, from the Canadian Intellectual Property Office, in Application No. 2,606,328. |
Number | Date | Country | |
---|---|---|---|
20090221309 A1 | Sep 2009 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 11119455 | Apr 2005 | US |
Child | 12464552 | US |