The field of the present invention is electronic messaging devices that send and receive messages.
Electronic messaging devices, referred to as “massagers”, are used to send and receive messages between users and their contacts. Many cellular phones include messagers that send and receive SMS messages. Due to their compact sizes, messagers often have limited key pads with relatively few small keys. As such, multiple key presses are often required to input a single character of text. For example, to input the character “b”, a user may be required to press on a “2” key twice. Multiple key presses for single character input is a cumbersome process, and composing a 10-20 word message may take several minutes.
Predictive text technology was integrated within messagers in order to accelerate message composition. Using such technology, one or more text predictions are presented to a user, and the user may thereby input entire words by a single key press. For example, if a user has entered characters r-e-a, text predictions may include such words as “reach”, “react”, “read”, “ready”, “real”, “realize” and “really”. A single key press enables the user to select one of these words. Moreover, even if the user wants to input a different word then those predicted, it often saves time to select one of the predicted words that is close to the user's intended word, and to modify the text accordingly. Thus, if the user wants to input the word “realign”, it is more efficient for him to select the predicted word “realize”, and then backspace twice to delete the z-e and enter the characters g-n.
Prior art text prediction technology includes “dictionary based” and “non-dictionary based” prediction. Dictionary based prediction bases its prediction upon a dictionary of common words. Products that include dictionary based prediction include T9® developed by Tegic Communications of Seattle, Wash., iTap® developed by Motorola, Inc. of Schaumburg, Ill., eZiText® developed by Zi Corporation of Calgary, AB, and Adaptx™ developed by Keypoint Technologies, Ltd. of Glasgow, Scotland. The T9 text prediction technology is described in U.S. Pat. No. 6,011,554 to King et al.
Non-dictionary based prediction bases its prediction upon statistical information for a specific language. Products that include non-dictionary based prediction include LetterWise and Wordwise developed by Eatoni Ergonomics of New York, N.Y.
Aspects of the present invention concern text prediction for messagers based on a user message profile. The user message profile includes information about messages that a user has sent and received, and personal information about the user including inter alia the user's list of contacts, the user's scheduler, and user files stored in the messager's file system.
Unlike dictionaries and language statistics, the user message profile includes information that enables a text predictor to customize its predictions for a specific user.
Aspects of the present invention also concern text prediction for composing a reply to a received message. By parsing the received message to identify special words, phrases, questions and phone numbers in the received message, a text predictor can customize a response.
There is thus provided in accordance with an embodiment of the present invention an electronic messager with a predictive text editor, including a storage unit for storing a data structure associating, for each one of a plurality of a user's contacts, usage data for the user's history of usage of words in communications with the user contact, a data manager coupled with the storage unit for generating the data structure in the storage unit, and for updating the data structure as additional communications with each user contact are performed and additional usage data is obtained therefrom, and a text predictor coupled with the storage unit, for receiving as input a character string and a designated user contact, and for generating as output an ordered list of predicted words, based on usage data in the data structure associated with the designated user contact.
There is moreover provided in accordance with an embodiment of the present invention a method for predicting text while a message is being composed, including generating a data structure associating, for each one of a plurality of a user's contacts, usage data about the user's history of usage of words in communications with the user contact, updating the data structure as additional communications with the user contact are performed and additional usage data is obtained therefrom, and predicting text while the user is composing a message, including receiving as input a character string and a designated user contact, and generating as output an ordered list of predicted words, based on usage data in the data structure associated with the designated user contact.
There is further provided in accordance with an embodiment of the present invention a method for predicting text while a reply message is being composed, including receiving an incoming message for a user, parsing the incoming message to identify questions, phone numbers and special phrases therein, and presenting possible responses that the user may choose from while the user replies to the incoming message, based on the questions, phone numbers and special phrases identified by the parsing.
The present invention will be more fully understood and appreciated from the following detailed description, taken in conjunction with the drawings in which:
Aspects of the present invention relate to predictive text used by electronic messagers, such as mobile phones.
In accordance with the present invention, a user's messager maintains a user message profile. The user message profile includes information about incoming and outgoing message histories for each of the user's contacts. The user profile also includes the user's personal data, including inter alia the user's contact names, items in the user's scheduler, and files and file names in the messager's file system.
Reference is now made to
Messager 100 includes a text editor 150 for composing messages. Many compact messagers have limited space for only a small key pad 130 for inputting characters. As a trade-off for the compactness of key pad 130, several button presses are often required to input a single character, which is cumbersome. A user may spend several minutes composing a short message of 10-20 words.
To speed up the process of composing messages, messager 100 includes a text predictor 160, which predicts words and phrases based on characters that were input. For example, if a user has input the characters r-e-a, then text predictor 160 may provide a list of predicted words and phrases the user can select from to complete the characters, including inter alia “reach”, “react”, “read”, “ready”, “real” and “really”. The user can select one of the words in the list and thereby accelerate composing his message. In general, text predictor 160 receives a character string as input and produces on-the-fly a list of predicted words and phrases as output.
Conventional text predictors 160 use dictionaries to generate the list of predicted words and phrases. In accordance with the present invention, text predictor 160 predicts its words and phrases from a user message profile 170 generated and maintained in a storage unit of messager 100. User message profile 170 includes a data structure, such as the tree data structure described hereinbelow with reference to
Data manager 180 regularly updates the data structure of user message profile 170 dynamically, based on incoming and outgoing messages that the user has received and sent, respectively. Data manager 180 may also update message profile 170 based on personal user information, such as a list of the user's contacts, the contents of a user's scheduler, and user files stored within messager 100.
Implementation details for text predictor 160 are described hereinbelow with reference to
When the user is composing a message to a designated recipient contact, text predictor 160 bases its predictions on messages in user message profile 170 that were received from the designated contact and on messages that were sent to the designated contact, if such messages exist. If the user is composing a message to a new contact then user message profile 170 does not contain a history of messages for the new contact, and text predictor 160 bases its predictions on general messages in user message profile 170.
It will be appreciated by those skilled in the art that the data structure stored in user message profile 170 may also be populated by words detected in speech during a conversation between the user and a user's contact. Speech-to-text conversion is used to convert voice to text. Words extracted from the converted text are then added to user message profile 170.
Such speech-to-text conversion may be performed by a speech-to-text convertor component within messager 100 (not shown in
When the user is replying to a message received from a contact, text predictor 160 derives its predictions based on the contents of the received message. A text parser 190 identifies special words, phrases and questions in the received message, and text predictor 160 uses these results to present the user with reply text he can choose from. For example, if text parser 190 identifies a question beginning with “Where” in the received message, then text predictor 160 retrieves data from the user's scheduler. Thus, if the user responds to a message beginning with “Where” while the user is in a meeting that is posted in the user's scheduler as,
then the predicted response takes the form “I am in a meeting with John in my office between 8:00 AM and 9:00 AM.” Alternatively, if text parser 190 identifies a question beginning with “Where” in the received message, then text predictor 160 presents a list of locations the user can choose from, including his home, his office and his physical location as determined by a GPS unit, in case messager 100 contains a GPS unit (not shown).
If text parser 190 identifies a question beginning with “Who” in the received message, then text predictor 160 presents a list of people the user can choose from, including his contacts.
If text parser 190 identifies a question beginning with “When” in the received message, then text predictor 160 presents text beginning with “At . . . ”, and if the user chooses this text then text editor 150 automatically switches into a numeric input mode.
If text parser 190 identifies a question beginning with “Why” in the received message, then text predictor 160 presents a text reply beginning with “Since . . . ” or “Because . . . ”
If text parser 190 identifies a phone number in the received message, then text editor 150 enables the user to edit, save or dial the identified phone number.
If text parser 190 identifies a special phrase, such as “How are you?” in the received message, text predictor 160 presents text replies beginning with “I'm fine”, “I'm doing well” and “I'm tired” that the user can choose from.
Reference is now made to
If the user's new message is the first message the user is writing to the recipient contact, as determined at step 220, then at step 232 the message editor predicts text patterns based on word frequencies in the user's general message history.
Implementation details for steps 231 and 232 are described hereinbelow with reference to
At step 240 the user sends his new message, and at step 250 information about the sent message is added to the user's message profile for reference when subsequently predicting text.
Reference is now made to
At step 330 the message received at step 310 is parsed for the presence of questions that begin with “Wh”. In fact, because of their short lengths, many short messages such as SMS messages include questions that begin with “Where”, “Who”, “When” and “Why”. Depending on the outcome of step 330, processing proceeds to one of the pairs of steps 341 and 351, 342 and 352, etc.
If the message received at step 310 contains a question that begins with “Where”, as determined at step 341, then at step 351 the message editor offers a list of locations the user can choose from, including inter alia the user's home, the user's workplace, and the user's location as determined by GPS information. Alternatively, as described hereinabove, the message editor may generated a response based on the user's scheduler.
If the message received at step 310 contains a question that begins with “Who”, as determined at step 342, then at step 352 the message editor offers a list of people the user can choose from, including inter alia the user's contacts.
If the message received at step 310 contains a question that begins with “When”, as determined at step 343, then at step 353 the message editor offers to begin the reply message with “At . . . ”, and the characters are automatically switched to numerical mode.
If the message received at step 310 contains a question that begins with “Why”, as determined at step 344, then at step 354 the message editor offers to begin the reply message with “Because . . . ”.
If the message received at step 310 contains a phone number, as determined at step 345, then at step 355 the message editor offers to save, edit or dial the identified phone number.
If the message received at step 310 contains a special phrase, as determined at step 346, then at step 356 the message editor offers to formulate the reply according to pre-defined options. For example, if the incoming message contains the phrase “How are you?”, then possible replies may include “I'm fine, thanks” and “I'm tired”. If the incoming message contains a yes/no question, then possible replies may include “yes”, “no” and “perhaps”.
At step 360 the user sends the reply message that he composed, and at step 370 information about the sent message is added to the user's message profile.
Implementation Details
Reference is now made to
In addition to its character string, within each node 420 of tree 410 is also stored a linked list 440 corresponding to those words that have that character string as their prefix. Each linked list 440 includes words and their frequencies of use with the specific user's contact for whom the data structure is associated with. The linked lists 440 are ordered based on frequency of use. Data manager 180 is responsible for generating and maintaining tree 410 and linked lists 440, and for dynamically updating them when new messages are sent and received to and from the specific user's contact, respectively, and new words and frequencies are derived therefrom. When a word's frequency of use changes, or when a new word is added, data manager updates tree 410 and its linked lists 440 accordingly.
As mentioned hereinabove with reference to
For example, if text predictor 160 receives the character string “Ca” as input, then using tree 410 it references the linked list 440 at the node for “Ca”, and generates as output the ordered list of words (1) Cat, (2) Cable, (3) Car, (4) Camel. In case the output list is limited to three words, the above list is truncated to (1) Cat, (2) Cable, (3) Car.
It will be appreciated by those skilled in the art that linked lists 440 may contain pointers to words stored in memory, instead of the words themselves.
The data structure of
It is noted that the data structure of
An alternate data structure, instead of the tree structure illustrated in
For example, if text predictor 160 receives the character string “Ca” as input, then using the dictionary it looks-up the words Cable, Camel, Car and Cat, and sorts these words according to their frequencies of use; namely, (1) Cat (freq=9), (2) Cable (freq=7, (3) Car (freq=4), (4) Camel (freq=1). As above, if the output list is limited to three words, the above list is truncated to (1) Cat, (2) Cable, (3) Car.
In accordance with the present invention, such a dictionary data structure is generated and maintained for each of the user's contacts. It will be appreciated by those skilled in the art that storing tree data structures or dictionary data structures for a large number of contacts may require more memory than is available in messager 100. In such case, a first in first out (FIFO) policy is implemented to purge the oldest words and profiles in order to accommodate new words and profiles. For example, if a user has 200 contacts and if the average size of a dictionary for the contacts is 10,000 entries and if each entry requires 16 bytes of storage, then the required memory is 200*10,000*16 bytes=32 MB of storage. For messagers that include one or more GBs of memory, the required memory for the dictionaries is approximately 3% or less of the total capacity.
Comparing the tree data structure with the dictionary data structure, it will be appreciated that the data structure illustrated in
It will further be appreciated by those skilled in the art that various optimizations may be performed to enhance the performance of text predictor 160 and data manager 180, for both the tree data structure and the dictionary data structure embodiments. Thus, the output list of text predictor 160 may be sorted only relative to the first three characters, say, of the predicted words. Such partial sort reduces processing requirements for data manager 180 vis a vis the tree data structure, and for text predictor 160 vis a vis the dictionary data structure.
Additionally, the entries in the dictionary data structure may be pre-sorted for specific prefixes, thereby reducing on-the-fly processing requirements for text predictor 160 vis a vis the dictionary data structure.
The present invention may be embodied as an enhancement to existing text prediction, such as T9 text prediction, by fine-tuning the prediction to each specific user contact. T9 bases its prediction on key strokes. For example, when a user presses on “228”, predictions such as “Cat”, “Bat”, “Act” are derived, since the “2” key represents “a”, “b” and “c”, and the “8” key represents “t”, “u” and “v”. The T9 predictions may also include words that have prefixes that correspond to “228”, such as “Cats”, “Bats”, “Actor”, “Acting”. The predictions are sorted by frequency of use. The present invention enhances T9 prediction inter alia by generating and sorting predictions according to frequencies of use for a specific user contact.
The present invention may also be embodied as a stand-alone text predictor. In distinction to T9, when the present invention is embodied as a stand-alone predictor, predictions are based on characters that are input, instead of key strokes per se. For example, when a user presses on “222-2”, for example, corresponding to “c-a”, predictions include words that have “ca” as prefix, such as “Cat”, “Cable”, “Car”, “Camel”, as in
In reading the above description, persons skilled in the art will realize that there are many apparent variations that can be applied to the methods and systems described. Although the present invention has been described with reference to text messages, such as short message service (SMS) messages, it also applies to other modes of communication, including inter alia e-mail messages and multi-media messaging service (MMS) messages. The data structure in
In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made to the specific exemplary embodiments without departing from the broader spirit and scope of the invention as set forth in the appended claims. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.
This application is a continuation-in-part of assignee's application U.S. Ser. No. 11/975,489, filed on Oct. 19, 2007 now abandoned, entitled METHOD AND SYSTEM FOR PREDICTING TEXT.
Number | Name | Date | Kind |
---|---|---|---|
5628055 | Stein | May 1997 | A |
5748512 | Vargas | May 1998 | A |
6201867 | Koike | Mar 2001 | B1 |
6243578 | Koike | Jun 2001 | B1 |
6377965 | Hachamovitch et al. | Apr 2002 | B1 |
6392640 | Will | May 2002 | B1 |
6690947 | Tom | Feb 2004 | B1 |
6898283 | Wycherley et al. | May 2005 | B2 |
7085542 | Dietrich et al. | Aug 2006 | B2 |
7111248 | Mulvey et al. | Sep 2006 | B2 |
7149550 | Kraft et al. | Dec 2006 | B2 |
7194285 | Tom | Mar 2007 | B2 |
7360151 | Froloff | Apr 2008 | B1 |
7487147 | Bates et al. | Feb 2009 | B2 |
7610194 | Bradford et al. | Oct 2009 | B2 |
7630980 | Parikh | Dec 2009 | B2 |
7650348 | Lowles et al. | Jan 2010 | B2 |
7679534 | Kay et al. | Mar 2010 | B2 |
7788327 | Naito et al. | Aug 2010 | B2 |
20040141004 | Cabezas et al. | Jul 2004 | A1 |
20040153963 | Simpson et al. | Aug 2004 | A1 |
20040153975 | Williams et al. | Aug 2004 | A1 |
20040233930 | Colby, Jr. | Nov 2004 | A1 |
20050070225 | Lee | Mar 2005 | A1 |
20050114770 | Sacher et al. | May 2005 | A1 |
20050159184 | Kerner et al. | Jul 2005 | A1 |
20050210115 | Naito et al. | Sep 2005 | A1 |
20050228774 | Ronnewinkel | Oct 2005 | A1 |
20060025091 | Buford | Feb 2006 | A1 |
20060105722 | Kumar | May 2006 | A1 |
20060241353 | Makino et al. | Oct 2006 | A1 |
20070004450 | Parikh | Jan 2007 | A1 |
20070016862 | Kuzmin | Jan 2007 | A1 |
20070018957 | Seo | Jan 2007 | A1 |
20070074131 | Assadollahi | Mar 2007 | A1 |
20070161404 | Yasujima et al. | Jul 2007 | A1 |
20080109735 | Vuong | May 2008 | A1 |
20080140886 | Izutsu | Jun 2008 | A1 |
20080243736 | Rieman et al. | Oct 2008 | A1 |
20090058688 | Thorn | Mar 2009 | A1 |
Number | Date | Country |
---|---|---|
1871075 | Dec 2007 | EP |
0059247 | Oct 2000 | WO |
0186922 | Nov 2001 | WO |
03103174 | Dec 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20090106695 A1 | Apr 2009 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 11975489 | Oct 2007 | US |
Child | 11986600 | US |