1. Technical Field
The present invention relates in general to an improved data processing system, and in particular to a method and system for enhanced communication on a data processing system. Still more particularly, the present invention relates to a method and system for enhanced communication by providing language translation of received messages on a data processing system.
2. Description of the Related Art
Language translation applications (software) are well known in the art. With the fast evolving global electronic network following the opening up of world-wide communication channels such as the Internet or World Wide Web, transmission of data and non-data communication in languages other than that of the receiving entity/system is becoming more and more common. The World Wide Web has introduced near instantaneous world wide communications; however, although the Web has helped to bring us all closer together, it has not removed the language barriers which exist in intercultural communications.
Email communication makes up a significant and fast growing portion of the communications environment which exist on the Internet. More and more users globally are communicating via email which is considerably cheaper than regular telephone network or other related communications systems.
Because of the global nature of the internet e-mail network, messages are often transmitted and received in languages other than the language of the recipient of the message. A message from Japan or Holland sent to a U.S.A. recipient would be received in Japanese or Dutch, respectively. With English as the preferred language of communication in the U.S.A., it is very probable that the recipient would be unable to comprehend the email without first translating it into English. Similarly, if the data processing system referencing the data or communication is configured in English, then it would not be able to reference the information without first translating it into English.
Currently, the only way to translate a message of this type is to transfer the text from the location it is received from to a language translation environment.
This often involves a cut and paste procedure wherein the user first cuts the foreign text from the receiving location, opens up a language translation software on the data processing system or alternatively going to a language translation environment on the web and then pastes the text into the a viewport within the language translation environment. The recipient then requests that the text be translated into a desired language.
AltaVista (TM) Translation utilizing Systran technology is one of the more common web locations which provide a language translation service. AltaVista's Translation Assistant provides a free service to the Internet community. Systran, a leader in the field of machine translations, provides a machine translation of the message received from one language to another selected language.
In machine translations, a data processing system is configured for translation utilizing language translation software. AltaVista Translation Assistant thus provides a user with a tool to translate a grammatically correct document into something comprehensible, although not perfect.
Systran Software, Inc. is the leading developer of natural language machine translation, having developed 28 language pairs. The Systran interface screen prompts a user to provide plain text in an input location. The user may then select the language pair he/she desires to be translated to/from (eg. French to English). The user then clicks on the “translate” button to complete the process. The text entered is translated into its English form and displayed to the user below the original French form. Other implementations of the Systran software permits for an entire URL content to be translated. Additionally, a language translation menu item may be added to the user's internet browser which access the Systran location directly.
Considerable research has been carried out on computer systems for the automatic translation of text. To date all translations have to be invoked by the user as described above. This is time consuming and also redundant, particularly when the user is already aware that he/she desires to read all communications in a particular language or alternatively where the data processing system has been configured for interfacing in a particular language.
It would therefore be desirable to have a method and system for automatically translating all incoming communication which is in a different language (foreign language) into the language desired by the user and/or configured on the data processing system. It would be further desirable to have translation immediately upon receipt without user manipulation or request for each individual communication.
It is therefore one object of the present invention to provide an improved data processing system.
It is another object of the present invention to provide a method and system for enhanced communication on a data processing system.
It is yet another object of the present invention to provide a method and system for enhanced communication by providing language translation of received messages on a data processing system.
The foregoing objects are achieved as is now described. A method is disclosed for allowing recipient translation of information received by a data processing system configured with a user interface having a human language which is different from the human language of the transmitted information. The method comprises of first selecting a desired human language for user interface with resident software applications on the data processing system. Then the method intercepts incoming communication to the data processing system which involves transmission of human language-based information and then analyzes the incoming communication to determine if it is in a different human language than the desired human language. When the human language of the incoming communication differs from the desired human language, the method dynamically translates the human language of the incoming communication into the desired human language. Finally, the method transmits the translated incoming communication in the desired human language to an output device of the data processing system.
The above as well as additional objects, features, and advantages of the present invention will become apparent in the following detailed written description.
The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself however, as well as a preferred mode of use, further objects and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings, wherein:
With reference now to the figures, and in particular with reference to
Data processing system 10 may have many additional components which are not shown such as serial, parallel, and USB ports for connection to, e.g., modems or printers. In the preferred embodiment of the invention, communication to the data processing system is made possible via a modem connected to a land line (telephone system) which is in turn connected to a network provider such as an internet service provider (ISP). Additionally, data processing system may be connected to a network via an ethernet/network card. Communicated data arrives at the modem or network card and is processed to be received by the data processing system's CPU or other software application.
Those skilled in the art will further appreciate that there are other components that might be utilized in conjunction with those shown in the block diagram of
Conventional data processing systems often employ a graphical user interface (GUI) to present information to the user. The GUI is created by software that is loaded on the data processing system, specifically, the data processing system's operating system acting in conjunction with application programs. Two well-known GUIs include OS/2 (a trademark of International Business Machines Corp.) and Windows (a trademark of Microsoft Corp.).
This invention implements a front end translation mechanism that first intercepts incoming communication, such as email, transmitted in a different language, then translates it into a desired language providing the recipient with a message in the desired language. For the purposes of this invention incoming communications include email messages, files transmitted via file transfer protocol (FTP), internet/web pages, chat or newsgroup communications, and terminal emulation. Those skilled in the art recognize that this list may include other forms of incoming communication similar to those listed above.
The implementation of the present invention occurs on a typical data processing system as described above. In the preferred embodiment, data processing system is enhanced to provide instantaneous translations upon receipt of a communication in a different language. The operating system (OS) of the data processing system such as Microsoft's Windows '98 provides the data processing system with its software configurations. These software configurations includes the desired language set for interfacing on the data processing system. Most systems are pre-configured (hardware) in the language of its expected end market. In the preferred embodiment, OS contains an option within its language configuration which permits a user to request automatic translations of all incoming communication in a foreign language. In the preferred embodiment, the user selects the option for automatic translations to implement the invention.
In one embodiment of the invention, data processing system is loaded with a language translation package which provides internal translation software to the user. This software may be purchased separately from the data processing system and OS from a language translation company such as SYSTRAN of La Jolla, Calif., USA.
Where communication is received via an internet site with the user having a connection ID, the translation may be completed within the internet environment by sending the communication to a web site such as AltaVista's to complete the translation and return the translated message. An example of this application is detailed below.
Alternatively, the option for automatic translation may exist within a language translation software package. The language translation package may include software algorithm to request if the user wishes automatic translations and then reconfigure the data processing system to permit automatic translation.
In the preferred embodiment of the invention, the receiving data processing system is equipped with an email engine, such as Eudora. The engine is resident software for displaying and manipulating email messages. It contains a queue which sorts the mail, looks at it, and provides options to reply, etc. to an outgoing server utilizing Simple Mail Transfer Protocol (SMTP). Those skilled in the art are familiar with the workings of an email engine. An email message is routed to a Post Office Protocol (POP) server on which the mail is stored until accessed. When the user logs unto his mail account utilizing the email engine, and connects to the POP server, the incoming email messages are downloaded unto the user's data processing system into the email engine's In-box. In the preferred embodiment, the user specifies the POP server and the language to have the email arrive in and the translation occurs at the user/client side.
In one embodiment of the invention, web-based communication and email are provided with a language tag which identifies the language in which the communication was created to a recipient data processing system. Transport Control Protocol/Internet Protocol (TCP/IP) is the protocol utilized to link users together on the Internet, intranet and other networks. It allows businesses to communicate with each other, share e-mail across the globe, send files from one city to another, and exchange ideas through special forums such as newsgroups. It also permits data exchange and advertising over the web as well as terminal emulation. TCP/IP email also utilizes a POP server and e-mail engine to receive mail.
The present invention utilizes several of the features/specifications of TCP/IP communication to identify the language of the incoming communication. Through TCP/IP technology, the internal system of the sending data processing system is able to recognize that the receiving system probably speaks another language. Additional sources of language information includes IP header, the source and destination IP addresses, IP addressing (username), masking, domain name system, time stamping, address resolution protocol and File Transfer Protocol (FTP).
In one embodiment of the invention, a database is maintained of all IP addresses and the IP address of the received message is looked up in the database to determine which country it originated from and the corresponding native language in which the mail was written. This embodiment, however, like others described herein, utilizes further identification procedures to verify the originating language. Those skilled in the art will appreciate that the TCP/IP provides a plurality of identification measures by which the language type of the originating message may be determined and further that a plurality of identification steps may be required to accurately determine an originating language.
Communications transmitted as Hypertext Markup Language (HTML) documents and extended Markup Language (XML) documents, for example, contains an elaborate tag system that permits the receiving system to determine which messages may require translation as soon as the message arrives. These tags are language codes attached to the message header which may be utilized to identify the message as written in a different language.
In one illustrative embodiment in which e-mail communication is received by the user on an email engine, a series of steps necessary to implement the invention are disclosed. Email engine is created by software resident on the data processing system. Email engine typically consists of GUIs which provide a display area and a number of options for user interface.
In another embodiment of the present invention, a user selects a list of email addresses from which e-mail in foreign languages are received. In this embodiment, the user is provided with an option on the email engine to automatically translate email from these particular addresses into the desired language of the user. It is understood by those skilled in the art that variations exist in the embodiments of the present invention but that all these variations fall within the scope of the present invention.
In the preferred embodiment, once the user configures the data processing system or email engine, all incoming messages in a foreign language are translated at the time the message is downloaded unto the data processing system and prior to the user receiving the message. In one embodiment, this process involves sending the message from the inbox to a language translation center such as AltaVista's web site, dynamically requesting the translation, and storing the translated message along with the original message in the inbox. Another embodiment utilizes resident translation software packages which are loaded unto the data processing system and accessed to process the translations.
The illustration below depicts a message received in one language and automatically translated. The user receives the output below when he opens the message.
In the preferred embodiment of the invention, the translation is completed and displayed sentence by sentence. The translated text is displayed in a different color and/or other attribute so that it is clearly presented and the recipient may follow/decipher the machine translation.
The process above describes a series of determination steps to identify the language. In this illustrative embodiment, this series of steps help to provide a more accurate determination of the language of the incoming communication. In the preferred embodiment, the results of the analysis gives a best match of the originating language. In another embodiment of the present invention, once a language is identified, then the process jumps to the translation steps.
One method of identifying the language is by analyzing character sets. In the preferred embodiment, character sets, as utilized in the present example, are not limited to Roman character sets. It is understood that alternative character sets, for example Japanese, exist and are incorporated in the analysis process. In the preferred embodiment of the present invention, all language sets are contemplated and may be translated.
While the invention has been particularly shown and described with reference to a preferred embodiment, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention. For example, it is understood that any number of determining steps may be implemented to determine the language of the incoming message and that all such methods fall within the scope of the invention.
Number | Name | Date | Kind |
---|---|---|---|
5546500 | Lyberg | Aug 1996 | A |
5884246 | Boucher et al. | Mar 1999 | A |
5884247 | Christy | Mar 1999 | A |