The present invention relates to a machine translation (automatic interpretation) system, a machine translation (automatic interpretation) method, and a machine translation (automatic interpretation) program, and in particular, to those superior in the handling of proper nouns.
As an application example of a machine translation/automatic interpretation system, there exists a regional proper noun dictionary receiving system disclosed by the present applicant in Japanese Patent Laid-Open No. 2003-330916. According to the publication, the regional proper noun dictionary receiving system includes a portable terminal device including a position detecting function to detect a geographical current position of the terminal and a server including a database of regional proper noun dictionaries in which information including pronunciation information associated with regionally unique nouns is described. According to the publication, the portable terminal device first transmits positional information obtained by the position detecting function to the server by use of a transmission function. When the positional information is received from the portable terminal device, the server conducts matching between the positional information and regional proper noun dictionaries to select a regional proper noun dictionary suitable for the positional information, and then downloads dictionary data thus selected to the portable terminal device. Then, the portable terminal device outputs a regional proper noun using the dictionary data obtained from the server. The regional proper noun dictionary receiving system can be used not only in our country but also in countries we visit in the world to obtain a regional proper noun thereof.
Patent document 1: Japanese Patent Application Laid-Open No. 2003-330916, FIG. 1
As described above, although it is known that a machine translation/automatic interpretation system becomes more useful with a large number of dictionaries including necessary information, proper nouns contained in these dictionaries are place names, personal names, names of peculiar items, and the like employed in the language under consideration, and a translated word itself has no meaning or a translated word is only a transliteration or transcription in many cases. When energy and storage space needed to create dictionaries for these proper nouns are taken into consideration, it is unrealistic with respect to the cost, feasibility, and the like to beforehand create the dictionaries for nouns unique to every countries and regions of the world.
The present invention has been devised in consideration of the above situation and it is an object thereof to provide a machine translation automatic interpretation method capable of translating proper nouns without beforehand preparing dictionaries associated with proper nouns and a system, an apparatus, and a program for implementing the machine translation automatic interpretation method and also to provide a machine translation automatic interpretation method capable of remarkably reducing the labor and the cost for the handling of the proper nouns and a system, an apparatus, and a program for implementing the machine translation automatic interpretation method.
In accordance with a first viewpoint of the present invention providing means for solving the problem, there is provided a machine translation system in which proper noun information including proper noun groups described in a translation target language is accessed; for the proper noun information accessed as above, corresponding character strings are automatically generated in a language at least other than the target language to create a proper noun user dictionary to present the dictionary to the user; receiving an input, a corresponding character string contained in a translation objective sentence is translated into proper nouns in the target language. More specifically, the machine translation system in accordance with the present invention includes means for generating proper noun user dictionary data and creates, by the means, a proper noun user dictionary including information necessary for the machine translation using proper nouns in the target language. Additionally, more preferably, the machine translation system in accordance with the present invention includes a proper noun user dictionary storage section to store proper noun user dictionary therein. By adopting such a configuration, it is possible to create and to enhance the user dictionary for the machine translation using data including only easily attainable proper nouns in the target language to thereby translate sentences including proper nouns.
In accordance with the present invention, translation/interpretation processing with a corresponding word translation function superior in proper noun translation can be executed without beforehand preparing dictionaries associated with proper nouns. More straightforwardly, since it is not required to beforehand create dictionaries for the machine translation at least for proper nouns, the cost of creation and possession of dictionaries can be reduced.
Subsequently, description will be given in detail of a best mode for carrying out the present invention. The machine translation system in accordance with the present invention is formed in its favorable embodiment as an information processing system including an input device (1 of
The storage section (3 of
The target language proper noun storage section (31 of
The proper noun user dictionary storage section (32 of
The machine translation dictionary storage section (33 of
The machine translation apparatus includes a proper noun user dictionary data generating section (21 of
The proper noun user dictionary data generating section (21 of
The machine translation section (22 of
Next, referring to the drawings, description will be given in detail of operation of the machine translation system configured as described above. First, description will be given of operation where the machine translation system outputs a proper noun user dictionary according to the proper noun information and urges input of a translation objective sentence.
First, the proper noun user dictionary data generating section (21 of
In this connection, “Aachen” and “Aaland” are generally expressed in the original language (Japanese) as “ah-hen” (city) in Germany and “ohrando” (island) in Finland. However, in the present invention, it is not required to strictly transliterate the words, which will be clarified by the explanation described later.
The proper noun user dictionary data generating section (21 of
In this stage, pairs of the proper nouns and the corresponding character strings of the machine translation system are displayed. The user inputs, by referring to the displayed information, a sentence including character strings corresponding to the proper nouns of the target language.
The machine translation section (22 of
Next, the machine translation section (22 of
When the original language is, for example, Japanese, if a translation object sentence “ahchen ni ikitai” is inputted, the machine translation section (22 of
As above, in accordance with the present embodiment, for at least proper nouns, items described in the target language can be accessed and be read in to thereby conduct the machine translation. Therefore, it is not required to beforehand create dictionary information for the machine translation, and the storage capacity for the dictionary information can be reduced.
Next, description will be given in detail of a first embodiment by referring to drawings.
A personal computer includes a keyboard as the input device 1, a magnetic disk storage device as the storage section 3, and a display as the output device 4. Moreover, the personal computer includes a central processing unit which reads in and executes predetermined programs to function as the proper noun user dictionary data generating section 21 and the machine translation section 22. Furthermore, the magnetic disk storage device has stored the target language proper noun groups, the proper noun user dictionary, and the machine translation dictionary respectively in the target language proper noun storage section 31, the proper noun user dictionary storage section 32, and the machine translation dictionary storage section 33.
In this paragraph, description will be given on the assumption that the target language proper noun storage section 31 has stored, as the proper nouns of the target language, proper nouns such as “KAXI”, “SOSDK”, “WEKIND”, and “LIDRE”.
The proper noun user dictionary data generating section 21 includes a table to convert the target language proper nouns into phonetic symbols and phonetic characters in the original language according to the respective characters constituting each of the nouns and a connective relationship between the characters. According to the table, the data generating section 21 generates a notation (corresponding character string) in the original language according to a proper noun in the target language and stores the notation in the proper noun user dictionary storage section 32.
In the storage section 32, the notations (corresponding character strings) generated by the data generating section 21 in the original language are stored together with the notations in the target language and additionally the dictionary information necessary for the machine translation.
For example, the proper noun user dictionary data generating section 21 decomposes a proper noun “KAXI” in the target language to create phonetic symbols as “KA”→“ka”+“XI”→“si”, and stores the corresponding character string “kasi” in the proper noun user dictionary storage section 32 (reference is to be made to
Next, the machine translation system displays a proper noun in the target language together with a notation (corresponding character string) of the proper noun in the original language (step A4).
When a predetermined translation execution operation is conducted, the machine translation section 22 reads the machine translation dictionary and the proper noun user dictionary from the machine translation dictionary storage section 33 and the proper noun user dictionary storage section 32 respectively (step A5, step A6), and reads a translation object sentence in the original language inputted from the input device 1 (step A7). The machine translation section 22 conducts a predetermined analysis for the inputted translation object sentence and converts the sentence described in the original language into a text in the target language using the machine translation dictionary and the proper noun user dictionary (step A8). Finally, the machine translation section 22 outputs the text in the target language as the translation result to the display (step A9).
For example, assume that the user inputs from the keyboard “kono michiha sosuzuku ni tsuujite imasuka”, viewing the display (reference is to be made to
As can be clear from the above description, in accordance with the present invention, it is possible to obtain a good translation result for proper nouns without preparing a proper noun dictionary in advance. Furthermore, in accordance with the present invention, existing proper noun information is converted into phonetic characters and phonetic symbols beforehand, and then is presented to the user to receive user's inputs, which makes it possible to identify a proper noun in the target language. Therefore, for proper nouns, a correct translation result can be produced without correctly inputting a notation in the target language, a translated word in the original language, or the like.
In the description of the first embodiment of the present invention, the proper noun user dictionary data generating section 21 includes a table to convert the target language proper nouns into phonetic symbols and phonetic characters in the original language with characters constituting each of the nouns and adjacent relationships between the characters. As a second embodiment of the present invention, description will be given of a configuration not using the table. Since the second embodiment of the present invention is substantially equal to the above first embodiment, description will be given of sections different from the first embodiment.
The proper noun user dictionary data generating section 21 of the machine translation system in accordance with the second embodiment of the present invention generates a name that is capable of uniquely identifying each proper noun according to a predetermined rule, regardless of the contents of characters of a proper noun in the target language.
For example, the data generating section 21 generates, for a set of proper nouns in the target language read out from the target language proper noun storage section 31, notations (corresponding character strings) as “Place name 1”, “Place name 2”, “Place name n”. And the data generating section 21 stores pairs in the proper noun user dictionary storage section 32 and displays them on the display.
As can be seen from the above description, the present invention does not necessarily require the table to generate the proper noun user dictionary data, but it is only necessary that the proper nouns in the target language can be displayed for the user to identify them. Therefore, when compared with the first embodiment described above, the second embodiment is realized in a simpler process, and it is possible to construct the system having a high processing speed at a low cost. Furthermore, when the user is not familiar with the phonetic characters and the phonetic symbols, though reading or inputting may be difficult, a correct translation result can be produced for the proper noun under consideration as described above. That is, the system generates notations (corresponding character strings) such as “place name n”, “Point n”, and “Destination n” for the user and receives user's input.
Next, description will be given of a third embodiment as a modification of the first embodiment of the present invention described above. The third embodiment of the present invention is constructed as shown in
The electronic map storage section 34 of the embodiment has stored therein, in addition to a map image, map information to display proper nouns such as place names, traffic facility names, and street names at predetermined positions.
The proper noun user dictionary data generating section 21 generates, for each proper noun included in the map information of the electronic map storage section 34, a notation (corresponding character string) in the original language as in the first embodiment. And the data generating section 21 stores pairs of notations and proper nouns in the proper noun user dictionary storage section 32 and displays them on the electronic map.
As can be clear from the above description, in accordance with the present embodiment, it is possible to effectively use, as a translation dictionary, the map including proper nouns. For example, even if place names, traffic facility names, street names, and the like of the electronic map information, an electronic route map, and a street map are indicated in a particular language (target language) and the user cannot understand the place names and the like in the language, it is possible as described above to display on the map the corresponding character strings which can be inputted by the user and hence instructions can be received. This means that an electronic map where character sections are generated in a foreign language or the like and products of applications of the map (map information system; GIS, car navigation system, etc.) can be easily modified for the user who cannot understand the foreign language.
Moreover, as in the second embodiment described above, the proper noun user dictionary data generating section 21 may generate names with which each of place name information (proper noun) can be uniquely identified according to a predetermined rule, regardless of the characters of proper nouns in the target language.
Next, description will be given of a fourth embodiment in which the machine translation system in accordance with the present invention acquires proper noun information via a network.
Referring to
Having received the proper noun information, the machine translation system generates, by the proper noun user dictionary data generating section 21, proper noun user dictionary data as in the respective embodiments described above (step B5) to store the data in the proper noun user dictionary (step B6) and displays it on the display (step B7). After this point, a process of translating a translation object sentence inputted from the user is similar to that of each embodiment described above.
As can be clear from the above description, the present invention can adopt not only a configuration to use the proper noun information possessed by the machine translation system, but also a configuration in which the proper noun information on the network is downloaded and is used for the translation. For example, when the machine translation system in accordance with the present invention is mounted on a portable information terminal having a GPS function, it is possible to obtain proper noun information corresponding to the current position obtained from the GPS so that the information is stored in the proper noun user dictionary or is used. In this case, for example, it is possible to create and to use a dictionary including place names and the like of respective places in the world according to necessity. And, in the situation, it is not required to beforehand prepare dictionary data, which has been described in Japanese Patent Application Laid-Open No. 2003-330916.
Next, description will be given of a fifth embodiment in which pictures are taken and proper noun information is obtained from image data in a machine translation system. The machine translation system in accordance with the fifth embodiment of the present invention is configured as shown in
The imaging section 25 is image input means such as a digital camera and a scanner, and the character recognizing section 26 is means which scans an image to cut out a character string section contained in the image and which compares the section with a standard character pattern to convert it into text data in consideration of a connection state or the like according to necessity.
Description will be given of operation of the machine translation system in accordance with the present embodiment. When the predetermined triggers such as reception of a conversion processing request from the user and a translation execution request for image data are activated, the proper noun user dictionary data generating section 21 of the machine translation system acquires through the character recognizing section 26 a character string from an image obtained by the imaging section 25 to generate proper noun user dictionary data in a manner similar to that of each embodiment described above.
As can be clear from the above description, in accordance with the present invention, it is possible to photograph a map, a book, or the like including proper nouns to effectively use it as a translation dictionary. For example, a sightseeing guide map, a route map, a road map generated in a particular language (target language), its title section and index section, a guidebook, a telephone directory, or the like can be photographed and included in the proper noun user dictionary to thereafter execute the translation processing.
Next, description will be given of a sixth embodiment proposing a detailed configuration of the proper noun user dictionary data generating section 21 of the machine translation system in accordance with the present embodiment. The sixth embodiment of the present invention is constructed as shown in
The target language audio synthesizing section 27 is a means to produce an audio output using proper noun information stored in the target language proper noun storage section 31, and the original language audio recognizing section 28 is means which compares the audio output of the proper noun information with standard audio patterns in the original language to convert the information into text data, relationships of adjacent characters or the like being considered according to necessity, to store the data in the proper noun user dictionary storage section 32.
Description will be given of operation of the machine translation system in accordance with the present embodiment. When the predetermined triggers such as reception of a conversion processing request from the user or a translation execution request for image data are activated, the target language audio synthesizing section 27 of the machine translation system reads the proper noun information stored in the target language proper noun storage section 31 to produce an audio output. The original language audio recognizing section 28 compares the audio output of the proper noun information with standard audio patterns in the original language to convert the information into text data, relationships of adjacent characters or the like being taken account of according to necessity, and stores the data in the proper noun user dictionary storage section 32.
As above, the present invention can configure, depending on a combination of the target language and the original language, the proper noun user dictionary data generating section 21 by using an existing audio synthesizing section and an existing audio recognizing section.
Incidentally, in the embodiment described above, exemplified is an example in which the proper noun user dictionary data generating section 21 of the first embodiment is replaced by a combination of the target language audio synthesizing section 27 and the original language audio recognizing section 28; however, it is also possible to replace the data generating section 21 of the second to fifth embodiments by the target language audio synthesizing section 27 and the audio recognizing section 28.
Next, description will be given of a seventh embodiment in which the present invention is applied to an automatic interpretation system. The embodiment is constructed as shown in
Moreover, in the present embodiment, a microphone is connected as the input device 1, first and second output devices 41 and 42 are arranged as the output devices, and a display and a speaker are respectively connected thereto.
Description will be given of operation of the automatic interpretation system in accordance with the present embodiment. Although the operation is substantially equal to that of the first embodiment, in place of the translation objective sentence input processing by the machine translation section 22 in each embodiment described above, there is executed processing in which the automatic interpretation section 22 conducts audio recognition for the audio from the microphone to detect and to translate a display item (corresponding character string) and then conducts audio synthesis in the target language to output by the speaker a result of the audio synthesis.
As above, the present invention is applicable not only to machine translation in which a character image is a primary object, but also to automatic interpretation in which voice and sound are primarily communicated. Furthermore, it is also possible to replace the machine translation section 22 of the embodiments 2 to 6 and the sixth embodiment with the automatic interpretation section 29.
Description has been given of the respective embodiments to which the present invention is applied as above; however, the machine translation/automatic interpretation system and the machine translation/automatic interpretation method in accordance with the present invention are usable, according to the principle and the effect thereof, not only for machine translation/automatic interpretation in a narrow sense, but also for various user interfaces including conversion processing similar to them. Therefore, the present invention is not limited to the machine translation system in a narrow sense and a program for the same, but also is applicable to various terminals, servers, and devices which can execute the machine translation method and the automatic interpretation method in accordance with the present invention.
Number | Date | Country | Kind |
---|---|---|---|
2004-079080 | Mar 2004 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP05/03847 | 3/7/2005 | WO | 9/15/2006 |