This application is based upon and claims the benefit of priority from Japanese Patent Application No. 2016-052394, filed Mar. 16, 2016, the entire contents of all of which are incorporated herein by reference.
Embodiments described herein relate generally to a display assist apparatus, method, and program.
As a scene to display a speech recognition result as a subtitle, for example, there is a scene to display a speech recognition result as a subtitle for the purpose of information assurance in a meeting for a participant who cannot follow a speech and has difficulty in understanding the contents. Additionally, as a scene to display a translation result as a subtitle, there is a scene to display a machine translation result or manual translation result between different languages as a subtitle in, for example, a conference system used in a conference with participants using different languages as mother tongues.
When displaying subtitles as described above, if the subtitles are sequentially switched along with the progress of contents, or an old subtitle already displayed is gradually pushed out of the screen by a subtitle to be newly displayed, a user can see the subtitle for only a limited time. Hence, there exists a technique of dividing the sentence of a speech recognition result or translation result to be displayed as a subtitle to make the contents of the subtitle clear or improve translation quality.
A display assist apparatus, method, and program according to embodiments will now be described with reference to the accompanying drawings. Note that in the following embodiments, portions denoted by the same reference numerals perform the same operations, and a repetitive description will appropriately be omitted.
In an actual operation, however, if the divided sentences of a speech recognition result or translation result are directly displayed, it is difficult to grasp the separation positions between the elements of a subtitle. Additionally, since the sentence structure cannot sufficiently be analyzed, the understanding of a user who refers to the contents of the subtitle cannot keep up with the display, and information transmission may be impeded.
The present embodiments have been made to solve the above-described problem, and have as its object to provide a display assist apparatus, method, and program capable of assisting understanding of contents.
In general, according to one embodiment, a display assist apparatus includes an acquisition unit, a first processor, a second processor, and a display controller. The acquisition unit acquires a character string. The first processor divides the character string into first segments each of which is a segment representing a semantic cluster and generates a plurality of divided character strings. The second processor detects, for the character string, second segments each of which is a segment larger than the each of the first segments. The display controller performs display control to make a distinction between the first segments and the second segments when displaying the plurality of divided character strings.
A display assist apparatus according to the first embodiment will be described with reference to the block diagram of
A display assist apparatus 100 according to the first embodiment includes an acquisition unit 101, a first processor 102, a second processor 103, and a display controller 104.
The acquisition unit 101 acquires an input character string based on input from a user. As the input from the user, various generally used methods such as keyboard input, handwritten character recognition, and input to a microphone for receiving speech are applicable.
If the input from the user is done by speech, the acquisition unit 101 acquires the character string of the speech recognition result of the speech as the input character string, and acquires pause information as well. The acquisition unit 101 acquires, as the pause information, a state in which a silent period in the speech continues for a predetermined time or more. If a setting is done to start speech input after a button is pressed at the time of microphone input, the acquisition unit 101 may acquire the pause information by detecting, for example, ON/OFF of the button.
If the input from the user is done by text input such as keyboard input or handwritten character recognition processing, the acquisition unit 101 acquires determination information together with the input character string. The acquisition unit 101 acquires the pressing of the enter key or the input of a full stop or a period as the determination information. If a screen display such as a determination button configured to determine the input is included in the user interface, the acquisition unit 101 may acquire a touch or click by a mouse or the like on the display as the determination information.
The first processor 102 receives the input character string and the pause information or determination information from the acquisition unit 101. The first processor 102 performs morphological analysis for the input character string, and divides the input character string that has undergone the morphological analysis into first language segments (to be also simply referred to as first segments hereinafter) that are language segments each representing a semantic cluster based on the pause information or determination information, thereby generating a plurality of divided character strings.
As the morphological analysis, any general morphological analysis technology such as CKY or longest match principle is usable. Note that if the input character string acquired by the acquisition unit 101 already has information concerning a morphological analysis result, the first processor 102 need not perform the morphological analysis.
The first processor 102 adds a role label to each divided character string. Examples of the role label are a label representing the type of a case when a phrase serving as a case element is used, a label representing a simple sentence, a label representing a phrase located at a sentence end when a verbal phrase with a tense at a sentence end is used, a label representing a context when a conjunction or adverb representing the structure or context of a sentence or an expression corresponding to these is used, a label representing a parallel element when a parallel element is used, and a label representing a pause when a pause is used as one first language segment.
The second processor 103 receives the plurality of divided character strings with role labels from the first processor 102. The second processor 103 detects second language segments (to be also simply referred to as second segments hereinafter) that are language segments larger than the first language segments from the divided character strings. The second processor 103 adds an ending label to a divided character string at the end of a second language segment. Examples of the ending label are a label representing the end of a clause (to be also referred to as a clause end) or the end of a sentence (to be also referred to as a sentence end) and a label representing a compound sentence. The second processor 103 detects the label of a first language segment added to an input divided character string and the arrangement of pauses, determines the clause end or sentence end, and adds an ending label to the corresponding divided character string.
The display controller 104 receives the plurality of divided character strings with role labels (and ending labels) from the second processor 103. When displaying the plurality of divided character strings, the display controller 104 performs display control to make a distinction between the first language segments and the second language segments based on the role labels and the ending labels. The display control can be any control to make the relationship between the plurality of divided character strings distinguishable and allow the user to easily understand it. For example, when displaying a plurality of divided character strings on a display or the like, to make a first language segment distinguishable, indent display is done, the font color is changed, a decoration such as underlining or italicizing is applied, a blank line or a separator line is inserted after a divided character string as the end of a second language segment is displayed, or a graphic such as a so-called balloon surrounding a displayed first language segment is drawn. Details of the display control will be described later.
Next,
In a table 200 shown in
The first language segment detection pattern 201 is a pattern used to divide an input character string into first language segments. Here, character string patterns each appearing at a sentence head when an input character string is divided into phrases and character string patterns each appearing at an end when an input character string is divided into phrases are shown. The role label 202 is a label representing the feature of the first language segment detection pattern 201. Here, the role label 202 is a label representing the type of a case. The first processor 102 determines whether a morpheme string that is a character string as the morphological analysis result of an input character string coincides with the first language segment detection pattern 201. Upon determining that the morpheme string coincides with the first language segment detection pattern 201, the first processor 102 generates a divided character string by setting a separation position at the end of the morpheme string, and adds the role label 202 corresponding to the coincident first language segment detection pattern 201 to the divided character string.
More specifically, for example, the first language segment detection pattern 201 “sentence head//” and the role label 202 “[sentence adverb]” are associated. Note that in
Note that if pause information or determination information exists immediately after a morpheme located at the end of a divided character string, the information may be added to the divided character string as a role label.
Next,
By the first processor 102, a processing result of divided character strings 301 to 310 as shown in
Next,
In a table 400 shown in
Here, the second language segment detection pattern 401 is a character string pattern that is, here, a language segment larger than a phrase and appears at a clause end or a sentence end in terms of grammar. The ending label 402 is a label representing a clause end or a sentence end. More specifically the second language segment detection pattern 401 “//” and the ending label 402 “<<clause end>>” are associated.
The second processor 103 determines whether a divided character string coincides with the second language segment detection pattern 401. Upon determining that the divided character string coincides with the second language segment detection pattern 401, the second processor 103 adds the ending label 402 corresponding to the coincident second language segment detection pattern 401 to the divided character string.
An example of the processing result of the second processor 103 will be described next with reference to
By the second processor 103, a processing result of divided character strings 501 to 510 as shown in
More specifically, when the table 400 is referred to, the divided character string 503 “ . . . ///” coincides with the second language segment detection pattern 401 “//”. Hence, the corresponding ending label 402 “<<clause end>>” is added to the divided character string 503. Similarly, when the table 400 is referred to, the divided character string 510 “///” coincides with the second language segment detection pattern 401 “///”. Hence, the corresponding ending label 402 “<<sentence end>>” is added to the divided character string 510.
The operation of the display controller 104 will be described next with reference to the flowchart of
In step S601, the display controller 104 sets the number of indents to 0.
In step S602, the display controller 104 displays a separator line at the start of display.
In step S603, the display controller 104 inserts the set number of indents and displays a divided character string. Note that in the processing of the first time, since the number of indents is 0, the display controller 104 displays the divided character string from the beginning of the line.
In step S604, the display controller 104 determines whether the divided character string displayed in step S603 is a sentence end, that is, whether a sentence end label is added to the divided character string. If a sentence end label is added, the process advances to step S608. If no sentence end label is added, the process advances to step S605.
In step S605, the display controller 104 determines whether the divided character string displayed in step S603 is a clause end, that is, whether a clause end label is added. If a clause end label is added, the process advances to step S609. If no clause end label is added, the process advances to step S606.
In step S606, the display controller 104 increments the set number of indents by one.
In step S607, the display controller 104 determines whether a next divided character string exists. If a next divided character string exists, the process returns to step S603 to repeat the same processing as described above. If a next divided character string does not exist, the process advances to step S610.
In step S608, since the end of the sentence can be known by the sentence end label, the display controller 104 displays a separator line and sets the number of indents to 0. After that, the process returns to step S603 to repeat the same processing as described above.
In step S609, since the end of the clause can be known by the clause end label, the display controller 104 displays a blank line and sets the number of indents to 0. After that, the process returns to step S603 to repeat the same processing as described above.
In step S610, the display controller 104 displays a separator line. The operation of the display controller 104 thus ends.
A detailed example of display control by the display controller 104 according to the first embodiment will be described with reference to
Here, an example in which display control is performed for the processing result of the second processor 103 shown in
For the divided character string 502, the number of indents is 1. Hence, the divided character string 502 is displayed from a display start position that is moved rightward by one interval (to be referred to as one indent) defined as an indent (display 702). Note that since
Since the divided character string 502 has neither a sentence end label nor a clause end label, like the divided character string 501, the number of indents is incremented by one and changed to 2.
For the next divided character string 503, the number of indents is 2. Hence, the divided character string 503 is displayed from a position moved rightward by two indents (display 703).
Since a clause end label is added to the divided character string 503, a blank line 704 is displayed after the display 703, and the number of indents is reset to 0 by the process of step S609.
If the same processing as described above is performed for the divided character strings 504 to 509, displays 705 to 708, a blank line 709, and displays 710 to 712 are displayed.
Note that when processing the last divided character string 510, since the divided character string 510 has a sentence end label, the display 712 is displayed, and then, a separator line 713 at the end is displayed by the process of step S610.
Note that after display of a divided character string with a clause end label, not a blank line but a separator line distinguishable from those at the start and end of the sentence may be displayed. That is, any display format capable of making a distinction between a clause end and a sentence end is usable.
Other examples of the display control of the display controller 104 will be described next with reference to
Here, assume that the divided character strings 501 to 510 shown in
In a case in which a character string up to one clause end is long, if the processing is performed according to the flowchart of
Alternatively, as shown in
According to the above-described first embodiment, when displaying divided character strings, display control is done to make a distinction between first language segments and second language segments, thereby performing display such that the difference between segments such as phrase segments, clause segments, and sentence segments can be seen. This can increase the visibility of a character string displayed as a subtitle or a telop and assist the user in understanding the contents.
A display assist apparatus according to the second embodiment will be described with reference to the block diagram of
A display assist apparatus 900 according to the second embodiment includes an acquisition unit 101, a first processor 102, a second processor 103, a display controller 104, and an expression convertor 901.
The operations of the acquisition unit 101, the first processor 102, and the second processor 103 are the same as in the first embodiment, and a description thereof will be omitted here.
The expression convertor 901 receives divided character strings with role labels (and ending labels) from the second processor 103, and converts the expression of a divided character string corresponding to a conversion rule to another expression. Note that the expression convertor 901 may receive not divided character strings processed by the second processor 103 but a character string that has undergone morphological analysis from the first processor 102, and convert, based on the conversion rule, the expression of the input character string that has undergone the morphological analysis. At this time, the first processor 102 sends the converted input character string to the second processor 103.
The display controller 104 performs display control for the converted divided character string.
An example of a conversion pattern that the expression convertor 901 refers to will be described next with reference to
In a table 1000 shown in
More specifically, the conversion target pattern 1001 “” and the conversion pattern 1002 “(blank)” are associated. That is, if a filler “” exists, “” is deleted.
Note that the example of
For example, as for a divided character string 1101, fillers are removed from a divided character string 501 “///” in the example of
As for a divide character string 1107, a divided character string 507 “//2/////////” with a clause end label in the example of
According to the above-described second embodiment, the expression convertor converts the expression of a divided character string to perform sentence arrangement processing, thereby displaying a subtitle more easily readable by the user and assisting the user in understanding the contents. If an expression is converted into another expression such as a dialect, a wider range of variations in subtitle display can be implemented.
A display assist apparatus according to the third embodiment will be described with reference to the block diagram of
A display assist apparatus 1300 according to the third embodiment includes an acquisition unit 101, a first processor 102, a second processor 103, a display controller 104, an expression convertor 901, and a machine translator 1301.
The operations of the acquisition unit 101, the first processor 102, the second processor 103, and the expression convertor 901 are the same as in the second embodiment, and a description thereof will be omitted here.
The machine translator 1301 receives a plurality of divided character strings converted as needed from the expression convertor 901, and machine-translates the plurality of divided character strings from a first language to a second language. As the method of machine translation, any translation engine used in general such as a rule based machine translation engine, an example based machine translation engine, or a statistic machine translation engine can be used.
The display controller 104 performs display control for the plurality of machine-translated divided character strings.
An example of the processing result of the machine translator 1301 will be described next with reference to
A detailed example of display control by the display controller 104 according to the third embodiment will be described with reference to
Note that when translating the first language to the second language, it may be preferable to change the length of a divided character string as a translation segment depending on the type of the second language. For example, in the Japanese-English translation and in a case in which Japanese is translated to Chinese (to be referred to as Japanese-Chinese translation), the translation segment is set according to each language. Hence, it is preferable to change the positions (divided character string separation positions) to separate an input character string into divided character strings.
A first example in which the divided character string separation positions of the first language change depending on the difference in the grammar according to the type of the second language will be described with reference to
In the Japanese-English translation shown in
Conversely, in the Japanese-English translation shown in
As for the determination the translation segment as described above, the machine translator 1301 receives information (to be also referred to as target language information) about the type of the second language, and connects or divides divided character strings in the second language segment based on the preset translation segment rule of the language. The target language information may be acquired based on a user designation. If the second language is determined in advance, the first processor 102 may refer to the translation segment rule and generate divided character strings in the process of generating divided character strings.
A second example in which the divided character string separation positions of the first language change depending on the difference in the grammar of the second language will be described next with reference to
Generally, in the Japanese-Chinese translation, Chinese corresponding to Japanese contents can be displayed as a subtitle because the number of characters is often smaller in Chinese than in Japanese. On the other hand, in the Japanese-English translation, the number of characters in English may be larger than in Japanese. Hence, translation of inessential contents may be inhibited to prevent characters from extending off the display space for subtitles.
In the Japanese-Chinese translation shown in
The machine translator 1301 defines a keyword not to be translated to a translated sentence in advance as a translation segment rule and performs the translation processing as shown in
Other examples of the processing of the machine translator 1301 will be described with reference to
Depending on the type of the second language, a natural sentence can sometimes be obtained as a translation corresponding to one divided character string by reflecting the translation result on the translated sentences of two divided character strings. For example as shown in
On the other hand, as shown in
The machine translator 1301 defines a keyword that should appear in the translated sentences of two divided character strings in advance as a translation segment rule and performs the translation processing as shown in
According to the above-described third embodiment, contents translated from the first language to the second language are displayed, thereby allowing even a user who can understand the second language to more easily grasp the contents and assisting the user in understanding the contents.
A display assist apparatus according to the fourth embodiment will be described with reference to the block diagram of
A display assist apparatus 1900 according to the fourth embodiment includes an acquisition unit 101, a first processor 102, a second processor 103, a display controller 104, an expression convertor 901, a machine translator 1301, and a word order determiner 1901.
The operations of the acquisition unit 101, the first processor 102, the second processor 103, the expression convertor 901, and the machine translator 1301 are the same as in the third embodiment, and a description thereof will be omitted here.
The word order determiner 1901 receives divided character strings that have undergone translation processing from the machine translator 1301 and determines the display order of the plurality of divided character strings based on the word order determination rule of the second language. That is, the word order determiner 1901 rearranges the plurality of divided character strings to obtain a natural order according to the grammatical order of the second language. In addition, the word order determiner 1901 re-adds ending labels again as needed. Note that the word order determiner 1901 may rearrange the plurality of divided character strings in the stage of the first language if the order is unnatural because of inversion or the like.
The display controller 104 performs display control for the rearranged divided character strings.
A first example of the word order determination rule that the word order determiner 1901 refers to will be described next with reference to
In a word order determination rule table 2000 shown in
More specifically, for example, the word order pattern 2001 of Japanese “[sentence adverb]→[object]→[predicate]” and the word order pattern 2002 of English “[sentence adverb]→[predicate]→[object]” are associated with each other.
An example of the processing result of the word order determiner 1901 will be described next with reference to
In the above-described example of
Based on the word order determination rule, the word order determiner 1901 rearranges the divided character strings 1401 to 1403 in an order of [sentence adverb]→[predicate]→[object]. In addition, the word order determiner 1901 re-adds the ending label to the last divided character string in the second language segment. As a result, divided character strings 2101 to 2103 are arranged in an order of “first [sentence adverb]”→“we will introduce [predicate]”→“about machine translation [object]<<clause end>>”. Note that the re-addition of the ending label may be done by the second processor 103.
A detailed example of display control by the display controller 104 according to the fourth embodiment will be described next with reference to
As shown in
According to the above-described fourth embodiment, the word order determiner 1901 rearranges the plurality of divided character strings to obtain a natural order according to the grammatical order of the second language, thereby displaying a more natural subtitle for a user who uses the second language and assisting the user in understanding the contents.
In the fifth embodiment, assume a case in which a plurality of speakers use the first language.
A display assist apparatus according to the fifth embodiment is implemented using a display assist apparatus mentioned in any one of the above-described embodiments.
In addition to an input character string, an acquisition unit 101 according to the fifth embodiment acquires speaker information that is unique to each speaker and is used to identify a speaker who inputs (utters) the input character string. As a method of acquiring speaker information, for example, the speaker information may be acquired by preparing a microphone connected to the acquisition unit 101 for each speaker. Alternatively, the speaker information may be acquired by identifying a speaker using a general speaker identifying technology using beam forming or a speech feature amount.
A first processor 102 receives the input character string and the speaker information from the acquisition unit 101, and adds, to each of a plurality of divided character strings obtained in the same way as in the above-described embodiments, a speaker label used to classify a divided character string for each speaker based on the speaker information.
A display controller 104 receives the plurality of divided character strings with speaker labels from a second processor 103. When displaying the plurality of divided character strings, the display controller 104 performs display control to make a distinction between first language segments and second language segments while making a distinction between speakers based on the speaker labels.
As shown in
As the speaker label determination method, for example, if user identification information (an IP address or user identification information including a user ID) can be obtained in advance based on the speaker information acquired by the acquisition unit 101, the speaker may be identified in accordance with the identification information. Alternatively, labels such as speaker A and speaker B that enable to make a distinction between different pieces of speaker information may be added.
The operation of the display controller 104 according to the fifth embodiment will be described next with reference to the flowchart of
The processes of steps S601, S603 to S607, and S609 are the same as in the above-described embodiment, and a description thereof will be omitted.
In step S2401, the display controller 104 prepares a new balloon.
In step S2402, since a sentence end label is added to a divided character string, the display controller 104 ends the current balloon and sets the number of indents to 0. The process returns to step S2401 to repeat the same processing as described above. Accordingly, one sentence is expressed by one balloon.
In step S2403, since the processing has ended for all divided character strings, the balloon is ended. The process returns to step S2401 to repeat the same processing as described above.
Note that processing of making the distinction between speakers clearer may be performed by, for example, changing the outline or outline color of a balloon for each speaker. In the fifth embodiment, a balloon is assumed to be used to distinguish a speaker. However, any display is usable as long as each speaker can be distinguished.
The operation of the display controller 104 according to the first modification of the fifth embodiment will be described with reference to the flowchart of
The processes of steps S604, S605, S607, S2401, and S2403 are the same as in the above-described embodiment, and a description thereof will be omitted.
In step S2601, the display controller 104 displays the contents of a divided character string in a box that is smaller than a balloon and fits in the balloon.
In step S2602, the display controller 104 ends the current balloon and returns to step S2401 to repeat the same processing as described above.
In step S2603, the display controller 104 displays a blank line and returns to step S2601 to repeat the same processing as described above.
As shown in
The second modification of the fifth embodiment will be described with reference to the flowchart of
The processes of steps S603 to S605, S607, S2401, S2403, and S2603 are the same as in the above-described embodiment, and a description thereof will be omitted.
In step S2801, if a clause end label or a sentence end label is not added, the display controller 104 displays a full stop at the end of the divided character string.
In step S2802, the display controller 104 displays a comma at the end of the divided character string, and ends the current balloon. After that, the process returns to step S2401 to repeat the same processing as described above.
In
According to the above-described fifth embodiment, for inputs from a plurality of users, the utterances are separated for each speaker using balloons. In addition, display control as in the above-described embodiment is performed for the divided character string in each balloon, thereby facilitating distinction between the speakers and assisting the user in understanding the contents.
An instruction shown in the processing procedures of the above-described embodiments can be executed based on a program that is software. When a general-purpose computer system stores the program in advance and loads it, the same effects as those of the above-described display assist apparatuses can be obtained. Each instruction described in the above embodiments is recorded in a magnetic disk (for example, flexible disk or hard disk), an optical disk (for example, CD-ROM, CD-R, CD-RW, DVD-ROM, DVD±R, DVD±RW, or Blu-ray® Disc), a semiconductor memory, or a similar recording medium as a program executable by a computer. Any storage format is employable as long as the recording medium is readable by a computer or an embedded system. When the computer loads the program from the recording medium, and causes the CPU to execute the instruction described in the program based on the program, the same operation as the display assist apparatuses according to the above-described embodiments can be implemented. When the computer acquires or loads the program, it may be acquired or loaded via a network, as a matter of course.
An OS (Operating System) operating on the computer or MW (middleware) such as database management software or a network may execute part of each processing for implementing the embodiments based on the instruction of the program installed from the recording medium to the computer or embedded system.
The recording medium according to the embodiments is not limited to a medium independent of the computer or embedded system, and also includes a recording medium that stores or temporarily stores the program transmitted by a LAN or the Internet and downloaded.
The number of recording media is not limited to one. The recording medium according to the embodiments also incorporates a case in which the processing of the embodiments is executed from a plurality of media, and the media can have any arrangement.
Note that the computer or embedded system according to the embodiments is configured to execute each processing of the embodiments based on the program stored in the recording medium, and can be either a single device formed from a personal computer or microcomputer or a system including a plurality of devices connected via a network.
The computer according to the embodiments is not limited to a personal computer, and also includes an arithmetic processing device or microcomputer included in an information processing apparatus. Computer is a general term for apparatuses and devices capable of implementing the functions of the embodiments by the program.
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel embodiments described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the embodiments described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
Number | Date | Country | Kind |
---|---|---|---|
2016-052394 | Mar 2016 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
6272461 | Meredith et al. | Aug 2001 | B1 |
6279018 | Kudrolli et al. | Aug 2001 | B1 |
6625508 | Watanabe | Sep 2003 | B1 |
6625608 | Watanabe | Sep 2003 | B1 |
6968506 | Yacavone et al. | Nov 2005 | B2 |
7006967 | Kahn | Feb 2006 | B1 |
7739116 | Miyamoto et al. | Jun 2010 | B2 |
8090570 | Waibel et al. | Jan 2012 | B2 |
8433580 | Sugiyama et al. | Apr 2013 | B2 |
8918311 | Johnson et al. | Dec 2014 | B1 |
9116989 | Ehlen et al. | Aug 2015 | B1 |
9460713 | Moreno Mengibar et al. | Oct 2016 | B1 |
20020161579 | Saindon et al. | Oct 2002 | A1 |
20030152293 | Bresler | Aug 2003 | A1 |
20050080631 | Abe et al. | Apr 2005 | A1 |
20070185704 | Yoshimura et al. | Aug 2007 | A1 |
20080046229 | Maskey et al. | Feb 2008 | A1 |
20080077390 | Nagao | Mar 2008 | A1 |
20080091407 | Furihata et al. | Apr 2008 | A1 |
20080300852 | Johnson et al. | Dec 2008 | A1 |
20080300872 | Basu et al. | Dec 2008 | A1 |
20090076793 | Hoefelmeyer | Mar 2009 | A1 |
20100324894 | Potkonjak | Dec 2010 | A1 |
20110202334 | Abir | Aug 2011 | A1 |
20110213607 | Onishi | Sep 2011 | A1 |
20110231474 | Locker et al. | Sep 2011 | A1 |
20130144597 | Waibel | Jun 2013 | A1 |
20130211818 | Sakamoto et al. | Aug 2013 | A1 |
20130262076 | Kamatani et al. | Oct 2013 | A1 |
20140095151 | Sakamoto et al. | Apr 2014 | A1 |
20140201637 | Na et al. | Jul 2014 | A1 |
20140244235 | Michaelis | Aug 2014 | A1 |
20140297276 | Tachimori | Oct 2014 | A1 |
20150081271 | Sumita et al. | Mar 2015 | A1 |
20150081272 | Kamatani et al. | Mar 2015 | A1 |
20150154183 | Kristjansson et al. | Jun 2015 | A1 |
20150271442 | Cronin et al. | Sep 2015 | A1 |
20150309994 | Liu | Oct 2015 | A1 |
20160078020 | Sumita et al. | Mar 2016 | A1 |
20160085747 | Kamatani et al. | Mar 2016 | A1 |
20160092438 | Sonoo | Mar 2016 | A1 |
20160170970 | Lindblom et al. | Jun 2016 | A1 |
20160275967 | Sumita et al. | Sep 2016 | A1 |
20160314116 | Kamatani et al. | Oct 2016 | A1 |
20170053541 | Tsyrina | Feb 2017 | A1 |
Number | Date | Country |
---|---|---|
H05-151256 | Jun 1993 | JP |
H05-176232 | Jul 1993 | JP |
H05-189480 | Jul 1993 | JP |
H06-141240 | May 1994 | JP |
H08-212228 | Aug 1996 | JP |
H08-263499 | Oct 1996 | JP |
H10-234016 | Sep 1998 | JP |
H10-247194 | Sep 1998 | JP |
3009642 | Feb 2000 | JP |
3059398 | Jul 2000 | JP |
2001-027995 | Jan 2001 | JP |
2001-075957 | Mar 2001 | JP |
2001-175280 | Jun 2001 | JP |
2001-224002 | Aug 2001 | JP |
2002-010222 | Jan 2002 | JP |
2002-342311 | Nov 2002 | JP |
2005-064600 | Mar 2005 | JP |
2007-018098 | Jan 2007 | JP |
2007-034430 | Feb 2007 | JP |
2008-083376 | Apr 2008 | JP |
2010-044171 | Feb 2010 | JP |
2011-182125 | Sep 2011 | JP |
2012-181358 | Sep 2012 | JP |
2012-203154 | Oct 2012 | JP |
2013-164515 | Aug 2013 | JP |
2013-206253 | Oct 2013 | JP |
2014-071769 | Apr 2014 | JP |
2015-060127 | Mar 2015 | JP |
2015-072701 | Apr 2015 | JP |
2015-187738 | Oct 2015 | JP |
2015-201215 | Nov 2015 | JP |
2016-057986 | Apr 2016 | JP |
2016-062357 | Apr 2016 | JP |
2016-177013 | Oct 2016 | JP |
2016-186646 | Oct 2016 | JP |
2016-206929 | Dec 2016 | JP |
Entry |
---|
Finch, et al., “An exploration of segmentation strategies in stream decoding.” Proc. IWSLT. 2014. |
Kolss, et al., “Simultaneous German-English lecture translation.” IWSLT. 2008. |
Kolss, et al., “Stream decoding for simultaneous spoken language translation.” Interspeech. 2008. |
Oda, et al., “Optimizing Segmentation Strategies for Simultaneous Speech Translation.” ACL (2). 2014. |
Sridhar, et al., “Corpus analysis of simultaneous interpretation data for improving real time speech translation.” Interspeech. 2013. |
Sridhar, et al., “Segmentation Strategies for Streaming Speech Translation.” HLT-NAACL. 2013. |
Zheng, et al., “Implementing SRI's Pashto speech-to-speech translation system on a smart phone.” Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE, 2010. |
Number | Date | Country | |
---|---|---|---|
20170270080 A1 | Sep 2017 | US |