Machine translation has entered the marketplace as a beneficial technology for every day tasks and for business purposes. For instance, commercial applications are used on the Internet for the automatic translation of Web pages and news articles. In addition, companies offer software that automatically translates Web sites for localization purposes. Other commercial software provides translations of business documents, such as memoranda and e-mails.
Microsoft's Writing Wizard™ is an exemplary educational tool that uses machine translation to assist with business writing. The Writing Wizard™ suggests idiomatic phrases to English language learners based on submitted foreign language phrases and/or word collocation. In addition, software products incorporating machine translation are beginning to be used and/or developed by large public school districts, such as the school districts in Los Angeles, New York and Philadelphia, to provide translations of school-related documents to parents who do not speak English. However, no currently available software program provides a user with the ability to automatically produce a plurality of translation exercises that permit the evaluation of language learners.
What are needed are a method and system for automatically generating a plurality of translation exercises that language learners having a particular primary language have difficulty translating.
A need exists for providing such a method and system in a learning environment and/or to an instructor.
A need exists for a method of reducing the time required for an instructor to generate assessment items for a language assessment.
A further need exists for a method of generating an increased number of assessment items directed to a particular translation difficulty for language learners having a particular primary dialect.
The present disclosure is directed to solving one or more of the above-listed problems.
Before the present methods, systems and materials are described, it is to be understood that this invention is not limited to the particular methodologies, systems and materials described, as these may vary. It is also to be understood that the terminology used in the description is for the purpose of describing the particular versions or embodiments only, and is not intended to limit the scope of the invention which will be limited only by the appended claims.
It must also be noted that as used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise. Thus, for example, reference to a “sentence” is a reference to one or more sentences and equivalents thereof known to those skilled in the art, and so forth. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art. Although any methods, materials, and devices similar or equivalent to those described herein can be used in the practice or testing of embodiments of the invention, the preferred methods, materials, and devices are now described. All publications mentioned herein are incorporated by reference. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such disclosure by virtue of prior invention.
In an embodiment, a method of developing a translation exercise may include receiving a grammatical structure, for each of one or more text segments in a first language, translating the text segment in the first language into a corresponding text segment in a second language, and selecting a selected text segment as a prompt for a translation exercise. The selected text segment may include one or more of a text segment in the second language having the grammatical structure, and a text segment in the first language corresponding to a text segment in the second language having the grammatical structure.
In an embodiment, a processor-readable storage medium may contain one or more programming instructions for performing a method of developing a translation exercise. The method of developing a translation exercise may include receiving a grammatical structure, for each of one or more text segments in a first language, translating the text segment in the first language into a corresponding text segment in a second language, and selecting a selected text
segment as a prompt for a translation exercise. The selected text segment may include one or more of a text segment in the second language having the grammatical structure, and a text segment in the first language corresponding to a text segment in the second language having the grammatical structure.
Aspects, features, benefits and advantages of the embodiments of the present invention will be apparent with regard to the following description, appended claims and accompanying drawings where:
The automatic generation of assessment items may enable teachers of language learners to increase the number of items they can create for assignments, such as practice exercises and assessments, over the number of items they may otherwise be able to generate manually. In an embodiment, such translation exercises may be created to provide language-learning students with structures that are typically difficult to decipher. The difficulty of such structures may be dependent upon the primary language of the students and the language being learned.
The texts may be input 115 into a tokenizer that identifies the words in each text that correspond to a text segment. In an embodiment, a text segment may correspond to a sentence within a text. In an alternate embodiment, each text may represent a text segment. Other and/or additional definitions of a text segment may be used within the scope of this disclosure. A machine translation system may then translate 120 the text segments into the non-native language. In an embodiment, a translation may be performed for each text segment.
Each word in the text-segment-based non-native language translations may be identified and tagged 125 as being a particular syntactic part-of-speech, such as a noun, a verb and the like. In an embodiment, the output of the part-of-speech tagger may have the following form for each processed word: $word_$TAG, where $word is the translated word and $TAG is a part-of-speech tag. Exemplary part-of-speech tags may include NNP for a proper noun and VBZ for a singular verb. An exemplary text segment as parsed by the part-of-speech tagger may appear as follows (if the translation is performed into English):
A regular expression program may be used to capture 130 syntactic patterns that relate to each grammatical structure selectable by the user. For example, in the above structure, the regular expression program may be directed to select a text segment that includes the consecutive string “$word_NNP+$word_VBZ.” The translation of a text segment including the appropriate regular expression may then be captured. Other programs or methods of identifying syntactic patterns that correspond to a grammatical structure may also be used within the scope of this disclosure.
The original, non-translated native language text segment may be returned 135 as a candidate text segment for the translation exercise. In this manner, native language text may be selected as a candidate text segment if the translation of the text into the non-native language includes the requested structure. A user may use one or more of the candidate text segments to create translation exercises from the language learner's native language into the non-native language.
In an alternate embodiment, the translation of a text segment that is captured by, for example, the regular expression program may be used as a candidate text segment for a translation exercise. In this embodiment, a user may use one or more of the candidate text segments to create translation exercises from a non-native language into the language learner's native language.
In an embodiment, a translation exercise may include a plurality of sentences, one or more paragraphs and/or an entire text. Each translation exercise may include one or more structures that pose difficulty for a group of non-native language speakers.
A disk controller 204 interfaces with one or more optional disk drives to the system bus 228. These disk drives may be external or internal floppy disk drives such as 210, CD ROM drives 206, or external or internal hard drives 208. As indicated previously, these various disk drives and disk controllers are optional devices.
Program instructions may be stored in the ROM 218 and/or the RAM 220. Optionally, program instructions may be stored on a computer readable medium such as a floppy disk or a digital disk or other recording medium, a communications signal or a carrier wave.
An optional display interface 222 may permit information from the bus 228 to be displayed on the display 224 in audio, graphic or alphanumeric format. Communication with external devices may optionally occur using various communication ports 226. An exemplary communication port 226 may be attached to a communications network, such as the Internet or an intranet.
In addition to the standard computer-type components, the hardware may also include an interface 212 which allows for receipt of data from input devices such as a keyboard 214 or other input device 216 such as a remote control, pointer and/or joystick.
An embedded system may optionally be used to perform one, some or all of the described operations. Likewise, a multiprocessor system may optionally be used to perform one, some or all of the described operations.
It will be appreciated that various of the above-disclosed and other features and functions, or alternatives thereof, may be desirably combined into many other different systems or applications. It will also be appreciated that various presently unforeseen or unanticipated alternatives, modifications, variations or improvements therein may be subsequently made by those skilled in the art which are also intended to be encompassed by the disclosed embodiments.
This Application claims priority to and incorporates herein by Reference to U.S. Provisional Application No. 60/655,125 titled Method and System for Automated Item Development for Language Learners, filed on Feb. 22, 2005 with the United States Patent and Trademark Office.
Number | Name | Date | Kind |
---|---|---|---|
3382588 | Serrell et al. | May 1968 | A |
3501851 | Price, Jr. et al. | Mar 1970 | A |
3761877 | Fernald | Sep 1973 | A |
4067122 | Fernandez et al. | Jan 1978 | A |
4360345 | Hon | Nov 1982 | A |
4475239 | VanRaamsdonk | Oct 1984 | A |
4518361 | Conway | May 1985 | A |
4547161 | Manning | Oct 1985 | A |
4593904 | Graves | Jun 1986 | A |
4686522 | Hernandez et al. | Aug 1987 | A |
4813013 | Dunn | Mar 1989 | A |
4931018 | Herbst et al. | Jun 1990 | A |
4947322 | Tenma et al. | Aug 1990 | A |
5002491 | Abrahamson et al. | Mar 1991 | A |
5010495 | Willetts | Apr 1991 | A |
5011413 | Ferris et al. | Apr 1991 | A |
5040131 | Torres | Aug 1991 | A |
5100329 | Deesen et al. | Mar 1992 | A |
5211564 | Martinez et al. | May 1993 | A |
5503560 | Stentiford | Apr 1996 | A |
6119077 | Shinozaki | Sep 2000 | A |
6431875 | Elliott et al. | Aug 2002 | B1 |
6711543 | Cameron | Mar 2004 | B2 |
6865532 | Anderson | Mar 2005 | B2 |
6988063 | Tokuda | Jan 2006 | B2 |
7085707 | Milner | Aug 2006 | B2 |
20020018984 | Ho et al. | Feb 2002 | A1 |
20020031751 | Wen et al. | Mar 2002 | A1 |
20020115048 | Meimer | Aug 2002 | A1 |
20030023423 | Yamada et al. | Jan 2003 | A1 |
20030154067 | Wen et al. | Aug 2003 | A1 |
20030208353 | Ueda | Nov 2003 | A1 |
Number | Date | Country | |
---|---|---|---|
20060190242 A1 | Aug 2006 | US |
Number | Date | Country | |
---|---|---|---|
60655125 | Feb 2005 | US |