1. Technical Field
The embodiments of the present disclosure relate to management technology, and particularly to a patent application file conversion system and method.
2. Description of Related Art
Simplified Chinese are based on traditional Chinese. Most words in simplified Chinese have the same meaning as the words in traditional Chinese. However, there are still some other differences between traditional Chinese and simplified Chinese. For example, the word “” in simplified Chinese corresponds to the word “
” in traditional Chinese, which both means claim in English. At present, translation between the simplified Chinese and the traditional Chinese is to translate every character one by one. However, the translated words may be not correct after the translation. For example, the word “
” in simplified Chinese is translated into the word it “
” in traditional Chinese, but the translated word “
” are not correct in traditional Chinese. Thus, a user may have to take a lot of time to check and recheck the translated words to make sure they make sense and are the correct translated words. For example, if the translated word is “
” in traditional Chinese, the user may manually correct the translated word from “
” to “
” in traditional Chinese. Therefore, there is room for improvement in the art.
The disclosure is illustrated by way of examples and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment in this disclosure are not necessarily to the same embodiment, and such references mean “at least one.”
In general, the word “module”, as used herein, refers to logic embodied in hardware or firmware, or to a collection of software instructions, written in a programming language, such as, JAVA, C, or assembly. One or more software instructions in the modules may be embedded in firmware, such as in an EPROM. The modules described herein may be implemented as either software and/or hardware modules and may be stored in any type of non-transitory computer-readable medium or other storage device. Some non-limiting examples of non-transitory computer-readable media include CDs, DVDs, BLU-RAY, flash memory, and hard disk drives.
The database 5 also stores a simplified Chinese word list 600 as shown in ” in simplified Chinese means “claim” in English, and “
” does not exist in simplified Chinese, thus “
” is the error word in simplified Chinese. The one or more error words and correct words in traditional Chinese word list 601 are traditional Chinese, and the one or more error words in traditional Chinese word list 600 are wrong in traditional Chinese, because the one or more error words in traditional Chinese does not obey syntax of traditional Chinese, for example, in traditional Chinese, the word “
” in traditional Chinese means “claim” in English, and “
” does not exist in traditional Chinese, thus “
” is the error word in traditional Chinese. Additionally, each error word corresponds to a correct word. For example, as shown in
” corresponds to the correct word “
” which means “claim” in English, and the error word “
” corresponds to the correct word “
” which means “database” in English. For example, as shown in
” corresponds to the correct word “
” which means “claim” in English, and the error word “
” corresponds to the correct word “
” which means “database” in English. The error words and the correct words are predetermined by a user, and the user can add or delete the error words and correct words in simplified Chinese word list 600 and the traditional Chinese word list 601.
The database 5 further stores a simplified Chinese file format 700 as shown in ></
>” as shown in
></
>” as shown in
The reading module 210 reads a patent application file from the database 5.
The insertion module 220 inserts content of the patent application file into a first file format. In one embodiment, the insertion module 220 determines if the patent application is written in simplified Chinese or in traditional Chinese. If the patent application is written in simplified Chinese, and the first file format is selected as the simplified Chinese file format 700, the content of the patent application file is inserted into the simplified Chinese file format 700. If the patent application is written in traditional Chinese, and the first file format is selected as the traditional Chinese file format 701, the content of the patent application file is inserted into the traditional Chinese file format 701.
The conversion module 230 converts the content of the patent application file in the first file format using a language converting application programming interface (API). The language converting API includes a traditional Chinese character set and a simplified Chinese character set, and is used to convert characters between the traditional Chinese and the simplified Chinese. For example, the traditional Chinese character “” is converted to traditional Chinese character “
” by the language converting API. In one embodiment, if the first file format is the simplified Chinese file format 700, the conversion module 230 converts the content of the patent application file in the first file format from the simplified Chinese to the traditional Chinese, the converted content is in traditional Chinese. If the first format is the traditional Chinese file format 701, the conversion module 230 converts the content of the patent application file in the first file format from the traditional Chinese to the simplified Chinese, the converted content is in simplified Chinese.
The insertion module 220 inserts the converted content from the first file format into a second file format. In one embodiment, the first file format and the second file format are different. For example, when the first file format is the simplified Chinese file format 700, the second file format is the traditional Chinese file format 701. When the first file format is the traditional Chinese file format 701, the second file format is the simplified Chinese file format 700.
The determination module 240 determines if the converted content in the second file format includes error words. In one embodiment, when the second file format is the simplified Chinese file format 700, the determination module 240 determines if one or more error words in simplified Chinese word list 600 exist in the second file format. When the second file format is the traditional Chinese file format 701, the determination module 240 determines if one or more error words in traditional Chinese word list 601 exist in the second file format.
The replacement module 250 replaces the error words with the correct words corresponding to the error words, in response to a determination that the second file format includes error words. In one embodiment, when the second file format includes error words in simplified Chinese word list 600, the replacement module 250 replaces the error words with the correct words in simplified Chinese word list 600. When the second file format includes error words in traditional Chinese word list 601, the replacement module 250 replaces the error words with the correct words in traditional Chinese word list 601.
In step S10, the reading module 210 reads a patent application file from a database 5. The patent application file may be written in simplified Chinese or be written in traditional Chinese.
In step S20, the insertion module 220 inserts content of the patent application file into a first file format. In one embodiment, if the patent application is written in simplified Chinese, and the first file format is the simplified Chinese file format 700, the content of the patent application file is inserted into the simplified Chinese file format 700. In one embodiment, if the patent application file is written in simplified Chinese, the patent application file is consisted with many parts consistent with the rules and regulations of the SIPO, such as an abstract, a technical filed, a summary, a description of related art, a brief description of the drawings, a detailed description, and claims, which is similar to a US patent application. Each part corresponds to two tags in simplified Chinese file format, and the insertion module 220 inserts the content of the patent application into the first file format according to the tags. For example, the abstract of the patent application file corresponds to the two tags “<></
>”, the abstract (e.g., XXXX) of the patent application file is inserted between the two tags “<
>XXXX</
>.”
If the patent application is written in traditional Chinese, and the first file format is the traditional Chinese file format 701, the content of the patent application file is inserted into the traditional Chinese file format 701. In one embodiment, if the patent application file is written in traditional Chinese, the patent application file is consisted with many parts consistent with the rules and regulations of the SIPO, such as an abstract, a technical filed, a Chinese summary, an English summary, a description of related art, a brief description of the drawings, a detailed description, and claims. Each part corresponds to two tags in traditional Chinese file format, and the insertion module 220 inserts the content of the patent application into the first file format according to the tags. For example, the abstract of the patent application file corresponds to the tags “<></
>”, the abstract (e.g., YYYY) of the patent application file is inserted between the two tags “<
>YYYY</
>.”
In step S30, the conversion module 230 converts the content of the patent application file in the first file format using a language converting API. In one embodiment, if the first format is the simplified Chinese file format 700, the conversion module 230 converts the content of the patent application file in the first file format from the simplified Chinese to the traditional Chinese, then the converted content is in traditional Chinese. If the first format is the traditional Chinese file format 701, the conversion module 230 converts the content of the patent application file in the first file format from the traditional Chinese to the simplified Chinese, the converted content is in simplified Chinese.
In step S40, the insertion module 220 inserts the converted content from the first file format into a second file format. In one embodiment, the first file format and the second file format are different. For example, when the first file format is the simplified Chinese file format 700, and the second file format is the traditional Chinese file format 701. When the first file format is the traditional Chinese file format 701, and the second file format is the simplified Chinese file format 700.
In step S50, the determination module 240 determines if the converted content in the second file format includes error words. In one embodiment, when the second file format is the simplified Chinese file format 700, if one or more error words in simplified Chinese word list 600 exist in the second file format, the procedure goes to step S60. Otherwise, if no error word in simplified Chinese word list 600 exists in the second file format, the procedure ends. When the second file format is the traditional Chinese file format 701, if one or more error words in traditional Chinese word list 601 exist in the second file format, the procedure goes to S60. Otherwise, if no error word in traditional Chinese word list 601 exists in the second file format, the procedure ends.
In step S60, the replacement module 250 replaces the error words with the correct words corresponding to the error words. In one embodiment, when the second file format includes the error word “,” the replacement module 250 replaces the error word with the correct words “
.” In one embodiment, when the second file format includes the error word “
,” the replacement module 250 replaces the error word with the correct word “
.”
Although certain inventive embodiments of the present disclosure have been specifically described, the present disclosure is not to be construed as being limited thereto. Various changes or modifications may be made to the present disclosure without departing from the scope and spirit of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
2012105642814 | Dec 2012 | CN | national |