1. Technical Field
Embodiments of the present disclosure relate generally to document analysis, and more particularly to a system and method for analyzing official notices of an electronically filed patent application.
2. Description of Related Art
In recent years, numerous patent offices, such as the United States Patent and Trademark Office (USPTO), European Patent Office (EPO), State Intellectual Property Office of People's Republic of China (SIPO), and Japanese Patent Office (JPO), have popularized electronic filing of patent applications. The patent office may issue one or more filling receipts when a patent application is filed electronically, or send one or more official notices during the examination process of the patent application. Often, the official notices must be processed manually, which can include viewing the notices, and acquiring information therefrom, such as an application number, filing date, and fee payment, and updating file information of the patent application correspondingly. Such conventional processing is redundant and inefficient, and error-based inaccuracy is difficult to avoid.
What is needed, therefore, is a system and method which can analyze official notices of an electronically filed patent application automatically, so as to increase efficiency and accuracy.
The disclosure is illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment in this disclosure are not necessarily to the same embodiment, and such references mean at least one.
The database 3 stores basic information of patent applications and official notices issued from the patent office websites 2, and processes the basic information. Processing the basic information includes reading, writing, deleting, modifying, updating, and backing up the basic information, for example. In one embodiment, the basic information may include an application number, a filing date, a title of the invention, status of fee payment, and the name of the examiner, for example. Each of the client computers 4 is operable to receive one or more official notices of an electronically filed patent application from one patent office website 2 during the examination process of the patent application, and display analysis results generated by the notice analysis system 10.
In one embodiment, the official notice analysis system 10 includes a download module 101, a file conversion module 102, a character identification module 103, a fault tolerance module 104, and a data analysis module 105. One or more computerized codes of the function modules 101-105 may be stored in the storage system 12 and executed by the at least one processor 11. In general, the word “module,” as used herein, refers to logic embodied in hardware or firmware, or to a collection of software instructions, written in a programming language, such as, for example, Java, C, or assembly. One or more software instructions in the modules may be embedded in firmware, such as an EPROM. The modules described herein may be implemented as either software and/or hardware modules and may be stored in any type of computer-readable medium or other storage system.
The download module 101 is operable to download an official notice of the electronically filed patent application from one of the patent office websites 2 according to a download command from one of the client computers 4. In addition, the download module 101 is operable to identify a document type of the official notice according to an official patent code predefined by a patent office. In one embodiment, the official patent code may be “APP.FILE.REC” or “CNTF”. If the official patent code is “APP.FILE.REC”, the download module 101 determines that the official notice is the filling receipt of the electronically filed patent application. If the official patent code is “CNTF”, the download module 101 determines that the official notice is the office action for the electronically filed patent application.
The file conversion module 102 is operable to convert the official notice to an image format document using a document conversion function. In one embodiment, the image format document may be a JPG format document, or a TIF format document. The document conversion function may be an application programming interface (API) provided by the Microsoft Windows operating system.
The character identification module 103 is operable to identify character data from the image format document using a character identification function. In one embodiment, the character data may include information regarding the application number, the filing date, the title of the invention, the inventor, the applicant, and the examiner, for example. The character identification function may be executed by a WINDOWS OCR function, for example.
The fault tolerance module 104 is operable to determine whether the identified character data is erroneous, and to correct the character data being determined to be erroneous. For example, when converting the official notice to the image format document, a number “1” may be read as an erroneous character “I”, and a number “0” may be read as an erroneous character “O.”
The data analysis module 105 is operable to extract patent information of the electronically filed patent application from the character data using a regular expression. In one embodiment, the regular expression is an expression that describes a set of character data, and is used to extract the patent information from the character data. For example, a set of three character data “Hide,” “Hine,” and “Hinge” may be described by the regular expression “Hi(d|ng?)e.” In the regular expression, the vertical bar “1” is used to separate alternatives. The question mark “?” is used to indicate there is zero or one element (here is “g”) to be extracted from the character data.
The data analysis module 105 is further operable to obtain basic information of the patent application from the database 3, and compare the patent information with the basic information. The data analysis module 105 generates analysis results of the official notice according to the comparison, and sends the analysis results to the client computer 4 for the user to review the patent information of the patent application.
In block S30, the download module 101 downloads an official notice of an electronically filed patent application from one of the patent office websites 2 according to a download command input from one of the client computers 4. In block S31, the download module 101 identifies a document type of the official notice according to an official patent code predefined by a patent office. Details of various documents types are mentioned previously above.
In block S32, the file conversion module 102 converts the official notice to an image format document using a document conversion function. In one embodiment, the image format document may be a JPG format document, or a TIF format document. The document conversion function may be a WINDOWS API function, for example. Details of the conversion are described in
In block S33, the character identification module 103 identifies character data from the image format document using a character identification function. In one embodiment, the character data may include information regarding the application number, the filing date, the title of the invention, the inventor, the applicant, and the examiner, etc. The character identification function may be a WINDOWS OCR function, for example.
In block S34, the fault tolerance module 104 determines whether the identified character data is erroneous, and corrects character data being determined to be erroneous. For example, when converting the official notice to the image format document, a number “1” may be read as an erroneous character “I”, and a number “0” may be read as an erroneous character “O.” Details of the correction are described in
In block S35, the data analysis module 105 extracts patent information of the electronically filed patent application from the character data using a regular expression. as mentioned above, the regular expression is an expression that describes a set of character data, and is used to extract the patent information from the character data.
In block S36, the data analysis module 105 integrates the patent information to generate completed patent information. For example, if a patent application has several inventors, the data analysis module 105 integrates information of all the inventors to generate the completed inventor information.
In block S37, the data analysis module 105 obtains basic information of the electronically filed patent application from the database 3, and compares the patent information with the basic information. In block S38, the data analysis module 105 generates analysis results of the official notice according to the comparison, and sends the analysis results to the client computer 4 via e-mail. The analysis results may be displayed on a display screen of the client computer 4 for the uses to review.
Although certain inventive embodiments of the present disclosure have been specifically described, the present disclosure is not to be construed as limited thereto. Various changes or modifications may be made to the present disclosure without departing from the scope and spirit of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
200910312803.X | Dec 2009 | CN | national |