This application is based on and claims priority under 35 USC 119 from Japanese Patent Application No. 2019-149848 filed Aug. 19, 2019.
The present disclosure relates to information processing apparatuses and non-transitory computer readable media.
For example, Japanese Unexamined Patent Application Publication No. 2010-61551 discloses an application-document digitalizing system having an image forming apparatus and an information processing apparatus. The image forming apparatus is capable of transmitting application document data generated as a result of scanning an application document. The image forming apparatus includes an application-document-data acquiring unit that acquires application document data obtained as a result of scanning one or more sets of application documents each constituted of one or more pages, and an application-document-data transmitting unit that transmits the application document data acquired by the application-document-data acquiring unit to the information processing apparatus. The image forming apparatus also includes a recognition-result receiving unit that receives a recognition result including segmentation information of the application document data from the information processing apparatus, and a recognition-result display unit that displays the recognition result including the segmentation information of the application document data received by the recognition-result receiving unit. The information processing apparatus includes an application-document-data receiving unit that receives the application document data transmitted from the image forming apparatus, and an image recognition unit that performs predetermined image recognition on the application document data received by the application-document-data receiving unit. The information processing apparatus also includes segmentation-information generating unit that generates segmentation information for segmenting the application document data into application document data for each set in accordance with a recognition result obtained by the image recognition unit, and a recognition-result transmitting unit that transmits the recognition result including the segmentation information generated by the segmentation-information generating unit to the image forming apparatus.
Sometimes, a recognition process is performed on a document set having multiple pages by reading the pages consecutively in a one-by-one fashion, and the pages are sorted out into sets as electronic data. In that case, the document set may sometimes have an error, such as a redundant page or a missing page in the document set or a page of a different inscriber or an unknown page mixed in the document set, due to being mishandled by the user. From the document set having such an error, an appropriate data set is not obtainable.
Aspects of non-limiting embodiments of the present disclosure relate to an information processing apparatus and an information processing program with which, if a combination in a data set obtained by reading and sorting out a document set is improper, a data set with a proper combination may be obtained from the data set including the improper combination.
Aspects of certain non-limiting embodiments of the present disclosure address the above advantages and/or other advantages not described above. However, aspects of the non-limiting embodiments are not required to address the advantages described above, and aspects of the non-limiting embodiments of the present disclosure may not address advantages described above.
According to an aspect of the present disclosure, there is provided an information processing apparatus including a processor. The processor is configured to perform a process. The process includes disassembling each of multiple first data sets in units of pages if a combination in the first data set is improper. The multiple first data sets are obtained by reading and sorting out multiple document sets each containing multiple pages of documents. The process also includes reassembling an adequate combination as a second data set if a page group obtained as a result of the disassembling includes the adequate combination.
An exemplary embodiment of the present disclosure will be described in detail based on the following figures, wherein:
An exemplary embodiment of the present disclosure will be described in detail below with reference to the drawings.
As shown in
The server apparatus 10 is connected to each of the checker terminal apparatuses 40A, 40B, and so on, the image reading apparatus 60, and the administrator terminal apparatus 70 via a network N. The server apparatus 10 is, for example, a general-purpose computer, such as a server computer or a personal computer (PC). The network N is, for example, the Internet, a local area network (LAN), or a wide area network (WAN).
The image reading apparatus 60 has a function of acquiring an image by optically reading, for example, a form formed of a paper medium, and transmitting the acquired image (referred to as “form image” hereinafter) to the server apparatus 10. The form used is, for example, one of various types of forms including multiple items, such as an address field and a name field. With regard to each of these multiple items, for example, handwritten text or printed text is inscribed on this form. As will be described in detail later, the server apparatus 10 performs an optical character recognition (OCR) process on the form image received from the image reading apparatus 60 so as to acquire a recognition result with respect to an image corresponding to each of the multiple items. This recognition result includes, for example, a text string indicating a string of one or more text characters. On the form, a region where text corresponding to an item is inscribable is defined by a frame, and the text inscribable region is defined as a recognition target region. By performing the OCR process on the defined region, a text string with respect to an image corresponding to each of the multiple items is acquired.
The checker terminal apparatus 40A is to be operated by a checker (user) U1 who performs a checking process, and the checker terminal apparatus 40B is to be operated by a checker U2 who performs a checking process. If these multiple checker terminal apparatuses 40A, 40B, and so on are not to be distinguished from one another, the checker terminal apparatuses 40A, 40B, and so on may be collectively referred to as “checker terminal apparatuses 40” hereinafter. Furthermore, if these multiple checkers U1, U2, and so on are not to be distinguished from one another, the checkers U1, U2, and so on may be collectively referred to as “checkers U” hereinafter. The checker terminal apparatuses 40 are, for example, general-purpose computers, such as personal computers (PC), or portable terminal apparatuses, such as smartphones or tablet terminals. Each checker terminal apparatus 40 has a checking-process application program (also referred to as “checking-process application” hereinafter) installed therein and to be used by the corresponding checker U for performing a checking process, and generates and displays a checking-process user interface (UI) screen. The checking process involves checking a recognition result of, for example, text included in a form image or checking and correcting a recognition result.
The administrator terminal apparatus 70 is a terminal apparatus that is to be operated by a system administrator SE and in which form definition data is set via a form definition screen (not shown) by the system administrator SE. The administrator terminal apparatus 70 is, for example, a general-purpose computer, such as a personal computer (PC), or a portable terminal apparatus, such as a smartphone or a tablet terminal.
If a certainty factor of a recognition result obtained by recognizing an image of each item (referred to as “item image” hereinafter) included in a form image is lower than a threshold value, the server apparatus 10 performs a manual checking process. If the certainty factor is higher than the threshold value, the server apparatus 10 outputs the recognition result as a final recognition result without performing a manual checking process.
If the above-described checking process is to be performed, the server apparatus 10 associates the item image with the text string obtained as a result of the OCR process, and performs control for causing the checker terminal apparatus 40 to display the item image and the text string on the UI screen. The checker U checks whether or not the text string corresponding to the item image is correct while viewing the item image. If the check result is correct, the checker U keeps the text string as-is. If the checked result is incorrect, the checker U inputs a correct text string to the UI screen. The checker terminal apparatus 40 transmits the text string input via the UI screen as a check result to the server apparatus 10. Based on the check result from the checker terminal apparatus 40, the server apparatus 10 outputs a final recognition result and performs control for causing the checker terminal apparatus 40 to display the final recognition result on the UI screen.
As shown in
The controller 11 includes a central processing unit (CPU) 11A, a read-only memory (ROM) 11B, a random access memory (RAM) 11C, and an input-output interface (I/O) 11D. These units are connected to one another via a bus.
The I/O 11D is connected to functional units including the storage unit 12, the display unit 13, the operation unit 14, and the communication unit 15. These functional units are communicable with the CPU 11A via the I/O 11D.
The controller 11 may serve as a second controller that partially controls the operation of the server apparatus 10, or may serve as a part of a first controller that entirely controls the operation of the server apparatus 10. The blocks of the controller 11 may partially or entirely be, for example, an integrated circuit (IC), such as a large scale integration (LSI) circuit, or an IC chip set. The blocks may be individual circuits or may partially or entirely be an integrated circuit. The blocks may be integrated with each other, or one or some of the blocks may be separately provided. In each of the blocks, a part thereof may be separately provided. The integration of the controller 11 is not limited to LSI and may be a dedicated circuit or a general-purpose processor.
The storage unit 12 is, for example, a hard disk drive (HDD), a solid state drive (SSD), or a flash memory. The storage unit 12 stores therein an information processing program 12A according to this exemplary embodiment. The information processing program 12A may alternatively be stored in the ROM 11B.
The information processing program 12A may be preinstalled in, for example, the server apparatus 10. The information processing program 12A may be realized by being stored in a nonvolatile storage medium or by being distributed via the network N, and by being installed in the server apparatus 10, where appropriate. Examples of the nonvolatile storage medium include a compact disc read-only memory (CD-ROM), a magneto-optical disk, an HDD, a digital versatile disc read-only memory (DVD-ROM), a flash memory, and a memory card.
The display unit 13 is, for example, a liquid crystal display (LCD) or an organic electroluminescence (EL) display. The display unit 13 may integrally have a touchscreen. The operation unit 14 is provided with, for example, an operation input device, such as a keyboard and a mouse. The display unit 13 and the operation unit 14 receive various types of commands from a user of the server apparatus 10. The display unit 13 displays various types of information, such as a result of a process executed in accordance with a command received from the user and a notification about a process.
The communication unit 15 is connected to the network N via the Internet, a LAN, or a WAN, and is communicable with the image reading apparatus 60, the checker terminal apparatuses 40, and the administrator terminal apparatus 70 via the network N.
As mentioned above, sometimes, a recognition process is performed on a document set having multiple pages by reading the pages consecutively in a one-by-one fashion, and the pages are sorted out into sets as electronic data. In that case, the document set may sometimes have an error due to being mishandled by the user. From the document set having such an error, an appropriate data set is not obtainable. The term “document set” used here is defined as a set containing multiple pages of paper documents. The term “data set” used here is defined as a set containing data (read data) of multiple pages obtained by reading the document set and sorting out the pages based on a certain rule. This data set is obtained as a result of sorting out the read data of the pages of the document set based on a recognition result obtained by performing an OCR process on the read data of the pages of the document set.
Although the form mentioned above is described as an example of a document in this exemplary embodiment, the document is not limited to a form and may include, for example, a normal document.
The CPU 11A of the server apparatus 10 according to this exemplary embodiment executes the information processing program 12A stored in the storage unit 12 by loading the information processing program 12A to the RAM 11C, thereby functioning as the units shown in
As shown in
The storage unit 12 according to this exemplary embodiment is provided with, for example, a form-data storage unit 12B that stores form data and a page storage unit 12C that stores improper data in units of pages.
The image reading apparatus 60 acquires read data by reading multiple form sets including multiple pages of forms, and transmits the acquired read data to the server apparatus 10.
The recognition processor 20 acquires a recognition result by executing an OCR process on the read data received from the image reading apparatus 60 in accordance with predetermined setting contents of form definition data. In this case, the recognition processor 20 acquires meta-information related to multiple pages of the read data as a result of performing the OCR process. This meta-information is at least one of a form page number, a layout, a specific field, an image patch, a form identification (ID), handwriting, and an inscriber ID. In detail, for example, each page of a form image is given a bar code or a two-dimensional code. By reading the bar code or the two-dimensional code, for example, a form ID, a page number, and an inscriber ID are acquired. A layout is information indicating the page configuration. In the case of the layout, the page configuration is stored in correspondence with the number of pages. A specific field is information indicating the location of the specific field. In the case of the specific field, the location of the specific field is stored in correspondence with the number of pages. An image patch is information indicating a specific image at a specific location. In the case of the image patch, the specific image at the specific location is stored in correspondence with the number of pages. Handwriting is information indicating the handwriting of an inscriber. The recognition processor 20 outputs the recognition result and the meta-information in correspondence with the read data.
The form-data registration unit 21 sorts out the read data, corresponding to the recognition result and the meta-information and output from the recognition processor 20, based on the recognition result. Each sorted piece of the read data is defined as a first data set. For example, it is assumed that A-1/3, A-2/3, A-3/3, B-1/3, and B-2/3 are obtained as recognition results of multiple form sets. A and B denote form IDs, and 1/3 to 3/3 denote page numbers. In this case, the read data is sorted into two first data sets, namely, an A set 1/3 to 3/3 and a B set 1/3 to 2/3. The form-data registration unit 21 stores the multiple first data sets obtained as a result of sorting out the read data into the form-data storage unit 12B.
The improperness determination unit 22 determines whether or not a combination in each of the multiple first data sets stored in the form-data storage unit 12B is improper by using the meta-information. For example, in the example of the A set and the B set mentioned above, the A set is determined as being adequate since 1/3 to 3/3 are all available, whereas the B set is determined as being improper since 3/3 is missing.
If the combination in each of the multiple first data sets is improper based on the determination result obtained by the improperness determination unit 22, the page processor 23 disassembles each first data set in units of pages. If a page group obtained as a result of the disassembling includes an adequate combination, the page processor 23 performs a process for reassembling the adequate page combination as a second data set. The expression “disassembles each first data set in units of pages” implies that a file of a first data set is disassembled into multiple pages. The expression “reassembling the adequate page combination as a second data set” implies that the adequate page combination is made into a file of the second data set.
The display controller 24 performs control for displaying the multiple pages obtained as a result of the page processor 23 disassembling the first data set and for displaying information indicating the cause of improperness of the first data set, for example, as shown in
If there is a page missing from the first data set, the page registration unit 25 stores the multiple pages of the first data set into a predetermined folder (referred to as “improperness folder” hereinafter). This improperness folder is provided in the page storage unit 12C. Furthermore, if there is an excess page included in the first data set, the page registration unit 25 stores the excess page in the improperness folder. In this case, the page processor 23 performs a process for reassembling the remaining page or pages excluding the excess page deleted from the first data set as a second data set.
Each of the pages of page groups stored in the improperness folder is given meta-information. For example, the page processor 23 performs a process of using the meta-information given to each of the pages of page groups to identify an adequate combination from the page groups. The display controller 24 performs control for displaying the adequate combination identified by the page processor 23 as a second data set in an identifiable manner. In this case, if any of the pages in the second data set is selected, the display controller 24 may perform control for displaying information indicating the content of the selected page in an expanded fashion.
Based on the meta-information of the page selected from a list of the page groups stored in the improperness folder, the page processor 23 may perform a process for searching through the page groups for a candidate for an adequate combination. In this case, the display controller 24 performs control for displaying the candidate for an adequate combination found by the page processor 23 in an identifiable manner. When displaying the candidate for an adequate combination in an identifiable manner, the display controller 24 may perform display control such that the meta-information used in the search for the pages serving as the candidate for an adequate combination is given to each of the pages. Moreover, the page processor 23 may perform a process for deriving a handwriting similarity indicating a similarity between the handwriting on the page selected from the list of the page groups and the handwriting on another page. For deriving the handwriting similarity, a known method is used in which the possibility of the handwriting being identical increases with increasing handwriting similarity (indicated with, for example, %). In this case, the display controller 24 may perform control for displaying levels of handwriting similarity for pages serving as candidates for adequate combinations in an identifiable manner.
The correction-data registration unit 26 stores corrected data, obtained as a result of correcting a page group stored in the improperness folder, into the form-data storage unit 12B.
Next, the operation of the server apparatus 10 according to this exemplary embodiment will be described with reference to
First, when the server apparatus 10 is commanded to execute an OCR process, the CPU 11A activates the information processing program 12A to execute the following steps.
In step 100 in
In step 101, the CPU 11A performs an OCR process on the read data acquired in step 100 so as to acquire a recognition result. In this case, meta-information is also acquired in accordance with the OCR process. As mentioned above, meta-information is at least one of a form page number, a layout, a specific field, an image patch, a form ID, handwriting, and an inscriber ID.
In step 102, the CPU 11A sorts out the read data into multiple first data sets based on the recognition result acquired in step 101, and stores the sorted first data sets into the form-data storage unit 12B.
In step 103, the CPU 11A executes an improperness determination process on each of the multiple first data sets sorted in step 102.
In step 120 in
In step 121, the CPU 11A sets the number of pages in the first data set acquired in step 120 to zero.
In step 122, the CPU 11A acquires layout information of each page of the first data set.
In step 123, the CPU 11A acquires a page (referred to as “current page” hereinafter) from the first data set.
In step 124, the CPU 11A increments the number of pages in the first data set.
In step 125, the CPU 11A extracts meta-information of the current page acquired in step 123.
In step 126, the CPU 11A determines whether or not the current page acquired in step 123 is the first page based on the meta-information extracted in step 125. If it is determined that the current page is the first page (i.e., if a positive determination result is obtained), the process proceeds to step 127. If it is determined that the current page is not the first page (i.e., if a negative determination result is obtained), the process proceeds to step 129.
In step 127, the CPU 11A determines whether or not the current number of pages and the page number match. If it is determined that the current number of pages and the page number match (i.e., if a positive determination result is obtained), the process proceeds to step 128. If it is determined that the current number of pages and the page number do not match (i.e., if a negative determination result is obtained), the process proceeds to step 133.
In step 128, the CPU 11A determines whether or not the first data set has a subsequent page. If it is determined that the first data set has a subsequent page (i.e., if a positive determination result is obtained), the process proceeds to step 123. If it is determined that the first data set does not have a subsequent page (i.e., if a negative determination result is obtained), the process returns to step 104 in
In step 129, the CPU 11A determines whether or not the form ID of the current page and the form ID of the first page are the same. If it is determined that the form ID of the current page and the form ID of the first page are the same (i.e., if a positive determination result is obtained), the process proceeds to step 130. If it determined that the form ID of the current page and the form ID of the first page are not the same (i.e., if a negative determination result is obtained), the process proceeds to step 132.
In step 130, the CPU 11A determines whether or not the handwriting on the current page and the handwriting on the first page are the same. For the handwriting determination, a known technique is used, but the technique is not particularly limited. If it is determined that the handwriting on the current page and the handwriting on the first page are the same (i.e., if a positive determination result is obtained), the process proceeds to step 127. If it is determined that the handwriting on the current page and the handwriting on the first page are not the same (i.e., if a negative determination result is obtained), the process proceeds to step 131.
In step 131, the CPU 11A sets a different inscriber flag to the current page, and proceeds to step 128.
In step 132, the CPU 11A sets a different form flag to the current page, and proceeds to step 128.
In step 133, the CPU 11A determines whether or not the current number of pages and the previous page number match. If it is determined that the current number of pages and the previous page number match (i.e., if a positive determination result is obtained), the process proceeds to step 134. If it is determined that the current number of pages and the previous page number do not match (i.e., if a negative determination result is obtained), the process proceeds to step 135.
In step 134, the CPU 11A sets a redundancy flag to the previous page and the current page, and proceeds to step 128.
In step 135, the CPU 11A determines whether or not the current number of pages and the subsequent page number match. If it is determined that the current number of pages and the subsequent page number match (i.e., if a positive determination result is obtained), the process proceeds to step 136. If it is determined that the current number of pages and the subsequent page number do not match (i.e., if a negative determination result is obtained), the process proceeds to step 137.
In step 136, the CPU 11A sets an insufficiency flag to the current page, increments the number of pages by one, and proceeds to step 128.
In step 137, the CPU 11A sets an unknown flag to the current page, and proceeds to step 128.
Referring back to
In step 105, the CPU 11A acquires a first data set.
In step 106, the CPU 11A determines whether or not the first data set acquired in step 105 is improper. If it is determined that the first data set is improper (i.e., if a positive determination result is obtained), the process proceeds to step 107. If it is determined that the first data set is not improper, that is, if it is determined that the first data set is adequate (i.e., if a negative determination result is obtained), the process proceeds to step 112.
In step 107, the CPU 11A disassembles the first data set in units of pages, and performs control for displaying the first data set disassembled in units of pages on, for example, the checker terminal apparatus 40. In detail, as illustrated in
In the example in
In step 108, the CPU 11A determines whether the first data set has a page missing therefrom or the first data set contains an excess page. As mentioned above, an excess page is any one of a redundant page, a page of a different inscriber, and an unknown page. If it is determined that the first data set has a page missing therefrom (i.e., in the case of a missing page), the process proceeds to step 109. If it is determined that the first data set contains an excess page (i.e., in the case of an excess page), the process proceeds to step 110.
In step 109, the CPU 11A stores the multiple pages of the first data set into the improperness folder, for example, as shown in
In contrast, in step 110, the CPU 11A stores only the excess page of the first data set into the improperness folder, for example, as shown in
In step 111, the CPU 11A reassembles the remaining page or pages excluding the excess page removed from the first data set as an adequate second data set.
In step 112, the CPU 11A determines whether or not the process has been executed on all of the first data sets. If it is determined that the process has not been executed on all of the first data sets (i.e., if a negative determination result is obtained), the process proceeds to step 105. If it is determined that the process has been executed on all of the first data sets (i.e., if a positive determination result is obtained), the sequential process based on the information processing program 12A ends.
Next, a process for storing an improper page or pages of a first data set into the improperness folder (referred to as “improperness-folder storing process” hereinafter) will be described in detail with reference to
A UI screen 41 and a UI screen 42 in
A UI screen 43 and a UI screen 44 in
A UI screen 45, a UI screen 46, and a UI screen 47 in
Next, a process for displaying a list of page groups stored in the improperness folder (referred to as “improper-page-list displaying process” hereinafter) will be described with reference to
First, when the server apparatus 10 is commanded to execute the improper-page-list displaying process, the CPU 11A activates the information processing program 12A to execute the following steps.
In step 140 in
In step 141, the CPU 11A acquires an improper page group from the improperness folder.
In step 142, the CPU 11A determines whether form IDs of the pages match with respect to the improper page group acquired in step 141.
In step 143, the CPU 11A determines whether inscriber IDs of the pages match with respect to the improper page group acquired in step 141.
In step 144, the CPU 11A searches for a page group with the same form ID or the same inscriber ID.
In step 145, the CPU 11A gives a group ID to the page group obtained as a result of the search in step 144.
In step 146, the CPU 11A performs control for displaying the page group having the same group ID, given thereto in step 145, in an identifiable manner on the checker terminal apparatus 40, as shown in
The improper-page-list screen 48 shown in
As shown in
As shown in
Next, another example of the improper-page-list displaying process will be described with reference to FIGS. 14 and 15.
First, when the server apparatus 10 is commanded to execute the improper-page-list displaying process, the CPU 11A activates the information processing program 12A to execute the following steps.
In step 150 in
In step 151, the CPU 11A acquires an improper page group from the improperness folder.
In step 152, the CPU 11A performs a handwriting-similarity imparting process on the improper page group acquired in step 151.
In step 160 in
In step 161, the CPU 11A determines whether or not page A exists. If it is determined that page A exists (i.e., if a positive determination result is obtained), the process proceeds to step 162. If it is determined that page A does not exist (i.e., if a negative determination result is obtained), the process returns to step 153 in
In step 162, the CPU 11A acquires one page (referred to as “page B” hereinafter) other than page A.
In step 163, the CPU 11A determines whether or not page B exists. If it is determined that page B exists (i.e., if a positive determination result is obtained), the process proceeds to step 164. If it is determined that page B does not exist (i.e., if a negative determination result is obtained), the process returns to step 160 and is repeated thereafter.
In step 164, the CPU 11A calculates a handwriting similarity between pages, namely, page A and page B. As mentioned above, the possibility of the handwriting being identical increases with increasing handwriting similarity (indicated with, for example, %).
In step 165, the CPU 11A imparts the handwriting similarity with page A to page B. The process then returns to step 162 and is repeated thereafter.
Referring back to
In step 154, the CPU 11A determines whether or not an arbitrary page has been selected from the improper-page-list screen. If it is determined that an arbitrary page has been selected (i.e., if a positive determination result is obtained), the process proceeds to step 155. If it is determined that an arbitrary page has not been selected (i.e., if a negative determination result is obtained), the process enters a standby state at step 154.
In step 155, the CPU 11A searches through the improper page group included in the improper-page-list screen for a page with the same form ID or the same inscriber ID as the page selected in step 154.
In step 156, the CPU 11A determines whether or not a page with the same form ID or the same inscriber ID exists based on the search result obtained in step 155. If it is determined that a page with the same form ID or the same inscriber ID exists (i.e., if a positive determination result is obtained), the process proceeds to step 157. If it is determined that a page with the same form ID or the same inscriber ID does not exist (i.e., if a negative determination result is obtained), the process proceeds to step 158.
In step 157, the CPU 11A performs control for displaying the page with the same form ID or the same inscriber ID in an identifiable manner on the improper-page-list screen. In detail, for example, the color of the relevant page is changed so as to be varied from the color of other pages.
In step 158, the CPU 11A searches through the improper page group included in the improper-page-list screen for a page with handwriting similar to that on the page selected in step 154. For example, a page with a handwriting similarity of 50% or higher is searched for.
In step 159, the CPU 11A determines whether or not a page with similar handwriting exists based on the search result obtained in step 158. If it is determined that a page with similar handwriting exists (i.e., if a positive determination result is obtained), the process proceeds to step 160. If it is determined that a page with similar handwriting does not exist (i.e., if a negative determination result is obtained), the information processing program 12A ends.
In step 160, the CPU 11A performs control for displaying the page with similar handwriting in an identifiable manner on the improper-page-list screen, and the sequential process based on the information processing program 12A ends. In detail, for example, the color of the relevant page is changed so as to be varied from the color of other pages. Furthermore, levels of handwriting similarity may be made identifiable by, for example, setting the color density of a page with a handwriting similarity ranging between 50% and 70% inclusive to 50% and setting the color density of a page with a handwriting similarity ranging between 70% and 100% inclusive to 70%.
Next, another example of the improper-page-list displaying process will be described in detail with reference to
On an improper-page-list screen 49A in
Specifically, the CPU 11A performs control for displaying candidates for combinations of adequate pages in an identifiable manner, as shown on the improper-page-list screen 49B in
As mentioned above, the CPU 11A performs a process for deriving a handwriting similarity indicating a similarity between the handwriting on the selected page (i.e., page 1 at the upper left corner in the example in
Next, a process for combining adequate pages selected from the improper-page-list screen (referred to as “adequate-page combining process” hereinafter) will be described in detail with reference to
On an improper-page-list screen 50 in
Next, a process for storing the combined page group into a checking-process folder (referred to as “combined-page-group storing process” hereinafter) will be described in detail with reference to
On an improper-page-list screen 52 in
On an improper-page-list screen 53 in
According to this exemplary embodiment, if a combination in a data set obtained by reading and sorting out a document set is improper, the data set containing the improper combination is disassembled and is reassembled into a data set with a proper combination. Therefore, even if a combination in the document set is improper, a data set with a proper combination may be obtained.
In the above exemplary embodiment, information processing executed by the CPU loading a software program may be executed by various types of processors other than the CPU. In this case, examples of the processor include a programmable logic device (PLD) whose circuit configuration is changeable after being manufactured, such as a field-programmable gate array (FPGA), and a dedicated electrical circuit serving as a processor having a circuit configuration specifically designed for executing a specific process, such as an application specific integrated circuit (ASIC). Furthermore, this information processing may be executed by one of these types of processors, or may be executed with a combination of two or more of the same type or different types of processors (e.g., a combination of multiple FPGAs or a combination of a CPU and an FPGA). More specifically, the hardware structure of each of these types of processors is an electrical circuit constituted of a combination of circuit elements, such as semiconductor elements.
A server apparatus has been described above as an example of an information processing apparatus according to an exemplary embodiment. The exemplary embodiment may be in the form of a program for causing a computer to execute the functions of the units included in the server apparatus. The exemplary embodiment may be in the form of a non-transitory computer-readable storage medium storing the program therein.
Furthermore, the configuration of the server apparatus described in the above exemplary embodiment is merely an example, and may be modified in accordance with conditions within the scope of the exemplary embodiment.
Moreover, the flow of the process according to the program described in the above exemplary embodiment is merely an example. An unnecessary step or steps may be deleted, a new step or steps may be added, or the processing sequence may be changed within the scope of the exemplary embodiment.
In the above exemplary embodiment, the program is executed so that the process according to the exemplary embodiment is realized based on a software configuration by using the computer. Alternatively, the exemplary embodiment may be realized in accordance with, for example, a hardware configuration or a combination of a hardware configuration and a software configuration.
In the embodiment above, the term “processor” refers to hardware in a broad sense. Examples of the processor include general processors (e.g., CPU: Central Processing Unit), dedicated processors (e.g., GPU: Graphics Processing Unit, ASIC: Application Integrated Circuit, FPGA: Field Programmable Gate Array, and programmable logic device).
In the embodiment above, the term “processor” is broad enough to encompass one processor or plural processors in collaboration which are located physically apart from each other but may work cooperatively. The order of operations of the processor is not limited to one described in the embodiment above, and may be changed.
The foregoing description of the exemplary embodiment of the present disclosure has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the disclosure to the precise forms disclosed. Obviously, many modifications and variations will be apparent to practitioners skilled in the art. The embodiment was chosen and described in order to best explain the principles of the disclosure and its practical applications, thereby enabling others skilled in the art to understand the disclosure for various embodiments and with the various modifications as are suited to the particular use contemplated. It is intended that the scope of the disclosure be defined by the following claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2019-149848 | Aug 2019 | JP | national |