This application is based on and claims priority under 35 USC 119 from Japanese Patent Application No. 2017-222147 filed Nov. 17, 2017.
The present invention relates to a document processing apparatus and a non-transitory computer readable medium.
According to an aspect of the invention, there is provided a document processing apparatus including a reception unit and a display control unit. The reception unit receives specification of a region in an electronic document by a user. The display control unit performs control such that a candidate for attribute information is displayed from a string in the region received by the reception unit, based on determination information as information for determining a type of attribute information.
Exemplary embodiments of the present invention will be described in detail based on the following figures, wherein:
Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to drawings.
A document management system according to an exemplary embodiment of the present invention includes, as illustrated in
Attribute information adding software is installed into the document processing apparatuses 10 to 12. An attribute information adding program is executed by the attribute information adding software, so that attribute information is added to document data as an electronic document. Accordingly, classification of plural pieces of document data may be achieved.
Furthermore, the document processing apparatuses 10 to 12 may transmit, receive, browse, and correct document data generated by adding attribute information thereto and files in which such document data are stored.
Furthermore, the server apparatus 14 is connected to the document processing apparatuses 10 to 12 via the network 1. Document data generated by adding attribute information thereto by the document processing apparatuses 10 to 12 and files in which such document data are stored may be stored in the server apparatus 14. The document processing apparatuses 10 to 12 are able to read document data generated by adding attribute information thereto and files in which such document data are stored, the document data and files being stored in the server apparatus 14. Therefore, the document processing apparatuses 10 to 12 are able to transfer the document data and files via the server apparatus 14.
As illustrated in
The CPU 16 controls an operation of the document processing apparatus 10 by executing a predetermined process based on an attribute information adding program stored in the memory 17 or the storage device 19. In this exemplary embodiment, the CPU 16 is explained as a unit that reads and executes the attribute information adding program stored in the memory 17 or the storage device 19. However, the program may be stored in a storing medium such as a compact disc-read only memory (CD-ROM) or the like and provided to the CPU 16.
The document processing apparatus 10 functions as a document information registration unit 22, a determination information registration unit 23, a region specification reception unit 24, an attribute information determination unit 25, a correction unit 26, a display control unit 27, and the like when the CPU 16 as a controller executes an attribute information adding program 30 stored in the storage device 19.
The storage device 19 stores the attribute information adding program 30, document information 31, format registration information 33, proper noun registration information 34, and the like.
The attribute information adding program 30 is a program that causes the CPU 16 to operate as the document information registration unit 22, the determination information registration unit 23, the region specification reception unit 24, the attribute information determination unit 25, the correction unit 26, the display control unit 27, and the like.
The document information 31 is, for example, information such as text information, image information, and moving image information, and includes document information generated by adding attribute information thereto.
The format registration information 33 and the proper noun registration information 34 are used as determination information, which is information for determining the type of attribute information. The format registration information 33 and the proper noun registration information 34 are registered in advance in the storage device 19.
The format registration information 33 is format information corresponding to the type of attribute (attribute name). A format for determining the type of attribute information is registered in the format registration information 33. For example, format information such as “Month Day, Year” or “MM/DD/YY” is registered for an attribute name of “date”. Furthermore, format information such as “AA Corporation”, “AA Co., Ltd.”, “AA Company Limited”, or “AA Limited” is registered for an attribute name of “name of business partner”. Furthermore, format information such as “xx Yen”, “Yxx”, or “$xx” is registered for an attribute name of “amount”.
A proper noun such as a string that may be registered as an attribute or a string that is frequently used as attribute information, for example, “ABC Corporation”, “DEF Co., Ltd.”, or the like is registered as the proper noun registration information 34.
The document information registration unit 22 registers new document information in the document information 31 in the storage device 19 in response to a registration request.
The determination information registration unit 23 registers new determination information in the storage device 19 in response to a registration request. Specifically, the determination information registration unit 23 registers new format information in association with the type of attribute and stores the registered format information in the format registration information 33. Furthermore, the determination information registration unit 23 registers a new type of attribute and stores the registered type in the format registration information 33. Furthermore, the determination information registration unit 23 registers new strings such as a string that may be registered as an attribute and a string that is frequently used as attribute information and stores the registered strings in the proper noun registration information 34.
The region specification reception unit 24 receives specification of a region by a user on a view screen for document data.
The attribute information determination unit 25 extracts a string in a region specified by the region specification reception unit 24, based on determination information such as the format registration information 33, the proper noun registration information 34, or the like stored in the storage device 19.
The correction unit 26 corrects a string in a region specified by the region specification reception unit 24, based on determination information such as the format registration information 33, the proper noun registration information 34, or the like stored in the storage device 19. That is, in a case where a region specified by the region specification reception unit 24 is not an appropriate region or there is an error, the correction unit 26 performs correction based on determination information such as the format registration information 33, the proper noun registration information 34, or the like. For example, the correction unit 26 performs correction so that a string in a specified region becomes the same as a string registered as the proper noun registration information 34. Furthermore, in a case where an unwanted string is included in a region specified by the region specification reception unit 24, based on a comparison with the proper noun registration information 34, the correction unit 26 deletes the unwanted string.
The display control unit 27 performs control such that a string extracted from among strings in a region specified by the region specification reception unit 24 is displayed as a candidate for attribute information, based on determination information such as the format registration information 33, the proper noun registration information 34, or the like stored in the storage device 19. That is, the display control unit 27 performs control such that a string extracted from among strings in a region is automatically input to an input field on a setting screen for an attribute name as the type of attribute corresponding to the string and the input string is displayed as a candidate for attribute information, based on determination information such as the format registration information 33, the proper noun registration information 34, or the like.
Furthermore, in the case where plural pieces of determination information such as the format registration information 33, the proper noun registration information 34, and the like stored in the storage device 19 are included in a region specified by the region specification reception unit 24, the display control unit 27 performs control such that strings corresponding to the plural pieces of determination information are extracted and the extracted strings are displayed as candidates for attribute information. That is, the display control unit 27 performs control such that strings extracted from among strings in a region are automatically input to input fields on a setting screen for attribute names as the types of attribute corresponding to the strings and the input strings are displayed as candidates for attribute information, based on determination information such as the format registration information 33, the proper noun registration information 34, and the like.
Furthermore, the display control unit 27 performs control such that a string corrected by the correction unit 26 is displayed as a candidate for attribute information.
Next, an example of a display screen for a case where attribute information adding software is activated and a file is expanded will be described in detail with reference to
When the attribute information adding software is activated, a screen for adding attribute information to document data is displayed on the display screen. Specifically, when the attribute information adding program is executed, various functions to be executed on document data are displayed as tools in a tool bar 40 on the display screen. Furthermore, a view screen 41 for document data, a setting screen 42 for adding attribute information, and the like are displayed.
In the document processing apparatus 10, determination information such as the format registration information 33, the proper noun registration information 34, and the like that are registered in advance is used. Therefore, only by specifying a region in the document data displayed on the view screen 41 using a text selection mode in accordance with an operation by a user, a string that is desired to be added as attribute information may be automatically input to an input field 43 on the setting screen 42 and displayed as a candidate for attribute information.
Next, an operation of the determination information registration unit 23 for setting and registering new determination information in the format registration information 33, the proper noun registration information 34, and the like in the storage device 19 will be explained with reference to
On the display screen illustrated in
A user is able to register determination information on the attribute name registration screens illustrated in
Specifically, on the display screen illustrated in
Furthermore, on the display screens illustrated in
Furthermore, on the display screen illustrated in
Next, a setting operation at the document processing apparatus 10 for adding an attribute to document data will be described with reference to
First, on the display screen illustrated in
When the “OK” button 47 is clicked in a state in which the check box 46 for “select appropriate attribute value” is clicked on the environment setting screen (Yes in step S11), it is determined whether or not determination information such as the format registration information 33, the proper noun registration information 34, or the like is registered (step S12). In the case where it is determined that determination information is not registered (No in step S12), the process ends, and the manual input mode is entered.
In the case where it is determined that determination information is registered (Yes in step S12), the proper noun registration information 34 is read from the storage device 19 (step S13), the format registration information 33 is read (step S14), and an automatic input mode for allowing a candidate for attribute information to be automatically input using the determination information such as the format registration information 33 and the proper noun registration information 34 is entered.
Next, an operation for adding attribute information to document data in the document processing apparatus 10 will be described in detail with reference to
First, a text selection mode is executed by a user (step S100), and specification of a region 61 including a string that is desired to be used as attribute information is received by the region specification reception unit 24 (step S101). Specifically, for example, when the user clicks the cursor 45 that is placed on a text selection button 60 illustrated in
Then, a string is extracted from the specified region (step S102), and it is determined whether or not the extracted string is the format registration information 33 (step S103). For example, it is determined whether or not the extracted string is a string such as “Limited” or “Co., Ltd” or a string including Arabic numerals such as 1, 2, and 3 or Chinese characters expressing numerals. Specifically, when “ABC Corporation” is extracted from the specified region 61, it is determined whether or not the extracted character string is the format registration information 33.
Then, in the case where it is determined that the extracted string is the format registration information 33 (Yes in step S103), attribute information is identified based on the format registration information 33 (step S104). Specifically, when it is determined that the extracted “ABC Corporation” is the format registration information 33, an attribute name of “name of business partner” corresponding to format information of “Corporation” in “ABC Corporation” is identified.
In the case where attribute information is identified based on the format registration information 33 (step 5104) or it is determined that the extracted string is not the format registration information 33 (No in step S103), it is determined whether or not the extracted string is the proper noun registration information 34 registered in advance (step S105).
In the case where it is determined that the extracted string is the proper noun registration information 34 registered in advance (Yes in step S105), the extracted string is compared with the proper noun registration information 34 and it is determined whether or not the extracted string is to be corrected (step S106). For example, it is determined whether or not an unwanted string is included in the extracted string, whether or not there is a shortage or excess in the extracted string, and the like.
In the case where it is determined that the extracted string is to be corrected (Yes in step S106), the extracted string is corrected based on the proper noun registration information 34 (step S107). That is, correction is performed so that the extracted string becomes the same as the string that is registered as the proper noun registration information 34. Specifically, for example, in the case where the string extracted from a specified region is “, ABC Corporation”, correction is performed so that a comma “,” is deleted and the extracted string thus becomes the same as “ABC Corporation” that is registered as the proper noun registration information 34.
In the case where it is determined that there is no need to correct the extracted string (No in step S106), in the case where the extracted string is corrected based on the proper noun registration information 34 (step S107), or in the case where it is determined that the extracted string is not the proper noun registration information 34 (No in step S105), it is determined whether or not attribute information displayed as a candidate is identified (step S108).
In the case where it is determined that the attribute information is identified (Yes in step S108), the string identified as attribute information is automatically input to the input field 43 of the setting screen 42 for attribute information and is displayed (step S109). Specifically, a type of attribute “name of business partner” is identified based on determination information from “ABC Corporation” extracted from the specified region 61. Then, “ABC Corporation” is automatically input to the input field 43 for the attribute name “name of business partner” of the setting screen 42 for attribute information and is displayed as a candidate for attribute information, as illustrated in
In the case where it is determined that attribute information is not identified (No in step S108), a string is manually input, by a user operation, to the input field 43 of the setting screen 42 for attribute information and is displayed (step S110).
Next, another exemplary embodiment of the present invention will be described with reference to
In this exemplary embodiment, a case where plural strings exist in a region 71 specified on the view screen 41 for document data will be described.
When the text selection mode is executed and a range is specified by dragging, with a cursor, for example, the region 71 that is desired to be added as attribute information to document data displayed on the view screen 41, as illustrated in
Specifically, a range from a space to a pause such as a punctuation mark of a sentence is recognized as a region of a string corresponding to a sentence including plural strings and is resolved into parts of speech such as a proper noun and a particle. In this case, a language written without a space between words may also be recognized, and a space or the like may also be recognized. That is, from the specified region 71, plural strings such as “Jul. 16, 2017”, “DEF Co., Ltd.”, and “ABC Corporation” are extracted.
The plural strings resolved according to parts of speech are acquired, and it is determined whether or not each of the extracted strings is the format registration information 33, based on the format registration information 33 stored in the storage device 19.
In the case where it is determined that each of the extracted “Jul. 16, 2017”, “DEF Co., Ltd”, and “ABC Corporation” is the format registration information 33, an attribute name of “date” corresponding to the format information of “Jul. 16, 2017” is identified as attribute information, and an attribute name of “name of business partner” corresponding to the format information of “ABC Corporation” and “DEF Co., Ltd.” is identified as attribute information.
Then, it is determined whether or not each of the extracted “Jul. 16, 2017”, “DEF Co., Ltd.”, and “ABC Corporation” is the proper noun registration information 34 registered in advance. In the case where each of the extracted “July 16, 2017”, “DEF Co., Ltd.”, and “ABC Corporation” is the proper noun registration information 34 and it is determined, by comparison with the proper noun registration information 34, that the extracted string needs to be corrected, the extracted string is corrected based on the proper noun registration information 34.
Then, it is determined whether or not attribute information is identified. In the case where it is determined that attribute information is identified, the string identified as attribute information is automatically input to the input field 43 of the setting screen 42 for attribute information and is displayed. Specifically, attribute information is identified based on determination information such as the format registration information 33, the proper noun registration information 34, and the like from “Jul. 16, 2017”, “DEF Co., Ltd.”, “ABC Corporation”, and the like extracted from the specified region 71, and “7/16/2017” is automatically input to the input field 43 for an attribute name of “date” on the setting screen 42 for attribute information and is displayed as a candidate for attribute information, as illustrated in
Then, a user registers attribute information by manual input or correction if necessary while viewing a screen on which candidates for attribute information are displayed, so that attribute information may be added to document data.
In the foregoing exemplary embodiment, a configuration in which the setting screen 42 for attribute information is displayed on the UI device 20 and processing is executed has been described. However, the present invention is not limited to this. For example, by selecting document data and causing an execution bar to be displayed, for example, by right-clicking a mouse, an execution screen may be displayed, and processing may be executed.
Furthermore, in the foregoing exemplary embodiment, a configuration in which prior to adding attribute information to document data using determination information such as the format registration information 33, the proper noun registration information 34, and the like, format information and a string to be used as determination information are registered in advance in the format registration information 33 and the proper noun registration information 34 on the environment setting screen has been described. However, the present invention is not limited to this. On the setting screen 42 for adding attribute information, registration may be performed by displaying a screen for asking whether or not to register a string extracted from a specified region in the format registration information 33 or the proper noun registration information 34.
In the foregoing exemplary embodiments, a case where the present invention is applied to attribute information adding software has been described. However, the present invention is not limited to this. The present invention may also be applied in a same manner to any type of software including a configuration in which an editing operation is performed on document data or the like.
For example, the present invention may be applied in a same manner to software that performs an editing operation on document data at a portable information terminal apparatus or the like such as a smartphone or a tablet terminal apparatus as well as software that edits document data at a personal computer.
Furthermore, in an exemplary embodiment, a program executed by an information processing apparatus may be provided by being stored in a computer-readable recording medium such as a magnetic recording medium (a magnetic tape, a magnetic disk (an HDD, a flexible disk (FD), etc.), an optical recording medium (an optical disk (a compact disk (CD), a digital versatile disk (DVD)), etc.), a magneto-optical recording medium, a semiconductor memory (a flash ROM etc.), or the like. Furthermore, the above program may be downloaded via a network such as the Internet.
The foregoing description of the exemplary embodiments of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Obviously, many modifications and variations will be apparent to practitioners skilled in the art. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications, thereby enabling others skilled in the art to understand the invention for various embodiments and with the various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2017-222147 | Nov 2017 | JP | national |