This invention relates to content or document management, but more specifically to a content manage system and method in which unstructured content may be imported into the system for routine workflow processing.
Many enterprise organizations, such as financial and insurance companies, utilize automated workflow management (WM) systems and methods to process documents, images, multimedia, or other information (hereafter referred to as content). For an insurance company, information processed by automated workflow operations may include insurance claims, issuances of new policies, coverage adjustments, and updating of customer accounts. In doing so, such information is typically imported into the system but the importation task may become challenging when the information resides in an unstructured file format or structure. Volumes of content information obtained from multiple branch offices of an organization are usually batch-processed at a central processing center so the task acquiring information into the automated workflow system could become insurmountable when attempting to import and process unstructured content.
Accordingly, it is desirable to provide a business or other enterprise with the flexibility to support unstructured or complex file structures when importing content into a content management system for automated workflow processing. Further, it is desirable to add this flexibility to an existing import process of a content management system by associating the unstructured content or file to be imported with user-configurable importation rules.
The present invention, which we call Universal Imports, allows a user to import unstructured content with limited information about its data structure, to use the limited information to derive more detailed information about the place to store components of the unstructured content in the management system, to store the unstructured content in the management system using information provided by importation rules, to tag the imported content with additional indices for subsequent access and processing, and to create a work item for subsequent automated workflow processing with respect to the imported content.
According to the present invention, a business enterprise advantageously may build import rules to fit a specific importation need instead of being required to follow a rigid set of rules like in previous implementations. Formatting rules, for example, of an index file approximating the format of the unstructured content may be configurable to facilitate the importation process. In addition, unlike previous import methods and systems, the present invention supports complex file structures like nested folders and attributes at any level. Previous methods and systems also did not support task/diaries at any level of automated content processing or a lookup/update ability. On the other hand, the present invention allows an enterprise to import large volumes of unstructured data into a content management system that may require a complex list of rules and/or file structure.
According to a first aspect of the present invention, there is provided in a content management system having a repository of information organized according to an index file, an improvement comprising a method of importing unstructured content into the repository which includes providing a template (e.g., an XML template) of configurable import rules to enable retrieval of information components of the unstructured content; ascertaining at least one a structural attribute of the unstructured content; enabling a user to configure import rules of the template according to the ascertained structural attribute(s); accessing and examining information components of the unstructured content according to the attribute(s); optionally tagging information components of the unstructured content according to a value of the accessed and examined information components; importing information components of the unstructured content into the repository of the content management system according to indices of the index file; identifying a workflow task with respect to the information components of the imported content; and processing a workflow task of the content management system relative to the imported content.
Other aspects of the method include iteratively defining import rules for the unstructured content according to structural information learned in a prior importation step in order to refine or make more definite the import rules, and associating information components of the unstructured content with respective indices of the index file prior to importing components of the unstructured content into the repository.
In accordance with another aspect of the invention, there is provided in a content management system a method of importing unstructured content comprising establishing indexing criteria for use in the content management system wherein the indexing criteria are defined to support a workflow processing scheme; examining the unstructured content to determine a preliminary file structure; providing a template of user-configurable import rules; configuring the import rules of the template according to the preliminary file structure of the unstructured content; importing the content into the content management system according to the import rules; and performing a workflow task with respect to the imported content. Additional aspects include the steps of providing a record for each page in a records database used to store retrieve pages of information of the unstructured content, and indexing each page of the imported content to provide a reference useful to retrieve each page in the content management system.
In yet another aspect of the invention, there is provides a content management system useful for importing unstructured content wherein the system comprises a template of configurable import rules; a user interface module to provide a user interface that enables a user to configure the template; a repository to store information; a retrieval module to access and retrieve information components from a storage medium containing the unstructured content where the accessing and retrieval are performed according to the template of configurable import rules; an indexing module responsive to the retrieval module to store information components of the unstructured content in the repository according to indices of an index file; and a workflow processing module that accesses the repository to process information components of the imported, unstructured content.
The system may further include a tagging module responsive to the retrieval module to tag respective information components of the unstructured content according to a value thereof, wherein the indexing module effects storage of the components in the repository according to the tag. Information components may be tagged according to a field reference value or a literal value, and the retrieval module may retrieve information components of the unstructured content according to a data field of a record and/or a delimiter of the unstructured content. In addition, the user interface module may enable the user to iteratively reconfigure the import rules of the template based on the nature or character of imported records of the unstructured content observed during a prior importation. In addition, the user interface module may enable the user to reconfigure indices of the index file in order to alter the structure of the repository in which information components of the unstructured content are stored.
Other aspects of the invention will become apparent upon review of the following description taken in connection with the accompanying drawings. The invention, though, is pointed out with particularity by the appended claims.
Universal Imports advantageously provides a user with the ability to create file structure, to create/update files and file attributes, to create/update folders and folder attributes, to create/update documents and document attributes, to create pages, to create/update tasks and task attributes at any level, to create/update task notes, to create diaries at any level, to import into nested or repeatable folders, to add documents to any folder, to add pages to any document, and to add/remove file marks and add page marks. In practice, the Universal Imports is implemented in an ImageRight Content Management and Workflow Management System that is specially designed for the insurance and financial companies, which system and software are commercially available from Vertafore, Inc. of Bothell, Wash.
A list of configurable importation rules in an XML template provides the primary tool used to accomplish the flexibility of the present invention. The template allows a workflow administrator or other user to define a set of rules that dictate where the import process obtains the unstructured information and how users of the content management system will work with that information. The template follows the XML specifications described by w3c (http://www.w3.org/XML/).
Satellite nodes 14, 16, and 18 serve respective users 20, 22, 24, 26, 28, 30 of the remote satellite offices where human users may also process files, import content, or perform work assignments. Each of the nodes 12, 14, 16, and 18 may include data processing devices or servers that manage, store, and/or effect transfer of files and other information locally or remotely via a network 19, as well as a user interface (e.g., display, keyboard, mouse, etc.) to enable a user to communicate with the system. These nodes also generate graphical user interfaces on a display device, subsequently described, that enable users to define or dynamically define processing parameters for performing a principal workflow task and various subtasks thereof. In particular, processors at one or more of the nodes 12, 14, 16, and 18 include executable program modules to implement the process steps set forth in
To begin, method 31 of
Universal Imports allow a user to examine existing structure based on one or more attributes of that structure. The lookup rules may contain only that which is needed to find the structure. An example may be an attribute on a policy folder called CLAIMANT CODE. If the CLAIMANT CODE is adequate to make that structure unique, then that is all that is needed, as illustrated by the follow XML code segment.
A next step 36 of the exemplary method 31 includes enabling a user via a user interface, for example, to configure import rules of the XML or other template according to the ascertained structural attribute(s) of the unstructured content. Formatting rules within the template are configurable through a user interface, such as provided by an ImageRight Enterprise Management Console, commercially available from Vertafore, Inc. Using the console, the user may specify whether the index file is fixed width or delimited. Fixed width means that the values have an offset or starting position and have a length, such as illustrated in
Step 38 of the exemplary method includes accessing and examining information components of the unstructured content according to one or more structural attributes. Step 40 includes optionally tagging one or more information records or components of the unstructured content according to a value of the accessed and examined information components. Tagging the content means setting up attributes associated with components of the file structure and setting those values using the template. In addition to the Lookup section noted above, the user may be provided with a section to create or update content information, as illustrated below.
The method further includes a step 42 of importing information components of the unstructured content into a repository of the content management system according to indices of the index file.
An index file provides information on where to place the unstructured content into the content management system. Indices of the index file may, for example, identify a location, drawer, file type, file number, folder type and document type, such as illustrated by the following format.
The exemplary method 31 further includes a step 44 of identifying (which includes creating and/or selecting) a workflow task with respect to the imported information components of the imported content; and a step 46 of processing a workflow task of the content document management system relative to the imported content. A workflow simulates a business process of the enterprise. The task proceeds from step to step until it reaches an end of the Workflow or end of the business process. A common example of an automated workflow process would be processing a new application for insurance and a common step within that process would be indexing policy information. Indexing may include associating the new application with an existing policy or creating a new policy.
The Appendix shows an example template that contains the rules for an import process. As noted, the template allows for field referenced, literal, or other values. Literal values can be used if the information does not need to be dynamic for each import file. Field reference values can be configured by the user which may be a list of named fields along with details of where those values reside. The named fields may be referenced from the template which may then be used to retrieve data during the import processing. The following segment of code in the appendix shows an example of literal and field reference value types.
The literal value in this example will not change unless the administrator changes the template. The field reference value may possibly change during every import record.
The name in the template matches one of the items in the table. The administrator also has the ability to exclude data that was previously required but not needed in the import process. This greatly simplifies the user's creation of the index files along with any future maintenance required for the import process. This example demonstrates the complexity that can be achieved through Universal Imports that cannot be achieved through legacy imports.
Although the invention has been described relative to exemplary hardware and software modules, it is within the skill of the ordinary artisan based on the teachings herein to alter, modify, or rearrange various elements of the apparatus and method without departing from the scope of the invention. According, the invention is defined by the appended claims rather than by what is shown or described herein.