1. Field of the Invention
The present invention relates generally to the manipulation of stored data, more particularly to systems and methodologies for the capture, transmission, management, storage, retrieval and display of document images in a shared-system environment using the Internet or other network.
2. Background of the Present Invention
Since the introduction of paper, there has been the problem of storing documents and making them readily available for later use. As society entered the Information Age, an ever growing mountain of paper documents became increasingly difficult to store and manage. Certain document-intensive industries, such as banking, have come under increasing pressure to manage this problem.
With the advent of the computer and increasing data storage capabilities, text and image-based data are now being electronically stored at an even greater pace. Since geographical images require considerably more storage space and processing power to manipulate than a simpler text-based system, conventional commercial computer systems heretofore have been unable to adequately service this growing segment of the industry, e.g., due to inadequate storage capacities on other technological bottleneck. An additional problem with image-based information is the inability at present to search the graphical image itself and the need to correlate the image with sufficient relevant text-based data to permit search or query capability and retrieval.
Conventional models for document imaging systems involve usage of imaging equipment and software at a single central facility, e.g., at a hospital or bank, under the control and direction of a central computer at that facility. Under this model, however, companies having multiple offices, desiring to centralize their records, have to ship their documents (either physically or electronically) to a central computer for centrally storing all of the documents and permitting access via phone or other dedicated lines.
Despite the advent of networking, e.g., local area networks or LANs and now the Internet, this central computer model has nonetheless retained hold. With the emergence of the Internet as a platform for commerce, however, new paradigms of operation became possible. Instead of companies investing heavily in equipment and manpower to support the scanning, indexing and storage of their own documents, companies could eliminate this entire overhead by outsourcing these and other data management functions. Applicants have recognized the need for this and other such services and have designed an improved system and methodology for servicing this heretofore unrecognized but greatly desired need.
It is, therefore, an object of the present invention to provide an improved system and methodology for document storage, management and retrieval.
It is also an object of the present invention to provide an improved remote distributed capture system, eliminating the need to ship documents to a central point for processing.
The present invention is directed to a document imaging platform system and methodology for capturing, transmitting, storing, retrieving and displaying documents in a shared-system environment using the Internet or other network. Through utilization of thumbnail images along with full images, transmission of multipage documents is facilitated, avoiding system bottlenecks. Document security is hierarchically based with document control being available to system users in addition to system administrators.
The disclosed invention will be described with reference to the accompanying drawings, which show important sample embodiments of the invention and which are incorporated in the specification hereof by reference, wherein:
The numerous innovative teachings of the present application will be described with particular reference to the presently preferred exemplary embodiments. However, it should be understood that this class of embodiments provides only a few examples of the many advantageous uses of the innovative teachings herein. In general, statements made in the specification of the present application do not necessarily delimit any of the various claimed inventions. Moreover, some statements may apply to some inventive features but not to others.
With reference now to
At the user side, designated generally in
During capture, designated generally by the reference numeral 102, the document image is temporarily stored in a local cache directory 118 for subsequent transfer. It should, of course, be understood that after capture, the document may be indexed, providing a variety of textual indicators useful in later identifying that particular document from a potential myriad of similar-looking documents.
Indexes applied to and associated with each image are stored in the central database repository 134, and are referenced by the image identification number assigned during scanning. With further reference to
With reference again to
It should, of course, be understood that the Web server 124 used for communications between the user side and the server side should be configured for maximum encryption or other security algorithms to maintain data privacy, and hinder eavesdropping or other potential intrusions.
It should be understood that the respective capture stations 102 are assigned a unique system-wide identification number, which allows the central servers to reliably know where data is coming from. Once a batch has been created, the central site, i.e., the repository, maintains an audit record of everything that happens to the batch throughout its life-cycle, and this audit is available real-time to the user. Transfer of batch data happens real-time during scan and index. Transmission of images is offloaded to an unattended application, the cache controller 120. The cache controller 120 receives all instructions from the central server, which tells it which images need to be transferred and which batches are eligible for deletion from the remote cache. The central server also provides an operations person the ability to schedule when each individual remote site 118 can send images (the transmission window), allowing Applicants to level-load the network bandwidth.
It should be readily understood that additional devices may be employed to forward non-indexed images to the central image repository 132 for storage therein. For example and with reference again to
In a similar fashion, a networked digital scan device 112, such as a Digital Sender device made by Hewlett Packard, or a Document Centre device 114, such as made by Xerox, may be employed to scan and forward document images as electronic mail attachments to the electronic mail server 128, as discussed hereinabove. Additionally, an electronic mail application (designated generally by the reference numeral 116) could be used to forward document images as attachments to the electronic mail server 128. As discussed, all of the transmissions to the e-mail server 128 are forwarded to the image repository 132 by the e-mail import device 129. It should be understood that the e-mail import device 129 also created image records in the database repository 134, and identifies these image records as non-indexed. The non-indexed records are then available for indexing by any capture station 102 having access to the central system.
Whereas the above describes various mechanisms for the capture and transference of images and index records to the central repository, additional functions permit management and manipulation of the images on the repository side.
With reference again to
Contents of the system journal 142 are available through an Audit and Reports interface (designated generally by the reference numeral 108) and provide the user with ad hoc report generation capabilities on any system activity.
With further reference to
In addition to the storage of full images of documents within the repository, smaller versions thereof, i.e., thumbnail images, are also stored. Upon document capture 102 and creation, the respective images and indices corresponding thereto are transmitted to the repository, where the respective thumbnail images are created and stored in a single file by system identification number. All document images (pages) are preferably stored in a single image format (no multipage TIFFs). When a user selects a document for viewing, the first page of the document is sent in full along with preferably all thumbnail images for all pages. Since the thumbnail images are considerably smaller than the original or full image size, e.g., the thumbnail image being less than about one kilobyte in size, the user has the ability to see a representation of all of the pages of the document in order to make a further viewing selection.
It should be readily apparent to one skilled in the art that minimizing data transference in this fashion greatly improves system performance and avoids unnecessary downloads.
Other repository-side functions are available to manipulate the images, including image processing techniques and other image cleanup techniques, described hereinabove in connection with reference numeral 146. Barcode processing can also be performed at the repository side to automatically extract index values from one or more barcodes affixed to the document images, also described hereinabove in connection with reference numeral 148.
Documents stored on the system are, of course, made available for search and display by a user. For example, and with reference again to the system configuration illustrated in
The advantages of this system configuration over prior document centralization methods are manifest. Instead of shipping countless pounds or tons of documents to a centralized facility for scanning, indexing and storage, this functionality can be distributed with scanning and perhaps indexing being localized endeavors, e.g., at bank branches, and electronic storage being centralized at the headquarters of an organization. Outsourcing one or more of these functions, expensive equipment, know-how and manpower can save a company considerable sums and generate efficiencies within the organization by simplifying the processing of and access to such data. Further, the cost of riding the technological wave of new software and hardware, perhaps the bleeding edge thereof, can be avoided through such outsourcing, leaving the responsibility for technological advancement and capability in the hands of those skilled in the area.
With reference now to
Both the API 202 and the Application Server 220 govern contact with a backend program 222, e.g., the aforedescribed repository interface 130 in
With further reference to
In addition to offering an improved paradigm over conventional document retention schemes, the present invention is also directed to improvements in the accessing of such documents, offering new techniques in security. As is understood in the art, security issues in the single facility model are governed by an administrator who directly controls the administration of the entire system.
The present invention employs the hierarchical concept of an account, a domain, an application and index fields to categorize the information. For example, an account represents a contract with a customer, e.g., a university, to provide document imaging services. A domain is a facet of the account, e.g., a department within the university such as student records, and an application would represent instances of the domain, e.g., admissions or transcript records. The final layer of granularity is the index field, which defines documents in applications, e.g., student name. Instead of the system administrator for the account controlling access at all levels, i.e., no granularity of control, control or access can be granted to domains or applications, distributing security to end users in multiple tiers. In other words, the system and methodology of the present invention places full control of the lookup configuration directly in the user's hands and requires no special programming to implement.
An advantage of this approach is ready reconfigurability by the user instead of an administrator. A form of distributed security is possible where only viable index fields permissible to that user are presented and others masked. One mechanism for employing this aspect of the present invention is having the user themselves use Open Database Connectivity (OBDC) protocols to define the index fields or lookups. By using a standard interface accessible to a variety of database formats, such as ODBC, the user instead of a system administrator can control or configure what they see and how.
It should further be understood that although the present invention is currently implemented in Visual BASIC with ActiveX controls, additional software tools may be employed to practice the principles of the present invention. For example, at least one such software tool is JAVA, which would offer additional benefits to this innovation.
As will be recognized by those skilled in the art, the innovative concepts described in the present application can be modified and varied over a wide range of applications. Accordingly, the scope of patented subject matter should not be limited to any of the specific exemplary teachings discussed, but is instead defined by the following claims.
Number | Name | Date | Kind |
---|---|---|---|
5675507 | Bobo, II | Oct 1997 | A |
5699458 | Sprague | Dec 1997 | A |
5737599 | Rowe et al. | Apr 1998 | A |
5781785 | Rowe et al. | Jul 1998 | A |
5819301 | Rowe et al. | Oct 1998 | A |
5943679 | Niles et al. | Aug 1999 | A |
6073148 | Rowe et al. | Jun 2000 | A |
6105042 | Aganovic et al. | Aug 2000 | A |
6262732 | Coleman et al. | Jul 2001 | B1 |
6275829 | Angiulo et al. | Aug 2001 | B1 |
6324545 | Morag | Nov 2001 | B1 |
6350066 | Bobo, II | Feb 2002 | B1 |
6389159 | Gilman et al. | May 2002 | B2 |
6437797 | Ota | Aug 2002 | B1 |
6457026 | Graham et al. | Sep 2002 | B1 |
6496206 | Mernyk et al. | Dec 2002 | B1 |
6582475 | Graham et al. | Jun 2003 | B2 |
6613100 | Miller | Sep 2003 | B2 |
6665838 | Brown et al. | Dec 2003 | B1 |
6674472 | Tsutsui | Jan 2004 | B1 |
6700612 | Anderson et al. | Mar 2004 | B1 |
6781711 | Rijavec et al. | Aug 2004 | B1 |
6938051 | Burger et al. | Aug 2005 | B1 |
20050022133 | Sakamoto et al. | Jan 2005 | A1 |