This nonprovisional application claims priority under 35 U.S.C. § 119(a) on European Patent Application No. 04106116.9, filed on Nov. 26, 2004, the entirety of which is incorporated herein by reference.
1. Field of the Invention
The invention relates to a method of providing a digital document file based on a physical document using a desk top document scanning system. The method includes the steps of scanning over a field of interest and detecting manual gestures made by a user that indicate a usage of scanning results.
2. Description of Background Art
U.S. Pat. No. 5,511,148 discloses a feedback feature implemented in a copying environment through effecting a projection on a working plane, whilst effecting a certain manipulation by pointing and tapping with fingers on the working plane. The reference relates to the creating and processing of documents, whereas the present invention is directed to a scanning environment proper that wants to effect selecting among various different fields of usage, and thus selectively forwarding scanned data to a subsystem and/or software application associated to the selected field of usage.
U.S. Pat. No. 5,732,227 discloses a hand gesture-operated system including a display surface to be used as a desktop, on which document images and so-called “actual objects” may be displayed. Actual objects designate file handling operations, such as file storage, fax processing, keyboard entry, etc. Document images may be dragged to such an object to initiate the associated operation, by an operator using hand gestures over the display surface. However, this background art document does not disclose actual document scanning for obtaining document images. Rather, document images are generated digitally from document files and displayed to facilitate handling of them under hand gesture control. In this respect, the gesture processing is much more similar to the use of a mouse/cursor on a computer screen desktop than to scanner control.
Furthermore, the present invention recognizes the high worth of intuitive manipulation on an easy-to-understand level that requires little or no critical movements from a user.
In consequence, amongst other things, it is an object of the present invention to effect such selecting in a straightforward and uncomplicated manner, that would enhance possibilities for using documents and the like presented on a desk top.
A first aspect of the present invention is directed to a method of providing a digital document file based on a physical document, using a desk top document scanning system. The method comprises the steps of scanning over a field of interest and detecting manual gestures made by a user that indicate a usage of scanning results; detecting in said field of interest a substantially steady non-pointing first manual gesture by a user; determining an intended usage application selection from said gesture; executing a document scanning operation within the field of interest; and forwarding results of said scanning operation to the selected usage application as determined from the gesture.
In the above method, the gesture is substantially steady, which means that no prescribed motion is necessary to recognize the gesture. The gesture needs not point to a particular spot such as would be the case on a preformatted form, and the operation can thus be used for any document or document-like item, such as text written on an envelope or label. The field of usage does pertain to the usage of the document as such, which may contain text, graphics, images, and other. Generally, the size is such as fitting on a desk top and therefore, rather limited, such as no larger than standard A2, but this particular size is no express limitation.
In particular, said field of interest may be re-defined by detecting a second manual gesture by a user which second manual gesture is presented at said field of interest. To a certain extent, such detecting can imply both detecting proper and interpreting.
A second aspect of the present invention relates to a system that is arranged for implementing the method of the first aspect of present invention. The second aspect of the present invention is directed to a desk top document scanning system for operating in combination with a plurality of scan data usage applications. The system comprises a scanning facility for scanning over a field of interest; a detecting facility, connected to said scanning facility, that is arranged for detecting a substantially steady first manual non-pointing gesture by a user, which gesture is presented at said field of interest as representing said usage; a selection determining facility, connected to the detecting facility, for determining a selection of a said usage application, based on said detected gesture; and a forwarding facility for forwarding results of scanning a document placed in the field of interest selectively to a selected one of said usage applications.
Further scope of applicability of the present invention will become apparent from the detailed description given hereinafter. However, it should be understood that the detailed description and specific examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.
These and further features, aspects and advantages of the invention will be discussed more in detail hereinafter with reference to the disclosure of preferred embodiments of the invention, and in particular with reference to the appended Figures that illustrate:
b-1d illustrate action gestures (as opposed to the selecting gesture of
As an alternative to the area selection procedure as described in relation with
In a practical implementation, the area is first selected (if appropriate), and thereupon the Action gesture is detected.
For a multi-page document, the first page is presented, the area is selected, and then a so-called “set” gesture is entered, which is formed by, e.g., four extended fingers. The two hand poses are repeated for every page. Alternatively, the area selection gesture may be left out for the succeeding pages. After the last page has been entered, the Action gesture is presented by the user. In this case, the pages are scanned after the gestures proper. However, a different sequence would well be feasible.
By itself, recognition of the hand shape proper is well-known to persons skilled in the art. Known methods are, e.g., template matching, contour matching, Eigenface Matching, and neural network application. This aspect is; however, not part of the present invention.
In a practical embodiment, the camera that is used for the scanning process generates, for instance, 12 images per second. Regarding operating parameters of this embodiment, after selecting the region of interest, at least one image of the following 10 images must be interpretable as an Action command, with a matching score of at least +0.8 in a range from −1 to +1.
Selecting the region of interest needs to give rise to at least five recognized locations, because such would already be sufficient for interpretation of a rectangular area.
An action gesture must yield a matching score of 0.8 or up in at least 8 from 10 successive images. The recognition must be relatively secure, because it will start executing a scanning process immediately. This is particularly important in scanning multi-page documents, since additional, erroneous, images in the sequence are annoying. Further, some motion may occur during the detecting. However, the pose itself must remain substantially unchanged. Of course, other parameters would apply to other embodiments, security level wanted, etc.
Various alternative camera locations are feasible, such as fixed to or pending from the office ceiling, etc.
Further to the executing of the gesture, the system finds the location information and calculates the region of interest, S54, possibly depending on the manner in which the gesture is executed (cf.
In the third place, if the Action command gesture specifying the selected field of usage is entered by the user after the location information gesture, in block S56 any necessary post-processing steps dedicated to the selected field of usage are determined. From then on, certain postprocessing steps may follow (cf.
In a basic embodiment of the invention, the scanner system is a personal gadget dedicated to one user. In that case, the destinations used by the e-mail and archive applications, and also for printing, may be pre-programmed as the e-mail address of the user and a dedicated directory within the user's computer system, respectively.
In a more elaborate embodiment, the scanner system may be a shared appliance in a multi-user environment. In that case, it would be preferable to include a user recognition function in the system. e.g., the scanner might be provided with a reader for, possibly remotely readable, identity cards, such as cards including an RFID tag, or with a device for recognizing biometrical characteristics, such as a fingerprint reader. Such elements could easily be incorporated in the construction of the scanner system, as already mentioned above as the Other Facilities 88. Also, an identity card may carry a machine-readable code, such as a bar code, and may be presented to the scanner, that can read it and so identify the user.
Also, and preferably, the system might be able to recognize a user by analysing the biometrical characteristics of the user's hand as a part the process of analysing the gesture. It is well-known from scientific research that hands of different persons are sufficiently different to enable identification by analysing the dimensions of fingers, phalanges and knuckles, especially in limited groups of people.
In this embodiment, the system may include a pre-programmed database of users with their identifying data and their preferred scan data destinations, such as an e-mail address and archiving storage locations, or a preferred printer. When a user presents his hand at the scanner field-of-view, or enters his identity data otherwise, he will automatically be recognized and his preferred scan data destination looked-up and applied.
Of course, a shared scanner may also be connected to a computer standing at its side and implementing a conventional user interface for selecting a destination.
From the above, it would be clear that the scanning procedures may be executed in various different manners. For example, the scanning proper and the two tiers of gestures may be effected in various different orders, which need not be uniform in a particular application. Furthermore, a single gesture pair may control the processing of a sequence of scans or pages. In such case, the pages are presented after the gestures. The page sequence may be started and terminated by specific gestures. Another specific gesture may be used as an ignore or cancel signal; in particular, the latter may again be a moving gesture. In principle, the number of gestures made by a single hand is relatively large, even while taking into account that various combinations are difficult or impossible for certain persons. Note that in particular the thumb has various distinctive poses possible. The gestures may be made by the right hand alone, or by either left or right hand, both hands then yielding the same or different meanings. In principle, even a two-handed gesture would be feasible, such as a cross. The color of the hand is in particular arbitrary, but some care may have to be taken to distinguish the hand color from the background.
Now, the present invention has hereabove been disclosed with reference to preferred embodiments thereof. Persons skilled in the art will recognize that numerous modifications and changes may be made thereto without exceeding the scope of the appended Claims. In consequence, the embodiments should be considered as being illustrative, and no restriction should be construed from those embodiments, other than as have been recited in the Claims.
Number | Date | Country | Kind |
---|---|---|---|
04106116.9 | Nov 2004 | EP | regional |