The present invention relates generally to electronic devices having digital camera functionality. More particularly, the present invention relates to digital camera devices which utilize optical character recognition to name image files.
Digital cameras are quickly becoming the principal photography device for most households. Indeed, many electronic devices are being integrated with digital camera functionality. For example, mobile telephones which include a digital camera are becoming increasingly common. The digital format allows users to easily and economically take and share large numbers of photographs. As a result, there is a need for an organizational system in storing the digital images.
One common problem that arises with current storage systems concerns the naming of each electronic file. In general, digital cameras save image files using a file name that is determined manually or via a default naming system. Manual entry normally allows for a more meaningful name to be assigned to a file. However, this is a rather arduous task with most digital cameras due to the relatively small size of the user interface. In addition to manual naming, most digital cameras include a default naming mechanism. Various systems are known in the art for providing a default name for files. Perhaps the most common default naming system involves a consecutive naming system where each file is named with a number in a consecutive sequence.
However, default naming, even where consecutive, often provides little or no useful information regarding the file. This makes it hard to recognize a specific image based on the file name. In fact, this problem is exacerbated as the memory size of devices grows, since this will often result in many more images stored in the memory of the device. In addition, where a user transfers files from a digital camera to another electronic device, such as a computer, different files often have the same name. This is particularly true where the default naming system involves a small series of numbers, or where the default naming system resets often, thus generating different images with the same file name. Also, image galleries on personal computers or web servers can hold a plethora of pictures, making it difficult to locate a specific picture where the default name is not known. Therefore, it would be beneficial to have an image named according to some memorable feature which serves to identify the content of the image to a user.
The present invention provides for the naming of digital camera image files using optical character recognition (OCR). A device incorporating the present invention can identify characters in an image and use those characters for creating a file name for storing the image in a memory unit. OCR refers to the branch of computer science that involves reading text from an image and translating the images into a form that the computer can manipulate (for example, into ASCII codes).
When an image is captured, a digital camera of the present invention automatically performs OCR for the image and determines if there is any text in the image. If text is found, then the image is either named according to the text, or the text can be proposed to a user, with the user allowed to select and/or edit the file name. This ability to select the name is especially useful in situations where there is more than one text string in an image. Thus, the name of the image is more easily recognizable than when default names are used. A user is able to better identify, organize, and find images with the help of more appropriate file names with the present invention.
In one embodiment of the present invention, more than one text item may be present in an image. The image is named according to a selected criteria or a group of criteria such as, but not limited to, the text size, text color, text length, the position of the text in the image, and combinations thereof. In one embodiment of the present invention, a user may select from a plurality of settings which determine what naming scheme is used.
These and other objects, advantages, and features of the invention, together with the organization and manner of operation thereof, will become apparent from the following detailed description when taken in conjunction with the accompanying drawings, wherein like elements have like numerals throughout the several drawings described below.
A generic digital camera constructed according to one embodiment of the present invention is shown at 10 in
When a digital file capturing an image 28 is created, the at least one lens 12 focuses the image 28 onto the at least one image sensor 18 which electronically records light reflected from the image 28. The camera processor 16 then breaks this electronic information down into digital data (via an analog-to-digital conversion) for a digital image which can be stored on a memory unit, such as the primary memory unit 14 and/or the secondary memory unit 20, as a file. The digital camera 10 also includes a data communication port 22 to enable the transmission of digital images from the digital camera 10 to a remote terminal, such as a personal computer 24. The data communication can be in either wired or wireless form and can be configured for USB, Bluetooth, infrared, or other connections. The digital camera 10 also includes one or more input buttons 26 for entering information and/or taking a picture, although input buttons 26 could also be remote from the digital camera 10.
The digital camera of the present may be one component of another device such as a video camera, a mobile telephone, a personal digital assistant, a watch, or an audio player. When the digital camera is a component of another device, various parts may be common to the devices. For example, in one embodiment of the present invention, a mobile telephone includes a digital camera component and a telephone component, both of which may share a housing, memory, OCR, processor, etc.
All OCR systems include an optical scanner for reading text and software for analyzing images. As used in this application, OCR refers to all types of optical scanning systems such as, but not limited to, OCR, intelligent character recognition (ICR), and optical mark reading (OMR). In one embodiment of the present invention, the optical scanner comprises the digital camera 10. In addition, most OCR systems use a combination of hardware (such as specialized circuit boards) and software to recognize characters, although some systems function entirely through software. In accordance with the principles of the present invention, OCR can be used to identify characters and/or words in an image. There are two common methods used for OCR: matrix matching and feature extraction.
In one embodiment of the invention, matrix matching is utilized. Matrix matching compares what the OCR module 21 sees as a character with a library of character matrices of dots. When a character matches one of these prescribed matrices of dots within a given level of similarity, the computer labels that image as the corresponding ASCII character. Matrix matching works best when the OCR encounters a limited repertoire of type styles, with little or no variation within each style.
In another embodiment, feature extraction is utilized. Feature extraction is OCR without a reliance on matching to predetermined templates. Feature matching is typically referred to as ICR or Topological Feature Analysis. This method relies on the software to perform an “intelligent” analysis of the image. For example, in one embodiment, an OCR module using feature extraction looks for general features, such as open areas, defined shapes, horizontal, diagonal, and vertical lines, and line intersections. In general, matrix matching works best when the image contains only basic text fonts, sizes, and variations of text. In contrast, feature extraction generally provides superior results where the characters are less predictable.
In accordance with the principles of the present invention, file naming with OCR can work as part of capturing a new image, or as part of browsing existing images. In one embodiment, the OCR module 21 operates on an image immediately after it is captured and processed into a digital format by the camera processor 16. In an exemplary embodiment, the user is prompted at step 215 to save the image with a file name suggested from the text recognized by the OCR module 21. In one exemplary embodiment, following suggestion of a file name at step 215, the user may accept the proposed new name (step 217), reject the new name (step 219), or manually change the name (step 221). In another embodiment, the image is automatically saved at step 223 using the assembled words from step 213.
In another embodiment of the invention, the OCR module 21 operates to analyze a digital image file when the digital image files is selected to be viewed. A suggested new file name for the image is provided based upon the OCR. The image may then be saved using this new, more-informative file name. In one embodiment, the prior file having the default name is deleted when the image is renamed based upon OCR.
In an exemplary embodiment, an image file is saved using a combination of information from the OCR module 21 and default data, such as a consecutive numbering system or time/date/year information.
In one embodiment of the present invention, a user may select from a plurality of settings which determine what naming scheme is used. The image is named according to a selected criteria or group of criteria such as, but not limited to, the text size, text color, text length, the position of the text in the image, and combinations thereof.
More than one text item may be present in an image. In one embodiment, the present invention includes one or more criteria for determining which of a plurality of text items in an image to use for naming the digital file. In one embodiment, a user may select one or more criteria for selecting one or more of the plurality of text to use in naming the digital image.
The following non-limiting examples illustrate operation of the invention. In one hypothetical situation, a user takes a holiday picture in the front of a hotel's main door using his or her camera-phone. While saving the picture, the phone's OCR module identifies the word “Hilton,” and proposes “Hilton June 23” as the default name for the picture. The user accepts the proposed file name and the image file is stored as “Hilton June 23.”
In another hypothetical scenario, a user browses images saved under a default naming system in a digital camera's memory. By background process, the phone OCR module is analyzing the pictures being browsed, and proposes a new name when the user opens a specific image.
The present invention is described in the general context of method steps, which may be implemented in one embodiment by a program product including computer-executable instructions, such as program code, executed by computers in networked environments.
Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Computer-executable instructions, associated data structures, and program modules represent examples of program code for executing steps of the methods disclosed herein. The particular sequence of such executable instructions or associated data structures represent examples of corresponding acts for implementing the functions described in such steps.
Software and web implementations of the present invention could be accomplished with standard programming techniques, with rule based logic, and other logic to accomplish the various database searching steps, correlation steps, comparison steps and decision steps. It should also be noted that the words “component” and “module” as used herein, and in the claims, is intended to encompass implementations using one or more lines of software code, and/or hardware implementations, and/or equipment for receiving manual inputs.
The foregoing description of embodiments of the present invention have been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the present invention to the precise form disclosed, and modifications and variations are possible in light of the above teachings or may be acquired from practice of the present invention. The embodiments were chosen and described in order to explain the principles of the present invention and its practical application to enable one skilled in the art to utilize the present invention in various embodiments, and with various modifications, as are suited to the particular use contemplated.