1. Technical Field
This application relates generally to electronic enhancement of printed material for immediate viewing. More specifically, the application relates to using a handheld electronic device containing a camera to provide magnification and enhancement to written text, even if the text is embedded within non-text material.
2. Description of Related Art
Ever since the invention of the printing press, people have increasingly relied on the printed word for knowledge and ease of communications. With reliance on the printed word, however, persons who have difficulty focusing on close material, such as those with presbyopia, or who find themselves in low lighting conditions can find themselves unable to read what is before them.
A number of devices exist for aiding vision in reading. For those with severe problems, reading machines provide a closed-circuit television, with a lens that is generally focused on a flat surface where books, pictures, and other printed matter can be placed. The reading machines can provide up to 30× or 40× magnification of printed matter. On the other hand, they are at least the size of a small television and cost several thousand dollars, making them impractical for limited budgets and for traveling away from home. Reading glasses and even lighted magnifying glasses are a more practical answer for most situations, but the user has to make an effort to have these devices with them, and may not bother if the need is irregular. When the need does arise, the user often must rely on others for help in deciphering the written material. A better solution is desirable.
Mobile telephones and personal digital assistants have become almost ubiquitous, with many of the newer devices combining the original capabilities with other devices, such as digital cameras. The addition of software that a) distinguishes print from graphics and b) magnifies and enhances the print can allow an electronic device containing a digital camera and display to provide magnification and enhancement of the printing, even when the words are embedded in or overlying graphics. At least some embodiments of the invention include the capability to recognize the language of the text and to provide a translation via online resources. At least some embodiments also include the ability to recognize different currencies and to provide conversions to another currency via online resources.
The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as a preferred mode of use, further objectives and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings, wherein:
Hardware
Although the capability exists to add a digital camera into a large number of electronic devices, the most common devices to have these are mobile phones and personal digital assistants (PDAs). Wireless phones are fast becoming universal for anyone who can afford them, while PDAs are popular with those whose schedules are crowded. Both of these devices are small enough to be carried almost constantly by the user and hence are available at times when a stand-alone camera is not generally accessible.
With reference now to
Turning to
Baseband processor 132 provides for receiver and transmitter operations and is also referred to as a transceiver. In particular, baseband processor 132 handles all of audio, signal, and data processing needed to receive and send data using RF transmissions or Bluetooth transmissions. Application processor 134 provides the processing power for other functions within camera phone 130. For example, calculators, calendars, alarms, camera functions, and directories are provided through application processor 134. Flash/SRAM 136 is a storage device in which various instructions for providing the functions within camera phone 130 are located and provide upgrades. Flash card 138 is a storage device in which user data and applications may be stored. An example of flash card 138 is a secure digital card.
A pathway for the transmission of voice and other types of data is through RFIC 140. Additionally, short-range transmissions may be sent or received through Bluetooth unit 146. Bluetooth unit 146 conforms to Bluetooth wireless specification, which defines the link layer and application layer for product developers. Both of these transmissions are made through antenna 144 in this illustrative example.
Color LCD 148 provides a display for pictures and other data for camera phone 130. Camera 150, in this example, is a complementary metal oxide semiconductor (CMOS) camera that may be built into camera phone 130 or connected to camera phone 130 as a module, such as IC card 152. IC card 152 also may contain other application specific functions, such as a global positioning system (GPS) or other functions, such as a modem or additional memory.
Camera 150 forms the camera module of camera phone 130, while the other components form the digital phone module of camera phone 130 in these illustrative examples. Instructions or circuits are added to camera phone 150 to allow for control of the digital camera and allow the processing and display functions according to a preferred embodiment of the present invention.
Devices such as the disclosed mobile telephone/camera of
With reference now to
PDA 200 may also include keypad 204, speaker 206, and antenna 208. Keypad 204 may be used to receive user input in addition to using screen 202. Speaker 206 provides a mechanism for audio output, such as presentation of an audio file. Antenna 208 provides a mechanism used in establishing a wireless communications link between PDA 200 and a network. Add-on camera 212 is shown inserted into the expansion slot of PDA 200. PDA 200 also preferably includes a graphical user interface that may be implemented by means of systems software residing in computer readable media in operation within PDA 200.
Turning now to
An operating system runs on processor 204 and is used to coordinate and provide control of various components within PDA 200. The operating system may be, for example, a commercially available operating system such as Windows Mobile, which is available from Microsoft Corporation, or Palm OS from PalmSource Inc. Instructions for the operating system and applications or programs are located on storage devices, such as storage 212, and may be loaded into main memory 206 for execution by processor 204. Camera 236 is shown by dotted lines to illustrate that it is removable. Once inserted into the expansion slot, camera 236 is attached to bus 206.
Those of ordinary skill in the art will appreciate that the hardware in
Image Processing
The use of the inventive processing will now be discussed with reference to
Once the discriminator program has isolated the foreground material, a pattern recognition algorithm is used to identify the text portion of the image (step 314). This program can be simple, to merely distinguish text from images or the program can be an optical character recognition program, which recognizes letters and words. Having discerned the text from other material on the page, it is then possible to provide magnification and enhancement of the text to make it easily readable by the user (step 316). In one embodiment, an image-rendering algorithm is used to magnify the text as needed, while providing smoothing of the enlarged text as magnification increases. In an alternate embodiment, once the text is recognized, it can be reformatted into a different font or the font size increased to aid readability. Finally, the enhanced and enlarged image is displayed on the screen (step 318). A scroll button or a combination of other buttons on the device can be used to allow the user to scroll down or across the resulting text.
Options
In a preferred embodiment of the invention, an options page allows the user to adjust the settings on the smart magnification process. For example, the user can set a desired magnification level or a desired font size for display. As previously mentioned, the aggressiveness of the discrimination algorithm can be set by the user or allowed to default to preset values. In at least one embodiment, default values for all options are set when the phone is initialized. This can include an automatic setting for the discrimination algorithm.
When a character recognition program is used, the text can be scanned to determine the language of the photographed document. Most mobile phones and PDAs contain a language setting, which the user can set to their preferred language. If text is encountered in the document that are not in the chosen language, the inventive program can optionally provide a translation of the foreign word(s). The translation can then be inserted into the text after the translated words or can be used to replace them. The processing and storage capabilities of the device are generally limited, but the translation itself does not need to be done by the device, since it can interface to outside sources having much greater capabilities. Alternatively, a software module can provide the translation service within the device. Likewise, when currency amounts are present in the photographed document, the inventive program can utilize outside sources, such as currency exchange rates, to provide a conversion into a currency most familiar to the user. Again, the default value for the desired currency can be taken from the users' preferred language; alternatively, the desired currency can be set in the options page for the inventive program.
The options page can also provide the user a choice of the display they wish to receive. For example, in a darkened room, such as many restaurants in the evening, the user may find it easier to read white text on a black background, rather than the traditional black text on white background. Other variations can also be designed.
In at least one embodiment of the invention, a lens especially designed for the disclosed photographing of text can be used in addition to or instead of the original lens provided in the device.
The inventive device and method provide users with an ability to magnify and enhance text for reading, using only a device that many carry with them on a daily basis. This provides added value to the device and helps the user to cope in previously awkward situations.
The description of the present invention has been presented for purposes of illustration and description, and is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art. The embodiment was chosen and described in order to best explain the principles of the invention, the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.