The present invention relates to a method for browsing multiple images on image display devices.
Given an image capture device, it is highly desirable to view the captured images stored on the device. In some cases, the primary function of the capture device is to view images stored therein.
As these devices become increasingly popular, and as their storage capacities increase the number of images to be viewed or selected increases. In contrast, as hard disk drives and other storage devices become smaller the size of the display becomes limiting for overall device size. Because of the physical size of these devices, their displays are necessarily limited in both size (they must fit on the device) and resolution (the human eye can resolve only a finite number of pixels at a given distance). Selecting, browsing, and otherwise accessing images on such small displays is not well supported by existing user interfaces. Current approaches to browsing and managing large image collections almost universally use the ‘light table’ metaphor, where images are represented as reduced-resolution ‘thumbnail’ images, often presented on a large, scrollable 2-D region. Images can be marked by scrolling until a desired thumbnail is visible, then selecting the desired thumbnail using a pointing device such as a mouse.
The current approaches are not suitable for small displays for a number of reasons. A primary drawback is that the reduced size and resolution of a small display does not permit further reduction in the thumbnail images. Consider that sizes small enough to permit many thumbnails to be visible will be too small to see the individual pictures. Larger size images will allow only a very few thumbnails to be shown at once. Additionally, the scrolling and selecting operations typically require a mouse or pointer not usually found on small devices.
A hyperbolic non-linear function has been used successfully in the Hyperbolic Browser developed at PARC for browsing trees and hierarchies (Lamping, J., R. Rao, and P. Pirolli, ‘A Focus+Context Technique Based on Hyperbolic Geometry for Visualizing Large Hierarchies’ in Proc. CHI 95, ACM Conference on Human Factors in Computing Systems 1995, ACM: New York). Browsing images using thumbnails is extremely well known in the art. Variants on this include warping the images and/or thumbnails using perspective (Juha Lehikoinen and Antti Aaltonen Saving Space by Perspective Distortion When Browsing Images on a Small Screen, OZCHI 2003, 26-28 Nov. 2003, Brisbane, Australia) or other distortions (Y. K. LY. K. Leung and M. D. Apperley A Review and Taxonomy of Distortion-Oriented Presentation Techniques. In ACM Transactions on Computer-Human Interaction (TOCHI), vol. 1 issue 2 (June 1994), pp 126-140), or using variable thumbnail sizes (Johnson, B., Shneiderman, B. Treemaps: a space-filling approach to the visualization of hierarchical information structures. In Proc. of the 2nd International IEEE Visualization Conference pp. 284-291 San Diego, October 1991 and Shingo Uchihashi, Jonathan Foote, Andreas Girgensohn, and John Boreczky. Video Manga: Generating Semantically Meaningful Video Summaries. In Proceedings ACM Multimedia (Orlando, Fla.) ACM Press, pp. 383-392, 1999).
This invention is a novel user interface for accessing multiple digital images. One embodiment of the invention, is applicable to small-format displays incorporated in handheld imaging devices such as digital cameras, camera-equipped cell phones, PDAs, and video cameras.
In this invention, algorithms to show multiple images at the maximum possible resolution are defined. Instead of reducing the resolution of each image, the portion that is actually shown is reduced. Selecting which part of each image to be shown is the subject of the invention.
Preferred embodiments of the present invention will be described in detail based on the following figures, wherein:
It may be that a static image produced in this way will be unsatisfactory, as the majority of the images can be obscured by the images at higher positions in the stacking order. In one embodiment of this invention, methods to dynamically change the revealed regions of each image in a fluid and rapid manner are proposed. Thus over a small amount of time many images can be fully viewed, even if only part of an image is exposed at a time.
In one embodiment of the invention, a ‘composite’ image is formed by one or more overlapping images with regions partially removed.
In an embodiment of the invention, parameters can be changed over time to reveal the different images and to change the region of each included in the ‘composite’ image. In an embodiment of the invention, ‘scrub’ is such a parameter (in analogy with the video editing technique to move precisely forward and backward in time). In an embodiment of the invention, ‘scrubbing’ selects the central image and thereby the neighboring images, i.e., the ‘set of display images’. Changing the ‘scrub’ parameter successively includes new images from the stack while hiding previously visible ones in the ‘composite’ image. Scrubbing moves the visible images up and down the stack (see
‘Zoom’ is another parameter, which can be used to control the displayed ‘set of display images’.
User Interaction
In an embodiment of the invention, the user is able to smoothly and quickly adjust the ‘zoom’ and ‘scrub’ parameters over a natural range. In an embodiment of the invention, the range allows that all images can be included in the ‘composite’ image over time and thereby viewed. Control over each parameter is ideally provided by a smoothly variable input device, such as a slider, dial, thumbwheel, or one axis of a joystick, mouse, stylus, or similar pointing device. In an embodiment of the invention, an interface that may be particularly suitable for small form-factor devices are tilt sensors; tilting left or right can ‘scrub’ deeper or shallower into the image collection, while tilting forwards or backwards can increase or decrease the ‘zoom’. In an embodiment of the invention, fully ‘zooming’ into a particular image is used to ‘select’ that image for further operations, such as marking, printing, copying, or deletion.
Image View Regions
The extent of the visible region is determined for the ‘composite image’, given values for the ‘scrub’ and ‘zoom’ parameters. In different embodiments of the invention, there are a large number of possible mappings. Consider one in detail so that it can be understood and extended. This mapping is explained using the ‘stack analogy’. For simplicity, visible regions are considered as rectangles that are the same height as the display (however, as discussed above in an embodiment of the invention, all images can be resized by padding or cropping to the best aspect ratio). Seen from the side, the images look like an array of parallel lines, as shown in
Moving the intercept line up and down changes the ‘scrub’, as successive images are revealed or concealed as shown in
In an embodiment of the invention, ‘zoom’ and ‘scrub’ are orthogonal or independent; that is, the ‘zoom’ (angle) can be changed without affecting the ‘scrub’ (height), and vice-versa. In practice, this is a generally desirable property.
Nonlinear Boundaries
In an embodiment of the invention, image boundaries are more interesting than the equal-sized rectangles displayed in
In an embodiment of the invention, a non-linear function can work better than the equal-sized rectangles of
depth=round(zoom*atanh((2x−width)/width+scrub) equation 1
In an embodiment of the invention, image boundaries can change non-linearly in time and space. A full exploration of possible mappings can be appreciated by one of skill in the art. Additional embodiments of the invention that have functional and/or aesthetic value are presented. In an embodiment of the invention, a non-linear image boundary function is used as shown in
This representation has the advantage that all pictures become ‘clumped up’ in the corners and it is possible to estimate how many pictures are in the collection by the density of boundaries, even if the individual images are not visible. In addition, the image can easily be rotated 90 degrees. This can assist searching for image features in a particular region (for example if a user searches for a face in the top left corner, the boundary representation can be rotated so the top left is always in the central image and thus visible).
The foregoing description of preferred embodiments of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many embodiments were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications that are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the claims and their equivalents.
Various embodiments of the invention may be implemented using a processor(s) programmed according to the teachings of the present disclosure, as will be apparent to those skilled in the computer art. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art. The invention may also be implemented by the preparation of integrated circuits and/or by interconnecting an appropriate network of component circuits, as will be readily apparent to those skilled in the art.
Various embodiments include a computer program product which can be a storage medium (media) having instructions and/or information stored thereon/in which can be used to program a general purpose or specialized computing processor(s)/device(s) to perform any of the features presented herein. The storage medium can include, but is not limited to, one or more of the following: any type of physical media including floppy disks, optical discs, DVDs, CD-ROMs, micro drives, magneto-optical disks, holographic storage devices, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, PRAMS, VRAMs, flash memory devices, magnetic or optical cards, nano-systems (including molecular memory ICs); paper or paper-based media; and any type of media or device suitable for storing instructions and/or information. Various embodiments include a computer program product that can be transmitted in whole or in parts and over one or more public and/or private networks wherein the transmission includes instructions and/or information, which can be used by one or more processors to perform any of the features, presented herein. In various embodiments, the transmission may include a plurality of separate transmissions.
Stored on one or more computer readable media, the present disclosure includes software for controlling the hardware of the processor(s), and for enabling the computer(s) and/or processor(s) to interact with a human user or other device utilizing the results of the present invention. Such software may include, but is not limited to, device drivers, interface drivers, operating systems, execution environments/containers, user interfaces and applications.
The execution of code can be direct or indirect. The code can include compiled, interpreted and other types of languages. Unless otherwise limited by claim language, the execution and/or transmission of code and/or code segments for a function can include invocations or calls to other software or devices, local or remote, to do the function. The invocations or calls can include invocations or calls to library modules, device drivers, interface drivers and remote software to do the finction. The invocations or calls can include invocations or calls in distributed and client/server systems.
This application claims priority under 35 U.S.C. 119(e) to U.S. Provisional Application No. 60/735,054 filed Nov. 8, 2005, by Jonathan Foote, entitled METHODS FOR BROWSING MULTIPLE IMAGES (Attorney Docket No. FXPL-01111US0 MCF/AGC) which is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
60735054 | Nov 2005 | US |