This application also relates to the following co-pending applications, each of which is incorporated herein by reference:
U.S. patent application Ser. No. 10/675,724, filed Sep. 30, 2003;
U.S. patent application Ser. No. 10/675,823, filed Sep. 30, 2003;
U.S. patent application Ser. No. 11/127,326, filed May 12, 2005;
U.S. patent application Ser. No. 11/128,543, filed May 12, 2005;
U.S. patent application Ser. No. 10/831,436, filed Apr. 23, 2004;
U.S. patent application Ser. No. 11/126,637, filed Apr. 15, 2005;
U.S. patent application Ser. No. 11/151,167, filed Jun. 10, 2005;
U.S. patent application Ser. No. 11/069,512, filed Mar. 1, 2005;
U.S. patent application Ser. No. 10/987,288, filed Nov. 12, 2004;
U.S. patent application Ser. No. 11/364,933, filed Mar. 1, 2006;
U.S. patent application Ser. No. 11/607,181, filed on Dec. 1, 2006;
U.S. patent application Ser. No. 11/769,671, filed Jun. 27, 2007; and
U.S. patent application Ser. No. 11/865,112, filed Oct. 1, 2007.
With the proliferation of digital cameras and memory cards, consumers are taking more images than ever. However, people rarely consume and repurpose their images beyond individual image prints. It is not that richer storytelling and sharing experiences lack perceived value—people enjoy receiving media creations such as image collages, calendars and books. Rather, the problem is that for most users, converting an image collection into an artifact that captures the story or memory is difficult, because the tools available are either too complicated to learn, or oversimplified to the point that they lack sufficient flexibility. Consider the example of creating a collage. Most users do not have access to truly flexible image manipulation and layout software, let alone the time and inclination to develop their own techniques. As a result, in typical solutions, flexibility is traded for the ease of use offered by rigid templates.
What are needed are improved systems and methods for authoring image collages.
In one aspect, the invention features a method in accordance with which a user interface is displayed. The user interface includes a catalog area, a collage mock-up area, and a mode select interface control operable to select an operational state of the user interface. Thumbnails of respective images are shown in the catalog area. A layout of a subset of the images is presented in the collage mock-up area. An instance of a respective one of multiple types of user input gestures with respect to a target object displayed in the user interface is received. The target object is an instance of a respective one of multiple object types. In response to the receipt of the user input gesture instance and a determination that the user interface is in a first operational state, an instance of a first action type is performed based on the type of the received user input gesture and the object type of the target object. In response to the receipt of the user input gesture instance and a determination that the user interface is in a second operational state, an instance of a second action type is performed based on the type of the received user input gesture and the object type of the target object.
The invention also features apparatus operable to implement the inventive method described above and computer-readable media storing computer-readable instructions causing a computer to implement the inventive method described above.
Other features and advantages of the invention will become apparent from the following description, including the drawings and the claims.
In the following description, like reference numbers are used to identify like elements. Furthermore, the drawings are intended to illustrate major features of exemplary embodiments in a diagrammatic manner. The drawings are not intended to depict every feature of actual embodiments nor relative dimensions of the depicted elements, and are not drawn to scale.
An “image collage” is a composition of images on a page.
An “image” broadly refers to any type of visually perceptible content that may be rendered on a physical or virtual page. Images may be complete or partial versions of any type of digital or electronic image, including: an image that was captured by an image sensor (e.g., a video camera, a still image camera, or an optical scanner) or a processed (e.g., filtered, reformatted, enhanced or otherwise modified) version of such an image; a computer-generated bitmap or vector graphic image; a textual image (e.g., a bitmap image containing text); and an iconographic image. In the illustrated embodiments, each of the images has a respective aspect ratio, which is the ratio of image height to image width. Each variable-area image may be assigned a respective positive scalar-valued nominal size. The term “nominal size” (also referred to as “relative area”) refers to a designated or theoretical size that may or may not vary from the actual or rendered size, where the “size” of an image is the amount of area of a page that is occupied by the image. In some embodiments, the user is allowed to set the nominal size values that are assigned to the images. In other embodiments, the image collage authoring system automatically assigns the nominal size values to the graphic objects.
A “thumbnail” is a reduced-resolution version of an image.
As used herein, the term “page” refers to any type of discrete area in which graphic objects may be laid out, including a physical page embodied by a discrete physical medium (e.g., a piece of paper) on which a layout of graphic objects may be printed, and a virtual, digital or electronic page containing a layout of graphic objects that may be presented to a user by, for example, an electronic display device.
A “user input gesture” is any type of input that is received from a user and may be interpreted as a command. The input may correspond to any type of input that is generated by a pointing device that is capable of inputting commands into a computer. Exemplary pointing devices include hand-manipulated pointing devices, such as computer mice, joysticks, trackballs, touchpads, and keyboards, which commonly are used to input instructions into a computer by manipulating the pointing device. Such pointing devices allow a user to control movement of a cursor (i.e., a virtual pointer) across a computer screen, select or move an icon or other virtual object displayed on the computer screen, and open and close menu items corresponding to different input commands.
A “computer” is any machine device, or apparatus that processes data according to computer-readable instructions that are stored on a computer-readable medium either temporarily or permanently.
An “object” is any type of discrete element in a user interface that has state and behavior, and may be selected or otherwise usefully treated separately from other elements of the user interface. Exemplary objects include passive objects (e.g., buttons and hyperlinks) that trigger the performance of an action, and passive objects (e.g., image objects) on which actions are performed. When used to characterize an object, the term “target” is a label that refers to an object that is to be or has been selected or an object that is to be or has been affected by an action.
As used herein, the term “includes” means includes but not limited to, the term “including” means including but not limited to. The term “based on” means based at least in part on.
The embodiments that are described herein provide an image collage authoring system that allows a user to retain control over the position and sizes of the images in a collage and to access image analysis based functionality that alleviates tedious and difficult tasks that commonly are associated with image selection, editing, and layout. These embodiments include a user interface that is designed to minimize the number and complexity of new concepts the users need to learn. The user interface provides a seamless integration between the images with which the user is engaged and the functionality used to control the images. The user interface has two modes of interaction: a basic mode that provides access to a basic set of intuitive features; and an enhanced mode that provides access to more complex functionality. The interaction model provides intelligent context-based interpretations of user inputs that enable the user to direct automatic selection, editing, and composition of images in accordance with the user's specific purpose, while avoiding overwhelming the user with decision-making and control settings when proactive suggestions can be made automatically.
In accordance with the method of
In the catalog area 28, the image collage authoring system 10 shows thumbnails 34 of respective ones of the images 16 (
The image collage authoring system 10 presents a layout 42 of a subset of the images in the collage mock-up area 30 (
The image collage authoring system 10 receives an instance of a respective one of multiple types of user input gestures with respect to a target object that is displayed in the user interface 22 (
The image collage authoring system 10 performs a context-dependent action that depends on the type of user input gesture, the type of object towards which the user input gesture is directed, and the operational mode of the user interface 22. For example, in response to the receipt of the user input gesture instance and a determination that the user interface 22 is in a first operational state, the image collage authoring system 10 performs an instance of a first action type based on the type of the received user input gesture and the object type of the target object (
In the illustrated embodiments, the mode select interface control 32 allows the user to select between a basic operational mode that provides access to a set of basic image collage authoring functions, and an enhanced operational mode that provides access to more complex enhanced functions. The basic functions tend to be more intuitive manual type of functions, whereas the enhanced functions tend to be more directed or automated functions. The separation of the basic and enhanced interface functions into two discreet operational modes allows the user to easily compartmentalize the two sets of functions in his or her mind and thereby readily and intuitively comprehend and remember a larger set of interface tools than would be possible if the functions were not separated in this way.
The user interface 22 also provides the user with visible feedback that reminds the user of the current operational mode. For example, in some embodiments, the pointer used by the user to interact with the user interface 22 is different for each of the operational modes. In the illustrated embodiments, the user's pointer corresponds to a standard pointer (e.g., an arrow pointer) in the basic operational mode, and corresponds to a different pointer (e.g., the magic wand pointer shown in
A. Introduction
The collage generator module 54 includes a selection component 58, an editing component 60, and a layout component 62. The selection component 58 makes proactive suggestions about which images are to be added to the collage as well as help the users to find similar or related images. The editing component 60 applies conservative yet effective auto-crop to the images and enhances their tone and color automatically. The layout component 62 provides alternative layout suggestions as well as accommodates changes the user makes to the individual images and the layout, all while satisfying various constraints.
The user interface module 56 provides seamless access to the functionalities of all the components 58-62 in a natural and intuitive way so that users do not have to memorize what the automation tools do or how they work. In addition, the user interface creates a fluid transition experience between fully manual functions and fully automatic functions.
1. User Interface Module
In the illustrated embodiment, the user interface module 56 defines a minimal set of computer mouse operations, including left button single click, drag, drop, and mouse wheel scrolling. The effect of computer mouse operations depends on the context in which it is made (e.g., the object upon which it is made and which mode, basic or enhanced, the user interface 22 is in).
In some embodiments, a list of albums is shown in the catalog area 28 when the application starts. The user can click on an individual album to load the set of images that are associated with the selected album. The album is represented by the album icon 40 and the associated images 34 are presented as an image strip, as shown in
In general, the user interface 22 may provide access to a wide variety of different functions in the basic and enhanced modes of operations.
In some embodiments, the collage mock-up area 30 includes a user-selectable mock-up area background object over respective portions of which the images of the collage layout are presented. The mock-up area background object is operable to change the layout of the images presented in the collage mock-up area. In particular, in response to a determination that the user has input made a point-and-click gesture with respect to the mock-up area background object, the collage generator module 12 changes the layout from a current layout to a new layout of the images in the subset. When the user interface 22 in the basic operational mode, the layout is changed by maintaining relative sizes and positions of the images in the layout while changing the layout between (i) a straight layout in which respective edges of adjacent ones of the images in the subset are parallel across respective dimensions of the layout and (ii) a tilted layout in which respective edges of adjacent ones of the images in the subset are non-parallel across respective dimensions of the layout. When the user interface in the enhanced operational mode, the layout is changed by changing ones of the images in the collage in terms of at least one of relative size and relative position.
In some embodiments, in response to a determination that the user interface is in the basic operational mode and the received user input gesture instance is a point-and-click gesture with respect to a target one of the images presented in the collage mock-up area 30, the collage generator module 12 selects the target image. In response to a determination that the user interface is in the enhanced operational mode and the received user input gesture instance is a point-and-click gesture with respect to a target one of the images presented in the collage mock-up area 30, the collage generator module 12 creates a modified instance of the target image and replacing the target image with the modified instance in the layout. In some embodiments, the collage generator module 12 creates the modified instance of the target image by performing at least one of: (i) automatically cropping the target image, and (ii) automatically enhancing the target image.
As explained above, in some embodiments of the user interface 22, the catalog area 28 includes an album area and an image area, where the album area includes the album icon 40, which is associated with a collection of images corresponding to the thumbnails 34 that are shown in the image area. In response to a determination that the user interface is in the enhanced operational mode and the received user input gesture instance is a point-and-click gesture with respect to the album icon, the collage generator module 12 automatically selects an image from the collection, adds the selected image to the images in the subset to produce a new subset of images, and determines a new layout of the images in the new subset of images. The user interface 22 presents the new layout in the collage mock-up area. In response to a determination that the user interface is in the enhanced operational state and the received user input gesture instance is a point-and-click gesture with respect to a target one of the images represented by the thumbnails shown in the image area, the collage generator module 12 automatically rearranges the thumbnails shown in the image area according to similarity between the target image and other ones of the images in the collection.
In some embodiments, the user interface 22 presents a respective view of each of the images in the collage through a respective frame that defines a boundary around the view of the image. In response to a determination that the user interface is in the basic operational mode and the received user input gesture instance is a drag-and-drop gesture defining a movement from first position over a target one of the images within a target one of the frames to a second position within the target frame, the collage generator module 12 repositions the target image within the target frame to define a different respective view of the target image through the frame. In response to a determination that the user interface is in the basic operational mode and the received user input gesture instance is a drag-and-drop gesture defining a movement from a first position over a first one of the images within a first one of the frames to a second position over a second one of the images within a second one of the frames, the collage generator module 12 swaps positions of the first and second images in the layout and the user interface 22 presents a view of the first image through the second frame and presenting a view of the second image through the first frame. In response to a determination that the user interface is in the basic operational mode and the received user input gesture instance is a scroll gesture with respect to a target one of the images in the layout, the collage generator module 12 re-sizes a region of the target image that is presented through the respective frame.
In some embodiments, in response to a determination that the user interface is in the enhanced operational mode and the received user input gesture instance is a drag-and-drop gesture defining a movement from a first position over a first one of the images in the layout to a second position over a second one of the images in the layout, the collage generator module 12 swaps relative positions of the first and second images in the layout. In this process, the collage generator module 12 determines a new layout of the images that exchanges the relative positions of the first and second images and maintains relative positions of all other ones of the images in the subset. The user interface 22 presents the new layout in the collage mock-up area.
In some embodiments, in response to a determination that the user interface is in the enhanced operational mode and the received user input gesture instance is a drag-and-drop gesture defining a movement from a first position over a selected one of the thumbnails to a second position over a target one of the images in the layout, the collage generator module 12 replaces the target image in the layout with the image corresponding to the selected thumbnail. In this process, the collage generator module 12 replaces the target image with the image corresponding to the selected thumbnail in the subset produce a new subset of images and determines a new layout of the images in the new subset of images. The user interface 22 presents the new layout in the collage mock-up area. In some embodiments, the collage generator module 12 determines the new layout with the image corresponding to the selected thumbnail positioned in an equivalent relative position in the layout as the target image.
In some embodiments, in response to a determination that the user interface is in the enhanced operational state and the received user input gesture instance is a scroll gesture with respect to a target one of the images in the layout, the collage generator module 12 re-sizes the target image to a new size and determines a new layout of the images that accommodates the new size of the target image and maintains relative positions of the images in the layout. The user interface 22 presents the new layout in the collage mock-up area.
Table 1 provides a summary of the basic functions that are available in an exemplary embodiment of the user interface 22.
In the basic operational mode, a user can drag an image from the image strip to the collage mock-up area 30 to add an image to the collage. A new collage layout is generated immediately. To remove an image from the collage mock-up area 30, the user simply drags and drops the image outside of collage mock-up area 30. Clicking on the background of the collage mock-up area 30 (i.e., the regions of the collage mock-up area unobscured by the image frames 45) toggles between a straight layout (shown in
Table 2 provides a summary of the enhanced functions that are available in the exemplary embodiment of the user interface 22.
In the enhanced operational mode, when a user drags and drops existing images into the collage mock-up area 30, the image collage authoring system 52 preserves image aspect ratios and adjusts the layout 42 to accommodate the switch (see
2. Collage Generator Module
The collage generator module 54 includes a selection component 58, an editing component 60, and a layout component 62.
a. Selection Component
Among the most tedious and time-consuming tasks in making a collage are sorting the image collection appropriately, and selecting images that best represent the collection. The selection component 58 uses analysis-based mechanisms for finding similar images and for recommending images that best represent the collection.
i. Fast Image Similarity
In the illustrated embodiments, an image collection is represented in the image strip area of the catalog area 28. The images are presented a single row of images along the bottom of the user interface 22. The image strip can be navigated by scrolling horizontally. In many contexts, it is advantageous to order images according to time stamp; however, in others, such an ordering is either impossible or inadequate. Firstly, metadata—including time stamps—may be absent or inconsistent. For example, it may not have been recorded; it may have been stripped or modified in previous editing; or time stamps from different clocks can disagree. Secondly, users often consider criteria other than time. For example, when assembling a collage from a museum tour, the time dimension may not be as important as the distribution of exhibits visited. In general, sorting images by content similarity can help users quickly find shots of subjects or scenes, regardless of available metadata. In the illustrated embodiments, the default order of the image strip is according to filename, which usually correlates with time.
As shown in
In general, any type of image similarity based sorting process may be used to sort the images in the image strip. These sorting processes typically are based on one or more of the following types of content similarity metrics, which can be roughly classified according to feature granularity as follows: (a) global features such as color histogram; (b) region-based features extracted from segmented images; and (c) key-point features extracted from interest-point detectors such as SIFT (see, e.g., Lowe, D. G., Distinctive image features from scale-invariant keypoints. IJCV, 2004). Generally, finer granularity leads to more accurate results, but at the cost of greater computation.
In some embodiments, a region-based image similarity sorting process is used. In this process, segmentations of the images are generated using a fast algorithm that is described below in sub-section IV.A.2.b. As a result, each image is represented by an image-dependent set of color clusters. Content similarity is then measured using the Earth Mover Distance (EMD) (see, e.g., Rubner, Y., Tomasi, C. and Guibas, L. J. A Metric for Distributions with Applications to Image Databases. ICCV, 1998), which solves for the minimal transportation cost that must be paid to transform one color distribution to the other.
ii. Automatic Image Suggestion
The selection component 58 is designed to alleviate two specific image selection scenarios: auto-population, and incremental population. In the auto-population scenario, the goal is to automatically generate a complete collage, as a starting point. In the incremental population scenario, the goal is to select a new image, from a cluster that is not already represented on the collage if possible. In some embodiments, both scenarios follow the same process. First, the image collection is partitioned into clusters of duplicates. This may be followed by a second partitioning, if necessary, to arrive at a set of “suggestion clusters”. Finally, a suggested image is selected from each suggestion cluster.
In the auto-populate scenario, we first determine the number of images to appear in the collage. A maximum number of images T>0 is set beforehand based on the size of the collage, to avoid a crowded result. If the number of duplicate cluster is less than or equal to T, then the suggestion clusters are the duplicate clusters. Otherwise, the suggestion clusters are determined by splitting the sequence at the greatest T−1 similarity gaps.
In the incremental suggestion scenario, the set of duplicates is the set of suggestion clusters. When the user issues a command to add a new suggested image, each suggestion cluster represented by an image on the collage is removed from consideration, and a rotating counter is used to identify the next suggestion cluster.
When the suggestion clusters have been determined, each image is assigned a composite score that is a weighted combination of two metrics described below: typicality within its suggestion cluster, and image sharpness. For each suggestion cluster, the image with the highest composite score is deemed the “best representative.”
Duplicate Detection
In some embodiments, duplicate detection is based on similarity alone.
In the illustrated embodiments, time information is used if it is available so as to leverage the fact that duplicate shots are often taken close in time. In these embodiments, two different binary classifiers were trained based on manually labeled pairs of consumer images. Each classifier is capable of deciding whether two arbitrary images are duplicates: one using both similarity and time, and the other using only similarity. Content similarity is measured using the fast algorithm from section IV.A.2.a.i. In some embodiments, a Support Vector Machine (SVM) (see, e.g., Chang, C. C. and Lin, C. J. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/˜cjlin/libsvm) is used to train the duplicate detectors with a linear kernel. Ten-fold cross validation was used to evaluate the accuracy of the resulting detectors.
In other embodiments, key-point based algorithms are used for near duplicate detection (see, e.g., Ke, Y., Sukthankar, R. and Huston, L. An efficient parts-based near-duplicate and sub-image retrieval system. ACM Multimedia, 2004, and Zhang D. Q. and Chang S. F. Detecting image near-duplicate by stochastic attributed relational graph matching with learning. ACM Multimedia, 2004).
Typicality Metric
The most typical image shares the most information with all other images in the cluster. This is equivalent to finding the sample that maximizes its average similarity to the rest of images in the same cluster. In effect, this metric is used to filter outliers and minimize the propagation of clustering errors to the image suggestion algorithm.
Sharpness Metric:
Image quality is a general concept that has many dimensions. For example, a good image should have good exposure, contrast, and color; in addition to good composition, focus on the subject, and pleasing facial expressions.
Blur in images often results from motion or lack of focus. Regardless of the cause, blur weakens the major edges in images. For example, in
where strength(s) is the average edge strength of the top 10% strongest edges and entropy(h) is the entropy of the normalized edge strength histogram. Non-blur images have stronger edges and more peaky edge strength distribution, therefore large strength(s) and smaller entropy(h), resulting a larger Q value.
b. Editing Component
Consumer photographers frequently pay little attention to scene composition. Oftentimes, the subject is too small, with excess empty space; or distractions at the edges attract the eye away from the main subject area. Appropriate cropping can significantly enhance the visual impact of many images. A by-product of cropping is often a change in aspect ratio which better suits the image content, and typically produces more interesting collage layouts as a result of the variety of aspect ratios.
i. Auto-Crop Function
In some embodiments, automatic image cropping involves two steps: a) image saliency analysis to identify the subject; and b) positioning of crop boundaries to include the subject area in an aesthetically pleasing way.
In general, any of a wide variety of processes for automatically identifying salient regions of interest (ROIs) in images may be used. Some embodiments use a multi-resolution center-surround difference technique (see, e.g., Itti, L., Koch, N., and Niebur, E. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. on PAMI, 20(11): 1254-1259). Other embodiments adopt a segmentation based approach as an alternative to saliency; segmentation produces crisp region boundaries which facilitate the optimization of crop boundary position.
Referring to
The saliency map 72 is augmented with face detection to identify the heads and shoulders of people in the image. The bounding box of all detected people is called the “people box”. In general, any type of face detection process may be used. An exemplary face detection process is described in Viola, P. and Jones, M. Robust Real-Time Face Detection. IJCV, 2004.
Crop boundary locations are selected by first finding a “minimum crop rectangle” 74 in the saliency map 72. Then the possible rectangle locations which include the minimum crop rectangle are searched using an optimization criterion to select an output crop rectangle 76, as shown in
The minimum crop rectangle is created by first forming “subject boxes”—rectangular areas which contain adjacent subject regions. Overlapping subject boxes are merged. Each subject box is scored using the sum of the areas of its subject regions which do not touch the image boundary. The bounding box of these regions is called the “core” of the subject box. The minimum crop rectangle is initially set to the core of the subject box with the highest score. This is expanded to include the people box and the central 15% of the image area. To prevent erroneous cropping of unusual images, the central 25% of the image area is added if the minimum crop rectangle is less than 20% of the image area, or if the area of subject regions in the minimum crop rectangle are less than 10% of the image area.
The optimization search finds the crop that minimizes a combination of penalties for: large crop area; Inclusion of distractions; proximity to minimum crop rectangle; proximity to strong region edges parallel to a crop edge; and crossing strong region edges. The penalty function finds crop borders which leave space around the subject, while still producing a reasonably tight crop, rather than simply cropping the ROI (see, e.g., Ma, M. and Guo, J. Automatic Image Cropping for Mobile Device with Built-in Camera. IEEE Consumer Communications and Networking Conference, 2004). For efficiency, some embodiments use a coarse search to find an approximate best crop, followed by a local fine search. Integral images efficiently calculate the penalty criteria during the search.
ii. Automatic Lighting/Color Enhancement
In addition to composition problems, consumer images frequently have suboptimal exposure and lighting. For most image creativity applications, color and tone editing is a must-have function.
In some embodiments, the editing component 60 provides one or more image enhancement options that automatically improve images that have contrast and shadow defects. In general, any of a wide variety of different image enhancement processes may be applied to the image, including those that bring dark subjects out of the shadows, lighten underexposed images, improve overall contrast, and add saturation to some color regions.
c. Layout Component
In general, the layout component 62 may use any of a wide variety of different image layout processes, subject to any number of layout criteria. In some embodiments, the layout component 62 arranges images on a rectangular canvas subject to the following primary criteria:
In these embodiments, the layout component 62 encodes the composite as a binary tree 82 that induces a recursive partition of the canvas as illustrated in
The layout component 62 associates each tree having the form illustrated in
The aspect ratio of an image is its height divided by its width. In this case, the coefficients in the linear system are all either 0, ±1, aspect ratios, or negated aspect ratios. As a result, the layout is “continuous”: a small change to the aspect of an image results in a small change to the layout. For this reason, the process of determining a new layout from a tree structure that has been modified in response to a command from the user is referred to as “reflow.”
The layout component 60 creates layouts and reflows layouts very quickly, permitting interactive editing and preview. A summary of the commands supported by the layout component 60 is given in Table 4. These commands include commands for adding, deleting, replacing, cropping and swapping images.
As indicated by
An image can be made larger or smaller in the context of a layout by manipulating the aspect ratios of all the other images, as illustrated in
The layout component 62 first determines target dimensions for the selected image by multiplying the current height and width by the side-length factor. Changes to the dimensions of the selected image are translated into target dimensions for the root bounding box. The layout component 62 can now determine new heights and widths for the remaining images such that target dimensions for both selected image and root bounding box will be realized upon reflow. For example, in the case of growing, images that are separated from the selected image by a horizontal (vertical) cut will have their aspect ratios reduced (increased).
Additional details regarding the construction and operation of the layout component 62 are described in U.S. patent application Ser. No. 11/769,671, filed Jun. 27, 2007, and in Atkins, C. B. Blocked Recursive Image Composition. ACM Multimedia, 2008.
Embodiments of the image collage authoring system 10 may be implemented by one or more discrete modules (or data processing components) that are not limited to any particular hardware, firmware, or software configuration. In the illustrated embodiments, these modules may be implemented in any computing or data processing environment, including in digital electronic circuitry (e.g., an application-specific integrated circuit, such as a digital signal processor (DSP)) or in computer hardware, firmware, device driver, or software. In some embodiments, the functionalities of the modules are combined into a single data processing component. In some embodiments, the respective functionalities of each of one or more of the modules are performed by a respective set of multiple data processing components.
The collage generator module 12, the user interface module 14, and the display 24 may be co-located on a single apparatus or they may be distributed across multiple apparatus; if distributed across multiple apparatus, the collage generator module 12, the user interface module 14, and the display 24 may communicate with each other over local wired or wireless connections, or they may communicate over global network connections (e.g., communications over the internet).
In some implementations, process instructions (e.g., machine-readable code, such as computer software) for implementing the methods that are executed by the embodiments of the image collage authoring system 10, as well as the data it generates, are stored in one or more machine-readable media. Storage devices suitable for tangibly embodying these instructions and data include all forms of non-volatile computer-readable memory, including, for example, semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices, magnetic disks such as internal hard disks and removable hard disks, magneto-optical disks, DVD-ROM/RAM, and CD-ROM/RAM.
In general, embodiments of the image collage authoring system 10 may be implemented in any one of a wide variety of electronic devices, including desktop computers, workstation computers, and server computers.
A user may interact (e.g., enter commands or data) with the computer 120 using one or more input devices 130 (e.g., a keyboard, a computer mouse, a microphone, joystick, and touch pad). Information may be presented through the user interface 22, which is displayed to the user on the display 24 (implemented by, e.g., a display monitor), which is controlled by a display controller 150 (implemented by, e.g., a video graphics card). The computer system 120 also typically includes peripheral output devices, such as speakers and a printer. One or more remote computers may be connected to the computer system 120 through a network interface card (NIC) 136.
As shown in
The embodiments that are described herein provide an image collage authoring system that allows a user to retain control over the position and sizes of the images in a collage and to access image analysis based functionality that alleviates tedious and difficult tasks that commonly are associated with image selection, editing, and layout. These embodiments include a user interface that is designed to minimize the number and complexity of new concepts the users need to learn. The user interface provides a seamless integration between the images with which the user is engaged and the functionality used to control the images. The user interface has two modes of interaction: a basic mode that provides access to a basic set of intuitive features; and an enhanced mode that provides access to more complex functionality. The interaction model provides intelligent context-based interpretations of user inputs that enable the user to direct automatic selection of images in accordance with the user's specific purpose, while avoiding overwhelming the user with decision-making and control settings when proactive suggestions can be made automatically.
Other embodiments are within the scope of the claims.
Number | Name | Date | Kind |
---|---|---|---|
5136686 | Koza | Aug 1992 | A |
5475805 | Murata | Dec 1995 | A |
5499366 | Rosenberg et al. | Mar 1996 | A |
5552982 | Jackson et al. | Sep 1996 | A |
5555362 | Yamashita et al. | Sep 1996 | A |
5592605 | Asuma et al. | Jan 1997 | A |
5659770 | Yamada | Aug 1997 | A |
5706457 | Dwyer | Jan 1998 | A |
5712995 | Cohn | Jan 1998 | A |
5729254 | Marks et al. | Mar 1998 | A |
5760786 | Marks et al. | Jun 1998 | A |
5760788 | Chainini et al. | Jun 1998 | A |
5796401 | Winer | Aug 1998 | A |
5801687 | Peterson et al. | Sep 1998 | A |
5889523 | Wilcox et al. | Mar 1999 | A |
5898434 | Small et al. | Apr 1999 | A |
5920315 | Santos-Gomez | Jul 1999 | A |
5956738 | Shirakawa | Sep 1999 | A |
6005560 | Gill et al. | Dec 1999 | A |
6008809 | Brooks | Dec 1999 | A |
6055522 | Krishna et al. | Apr 2000 | A |
6057842 | Knowlton et al. | May 2000 | A |
6081262 | Gill et al. | Jun 2000 | A |
6097389 | Morris et al. | Aug 2000 | A |
6111586 | Ikeda et al. | Aug 2000 | A |
6121970 | Guedalia | Sep 2000 | A |
6128010 | Baxter et al. | Oct 2000 | A |
6256108 | Dziesietnik et al. | Jul 2001 | B1 |
6301586 | Yang et al. | Oct 2001 | B1 |
6380954 | Gunther | Apr 2002 | B1 |
6415306 | Seaman | Jul 2002 | B2 |
6448956 | Berman et al. | Sep 2002 | B1 |
6563602 | Uratani et al. | May 2003 | B1 |
6596032 | Nojima et al. | Jul 2003 | B2 |
6636648 | Loui et al. | Oct 2003 | B2 |
6636650 | Long et al. | Oct 2003 | B1 |
6671405 | Savakis et al. | Dec 2003 | B1 |
6701306 | Kronmiller et al. | Mar 2004 | B1 |
6727909 | Matsumura et al. | Apr 2004 | B1 |
6738494 | Savakis et al. | May 2004 | B1 |
6748097 | Gindele et al. | Jun 2004 | B1 |
6771292 | Sharp | Aug 2004 | B2 |
6771801 | Fisher et al. | Aug 2004 | B1 |
6977665 | Yokouchi | Dec 2005 | B2 |
7013432 | Taylor et al. | Mar 2006 | B2 |
7019864 | Delhoune et al. | Mar 2006 | B2 |
7093263 | Sexton et al. | Aug 2006 | B1 |
7096445 | Pucci et al. | Aug 2006 | B1 |
7124360 | Drenttel et al. | Oct 2006 | B1 |
7145674 | Hayes | Dec 2006 | B2 |
7148990 | Atkins | Dec 2006 | B2 |
7149968 | Ackerschewski et al. | Dec 2006 | B1 |
7184167 | Ito et al. | Feb 2007 | B1 |
7207735 | Narusawa et al. | Apr 2007 | B2 |
7281199 | Nicol et al. | Oct 2007 | B1 |
7290006 | Xie et al. | Oct 2007 | B2 |
7340676 | Geigel et al. | Mar 2008 | B2 |
7434159 | Lin | Oct 2008 | B1 |
7640516 | Atkins | Dec 2009 | B2 |
7644356 | Atkins et al. | Jan 2010 | B2 |
7900149 | Hatcher et al. | Mar 2011 | B2 |
7921111 | Beddow | Apr 2011 | B1 |
8010548 | Beddow | Aug 2011 | B1 |
8065627 | Atkins | Nov 2011 | B2 |
8291314 | Atkins | Oct 2012 | B2 |
20010033296 | Fullerton et al. | Oct 2001 | A1 |
20020051208 | Venable | May 2002 | A1 |
20020059322 | Miyazaki et al. | May 2002 | A1 |
20020064302 | Massengill | May 2002 | A1 |
20020070982 | Hill et al. | Jun 2002 | A1 |
20020122067 | Geigel et al. | Sep 2002 | A1 |
20020124048 | Zhou | Sep 2002 | A1 |
20020161777 | Smialek | Oct 2002 | A1 |
20020196271 | Windl et al. | Dec 2002 | A1 |
20030001879 | Lin et al. | Jan 2003 | A1 |
20030007014 | Suppan | Jan 2003 | A1 |
20030033288 | Shanahan et al. | Feb 2003 | A1 |
20030046349 | Burgin et al. | Mar 2003 | A1 |
20030046401 | Abbott et al. | Mar 2003 | A1 |
20030059121 | Savakis et al. | Mar 2003 | A1 |
20030072486 | Loui et al. | Apr 2003 | A1 |
20030128877 | Nicponski | Jul 2003 | A1 |
20030152289 | Luo | Aug 2003 | A1 |
20030160824 | Szumla | Aug 2003 | A1 |
20030162159 | Sheehan | Aug 2003 | A1 |
20040122760 | Clearwater et al. | Jun 2004 | A1 |
20040122805 | Sang et al. | Jun 2004 | A1 |
20040122858 | Clearwater | Jun 2004 | A1 |
20050071781 | Atkins | Mar 2005 | A1 |
20050071783 | Atkins | Mar 2005 | A1 |
20050138570 | Good et al. | Jun 2005 | A1 |
20050168779 | Tsue et al. | Aug 2005 | A1 |
20050168782 | Kobashi et al. | Aug 2005 | A1 |
20050240865 | Atkins | Oct 2005 | A1 |
20050243381 | Hill et al. | Nov 2005 | A1 |
20050261881 | Jackson | Nov 2005 | A1 |
20060026508 | Balinsky et al. | Feb 2006 | A1 |
20060026515 | Balinsky | Feb 2006 | A1 |
20060053370 | Hitaka et al. | Mar 2006 | A1 |
20060103667 | Amit et al. | May 2006 | A1 |
20060103891 | Atkins | May 2006 | A1 |
20060109516 | Catalan et al. | May 2006 | A1 |
20060136839 | Makela | Jun 2006 | A1 |
20060150092 | Atkins | Jul 2006 | A1 |
20060181548 | Hafey et al. | Aug 2006 | A1 |
20060181736 | Quek et al. | Aug 2006 | A1 |
20060195784 | Koivisto et al. | Aug 2006 | A1 |
20060200758 | Atkins | Sep 2006 | A1 |
20060224952 | Lin | Oct 2006 | A1 |
20060233536 | Shuhami | Oct 2006 | A1 |
20060259856 | Atkins | Nov 2006 | A1 |
20060259857 | Atkins | Nov 2006 | A1 |
20060279566 | Atkins et al. | Dec 2006 | A1 |
20070008321 | Gallagher et al. | Jan 2007 | A1 |
20070097421 | Sorensen et al. | May 2007 | A1 |
20070208996 | Berkner et al. | Sep 2007 | A1 |
20070234883 | Koizumi | Oct 2007 | A1 |
20080065737 | Burke et al. | Mar 2008 | A1 |
20080082912 | Atkins | Apr 2008 | A1 |
20080094420 | Geigel et al. | Apr 2008 | A1 |
20080134094 | Samadani et al. | Jun 2008 | A1 |
20080215985 | Batchelder et al. | Sep 2008 | A1 |
20080222560 | Harrison | Sep 2008 | A1 |
20080270930 | Slosar | Oct 2008 | A1 |
20080313533 | Hoyer et al. | Dec 2008 | A1 |
20090002764 | Atkins et al. | Jan 2009 | A1 |
20090089660 | Atkins et al. | Apr 2009 | A1 |
20090147297 | Stevenson | Jun 2009 | A1 |
20090307623 | Agarawala et al. | Dec 2009 | A1 |
20100124378 | Das et al. | May 2010 | A1 |
20100275152 | Atkins et al. | Oct 2010 | A1 |
20100329575 | Scalise et al. | Dec 2010 | A1 |
Number | Date | Country |
---|---|---|
1186992 | Mar 2002 | EP |
1503336 | Feb 2005 | EP |
2378340 | May 2003 | GB |
01191270 | Aug 1989 | JP |
09185728 | Jul 1997 | JP |
10-293838 | Nov 1998 | JP |
2002-142092 | May 2002 | JP |
2002288669 | Oct 2002 | JP |
2003101749 | Apr 2003 | JP |
2003-274139 | Sep 2003 | JP |
WO-9810356 | Mar 1998 | WO |
WO-0139019 | May 2001 | WO |
WO-0237939 | May 2002 | WO |
WO-02084582 | Oct 2002 | WO |
Entry |
---|
Atkins, C.Brian, “Blocked Recursive Image Composition,” ACM Multimedia, Oct. 2008, 4 pages. |
Chen, Jun-Cheng et al., “Tiling Slideshow,” ACM Multimedia, Oct. 2006, 10 pages. |
Diakopoulos, Nicholas and Essa, Irfan, “Mediating Photo Collage Authoring, ” UIST, Oct. 2005, 4 pages. |
Fogarty, James et al., “Aesthetic Information Collages: Generating Decorative Displays that Contain Information,” UIST, 2001, 10 pages. |
Girgensohn, Andreas and Chiu, Patrick, “Stained Glass Photo Collages, ” UIST, Oct. 2004, 2 pages. |
Graham, Adrian et al., “Time as Essence for Photo Browsing Through Personal Digital Libraries,” JCDL, Jul. 2002, 10 pages. |
Kerne, Andruid et al., “combinFormation: A Mixed-Initiative System for Representing Collections as Compositions of Image and Text Surrogates,” JCDL, Jun. 2006, 10 pages. |
Rother, Carsten et al., “AutoCollage,” ACM Transacations on Graphics, vol. 25, Issue 3, Jul. 2006, 6 pages. |
Xiao, Jun et al. “Mixed-Initiative Photo Collage Authoring,” ACMMM, Oct. 2008, 11 pages. |
C. Brian Atkins; “Adaptive Photo Collection Page Layout”; HP Labs; Palo Alto, CA 94304; 2004 Inti Conference, Oct. 24, 2004; <http://www.hpl.hp.com/researchlisl/layout>. |
D.F. Wong, et al.; “A new algorithm for floorplan design”; Proc. Design Automation Conference; pp. 101-107, 1986. |
Eldan Goldenberg, “Automatic layout of variable-content print data”, MCs Dissertation, School of Cognitive & Computing Sciences, University of Sussex, Brighton, UK (2002). |
International Searching Authority, International Search Report and Written Opinion, issued in PCT/US2010/054153, mailed Dec. 28, 2010. |
Joe Geigel, et al.; “Automatic page layout using genetic algorithms for electronic albuming”; Proceedings of Electronic Imaging 2001 (Jan. 2001). |
Joe Geigel, et al.; “Using Genetic Algorithms for Album Page Layouts”; Multimedia; IEEE, Volume: 10, Issue: 4, pp. 16-27, (Oct.-Dec. 2003). |
Number | Date | Country | |
---|---|---|---|
20100199227 A1 | Aug 2010 | US |