Individuals and organizations are rapidly accumulating large collections of digital image content, including still images, text, graphics, animated graphics, and full-motion video images. This content may be presented individually or combined in a wide variety of different forms, including documents, catalogs, presentations, still photographs, commercial videos, home movies, and metadata describing one or more associated digital content files. As these collections grow in number and diversity, individuals and organizations increasingly will require systems and methods for organizing and presenting the digital content in their collections. To meet this need, a variety of different systems and methods for organizing and presenting digital image content have been proposed.
For example, there are several digital image collage generation systems that enable users to create digital photo albums manually. These systems typically provide tools for organizing a collection of images and laying out these images on one or more pages. Among the common types of tools for manually creating a digital photo album are tools for selecting a subset of images in the collection that will appear on a page of an album, a graphical user interface for manually rearranging the images on the page, and basic image editing tools for modifying various characteristics, such as size and orientation, of the images that will appear in the album. Users typically find the process of generating a digital photo album using fully manual digital image albuming systems to be tedious and time consuming.
Some automated digital image collage generation systems allow users to organize digital images into album pages in accordance with dates and times specified in the metadata associated with the images. These systems also typically allow users to annotate the images appearing in the digital photo album pages. Some automated digital image albuming systems provide various predefined layout templates that a user may select to create a digital photo album. In these systems, the user assigns images from the collection to various predefined image locations on a selected layout template, and the system automatically adjusts the size, placement, rotation, and framing of the images in accordance with parameters specified for the various predefined image locations on the selected template. Other digital image collage generation systems are designed to produce album pages automatically with minimal input from the user.
What are needed are improved systems and methods of generating image collages.
In one aspect, the invention features a method in accordance with which a regular mesh of rectangular cells of uniform size is defined. The regular mesh aligned with a non-rectangular layout boundary such that a count of ones of the cells having at least a prescribed fraction of their size contained within the layout boundary is maximized. A layout of frames within the layout boundary is determined based on the locations of the cells with respect to the layout boundary, wherein the frames define respective size dimensions and positions of respective views of graphic objects on a page. The views of the graphic objects arranged in accordance with the layout of the frames on the page are outputted.
The invention also features apparatus operable to implement the method described above and computer-readable media storing computer-readable instructions causing a computer to implement the method described above.
In the following description, like reference numbers are used to identify like elements. Furthermore, the drawings are intended to illustrate major features of exemplary embodiments in a diagrammatic manner. The drawings are not intended to depict every feature of actual embodiments nor relative dimensions of the depicted elements, and are not drawn to scale.
I. Definition of Terms
A “graphic object collage” is a composition of graphic objects on a page.
The term “graphic object” refers broadly to any type of visually perceptible content that may be rendered on a physical or virtual page, including images and text. Image-based graphic objects (or simply “images”) may be complete or partial versions of any type of digital or electronic image, including: an image that was captured by an image sensor (e.g., a video camera, a still image camera, or an optical scanner) or a processed (e.g., filtered, reformatted, enhanced or otherwise modified) version of such an image; a computer-generated bitmap or vector graphic image; a textual image (e.g., a bitmap image containing text); and an iconographic image. The term “graphic object” encompasses both a single-element graphic object and a multi-element graphic object formed from a cohesive group or collection of one or more graphic objects. The assignment of single-element graphic objects to a particular multi-element graphic object signifies that the constituent single-element graphic objects are related. In general, the types of the single-element graphic objects in a multi-element graphic object may be the same or different. In the illustrated embodiments, each of the graphic objects has a respective aspect ratio, which is the ratio of graphic objects height to graphic objects width. Each variable-area graphic object may be assigned a respective positive scalar-valued nominal size. The term “nominal size” (also referred to as “relative area”) refers to a designated or theoretical size that may or may not vary from the actual or rendered size, where the “size” of a graphic object is the amount of area of a page that is occupied by the graphic object. In some embodiments, the user is allowed to set the nominal size values that are assigned to the graphic objects. In other embodiments, the graphic object collage generation system automatically assigns the nominal size values to the graphic objects.
As used herein, the term “page” refers to any type of discrete area in which graphic objects may be laid out, including a physical page embodied by a discrete physical medium (e.g., a piece of paper) on which a layout of graphic objects may be printed, and a virtual, digital or electronic page containing a layout of graphic objects that may be presented to a user by, for example, an electronic display device.
A “computer” is any machine, device, or apparatus that processes data according to computer-readable instructions that are stored on a computer-readable medium either temporarily or permanently. A “computer operating system” is a software component of a computer system that manages and coordinates the performance of tasks and the sharing of computing and hardware resources. A “software application” (also referred to as software, an application, computer software, a computer application, a program, and a computer program) is a set of instructions that a computer can interpret and execute to perform one or more specific tasks. A “data file” is a block of information that durably stores data for use by a software application.
As used herein, the term “includes” means includes but not limited to, the term “including” means including but not limited to. The term “based on” means based at least in part on. The term “ones” means multiple members of a specified group.
II. Overview
The embodiments that are described herein provide systems and methods that enable graphic object collages that conform to non-rectangular layout boundaries to be generated.
In accordance with the method of
Referring to
The modules 14-18 of the graphic object collage generation system 10 are not limited to a specific hardware or software configuration, but rather they may be implemented in any computing or processing environment, including in digital electronic circuitry or in computer hardware, firmware, device driver, or software. For example, in some implementations, these modules may be embedded in the hardware or software of any one of a wide variety of digital and analog electronic devices, including desktop and workstation computers, digital still image cameras, digital video cameras, printers, scanners, and portable electronic devices (e.g., mobile phones, laptop and notebook computers, and personal digital assistants). In some implementations, computer process instructions for implementing the modules 14-18 and the data generated by the modules 14-18 are stored in one or more machine-readable media. Storage devices suitable for tangibly embodying these instructions and data include all forms of non-volatile memory, including, for example, semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices, magnetic disks such as internal hard disks and removable disks, magneto-optical disks, and CD-ROM.
III. Exemplary Embodiment of the Graphic Object Collage Generation System
A. Introduction
The page layout generator module 54 includes a layout component 62. The layout component 62 determines the layout of frames within the layout boundary based on the locations of the cells with respect to the layout boundary (
The layout boundary processing module 56,includes a mesh definition component 64 that defines the regular mesh of rectangular cells that is aligned with a non-rectangular layout boundary such that a count of ones of the cells having at least a prescribed fraction of their size contained within the layout boundary is maximized (see
The user interface module 58 provides seamless access to the functionalities of the graphic object collage generation system 52 in a natural and intuitive way so that users do not have to memorize what the automation tools do or how they work. In addition, the user interface creates a fluid transition experience between fully manual functions and fully automatic functions.
B. Layout Boundary Processing Module
In the layout boundary processing module 56, the mesh definition component 64 defines the regular mesh of rectangular cells that is aligned with a non-rectangular layout boundary such that a count of ones of the cells having at least a prescribed fraction of their size contained within the layout boundary is maximized (see
In accordance with the method of
For the current location of the mesh bounding box 74 with respect to the layout boundary 18, the mesh definition component 64 tallies a respective count OverlapCount[0][0] in the embodiment shown in
The mesh definition component 64 repositions the mesh bounding box 74 in a new current location relative to the layout boundary 18 (
If the current location of the mesh boundary box 76 is within a specified maximum adjustment range (
The following pseudo code describes an embodiment of the method of
In some embodiments, the mesh definition component 64 determines whether or not the layout boundary is symmetric in one or more dimensions. In some embodiments, the mesh definition component 64 determines whether or not the layout boundary is symmetric based on a comparison of the center of mass components of the layout boundary to the center of mass components of the bounding box of the layout boundary. If the horizontal (e.g., x) center of mass coordinates are substantially the Same, then the layout boundary is considered to by symmetric in the horizontal dimension; if the vertical (e.g., y) center of mass coordinates are substantially the same, then the layout boundary is considered to by symmetric in the vertical dimension
In response to a determination that the layout boundary is symmetric in, for example, the horizontal dimension and/or the vertical dimension, the mesh definition component 64 selects the set of cells contained by the mesh bounding box that is symmetrically distributed about the center of mass of the layout boundary in the horizontal dimension and/or the vertical dimension. In these embodiments, this symmetry constraint is prioritized over the cell maximization constraint described above. In some of these embodiments, this prioritization is achieved by restricting the mesh definition component 64 such that it only considers mesh bounding box positions that satisfy the symmetry constraint.
In some embodiments, the mesh definition component 64 divides an initial layout boundary (referred to herein as an “inclusive layout boundary”) into boundary segments. For each of the non-rectangular ones of the boundary segments, the mesh definition component 64 defines a respective regular mesh of rectangular cells of uniform size that is aligned with a non-rectangular layout boundary such that a count of ones of the cells having at least a prescribed fraction of their size contained within the layout boundary is maximized. The mesh definition component 64 typically uses one of the mesh definitions described herein in order to define a respective regular mesh for each of the non-rectangular boundary segments. In this process, for each of these non-rectangular boundary segments, the mesh definition component 64 typically performing the defining of a respective regular mesh of rectangular cells of a respective uniform size (
In some embodiments, the process of dividing an inclusive boundary into boundary segments involves identifying parallel linear partitions of the inclusive layout boundary that are aligned with respective portions of the inclusive layout boundary, and dividing the inclusive layout boundary along the partitions. There are many ways to detect prominent lines in the initial layout boundary. A exemplary line detection process is describe in Duda, R. O. and P. E. Hart, “Use of the Hough Transformation to Detect Lines and Curves in Pictures,” Comm. ACM, Vol. 15, pp. 11-15 (January, 1972). Once detected, the prominent lines are used as a basis for segmenting the inclusive boundary.
In some embodiments, for each of the non-rectangular boundary segments 92-100, the mesh definition component sets the respective uniform size of the cells such that a distance between adjacent ones of the parallel linear partitions is an integral multiple of a rectangular dimension of the cells, as shown in
The layout boundary processing module 56 passes the page layout generator module 54 a respective specification of the non-rectangular boundary segments 92-100 along with an associated regular mesh definition.
In some embodiments, the mesh definition component 64 also defines a respective regular mesh for the rectangular boundary segment 102 using any of the mesh definition methods described herein. In other embodiments, the layout boundary processing module 56 passes the page layout generator module 54 a specification of the boundary segment 102 without an associated regular mesh definition.
C. Page Layout Generator Module
The page layout generator module 54 includes a layout component 62, which determines the layout of frames within the layout boundary based on the locations of the cells with respect to the layout boundary (
In some embodiments, the layout component 62 determines the layout of graphic object frames (see
In some embodiments, the graphic objects are ranked with at least some of the graphic objects are ranked higher than other lower-ranked ones of the graphic objects. The graphic objects may be ranked, for example, manually by a user or automatically by an image analysis system. The rank can be base on various factors such as image quality, faces of people, and expression of the faces. In these embodiments, the layout component 62 assigns the higher ranked ones of the graphic objects to respective ones of the frames that are aligned with respective ones of the rectangular cells that are completely contained by the layout boundary 18, and assign the lower-ranked ones of the graphic objects to respective ones of the frames that are aligned with respective ones of the cells with respective fractions of their size outside the layout boundary.
For partial mesh cells along the edge of the layout boundary 18, edges of the layout boundary 18 can be used as clipping paths for cropping the views of the graphic objects 12 assigned to the partial cells. For example, in some embodiments, one or more of the lower-ranked ones of the graphic objects are cropped such that their respective views fit within the respective ones of the frames with a non-rectangular border cropped to a portion of the non-rectangular layout boundary. If more than a specified fraction (e.g., 50%) of a cell's area is inside the layout boundary 18, a graphic object (e.g., a graphic object that can be cropped) can be placed in the cell, and cropped and scaled based on content analysis in order to ensure that important contents (e.g. face regions) are preserved and shown through the frame instead of being clipped out. In some embodiments, if a cell has less than a specified fraction (e.g., 50%) of its area within the layout boundary 18, the cell is filled with an average color of the graphic object content in one or more adjacent cells; in other embodiments, a frame of a respective one (typically a lower ranked one) of the graphic objects is assigned to the partial cell; in still other embodiments, the graphic object assigned to an adjacent cell can by duplicated in the partial cell.
Referring to
The layout component 62 generates each layout of graphic objects using any of a wide variety of different image layout processes, subject to any number of layout criteria. In some embodiments, the layout component 62 arranges the graphic objects 12 within a rectangular canvass subject to the following primary critera:
(a) graphic object aspect ratios are respected; and
(b) graphic object borders (β) and spacing (σ) between adjacent graphic object borders are precisely specified.
Criterion (a) reflects the assumption that the process should not impose overlap or other hiding of image content unless specifically directed to do so; while criterion (b) provides accommodation for graphic object borders or other features of graphic design going into the final presentation. The layout component 62 creates a hew layout also based on the following secondary criteria:
(c) graphic object areas should be proportional to positive relative area values supplied with the graphic object; and
(d) graphic object should occupy a maximum amount of the area within the rectangular canvass.
In these embodiments, the layout component 62 encodes the graphic object layout 120 as a binary tree 124 that induces a recursive partition of the canvas as illustrated in
The layout component 62 associates each tree having the form illustrated in
Additional details regarding the construction and operation of the layout component 62 are described in U.S. patent application Ser. No. 12/366,616, filed Feb. 5, 2009, and U.S. patent application Ser. No. 11/769,671, filed Jun. 27, 2007, and in Atkins, C. B. Blocked Recursive Image Composition, ACM Multimedia, 2008.
IV. Exemplary Operating Environment
Embodiments of the graphic object collage generation system 10 may be implemented by one or more discrete modules (or data processing components) that are not limited to any particular hardware, firmware, or software configuration. In the illustrated embodiments, these modules may be implemented in any computing or data processing environment, including in digital electronic circuitry (e.g., an application-specific integrated circuit, such as a digital signal processor (DSP)) or in computer hardware, firmware, device driver, or software. In some embodiments, the functionalities of the modules are combined into a single data processing component. In some embodiments, the respective functionalities of each of one or more of the modules are performed by a respective set of multiple data processing components.
The modules of the graphic object collage generation system 10 may be co-located on a single apparatus or they may be distributed across multiple apparatus; if distributed across multiple apparatus, these modules and the display 24 may communicate with each other over local wired or wireless connections, or they may communicate over global network connections (e.g., communications over the internet).
In some implementations, process instructions (e.g., machine-readable code, such as computer software) for implementing the methods that are executed by the embodiments of the graphic object collage generation system 10, as well as the data it generates, are stored in one or more machine-readable media. Storage devices suitable for tangibly embodying these instructions and data include all forms of non-volatile computer-readable memory, including, for example, semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices, magnetic disks such as internal hard disks and removable hard disks, magneto-optical disks, DVD-ROM/RAM, and CD-ROM/RAM.
In general, embodiments of the graphic object collage generation system 10 may be implemented in any one of a wide variety of electronic devices, including desktop computers, workstation computers, and server computers.
A user may interact (e.g., enter commands or data) with the computer 140 using one or more input devices 150 (e.g., a keyboard, a computer mouse, a microphone, joystick, and touch pad). Information may be presented through the user interface 28 (see
As shown in
The embodiments that are described herein provide systems and methods that enable graphic object collages that conform to a non-rectangular layout boundary to be generated.
Other embodiments are within the scope of the claims.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2009/056336 | 9/9/2009 | WO | 00 | 9/25/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2011/031257 | 3/17/2011 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7403211 | Sheasby et al. | Jul 2008 | B2 |
20020122067 | Geigel et al. | Sep 2002 | A1 |
20040179026 | Rix | Sep 2004 | A1 |
20070208996 | Berkner et al. | Sep 2007 | A1 |
20080222560 | Harrison | Sep 2008 | A1 |
20090148064 | Schulz | Jun 2009 | A1 |
20100321405 | MacInnes et al. | Dec 2010 | A1 |
Number | Date | Country |
---|---|---|
10-232945 | Sep 1998 | JP |
2004-104161 | Apr 2004 | JP |
10-2004-0073870 | Aug 2004 | KR |
Entry |
---|
Christian Spielberger et al., “Inner Approximation of Polygons and Polyhedra by Unions of Boxes,” EWCG 2006, Delphi, Mar. 27-29, 2006. |
International Search Report and Written Opinion dated May 11, 2010, issued on PCT Patent Application No. PCT/US/2009/056336 filed on Sep. 9, 2009, The State Intellectual Property Office, Korea. |
Number | Date | Country | |
---|---|---|---|
20120147046 A1 | Jun 2012 | US |