Reference is made to commonly assigned U.S. patent application Ser. No. 13/598,260 filed concurrently herewith, entitled “System For Generating Tag Layouts”, the disclosure of which is incorporated by reference herein in its entirety.
Reference is made to commonly assigned U.S. patent application Ser. No. 13/598,202 filed concurrently herewith, entitled “Method For Generating Tag Layouts”, the disclosure of which is incorporated by reference herein in its entirety.
The present invention relates to computing a scale factor for inserting a first set of shapes into a second set of shapes to form a combined image by using a technique that iteratively inserts the first set of shapes into the second set of shapes and updates the scale factor in response to a residual area or an overflow area until the first set of shapes has been inserted into the second set of shapes and the residual area in the second set of shapes is below a threshold.
In recent years, information on the World Wide Web has grown drastically and is continuing to do so at an ever increasing rate. The prevalence of social media culture has resulted in the digitization of all aspects of lives (i.e. from conversations to celebrations). In other words, human lives have largely become synonymous with the information we consume and share on the Web. Today's man is surrounded by a plethora of information of various kinds in his disparate digital devices. Ironically, the time slice devoted for consumption of a given piece of information is decreasing by the day. Therefore, the need for concise and meaningful information presentation has become critical.
Undoubtedly, text constitutes the most abundant form of information on the Web. However, the structure of text on the Web can vary from extremely un-syntactical (e.g. tags or short form sentences) to very structured (e.g. well written and edited articles). Of late, tag-clouds have gained significance as a way to visualize structured as well as unstructured text. A tag-cloud is a visual depiction of the word content of a document. A tag-cloud can provide a quick word-content summary of large documents, collections, or tag-sets. It can be constructed by tag-frequencies or derived from an ordered tag-set by using tag-weights. An appealing aspect of tag-clouds is the presentation of the relative emphasis or importance of different words or concepts in a seemingly simple manner that a human eye can quickly discern (in contrast to listing numeric weights against different words).
Prior art in generating layouts of tag-clouds for visualization limits the shape of tag layouts or does not preserve the ordering of the set of tags. In addition, prior art for generating tag layouts can result in layouts with tags repeated to fill the space, or omitted due to lack of space, described by the shape instead of scaling the tags or the shapes to achieve a good fit.
The present invention is directed to producing a combined image of a layout by placing a first set of shapes within a second set of shapes while preserving criteria including a certain ordering of the first set of shapes, amount of overlap of the first set of shapes as placed in the second set of shapes, or the scaling of the first or second set of shapes.
According to the present invention, a method for computing a scale factor to insert a first set of shapes into a second set of shapes to form a combined image comprises:
receiving the first set of shapes, including at least one shape, wherein the first set of shapes represents visual information to be placed inside the second set of shapes;
receiving the second set of shapes, including at least one shape, wherein the second set of shapes defines the boundaries of a space for inserting the first set of shapes;
using a processor to convert the first set of shapes into a set of rectangles and the second set of shapes into a set of intervals;
using the processor to determine an initial scale factor and to generate the combined image by iteratively inserting the set of rectangles into the set of intervals and updating the scale factor in response to a residual area or an overflow area until a final scale factor is reached wherein all the rectangles in the set of rectangles have been inserted into the set of intervals and the residual area in the set of intervals is below a threshold; and
storing the combined image in processor accessible memory.
An advantage of the present invention is that the generated layout is of an arbitrary given shape and compact and the ordering of the first set of shapes is preserved in the layout. This provides an improved visual representation of the first set of shapes in the layout.
It is to be understood that the attached drawings are for purposes of illustrating aspects of the invention and may not be to scale.
The present invention is directed to producing a combined image showing a layout generated by placing a first set of shapes within a second set of shapes while preserving criteria including a certain ordering of the first set of shapes, amount of overlap of the first set of shapes as placed in the second set of shapes, or the scaling of the first or second set of shapes. According to an aspect of the present invention, the first set of shapes can be a set of tags, with an ordering associated with the tags, the second set of shapes can be a set of closed shapes or polygons, and the combined image can be a generated tag layout.
The source of content or program data files 24 can include any form of electronic, optical, or magnetic storage such as optical discs, storage discs, diskettes, flash drives, etc., or other circuit or system that can supply digital data to processor 34 from which processor 34 can load software applications, image files, sets of tags, ordering of tags, and closed shapes or receive software applications, image files, sets of tags, ordering of tags, and closed shapes required to generated a tag layout. In this regard, the content and program data files can comprise, for example and without limitation, software applications, a still image data base, image sequences, a video data base, graphics, and computer generated images, sets of tags, ordering of tags, closed shapes and any other data necessary for practicing aspects of the present invention as described herein. Source of content data files 24 can optionally include devices to capture images to create content data for use in content data files by use of capture devices and/or can obtain content data files that have been prepared by or using other devices or image enhancement and editing software. In
Sensors 38 are optional for particular aspects of the present invention and can include light sensors, biometric sensors, and other sensors known in the art that can be used to detect conditions in the environment of system 26 and to convert this information into a form that can be used by processor 34 of system 26. Sensors 38 can also include one or more cameras, video sensors, scanners, microphones, PDAs, palm tops, laptops that are adapted to capture images and can be coupled to processor 34 directly by cable or by removing portable memory 39 from these devices and/or computer systems and coupling the portable memory to slot 46. Sensors 38 can also include biometric or other sensors for measuring involuntary physical and mental reactions. Such sensors including, but not limited to, voice inflection, body movement, eye movement, pupil dilation, body temperature, and p4000 wave sensors.
Processor accessible memory 40 can include conventional digital memory devices including solid state, magnetic, optical or other data storage devices, as mentioned above. Processor accessible memory 40 can be fixed within system 26 or it can be removable and portable. In
In
Communication system 54 can comprise for example, one or more optical, radio frequency or other transducer circuits or other systems that convert data into a form that can be conveyed to a remote device such as remote memory system 52 or remote display 56 using an optical signal, radio frequency signal or other form of signal. Communication system 54 can also be used to receive a digital image and other data, as exemplified above, from a host or server computer or network (not shown), a remote memory system 52 or an online printing service 58. Communication system 54 provides processor 34 with information and instructions from signals received thereby. Typically, communication system 54 will be adapted to communicate with the remote memory system 52 by way of a communication network such as a conventional telecommunication or data transfer network such as the internet, a cellular, peer-to-peer or other form of mobile telecommunication network, a local communication network such as wired or wireless local area network or any other conventional wired or wireless data transfer system.
User input system 68 provides a way for a user of system 26 to provide instructions to processor 34, such instructions comprising automated software algorithms of particular aspects of the present invention that generate tag layouts. This software also allows a user to make a designation of content data files, such as sets of tags, closed shapes, and ordering of tags, to be used in generating a tag layout according to an aspect of the present invention and to select an output form for the output product. User controls 68a, 68b or 58a, 58b in user input system 68 and online printing service 58, respectively, can also be used for a variety of other purposes including, but not limited to, allowing a user to arrange, organize and edit content data files, such as image files, sets of tags, ordering of tags, and closed shapes to be used to generate the tag layout, for example, by incorporating image editing software in computer system 26 which can be used to edit tag layout generated by computer system 26, to provide information about the user or audience, to provide annotation data such as voice and text data, to identify characters in the content data files, and to perform other interactions with system 26.
In this regard user input system 68 can comprise any form of device capable of receiving an input from a user and converting this input into a form that can be used by processor 34. For example, user input system 68 can comprise a touch screen input 66, a touch pad input, a 4-way switch, a 6-way switch, an 8-way switch, a stylus system, a trackball system, a joystick system, a voice recognition system, a gesture recognition system, a keyboard 68a, mouse 68b, a remote control or other such systems. In
In one aspect of the present invention, step 115 can be performed iteratively to generate the tag layout 245. In this case, the processor 34 can compute the scale factor 235 for the polygon 800 that can compactly accommodate the ordered set of tags such that the following criteria are adhered to.
The layout preserves the order of tags.
Different tags do not overlap within the polygon.
In some aspects of the present invention, the generated tag layout 245 can preserve the order of tags going left-right or top-bottom as shown in
To aid understanding of step 210,
In step 315, the set of sequences of integer points 310 denoted by S is converted to the set of line segments 320. Equation 1 can be used to represent the polygon 800 as the set of sequence of integer points 310 using n points.
S={pi=(xi,yi)|0≦i<n} (1)
where S is the set of sequence of integer points 310, pi is an ith point on the polygon 800, and xi and yi are coordinates of the ith point. The set of sequence of integer points 310 S can be converted to the set of line segments 320 denoted by L using Equation (2) as illustrated in
L=={li=
where li is a line segment joining neighboring points pi and pj and the % represents a remainder of a division operation where (i+1) is divided by (n+1). In an aspect of the present invention, there is at least one sequence of integer points 310 S, and therefore, there is at least one set of line segments 320 L.
The set of line segments 320 can be converted to the set of intervals 220 using step 325. Equation 3 describes a set of horizontal lines LH at a plurality of integer locations y.
LH={y=c|┌ymin┐≦c≦└ymax┘} (3)
where ymin and ymax are a minimum and a maximum value of yi respectively, ┌ymin┐ and └ymax┘ mean mathematical ceiling and flooring operations on ymin and ymax respectively, and c is an integer value between ┌ymin┐ and └ymax┘ in one aspect of the present invention. However, ymin, ymax, and c can be real numbers in other aspects of the invention. Next, the intersections between the set of line segments 320 and LH are determined. As illustrated in
The set of intervals 220 denoted by R is constructed as described in Equation 4.
R={rc(i)=[sc(i),ec(i)]|┌ymin┐≦c≦└ymax┘,0≦i≦M−1} (4)
where rc(i) is an ith interval at location c, sc(i) is a starting location of the interval rc(i), ec(i) is an ending location of the interval rc(i), M is the number of unique intervals determined by the intersections of LH and the set of line segments 320. For example,
The flow chart of
R×s(0)={rc(i)=[(sc(i)×s(0))*,(ec(i)×s(0)*])} (5)
where c is in an interval |ymin×s(0)|≦c≦└ymax×s(0)┘ and * represents a mathematical rounding operation to the closest integer. The scale factor 235 s is increased when all of the rectangles Tk are not inserted into the polygon R×s and the scale factor 235 s is reduced when there is a residual space after the successful insertion of all of the rectangles Tk into the polygon R×s.
Equation 6 describes how to compute a residual area 675 denoted by Ar+(R′) if all of the rectangles Tk are inserted into an updated set of intervals R′.
where R′, ec′(i) and sc′(i) denote updated values of R, ec(i), and sc(i) after the insertion of the set of rectangles 225 into the set of intervals 220, respectively. To aid in understanding, an aspect of the present invention is now described with reference to
If only v number of rectangles Tk, where v is less than the total number of rectangles N in the set of rectangles 225, are inserted into the set of intervals 220, an overflow area 670 denoted by Ar−(v) can be computed as described in Equation 7.
where wk and hk are the width and the height of Tk, respectively. A result 1000 with a particular scale and a result 1010 with another particular scale are shown in
Referring back to
Equation 8 describes how to set the initial scale factor 415 denoted by s(0).
where Ar+(R) is computed by setting R′=R in Equation 6 and Ar−(0) is computed by setting v=0 in Equation 7.
In step 420, variables used to compute the scale factor 235 are initialized. In one aspect of the present invention, these variables include f representing a goodness of fit 690 of inserting the set of rectangles 225 into the set of intervals 220 at a particular scale factor 235 s, s+ representing the scale factor 235 that satisfies f>0, f+ representing the goodness of fit 690 for s+, s− representing the scale factor 235 that satisfies f<0, f− representing the goodness of fit 690 for s−, and Δs representing a change in scale factor 235. In one aspect of the present invention, these variables can be initialized as f=1, step=0.5, s=s(0), s+=−1, s−=−1, f−=0, f+=0, and Δs=∞. In step 435, the goodness of fit, f, is computed and if there is residual area 680, then s+ is set to s and f+ is set to f If f<0, meaning some of the tags were not inserted into the set of intervals 220, then s− is set to s and f− is set to f Step 435 is shown in detail in
In step 425, the convergence of the root-finding algorithm is checked. If Δs>ε or f<0, the root-finding algorithm has not converged, meaning either the change in scale factor 235 is greater than a threshold ε or some tags were not inserted into the polygon 800. If Δs≦ε and f≧0, the root-finding algorithm has converged, meaning all of rectangles Tk have been inserted into the set of polygons 200 and the change in scale factor 235 between a previous iteration and the current iteration is smaller than the threshold ε. In this case, the value of the scale factor 235 is output to the algorithm described in the flowchart of
The new scale factor is used to compute a new goodness of fit 690 as described in step 435. This process continues iteratively until the root-finding algorithm converges as described in step 425.
In step 650, the values of k, and R are updated as k=k+1, and R=R′. In step 665, the overflow area 670 is computed by Ar−(k) by setting v=k in Equation 7. In the following step 660, the overflow area 670 multiplied by −1, that is −1×Ar−(k) is used to determine the goodness of fit 690.
In a particular aspect of the present invention, the order is loosely maintained from top left to the bottom right. In other words, the order of insertion of consecutive ordered rectangles is left to right. However, if there is a rectangle that can be fit into a blank space between two consecutive rectangles in the order specified by the ordering of the set of tags 205, the ordering can be ignored locally to achieve a good fit. In other aspects of the invention, the order is strictly maintained from top left to the bottom right and no insertion of tags is allowed out of order.
The set of rectangles 225 is inserted into the set of intervals 220 using step 635 iteratively for each rectangle. As discussed above, step 635 preserves the order of the insertion of rectangles from left to right. As the set of intervals 220 is filled with rectangles from the set of rectangles 225, the next rectangle Tk is inserted into the set of intervals 220 at the leftmost available 2D integer location (x,y) where y is set either by step 625 or step 655 and x is determined by step 635.
It is possible that a rectangle 950 having width of w2 and height 2 cannot be inserted into a subset of intervals 955. The rectangle 950 is positioned at the leftmost available location at y=jj for insertions as shown in 960. The rectangle 950 has to be shifted down to allow for complete insertion into the subset of intervals 955. However, as shown in 965, this results in the rectangle 950 extending outside the subset of intervals 955. In this case, the rectangle 950 cannot be inserted into the subset of intervals 955 at the current scale and an indication of this is returned to the algorithm of
An aspect of the present invention has the constraint that there is an ordering associated with the set of tags 205. In one aspect of the present invention, the ordering of the set of tags 205 can be based on a semantic analysis of the set of tags 205. In this case, a co-occurrence matrix can be constructed for the set of tags 205. The co-occurrence matrix measures how often tags appear together in a given set of pictures or documents. The co-occurrence matrix can be clustered using methods well known in the art such as spectral clustering. The clustering of tags thus obtained can be used to derive an ordering for the tags such that tags within the same cluster are closer in the ordering while tags in different clusters are further in the ordering. In another aspect of the invention, page-rank, as taught in Page et al., can be computed for each tag based on the co-occurrence matrix. Tags can then be ordered directly based on their page-rank.
The semantic analysis can also be performed using natural language understanding. Natural language understanding involves classifying tags into their respective figures of speech (nouns, adjectives, verbs, adverbs etc.). Such a classification can be performed using WordNet, a lexical database of English language developed by the Cognitive Science Laboratory at Princeton University. WordNet is a lexical reference system that uses psycholinguistic theories of human lexical memory. The linguistic classification of tags obtained using WordNet can be used to derive an ordering for the tags such that tags within the same linguistic class are closer in the ordering while tags in different linguistic classes are further in the ordering. In another aspect of the invention page-rank can be computed for each tag based on a pairwise tags linguistic class matrix (wherein entry for a pair of tags is 1 only if they are in the same linguistic class and otherwise the entry is 0). Tags can then be ordered directly based on their page-rank.
In another aspect of the present invention, the ordering of the set of tags 205 can be performed based on user preference. A user's picture collection can be analyzed to reveal the user's preferences. Pictures in a user's collection can be annotated with words. The image annotations obtained from user's collection can be used to derive an ordering for the tags such that tags that are related to the image annotations are ranked higher in the ordering and tags that are unrelated to the image annotations are ranked lower in the ordering.
In another aspect of the present invention, there can be more than one closed shapes, represented using the set of polygons 200, for placing the set of tags 205 to generate the tag layout 245. In this case, the set of tags 205 has to be divided into subsets of tags. The number of subsets of tags is equal to the number of polygons 800 in the set of polygons 200. In one aspect of the present invention, the set of tags 205 is divided into subsets of tags based upon the relative sizes of the polygons 800 and the sizes of the tags in the set of tags. The sizes of all of the polygons 800 are added together to generate a total size for the set of polygons. A subset size for each polygon 800 in the set of polygons 200 is determined by computing a ratio between the size of the polygon 800 and the total size of all the polygons. The set of tags 205 is represented by the set of rectangles T 225, where the size of each rectangle in the set of rectangles 225 is based on the size of the corresponding tag. The sizes of all the rectangles in the set of rectangles 225 are added together to compute a total size for the rectangles to be placed into the set of polygons 200. The set of rectangles 225 can be divided into subsets based on the computed ratios between the sizes of the polygons 800 and the total size of all the polygons. The number of subsets of sets of rectangles 225 is equal to the number of polygons 800 in the set of polygons 200.
In one aspect of the present invention, the set of tags 205 is split into subsets based upon the ordering of the set of tags 205 such a polygon 800 in the set of polygons 200 has a subset of consecutive tags in the set of tags 205. In this aspect, the ordering is maintained within each polygon 800 independently of other polygons in the set of polygons 200. In another aspect of the present invention, the tags can be split across the polygons 800 such that the ordering is maintained across all the polygons simultaneously. In this case, the subset of tags associated with a polygon 800 may not have consecutive tags from the set of tags 205.
The invention has been described in detail with particular reference to preferred embodiments thereof, but it will be understood that variations and modifications can be effected within the spirit and scope of the invention.
Number | Name | Date | Kind |
---|---|---|---|
5572235 | Mical et al. | Nov 1996 | A |
6737591 | Lapstun et al. | May 2004 | B1 |
6920608 | Davis | Jul 2005 | B1 |
7096199 | Lapstun et al. | Aug 2006 | B2 |
7105753 | Lapstun et al. | Sep 2006 | B1 |
7152942 | Walmsley et al. | Dec 2006 | B2 |
7167181 | Duluk et al. | Jan 2007 | B2 |
7574486 | Cheng et al. | Aug 2009 | B1 |
8113727 | Niwa et al. | Feb 2012 | B2 |
8400649 | Selvaraj | Mar 2013 | B2 |
8548874 | Nations et al. | Oct 2013 | B2 |
20040233163 | Lapstun et al. | Nov 2004 | A1 |
20050160065 | Seeman | Jul 2005 | A1 |
20050259818 | Silverbrook et al. | Nov 2005 | A1 |
20070253010 | Selvaraj | Nov 2007 | A1 |
20080082914 | Ueno et al. | Apr 2008 | A1 |
20080139191 | Melnyk et al. | Jun 2008 | A1 |
20080320036 | Winter | Dec 2008 | A1 |
20090300506 | Drucker et al. | Dec 2009 | A1 |
20110074824 | Srinivasan et al. | Mar 2011 | A1 |
20120036278 | Rafsky et al. | Feb 2012 | A1 |
20120158544 | Nations et al. | Jun 2012 | A1 |
Entry |
---|
Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd—(1999)—“The PageRank Citation Ranking: Bringing Order to the Web”—Technical Report—Stanford InfoLab—pp. 1-17. |
Number | Date | Country | |
---|---|---|---|
20140063536 A1 | Mar 2014 | US |