None.
None.
None.
The invention disclosed broadly relates to the field of web page design, and more particularly relates to the field of evaluation of web page design.
State of the art methods for evaluating web page design and layouts are slow, expensive, require manual intervention from designers, and are data-intensive. They generally require data collection from several thousands of users, followed by an analysis of the click-through rate (“CTR”) in order to determine whether or not a new design/layout is effective.
Saliency refers to the state or quality of an item that makes it stand out relative to its neighboring items. In simpler terms, saliency is what catches the eye. There are a few publications on saliency models, such as “Feature Combination Strategies For Saliency-Based Visual Attention Systems,” by Laurent Itti and Christof Koch; “A Model Of Saliency-Based Visual Attention For Rapid Scene Analysis” by Itti, Koch, and Neibur IEEE PAMI 1998; and “Saliency Detection: A Spectral Residual Approach” by Xiaudi Hou and Liqing Zhang, Department of Computer Science, Shanghai Jiao Tong University, CVPR 2007. These models have only been tested on a small scale (less than hundred users) and have not been tested on web pages. There have been no attempts made to log saliency information on a large scale (on millions of users), to analyze the saliency information in relation to click logs, and to apply the saliency information to large scale web page and ad design/layout evaluation and optimization
Referring now in specific detail to the drawings and to
While eye tracking offers an excellent way to track user attention at the millisecond timescale on a display, it suffers the following shortcomings. Commercial eye trackers are expensive (costing more than $10,000 each). This eye tracking method is a good methodology for tracking user attention on a small scale, but because it requires that users be brought into a lab for observation/eye tracking, it is not scalable. Moreover, data collection from 100 users can take more than a week, and hence this methodology is time-consuming.
The toolbar method allows tracking mouse movements, which is considered to be a weak proxy for eye gaze. The pros of this methodology are that it is more scalable than eye tracking and it is also cheaper. The cons are that mouse tracks are inherently noisy (there is wide variability in mouse usage patterns across users), and the relationship between mouse and eye gaze/attention is not well understood.
Finally, click logs offer another way to track user attention on the web. The pros of this approach are that it is highly scalable, and avoids the sampling biases that may arise from eye tracking and toolbars. The cons, however, are that click logs offer poor temporal resolution of several seconds (compared to eye tracking at a millisecond resolution), and reflect user choice which is an after-effect of user attention. For example, the user may attend to several items on the page but may not click on anything. In such cases, click logs cannot offer any information about the user's attention while on the page.
These methods can advantageously mine data for interesting patterns, consider a user's state of mind, and consider his/her natural behavior. The drawbacks are that they require a great deal of user training data over a long period of time; hence they are slow and expensive. Although these current methods 106 provide a great and valuable insight into what catches the user's eye, they are not suited for large scale implementation. The greater the depth of understanding 104 they provide, the less scalable they are. The arrows in
Briefly, according to an embodiment of the invention a method for evaluating a web design includes steps or acts of: receiving input that includes page elements; deriving a plurality of key page elements from the input; running a saliency model on the input to derive hot spots representing those items that are most likely to initially grab a viewer's attention; comparing positions of the hot spots to placement of the plurality of the key page elements to determine effectiveness of the placement of the key page elements; and presenting a saliency map depicting the hot spots in the page elements.
According to another embodiment of the present invention, a system for evaluating web design includes: a memory with computer program instructions for: receiving input that includes page elements; deriving a plurality of key page elements from the input; running a saliency model on the input to derive hot spots representing those items that are most likely to initially grab a viewer's attention; comparing positions of the hot spots to placement of the plurality of the key page elements to determine effectiveness of the placement of the key page elements; and presenting a saliency map depicting the hot spots in the page elements. The system further includes a processor device for carrying out the method steps.
In another embodiment of the present invention, a computer readable storage medium includes instructions for executing the method steps above.
The method can also be implemented as machine executable instructions executed by a programmable information processing system or as hard coded logic in a specialized computing apparatus such as an application-specific integrated circuit (ASIC).
To describe the foregoing and other exemplary purposes, aspects, and advantages, we use the following detailed description of an exemplary embodiment of the invention with reference to the drawings, in which:
While the invention as claimed can be modified into alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that the drawings and detailed description thereto are not intended to limit the invention to the particular form disclosed, but on the contrary, the intention is to cover all modifications, equivalents and alternatives falling within the scope of the present invention.
Before describing in detail embodiments that are in accordance with the present invention, it should be observed that the embodiments reside primarily in combinations of method steps and system components related to systems and methods for placing computation inside a communication network. Accordingly, the system components and method steps have been represented where appropriate by conventional symbols in the drawings, showing only those specific details that are pertinent to understanding the embodiments of the present invention so as not to obscure the disclosure with details that will be readily apparent to those of ordinary skill in the art having the benefit of the description herein. Thus, it will be appreciated that for simplicity and clarity of illustration, common and well-understood elements that are useful or necessary in a commercially feasible embodiment may not be depicted in order to facilitate a less obstructed view of these various embodiments.
Throughout the specification and claims, the following terms take the meanings explicitly associated herein, unless the context clearly dictates otherwise. The phrase “in one embodiment” as used herein does not necessarily refer to the same embodiment, though it may. Furthermore, the phrase “in another embodiment” as used herein does not necessarily refer to a different embodiment, although it may. Thus, as described below, various embodiments of the invention may be readily combined, without departing from the scope or spirit of the invention. In addition, as used herein, the term “or” is an inclusive “or” operator, and is equivalent to the term “and/or,” unless the context clearly dictates otherwise. The term “based on” is not exclusive and allows for being based on additional factors not described, unless the context clearly dictates otherwise. In addition, throughout the specification, the meaning of” a,” “an,” and “the” includes plural references. The meaning of “in” includes “in” and “on.”
A saliency-based method.
We describe a novel saliency-based method for fast, inexpensive, automated, and scalable evaluation and optimization of webpage designs and layouts, and advertisement and brand effectiveness. This method, which we will describe in detail below, is applicable to a number of different applications, such as advertisement layout/effectiveness, webpage creative content, and optimizing webpage layout.
An embodiment of the present invention uses a model of human visual attention (Hou & Zhang, 2008); inspired by information processing in the primate visual cortex, to predict what users might see on a web page in the first few seconds. Referring now to
This saliency model we use is derived from insights in neuroscience and cognitive psychology which show that visual attention is attracted (in some cases, strongly captured) by display items whose visual properties (color, orientation, brightness, motion) are very different from the surrounding display items. Orientation here refers to whether the item on a page is horizontal or vertical or tilted along some direction. For example, a red Coca Cola can against a green background is salient and attracts attention due to the high color contrast, whereas the same red Coca Cola can in an orange/reddish background is not salient and does not attract attention due to the poor color contrast. Another example is that a tilted building surrounded by several vertical buildings is salient due to the difference in orientation.
From early visual cortex studies, eye tracking experiments have shown that saliency is an important driving factor of attention in the first few seconds, be it watching videos, static images, natural or synthetic displays. Since saliency depends on low-level visual processes, is precognitive and involves involuntary attention mechanisms, it is a basic attention-driving mechanism that can be found across species (e.g., barn owl, archer fish, cat, monkey and human), independent of gender, age, and race. Motivated by these facts, we use saliency-based methods to determine hot spots and predict what users might see on web pages in the first few seconds.
We provide a service and a system implementation to use this model in a new application domain, namely to evaluate webpage designs and layouts by testing whether the “important” page elements appear salient and attract attention. “Importance” here can be defined either in terms of importance/relevance to the user (based on user profile and interests), or importance to the publisher (based on expected revenue from various page elements, e.g., ads).
The importance of page elements can be inferred in many cases. For example, the advertisement is an important page element for the advertiser; the brand logo and content are important page elements for the publisher; content related to the users' search queries are important page elements for the user. Alternatively, the importance of page elements can be explicitly stated as an input to the model, e.g., designers may specify that the mail or search logo is an important page element. In this case, since the model does not by itself know where these important elements are on the page, the x,y coordinates of the bounding box (smallest rectangle that bounds the element) must also be provided as an input to the model, along with the web page. This can be done manually by the designer/publisher, or automatically by using computer vision tools that can provide the coordinates of elements on a rendered page. One known method to get the coordinates is to run a Javascript that provides the mouse x,y coordinates on a web page. The user simply places the mouse at one corner of the bounding box, notes the x,y coordinates and moves to the next corner to capture those coordinates, until done.
Referring now to
The advantages to this method over the current methods are that it is predictive without requiring users to submit training data; it is fast; and it is inexpensive, scalable, and automated. The model can be trained to learn user bias towards page locations, features, content, and the like. We use saliency-based methods on a large scale to evaluate page/ad effectiveness on the Web and relate it to tangibles like clicks and page dwells. In addition, the proposed system implementation of logging page screenshots and saliency of page elements, in addition to standard information in click logs, is novel. This will enable log analysis to reveal the relationship between saliency and click-through-rates (CTR), and page dwells.
Also, the internal web service that we provide to automate saliency computation on any URL or page screenshot is novel. Referring now to
Referring now to
A further breakdown of information may be provided in the form of comparative analysis of visual features that contribute to the saliency of the hotspot. For example, a red Coca Cola can against a green background is salient due to the color contrast, while orientation, brightness and other visual features may not contribute much. The comparative analysis provides feedback and insights to designers and advertisers on the efficacy of their designs. For web portals like Yahoo!, advertisements may be displayed on web pages resulting from a user-defined search based upon one or more search terms. Such advertising is most beneficial to users, advertisers and web portals when the displayed advertisements are relevant to the web portal user's interests. Thus, a variety of techniques have been developed to infer the user's interests/intent and subsequently target the most relevant advertising to that user.
The proposed method provides a novel way to improve advertisement effectiveness by enabling ad selection based on saliency in addition to the existing approach of presenting targeted advertisements to those users interested in receiving product information from various sellers by employing demographic characteristics (i.e., age, income, sex, occupation, etc.) for predicting the behavior of groups of different users. For example, given a set of advertisements that are targeted to the user based on demographics, the proposed method could be used to select the advertisement that is most salient and hence even more likely to be noticed by the user.
Method embodiments.
Referring now to
Referring now to
Referring now to
Referring now to
Benefits and advantages of the invention.
General benefits: Unlike existing methods that require a huge amount of user data for training—thus slow and expensive—our proposed method is fast, cheap and scalable as it is predicts user attention in the first few seconds on page without requiring data from even a single user, in order to evaluate page designs/layouts. Given the importance of page elements to the user and/or publisher (e.g., user preference for various topics on page, or publisher's expected revenue from the various page elements), the model can be used near run time to compare several candidate designs or layouts and present the winning design/layout to the user, that will maximize user attention on the preferred page elements.
Specific applications: 1) evaluating ad & brand effectiveness (testing whether the ad/brand appear salient on page and attract attention, if not, tuning visual properties to improve salience); 2) evaluating ad placement and layout (where should the ad be placed to attract more attention); 3) scalable evaluation of web page and advertisement designs and their impact on click-through rates (CTRs) and page dwells; and 4) predicting the top hotspots of user attention and generating summary statistics, ranking and comparative analysis by features.
Referring now to
In general, the routines which are executed when implementing these embodiments, whether implemented as part of an operating system or a specific application, component, program, object, module or sequence of instructions, will be referred to herein as computer programs, or simply programs. The computer programs typically comprise one or more instructions that are resident at various times in various memory and storage devices in an information processing or handling system such as a computer, and that, when read and executed by one or more processors, cause that system to perform the steps necessary to execute steps or elements embodying the various aspects of the invention.
Throughout the description herein, an embodiment of the invention is illustrated with aspects of the invention embodied solely on computer system 1800. As will be appreciated by those of ordinary skill in the art, aspects of the invention may be distributed amongst one or more networked computing devices which interact with computer system 1800 via one or more data networks such as, for example, network 1820. However, for ease of understanding, aspects of the invention have been embodied in a single computing device—computer system 1800.
Computer system 1800 includes processing device 1802 which communicates with an input/output subsystem 106, memory 104, storage 110 and network 110. The processor device 102 is operably coupled with a communication infrastructure 122 (e.g., a communications bus, cross-over bar, or network). The processor device 102 may be a general or special purpose microprocessor operating under control of computer program instructions 132 executed from memory 104 on program data 134. The processor 102 may include a number of special purpose sub-processors such as a comparator engine, each sub-processor for executing particular portions of the computer program instructions. Each sub-processor may be a separate circuit able to operate substantially in parallel with the other sub-processors.
Some or all of the sub-processors may be implemented as computer program processes (software) tangibly stored in a memory that perform their respective functions when executed. These may share an instruction processor, such as a general purpose integrated circuit microprocessor, or each sub-processor may have its own processor for executing instructions. Alternatively, some or all of the sub-processors may be implemented in an ASIC. RAM may be embodied in one or more memory chips.
The memory 1804 may be partitioned or otherwise mapped to reflect the boundaries of the various memory subcomponents. Memory 1804 may include both volatile and persistent memory for the storage of: operational instructions for execution by CPU 1802, data registers, application storage and the like. Memory 1804 preferably includes a combination of random access memory (RAM), read only memory (ROM) and persistent memory such as that provided by a hard disk drive. The computer instructions/applications that are stored in memory 1804 are executed by processor 1802. The computer instructions/applications and program data can also be stored in hard disk drive for execution by processor device 1802. Storage 1810 may be located within the system 1800 or off-site. Storage 1810 may contain the logs 1813 for the screenshots and saliency data.
Those skilled in the art will appreciate that the functionality implemented within the blocks illustrated in the diagram may be implemented as separate components or the functionality of several or all of the blocks may be implemented within a single component. The I/O subsystem 1816 may comprise various end user interfaces such as a display, keyboards, and a mouse. The I/O subsystem 1816 may further comprise a connection to the Internet 1890.
The computer system 1800 may also include a removable storage drive 1850, representing a floppy disk drive, a magnetic tape drive, an optical disk drive, etc. The removable storage drive 1850 reads from and/or writes to a removable storage unit 120 in a manner well known to those having ordinary skill in the art. Removable storage unit 1852, represents a floppy disk, a compact disc, magnetic tape, optical disk, CD-ROM, DVD-ROM, etc. which is read by and written to by removable storage drive 1850. As will be appreciated, the removable storage unit 1852 includes a non-transitory computer readable medium having stored therein computer software and/or data.
The computer system 1800 may also include a communications interface 1818. Communications interface 1818 allows software and data to be transferred between the computer system and external devices. Examples of communications interface 1818 may include a modem, a network interface (such as an Ethernet card), a communications port, a PCMCIA slot and card, etc. Software and data transferred via communications interface 1818 are in the form of signals which may be, for example, electronic, electromagnetic, optical, or other signals capable of being received by communications interface 1818.
In this document, the terms “computer program medium,” “computer usable medium,” and “computer readable medium” are used to generally refer to both transitory and non-transitory media such as main memory 1804, removable storage drive 1850, a hard disk installed in hard disk drive, and signals. These computer program products are means for providing software to the computer system 1800. The computer readable medium 1852 allows the computer system 1800 to read data, instructions, messages or message packets, and other computer readable information from the computer readable medium 1852.
Therefore, while there has been described what is presently considered to be the preferred embodiment, it will understood by those skilled in the art that other modifications can be made within the spirit of the invention. The above description(s) of embodiment(s) is not intended to be exhaustive or limiting in scope. The embodiment(s), as described, were chosen in order to explain the principles of the invention, show its practical application, and enable those with ordinary skill in the art to understand how to make and use the invention. It should be understood that the invention is not limited to the embodiment(s) described above, but rather should be interpreted within the full meaning and scope of the appended claims.