The present application relates generally to data processing systems and, in one specific example, to techniques for conducting A/B experimentation of online content.
The practice of A/B experimentation, also known as “A/B testing” or “split testing,” is a practice for making improvements to webpages and other online content. A/B experimentation typically involves preparing two versions (also known as variants, or treatments) of a piece of online content, such as a webpage, a landing page, an online advertisement, etc., and providing them to separate audiences to determine which variant performs better.
Some embodiments are illustrated by way of example and not limitation in the figures of the accompanying drawings in which:
Example methods and systems for conducting A/B experimentation of online content are described. In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of example embodiments. It will be evident, however, to one skilled in the art that the embodiments of the present disclosure may be practiced without these specific details.
As shown in
Once registered, a member may invite other members, or be invited by other members, to connect via the social network service. A “connection” may require a bi-lateral agreement by the members, such that both members acknowledge the establishment of the connection. Similarly, with some embodiments, a member may elect to “follow” another member. In contrast to establishing a connection, the concept of “following” another member typically is a unilateral operation, and at least with some embodiments, does not require acknowledgement or approval by the member that is being followed. When one member follows another, the member who is following may receive status updates or other messages published by the member being followed, or relating to various activities undertaken by the member being followed. Similarly, when a member follows an organization, the member becomes eligible to receive messages or status updates published on behalf of the organization. For instance, messages or status updates published on behalf of an organization that a member is following will appear in the member's personalized data feed or content stream. In any case, the various associations and relationships that the members establish with other members, or with other entities and objects, are stored and maintained within the social graph, shown in
The social network service may provide a broad range of other applications and services that allow members the opportunity to share and receive information, often customized to the interests of the member. For example, with some embodiments, the social network service may include a photo sharing application that allows members to upload and share photos with other members. With some embodiments, members may be able to self-organize into groups, or interest groups, organized around a subject matter or topic of interest. With some embodiments, the social network service may host various job listings providing details of job openings with various organizations.
As members interact with the various applications, services and content made available via the social network service, the members' behavior (e.g., content viewed, links or member-interest buttons selected, etc.) may be monitored and information concerning the member's activities and behavior may be stored, for example, as indicated in
With some embodiments, the social network system 20 includes what is generally referred to herein as an A/B testing system 200. The A/B testing system 200 is described in more detail below in conjunction with
Although not shown, with some embodiments, the social network system 20 provides an application programming interface (API) module via which third-party applications can access various services and data provided by the social network service. For example, using an API, a third-party application may provide a user interface and logic that enables an authorized representative of an organization to publish messages from a third-party application to a content hosting platform of the social network service that facilitates presentation of activity or content streams maintained and presented by the social network service. Such third-party applications may be browser-based applications, or may be operating system-specific. In particular, some third-party applications may reside and execute on one or more mobile devices (e.g., phone, or tablet computing devices) having a mobile operating system.
According to various example embodiments, an A/B experimentation system is configured to enable the preparation and conduction of an A/B experiment of online content among members of an online social networking service such as LinkedIn®. The A/B experimentation system may display a targeting user interface allowing the user to specify targeting criteria statements that reference members of an online social networking service based on their member attributes (e.g., their member profile attributes displayed on their member profile page, or other member attributes that may be maintained by an online social networking service that may not be displayed on member profile pages). In some embodiments, the member attribute is any of location, role, industry, language, current job, employer, experience, skills, education, school, endorsements of skills, seniority level, company size, connections, connection count, account level, name, username, social media handle, email address, phone number, fax number, resume information, title, activities, group membership, images, photos, preferences, news, status, links or URLs on a profile page, and so forth. For example, the user can enter targeting criteria such as “role is sales”, “industry is technology”, “connection count >500”, “account is premium”, and so on, and the system will identify a targeted segment of members of an online social network service satisfying all of these criteria. The system can then target all of these users in the targeted segment for online A/B experimentation.
Once the segment of users to be targeted has been defined, the system allows the user to define different variants for the experiment, such as by uploading files, images, HTML code, webpages, data, etc., associated with each variant and providing a name for each variant. One of the variants may correspond to an existing feature or variant, also referred to as a “control” variant, while the other may correspond to a new feature being tested, also referred to as a “treatment”. For example, if the A/B experiment is testing a user response (e.g., click through rate or CTR) for a button on a homepage of an online social networking service, the different variants may correspond to different types of buttons such as a blue circle button, a blue square button with rounded corners, and so on. Thus, the user may upload an image file of the appropriate buttons and/or code (e.g., HTML code) associated with different versions of the webpage containing the different variants.
Thereafter, the system may display a user interface allowing the user to allocate different variants to different percentages of the targeted segment of users. For example, the user may allocate variant A to 10% of the targeted segment of members, variant B to 20% of the targeted segment of members, and a control variant to the remaining 70% of the targeted segment of members, via an intuitive and easy-to-use user interface. The user may also change the allocation criteria by, for example, modifying the aforementioned percentages and variants. Moreover, the user may instruct the system to execute the A/B experiment, and the system will identify the appropriate percentages of the targeted segment of members and expose them to the appropriate variants.
Turning now to
In order to run an experiment, the A/B testing system 200 allows a user to create a testKey, which is a unique identifier that represents the concept or the feature to be tested. The A/B testing system 200 then creates an actual experiment as an instantiation of the testKey. Such hierarchical structure makes it easy to manage experiments at various stages of the testing process. For example, suppose the user wants to investigate the benefits of adding a background image. The user may begin by diverting only 1% of US users to the treatment, then increasing the allocation to 50% and eventually expanding to users outside of the US market. Even though the feature being tested remains the same throughout the ramping process, it requires different experiment instances as the traffic allocations and targeting changes. In other words, an experiment acts as a realization of the testKey, and only one experiment per testKey can be active at a time.
Every experiment is comprised of one or more segments, with each segment identifying a subpopulation to experiment on. For example, a user may set up an experiment with a “whitelist” segment containing only the team members developing the product, an “internal” segment consisting of all company employees and additional segments targeting external users. Because each segment defines its own traffic allocation, the treatment can be ramped to 100% in the whitelist segment, while still running at 1% in the external segments. Note that segment ordering matters because members are only considered as part of the first eligible segment. After the experimenters input their design through an intuitive User Interface, all the information is then concisely stored by the A/B testing system 200 in a DSL (Domain Specific Language). For example, the line below indicates a single segment experiment targeting English-speaking users in the US where 10% of them are in the treatment variant while the rest in control:
(ab(=(locale)“en_US”)[treatment 10% control 90%])
In some embodiments, the A/B testing system 200 may log data every time a treatment for an experiment is called, and not simply for every request to a webpage on which the treatment might be displayed. This not only reduces the logs footprint, but also enables the A/B testing system 200 to perform triggered analysis, where only users who were actually impacted by the experiment are included in the A/B test analysis. For example, LinkedIn.com could have 20 million daily users, but only 2 million of them visited the “jobs” page where the experiment is actually on. Without such trigger information, it is difficult to isolate the real impact of the experiment from the noise, especially for experiments with low trigger rates. For example, as illustrated in diagram 300 in
According to various example embodiments, experiment may be tracked in two ways: triggered experiments and targeted experiments. In the case of tracking for triggered experiments, the A/B testing system 200 enables a user to generate and experiment with multiple variants of a piece of online content, such as multiple types of search buttons on homepage. Once the experiment is created, the A/B testing system 200 deploys the code on the service (e.g., 202) that runs that online content (e.g., a service that runs the search box). Once a member interacts with the content, the service will execute a call to an experiment server or client (e.g., 204), with the information describing the A/B test (e.g., “Test 1”), and a request of what variant to display (e.g., the blue or the red color search button). The experiment server/client will return the result of the correct variant to display. Thus, on a member basis, the A/B testing system 200 determines what to display for each experiment. Each call to experiment server or client is logged as a triggered experiment.
In the case of tracking for targeted experiments the A/B testing system 200 enables a user to create a targeted experiment that is not deployed on any service, but the user is still able to track with less precision how the experiment may have performed in a hypothetical case. For example, the user may wish to generate a dummy experiment in order to see a change in metrics (e.g., page views) for everyone that visited a homepage. In particular, since no service is calling these experiments, no triggered experiment logs are created. Thus, the A/B testing system 200 analyzes all the page visits to the underlying webpage (e.g. linkedin.com) where the experiment resides or is accessible from, and replays those visits as if they hit a service that calls the experiment server or client. For example, suppose that out of 350 million LinkedIn members, 50 million hit linkedin.com yesterday with a full page view. Of those 50 million members, the A/B testing system 200 will record an execution of the A/B experiment as if this experiment was actually executed for them on a service. In this way, the A/B testing system 200 generates a targeted log for that service. In other words, the A/B testing system 200 records, for a given experiment and a given member ID that visited the homepage, the variant that would have been returned if the experiment had been deployed for this member ID. Thus, since 200 has already determined what variant should be shown to each member, member visits to a homepage may be utilized to generate less precise tracking results about how experiments may have been executed.
In some embodiments, after the service module 202 displays the correct variant to a member associated with a member ID (see operation 405), the experiment module 204 may further store, in a database, information indicating that the correct variant of the A/B experiment has been displayed for the given member ID. Moreover, the experiment module 204 may update information in the database indicating a count of member IDs that have been exposed to the correct variant of the A/B experiment.
As described above, the database 206 may include member-variant information indicating, for each of a plurality of A/B experiments, a list of member IDs and a correct variant associated with each of the member IDs (e.g., see member-variant information 600 in
In some embodiments, the experiment module 204 may display, via a user interface displayed on the client device, a size of a triggered population for the A/B experiment and/or a size of a targeted population of the A/B experiment. The targeted population corresponds to members of the online social networking service for which a correct variant of the A/B experiment has been assigned (e.g., see operation 504 and 505 in method 500). The triggered population corresponds to members of the online social networking service for which a correct variant of the A/B experiment has actually been retrieved and displayed (e.g., see operation 405 in method 400). Instead of displaying the size of a triggered population for the A/B experiment and/or a size of a targeted population of the A/B experiment, the experiment module 204 may display a metric indicating the relationship between the sizes of the populations (e.g., a metric indicating that the triggered population is 10% of the targeted population). In some embodiments, the experiment module 204 may calculate and display various metrics as a result of the A/B experiment (e.g., percentage improvement in page views, click-through rate, unique visitors, etc., when the treatment is applied), where such statistics are generated based on the triggered population for the treatment (rather than the targeted population for the treatment), consistent with various embodiments described herein.
Certain embodiments are described herein as including logic or a number of components, modules, or mechanisms. Modules may constitute either software modules (e.g., code embodied (1) on a non-transitory machine-readable medium or (2) in a transmission signal) or hardware-implemented modules. A hardware-implemented module is a tangible unit capable of performing certain operations and may be configured or arranged in a certain manner. In example embodiments, one or more computer systems (e.g., a standalone, client or server computer system) or one or more processors may be configured by software (e.g., an application or application portion) as a hardware-implemented module that operates to perform certain operations as described herein.
In various embodiments, a hardware-implemented module may be implemented mechanically or electronically. For example, a hardware-implemented module may comprise dedicated circuitry or logic that is permanently configured (e.g., as a special-purpose processor, such as a field programmable gate array (FPGA) or an application-specific integrated circuit (ASIC)) to perform certain operations. A hardware-implemented module may also comprise programmable logic or circuitry (e.g., as encompassed within a general-purpose processor or other programmable processor) that is temporarily configured by software to perform certain operations. It will be appreciated that the decision to implement a hardware-implemented module mechanically, in dedicated and permanently configured circuitry, or in temporarily configured circuitry (e.g., configured by software) may be driven by cost and time considerations.
Accordingly, the term “hardware-implemented module” should be understood to encompass a tangible entity, be that an entity that is physically constructed, permanently configured (e.g., hardwired) or temporarily or transitorily configured (e.g., programmed) to operate in a certain manner and/or to perform certain operations described herein. Considering embodiments in which hardware-implemented modules are temporarily configured (e.g., programmed), each of the hardware-implemented modules need not be configured or instantiated at any one instance in time. For example, where the hardware-implemented modules comprise a general-purpose processor configured using software, the general-purpose processor may be configured as respective different hardware-implemented modules at different times. Software may accordingly configure a processor, for example, to constitute a particular hardware-implemented module at one instance of time and to constitute a different hardware-implemented module at a different instance of time.
Hardware-implemented modules can provide information to, and receive information from, other hardware-implemented modules. Accordingly, the described hardware-implemented modules may be regarded as being communicatively coupled. Where multiple of such hardware-implemented modules exist contemporaneously, communications may be achieved through signal transmission (e.g., over appropriate circuits and buses) that connect the hardware-implemented modules. In embodiments in which multiple hardware-implemented modules are configured or instantiated at different times, communications between such hardware-implemented modules may be achieved, for example, through the storage and retrieval of information in memory structures to which the multiple hardware-implemented modules have access. For example, one hardware-implemented module may perform an operation, and store the output of that operation in a memory device to which it is communicatively coupled. A further hardware-implemented module may then, at a later time, access the memory device to retrieve and process the stored output. Hardware-implemented modules may also initiate communications with input or output devices, and can operate on a resource (e.g., a collection of information).
The various operations of example methods described herein may be performed, at least partially, by one or more processors that are temporarily configured (e.g., by software) or permanently configured to perform the relevant operations. Whether temporarily or permanently configured, such processors may constitute processor-implemented modules that operate to perform one or more operations or functions. The modules referred to herein may, in some example embodiments, comprise processor-implemented modules.
Similarly, the methods described herein may be at least partially processor-implemented. For example, at least some of the operations of a method may be performed by one or processors or processor-implemented modules. The performance of certain of the operations may be distributed among the one or more processors, not only residing within a single machine, but deployed across a number of machines. In some example embodiments, the processor or processors may be located in a single location (e.g., within a home environment, an office environment or as a server farm), while in other embodiments the processors may be distributed across a number of locations.
The one or more processors may also operate to support performance of the relevant operations in a “cloud computing” environment or as a “software as a service” (SaaS). For example, at least some of the operations may be performed by a group of computers (as examples of machines including processors), these operations being accessible via a network (e.g., the Internet) and via one or more appropriate interfaces (e.g., Application Program Interfaces (APIs).)
Example embodiments may be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. Example embodiments may be implemented using a computer program product, e.g., a computer program tangibly embodied in an information carrier, e.g., in a machine-readable medium for execution by, or to control the operation of, data processing apparatus, e.g., a programmable processor, a computer, or multiple computers.
A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, subroutine, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a communication network.
In example embodiments, operations may be performed by one or more programmable processors executing a computer program to perform functions by operating on input data and generating output. Method operations can also be performed by, and apparatus of example embodiments may be implemented as, special purpose logic circuitry, e.g., a field programmable gate array (FPGA) or an application-specific integrated circuit (ASIC).
The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. In embodiments deploying a programmable computing system, it will be appreciated that that both hardware and software architectures require consideration. Specifically, it will be appreciated that the choice of whether to implement certain functionality in permanently configured hardware (e.g., an ASIC), in temporarily configured hardware (e.g., a combination of software and a programmable processor), or a combination of permanently and temporarily configured hardware may be a design choice. Below are set out hardware (e.g., machine) and software architectures that may be deployed, in various example embodiments.
The example computer system 800 includes a processor 802 (e.g., a central processing unit (CPU), a graphics processing unit (GPU) or both), a main memory 804 and a static memory 806, which communicate with each other via a bus 808. The computer system 800 may further include a video display unit 810 (e.g., a liquid crystal display (LCD) or a cathode ray tube (CRT)). The computer system 800 also includes an alphanumeric input device 812 (e.g., a keyboard or a touch-sensitive display screen), a user interface (UI) navigation device 814 (e.g., a mouse), a disk drive unit 816, a signal generation device 818 (e.g., a speaker) and a network interface device 820.
The disk drive unit 816 includes a machine-readable medium 822 on which is stored one or more sets of instructions and data structures (e.g., software) 824 embodying or utilized by any one or more of the methodologies or functions described herein. The instructions 824 may also reside, completely or at least partially, within the main memory 804 and/or within the processor 802 during execution thereof by the computer system 800, the main memory 804 and the processor 802 also constituting machine-readable media.
While the machine-readable medium 822 is shown in an example embodiment to be a single medium, the term “machine-readable medium” may include a single medium or multiple media (e.g., a centralized or distributed database, and/or associated caches and servers) that store the one or more instructions or data structures. The term “machine-readable medium” shall also be taken to include any tangible medium that is capable of storing, encoding or carrying instructions for execution by the machine and that cause the machine to perform any one or more of the methodologies of the present disclosure, or that is capable of storing, encoding or carrying data structures utilized by or associated with such instructions. The term “machine-readable medium” shall accordingly be taken to include, but not be limited to, solid-state memories, and optical and magnetic media. Specific examples of machine-readable media include non-volatile memory, including by way of example semiconductor memory devices, e.g., Erasable Programmable Read-Only Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks.
The instructions 824 may further be transmitted or received over a communications network 826 using a transmission medium. The instructions 824 may be transmitted using the network interface device 820 and any one of a number of well-known transfer protocols (e.g., HTTP). Examples of communication networks include a local area network (“LAN”), a wide area network (“WAN”), the Internet, mobile telephone networks, Plain Old Telephone (POTS) networks, and wireless data networks (e.g., WiFi, LTE, and WiMAX networks). The term “transmission medium” shall be taken to include any intangible medium that is capable of storing, encoding or carrying instructions for execution by the machine, and includes digital or analog communications signals or other intangible media to facilitate communication of such software.
Although an embodiment has been described with reference to specific example embodiments, it will be evident that various modifications and changes may be made to these embodiments without departing from the broader spirit and scope of the invention. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense. The accompanying drawings that form a part hereof, show by way of illustration, and not of limitation, specific embodiments in which the subject matter may be practiced. The embodiments illustrated are described in sufficient detail to enable those skilled in the art to practice the teachings disclosed herein. Other embodiments may be utilized and derived therefrom, such that structural and logical substitutions and changes may be made without departing from the scope of this disclosure. This Detailed Description, therefore, is not to be taken in a limiting sense, and the scope of various embodiments is defined only by the appended claims, along with the full range of equivalents to which such claims are entitled.
Such embodiments of the inventive subject matter may be referred to herein, individually and/or collectively, by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any single invention or inventive concept if more than one is in fact disclosed. Thus, although specific embodiments have been illustrated and described herein, it should be appreciated that any arrangement calculated to achieve the same purpose may be substituted for the specific embodiments shown. This disclosure is intended to cover any and all adaptations or variations of various embodiments. Combinations of the above embodiments, and other embodiments not specifically described herein, will be apparent to those of skill in the art upon reviewing the above description.
This application claims the benefit of priority to U.S. Provisional Application Ser. No. 62/126,169, filed Feb. 27, 2015, and U.S. Provisional Application Ser. No. 62/140,366, filed Mar. 30, 2015, which are incorporated herein by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
62126169 | Feb 2015 | US | |
62140366 | Mar 2015 | US |