When deciding which place to visit, such as dining out, finding things to do on a weekend, going shopping, etc., users use various information to help them make a decision. They may look at ratings and reviews of a place, consult their friends or family, or rely on third party rankings to formulate an opinion of a place. This process can be lengthy and time-consuming as this information is meant to be consumed by everyone, and opinions vary from one person to the next. A user may look at a rating or a review and wonder if the reviewer has similar taste to them or cares about the same things.
According to the present disclosure, a personalized score is provided for a place that the user may want to visit. The score is computed based on at least one of inferred or explicit parameters, using machine learning. The score may be displayed to the user in connection with the place, and in some examples the user may also view the underlying factors that resulted in the score. Because each user is unique, the score may be different for one person than for another. Accordingly, when a group of friends are deciding on a place to visit, such as a place to eat, the personalized score for a given restaurant may be higher for a first user than for a second user.
In addition to providing personalized scores, the systems and methods described herein provide a complete view of all the places the user might want to know about. Thus, for example, the user may look at any arbitrary place (e.g., a place they heard about from someone else, read about in an article, saw in an ad, etc.) and understand whether the place matches them well. This becomes more important when trying to make a joint decision with multiple people. Each individual user may want to know if a place would be ok for them, even though it's not the best match for them. Each user may also want to know if a place is a bad match for them, or if it violates one of their restrictions.
One aspect of the disclosure provides a method for providing a personal score for a place. The method includes identifying, with one or more processors, one or more places of potential interest to a user, identifying, with the one or more processors, user preferences, determining, with the one or more processors, a personal score for one or more of the places, the personal score being generated based on the identified user preferences, and providing for display, with the one or more processors, the personal score for the one or more of the places in association with information about the place. According to some examples, the method may further include receiving a request, matching the one or more places of potential interest to the request, and sorting the places matching the request based on the personal scores. The user preferences may include implicit preferences inferred by information passively collected from the user with the user's authorization and/or explicit preferences entered by the user through a user interface. Determining the personal score may include applying a machine learning model. According to some examples, a set of explanations may also be generated and provided for display, wherein the set of explanations indicate reasons the user may like the one or more places.
Another aspect of the disclosure provides a system for providing a personal score for a place, including one or more memories storing preferences of the user, and one or more processors in communication with the one or more memories. The one or more processors may be configured to receive a request for a place, identify one or more places matching the request, identify user preferences, determine a personal score for one or more of the place matching the request, the personal score being generated based on the identified user preferences, and provide for display the personal score for the one or more of the place matching the request in association with information about the place matching the request.
Yet another aspect of the disclosure provides a method for constructing a machine learning model to generate a personal score for a place, the personal score based on preferences of a given user. The method may include accessing data from multiple sources, generating, using the accessed data, a user table including user visit data and online place interactions, generating, using the accessed data, a place table including an identification of places matching a particular set of criteria and place-level attributes used for identifying preferences, creating a lookup table associating a user identifier to samples of places, the samples being places for which the user has indicated interest or disinterest, joining the lookup table to the user table, and training a model to predict a personal score for any given place using the joined tables. According to some examples, the method may further include computing a personal score using the model, receiving survey results relating to an accuracy of the computed personal score, and modifying the model based on the survey results.
The systems and methods described herein predict how well a place matches a users' tastes and preferences. The user's personal preferences are inferred using implicit signals. In some examples, explicit preferences are also collected directly from the users. These user preferences are then matched against the details of a place using a trained machine learning model to predict how well a place matches a user's taste in the form of a score. This score, as well as explanations for why the score is high or low, is provided to the user. The explanations may include, for example, that the place matched a preference the user liked or matched a preference the user disliked.
As just one example, the place for which the user is searching may be a restaurant. A profile for the user may be generated, and various parts of the profile may be computed. These various parts may include attributes like dietary restrictions, cuisine preferences, ambiance preferences, budget sensitivity, etc. The profile may also include any interactions the user may have had with a place, including but not limited to the users' location and web/search history, whether the user saved or bookmarked a place, if they called or navigated to the place, as well as any reviews, ratings, or photos uploaded for the place. Such information may be used to infer places the user likes or visited often.
Client computing devices which are used for collecting implicit signals each have a privacy setting, which must be set to authorize such reporting. For example, the user of the client computing devices has an option to turn such reporting on or off, and may have an option to select which types of information are reported and which types are not. By way of example only, the user may allow reporting of particular locations visited, but not all locations. Moreover, privacy protections are provided for any data transmitted by the mobile device, including, for example, anonymization of personally identifiable information, aggregation of data, filtering of sensitive information, encryption, hashing or filtering of sensitive information to remove personal attributes, time limitations on storage of information, or limitations on data use or sharing. Rather than using any personal information to uniquely identify a mobile device, a cryptographic hash of a unique identifier may be used.
For many of the attributes that are inferred, a mechanism is provided for users to provide, confirm, change, or delete the specific preference. The values can be ‘like’, ‘dislike’, ‘neutral’, ‘must have’, ‘must not have’, etc. depending on the attribute. Profiles may also be computed for each place to describe what type of place it is. For example, each restaurant may have a profile that describes it by the type of cuisine served, whether it caters to specific diets, or specific dishes it may have. The restaurant profile may also include information about its ambiance, how similar it is to another restaurant, price level, etc. A list of exemplars may be generated for the user's most favorites places based on visits, the user's ratings/reviews of a place, saving the place to their favorite's list, etc. This can be further extended to build a comprehensive user-place graph that indicates a user's affinity to each place the user has interacted with in the past. These place affinities can be used to further determine how likely a user will enjoy visiting a similar place. Place similarities can be based on similarities between the place profiles (e.g. similar menus, price, ratings, ambience, descriptions, reviews, etc.), or using collaborative filtering techniques to determine if similar types of users visit both places.
While the examples above relate to restaurants, it should be understood that personalized scores may be generated for any number of different types of places. For example, the scores may be generated for stores (e.g., clothing stores, grocery stores, electronic stores, etc.), hotels, attractions (e.g., museums, amusement parks, etc.), events (e.g., concerts, sporting events, street fairs, etc.), gas stations, or any other points of interest. Scores may also be generated for more general areas including many points of interest, such as for particular cities, malls, etc.
A machine learning (ML) model is trained to predict how much a user would like a place. The labels can be collected explicitly through survey questions, in-app feedback mechanisms, or other rating flows where we ask users directly how much they like a place. The labels can also be based on other proxy signals like place visits (Location history) or place clicks (Web/Search history.) Each training example consist of a single label which may be survey response, rating, visit, or click, etc. The input features to the ML model includes all the implicit and explicit user preferences described above. They also include details about the place, as described above. The output of the model includes a score that indicates the user's affinity to a place and how well the place matches the user's personal preferences, as well as explanations for which specific features contributed the most to the final score. For example, a place would receive a high score if the user is vegetarian and the restaurant is popular among vegetarians. Conversely, a steakhouse would receive a low score for a vegetarian user.
The advantage of solving the problem in this manner is allowing a wide range of users to benefit from the recommendation model. A user may passively provide information by authorizing reporting of their location and/or search history. The user's preferences or places they like can then be automatically inferred by observing what places they visit, click on, or get directions to. On the other hand, a user who does not authorize reporting of their history can still explicitly provide their preferences by setting their preference values, and by providing direct feedback about whether they like a place or not (e.g. ratings, reviews, starring, etc.). In addition to inferring the score, a list of personalized justifications is provided for why the user might like or dislike a place. These may be directly related to the user profile, such as “Because you like <cuisine X>” or “Similar to <place Y that the user likes>”, or they could be combined with other sources of information to make the justification more colorful, such as top lists like “Top 10 places in SF for <cuisine X>” if the user likes <cuisine X>. Such top lists could come from third-party publications or they can be generated algorithmically. These justifications will help users in understanding the reasons for which a place is recommended. They also provide the user with the option to update their profile if the inference is incorrect or their preferences have changed.
The personalized scoring system and methods may be implemented in any of a number of applications, such as search applications, map/navigation applications, scheduling applications, dining/shopping applications, etc. For example, the score will show when a user makes a categorical query (e.g. “restaurants near me”), or if they search for a specific place. The score may be shown next to other place details like user ratings, number of reviews, price info, etc. The user may also be able to click on the score to get a detail page explaining how the score was computed (e.g. the list of justifications). In other examples, such as in map applications, the score may be shown in the map itself, next to the place markers. In addition to just displaying this score, places may be ranked by this score for recommendation purposes. For example, when providing a listing responsive to the search for “restaurants near me,” places with higher personalized scores may be ranked higher.
In addition to the foregoing example implementations, the system and method may be used to send recommendations to users for any of a number of applications or other products, now known or future developed. The scores may be used in conjunction with other business logic for deciding when recommendations should be made. These scores could also be used to decide which point of interests to show to the user on a map, such as by surfacing the points of interest the user is most likely going to visit on the map view. The score could also be used for personalizing operation of an electronic assistant device. For example, the assistant device can recommend a restaurant to visit or for food ordering, or it can recommend things to do on a weekend, such as visit a park, take a pottery class, etc. The assistant can also make recommendations for helping the user explore a new area, such as “check out the local hiking trail,” or “visit the local favorite bar.” It should be understood that these are merely examples of numerous possible implementations, and should not be considered as limiting.
Image section 110 may include an image relevant to the place. For example, for Restaurant B the image may be of the inside or outside of the restaurant, a particular dish served at the restaurant, etc.
Summary information section 120 may include a variety of information describing the place. By way of example only, for Restaurant B such information includes a rating 121, a price category 122, a distance 123 from a particular location (e.g., the user's location), a categorization 124 of the type of food served, and the personal score 125. The summary section 120 may also include one or more links 126, facilitating actions by the user in connection with the place. For example, the links 126 may enable the user to call the place, get directions to the place, visit a website of the place, reserve a table at the place, save the place to one or more personal lists, etc. The foregoing are merely examples, and it should be understood that the summary information section 120 may include any of a number of other types of information. For example, the summary information section 120 may also include text, such as a listing of the operating hours of the Restaurant B.
Detailed information section 130 may include further information related to the place. In some examples, such information may correspond to information in the summary section 120. For example, reviews of the place which correspond to the rating 121 may be provided. Other examples of detailed information include descriptions, photos, and mentions of the place in other media, such as news or third party rankings.
While a number of example sections are described above in connection with
Memory 230 can also include data 232 that can be retrieved, manipulated or stored by the processor. The memory can be of any non-transitory type capable of storing information accessible by the processor, such as a hard-drive, memory card, ROM, RAM, DVD, CD-ROM, write-capable, and read-only memories.
The instructions 234 can be any set of instructions to be executed directly, such as machine code, or indirectly, such as scripts, by the one or more processors. In that regard, the terms “instructions,” “application,” “steps,” and “programs” can be used interchangeably herein. The instructions can be stored in object code format for direct processing by a processor, or in any other computing device language including scripts or collections of independent source code modules that are interpreted on demand or compiled in advance. Functions, methods, and routines of the instructions are explained in more detail below.
Data 232 may be retrieved, stored or modified by the one or more processors 220 in accordance with the instructions 234. For instance, although the subject matter described herein is not limited by any particular data structure, the data can be stored in computer registers, in a relational database as a table having many different fields and records, or XML documents. The data can also be formatted in any computing device-readable format such as, but not limited to, binary values, ASCII or Unicode. Moreover, the data can comprise any information sufficient to identify the relevant information, such as numbers, descriptive text, proprietary codes, pointers, references to data stored in other memories such as at other network locations, or information that is used by a function to calculate the relevant data.
The one or more processors 220 can be any conventional processors, such as a commercially available CPU. Alternatively, the processors can be dedicated components such as an application specific integrated circuit (“ASIC”) or other hardware-based processor. Although not necessary, one or more of computing devices 210 may include specialized hardware components to perform specific computing processes, such as decoding video, matching video frames with images, distorting videos, encoding distorted videos, etc. faster or more efficiently.
Although
Each of the computing devices 210, 260, 270 can be at different nodes of a network 250 and capable of directly and indirectly communicating with other nodes of network 250. Although only a few computing devices are depicted in
As an example, each of the computing devices 210 may include web servers capable of communicating with storage system 240 as well as computing devices 260, 270 via the network 250. For example, one or more of server computing devices 210 may use network 250 to transmit and present information to a user on a display, such as display 265 of computing device 260. In this regard, computing devices 260, 270 may be considered client computing devices and may perform all or some of the features described herein.
Each of the client computing devices 260, 270 may be configured similarly to the server computing devices 210, with one or more processors, memory and instructions as described above. Each client computing device 260, 270 may be a personal computing device intended for use by a user, and have all of the components normally used in connection with a personal computing device such as a central processing unit (CPU), memory (e.g., RAM and internal hard drives) storing data and instructions, a display such as display 265 (e.g., a monitor having a screen, a touch-screen, a projector, a television, or other device that is operable to display information), and user input device 266 (e.g., a mouse, keyboard, touch-screen, or microphone). The client computing device may also include a camera 267 for recording video streams and/or capturing images, speakers, a network interface device, and all of the components used for connecting these elements to one another. The client computing device 260 may also include a location determination system, such as a GPS 268. Other examples of location determination systems may determine location based on wireless access signal strength, images of geographic objects such as landmarks, semantic indicators such as light or noise level, etc.
Although the client computing devices 260, 270 may each comprise a full-sized personal computing device, they may alternatively comprise mobile computing devices capable of wirelessly exchanging data with a server over a network such as the Internet. By way of example only, client computing device 260 may be a mobile phone or a device such as a wireless-enabled PDA, a tablet PC, a netbook, a smart watch, a head-mounted computing system, or any other device that is capable of obtaining information via the Internet. As an example the user may input information using a small keyboard, a keypad, microphone, using visual signals with a camera, or a touch screen.
As with memory 230, storage system 240 can be any type of computerized storage capable of storing information accessible by the server computing devices 210, such as a hard-drive, memory card, ROM, RAM, DVD, CD-ROM, write-capable, and read-only memories. In addition, storage system 240 may include a distributed storage system where data is stored on a plurality of different storage devices which may be physically located at the same or different geographic locations. Storage system 240 may be connected to the computing devices via the network 250 as shown in
Storage system 240 may store data, such as maps, information associated with different places, user preferences, etc. Using the stored data, the computing devices 110 may determine personal scores for places, the personal score being tailored to each user.
Places responsive to the user's search, such as places serving dinner within a predetermined geographical range of the user's location, are identified. The search results may also be represented on the map 315, such as by marker points. Though not shown in
As shown, the search results may also be listed below the map, and the listing includes the personal scores. In this particular example, the results are shown as listed in order of highest personal score. Restaurant B most closely matches the explicit and/or implicit preferences of the user 305, and thus is listed at the top with a high personal score of 93%. A next closest match to the user's preferences is Restaurant X, with a personal score of 81%. Further matches, with descending scores, may be listed below Restaurant X and may be viewed by scrolling or the like.
While a number of example parameters are shown in
According to some examples, a combined personal score may be generated for two or more users. For example, two friends may be interested in meeting at a restaurant for dinner, but the friends may have different personal preferences. As such, a restaurant that has a high personal score for one user may have a low personal score for the other user. To accommodate the two friends, a combined personal score may take into account factors based on both users' preferences, both positive and negative. For example, a first user may identify a second user by information associated with an account of the second user. Such information may include, for example, a unique identifier, an email address, a username, or any other unique information. Once the first and second user are identified, a combined personal score may be generated. The combined personal score may be provided alongside other information, such as individual personal score for the first and/or second users. The combined personal score may be provided through any of a number of applications, such as a communication application that connects the first user and the second user.
The personal score may be computed using machine learning. According to some examples, the process is split into different phases, including feature extraction, training example generation, model training and evaluation.
As mentioned previously, the user, place, context and training data can come from various sources. User data may include inferred and explicit user preferences, their visited places, ratings and reviews they have posted, places they have bookmarked or saved, etc. Place data may include average star ratings, reviews from the general public, photos of the place, webpages mentioning the place, price level, menu items/cuisines served, etc. Context data may include time of day, day of the week, season, weather, if the user is traveling, or if the user is making plans with other people. Finally, training data can come from visit history, web and search activities, survey responses, user's ratings, etc.
The training data used for the model may include both positive and negative factors. Positive factors may include, for example, explicitly answering they like the place in a survey, the number of times the user previously visited the place or similar places, searching for the place, giving a high rating or other factors indicating a user's likely interest in the place. Negative factors may include direct signals like answering they don't like the place in a survey, giving a low rating, or inferred from the fact they never visited or interacted with the place even though it is close to other places the user has visited. It should be understood that other training data may be used in addition or in the alternative to the training data described above.
The set of feature extractors take the joined user/place/context data and outputs a set of machine learning features which could be scalars, category labels, or other suitable values for inputting into a machine learning model. The features could depend solely on user data, solely on place data, or on the combination of all the data. For example, a feature extracted from only user data could be the frequency the user dines out. This indicates a prior on how likely a user would like visiting any restaurant. A feature based solely on place data could be the places' average star rating or number of visitors, indicating the place's general popularity. A feature based on the combination of user and place data could be the user's preference for particular cuisines or menu items served at the place. Another example could be how similar the place is to one of the user's favorite restaurants. Additional context data could further refine the feature to indicate if users have different preferences in different contexts, for example preferring convenient locations when it's raining, or preferring touristy places while traveling.
The label extractor takes the training data and outputs a single label for each example which again could be a scalar, category label, or other suitable values for machine learning models. As an example, the label could be the number of times the user has visited the place, or the label could be the response the user selected when asked how much they like the place in a survey.
Depending on the set of features and labels selected, different machine learning models may be used. Such models may be trained in parallel. Some features may be common across models, while each model may have its own specific features. A shared feature extractor set may be developed. Each model may then select the desired subset of extractors. Similarly, different models may share the same label extractor or use different ones. As an example, the machine learning models may be a linear regression or deep neural network model that predicts how many times a user would visit a place. As another example, the model could be an ordinal regression model that predicts what the user would answer when asked how much they like the place in a survey.
Based on the personal score and explanations shown to the user, the user may provide feedback or use other explicit control to adjust their preferences. This could allow the user to have more fine-grained control over their own data and improve the accuracy of the predicted personal scores.
Signals for the machine learning model may include both personalized and contextual signals, such that the model can intelligently predict what the user would prefer in a particular context. Examples of such signals include cuisine preferences represented as scalar values, similar places, location, weather, time, dietary restrictions, similarity to other saved, visited, or highly rated places, budgetary category, or any of a number of other factors.
Using one or more of the models, a personal score is generated. By way of example only, the score may be computed using linear regression.
During the training and evaluation phase, the machine learning models may be evaluated using survey and/or user generated data. For example, surveys may be provided to real users to ask their opinions on places recommended by the model, as well as their opinions on personal scores generated for them for the places. User generated data, such as ratings, reviews, lists of the user's saved or favorited places, etc., may also be used by itself or in combination with the surveys.
Evaluation of the machine learning models may focus on various metrics. For example, the evaluation metrics may focus on the ranking of a particular place relative to other places. Additionally or alternatively, the evaluation metrics may focus on the personal score generated for a particular place.
Results from the machine learning model may also be tuned using, for example, explicit user preferences. For example, as mentioned above, the user may be presented with a personal score for a particular place along with an explanation of the factors that resulted in generation of that personal score. The user may edit one or more factors, for example, by changing budgetary preferences, cuisine preferences, or any of the factors. In such case, an updated score may be generated.
Users may interact with their personal scores, such as to view the underlying factors or to provide explicit preferences, through a user interface. Examples of different aspects of the user interface are provided in
Moreover, as shown in
In some examples, a “how is this calculated” section may also be provided. This section may provide an explanation of the different types of information reported and used to compute the personal score. Additionally or alternatively, this section may provide links to edit the user's preferences. For example, individual links may be provided to update discrete information, such as turning location history reporting on or off. A general link, e.g., “Update Your Preferences,” may also be provided to bring the user to a preference editing section, such as those described below in connection with
Each option may be marked by the user as a positive or negative preference, which may be reflected using a positive or negative indicator. The positive or negative indicator may include any of a number of different representations, such as coloring/shading, graphics (e.g., check mark, “x”, circle with a line through it, etc.), or other representation. As shown in the example of
While
As mentioned above in connection with
Another type of feedback includes surveys.
In
Further to the example systems described above, example methods are now described. Such methods may be performed using the systems described above, modifications thereof, or any of a variety of systems having different configurations. It should be understood that the operations involved in the following methods need not be performed in the precise order described. Rather, various operations may be handled in a different order or simultaneously, and operations may be added or omitted.
In block 2320, places matching the request are identified. By way of example, all places within a predetermined geographical range may be identified as candidates for recommendations to the user. In some examples, the places identified as matching the request may be indicated to the user on a map or in any other form.
In block 2330, user preferences are identified. The user preferences may be relevant to the type of place requested. For example, if the type of place requested is a place to eat food, the identified user preferences may relate to cuisine, budget, ambiance etc. If the type of place requested is a clothing store, the user preferences may relate to budget, style, etc. The user preferences may be identified from a larger set of user preferences that are stored. For example, if a user authorizes location reporting or web history reporting, the user preferences may include inferences based on locations or websites previously visited by the user. The user preferences may also include explicit preferences entered by the user. Such explicit preferences may be entered at any time before or after the request is entered. In some examples, the user preferences may also include restrictions. For example, if a user cannot go to a particular type of restaurant because of a food allergy, such food allergy may be identified as a restriction. Various other types of restrictions are also possible.
In block 2340, a personal score is generated for one or more of the places matching the request. The personal score is specific to the user, generated based on the identified user preferences. The personal score may be generated using a machine learning model, such as described above in connection with
In block 2350, places matching the request may optionally be sorted by the personal score. For example, the results may be sorted from highest personal score to lowest. In some examples, the sort may be based on a plurality of factors, such as personal score in combination with location. Further, matches violating a restriction may be filtered out of the results.
In block 2360, the personal scores are provided for display, such as by transmitting to the user's device. For example, the personal score for a particular result may be provided in association with other information about the particular result.
It should be understood that the method described above is merely an example, and that other methods may also be implemented. For example, recommendations may be actively sent to a user, without receiving a request from the user. For example, periodic (e.g., weekly) suggestions may be sent to the user based on places with high personal scores in areas of interest to the user.
Unless otherwise stated, the foregoing alternative examples are not mutually exclusive, but may be implemented in various combinations to achieve unique advantages. As these and other variations and combinations of the features discussed above can be utilized without departing from the subject matter defined by the claims, the foregoing description of the embodiments should be taken by way of illustration rather than by way of limitation of the subject matter defined by the claims. In addition, the provision of the examples described herein, as well as clauses phrased as “such as,” “including” and the like, should not be interpreted as limiting the subject matter of the claims to the specific examples; rather, the examples are intended to illustrate only one of many possible embodiments. Further, the same reference numbers in different drawings can identify the same or similar elements.
The present application claims the benefit of the filing date of U.S. Provisional Patent Application No. 62/667,952 filed May 7, 2018, the disclosure of which is hereby incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
62667952 | May 2018 | US |