Aspects of the presently disclosed technology relate generally to risk assessment techniques and more particularly to generating a personality profile for calculating a risk assessment based on digital footprint from a plurality of disparate sources.
Service provider platforms generally leverage end user preferences for targeting or offering services to end users. For example, some service provider platforms may utilize demographics to understand a target market of users and offer services accordingly. In some instances, a service provider platform may analyze user preferences on an individual level to offer services to a specific user. However, this information is often one-dimensional, providing limited insight into the individual. For instance, a video streaming service may generally determine which videos to recommend to a user based on a viewing history of the user and other users. As such, any understanding of the individual user is limited to a likelihood that a user will view certain videos. Such limited insight is insufficient for complex services, such as risk assessment, involving various facets of an individual. It is with these observations in mind, among others, that various aspects of the present disclosure were conceived and developed.
Implementations described and claimed herein address the foregoing by providing systems and methods for behavior assessment for an individual. In one implementation, user data for the individual is obtained from one or more digital sources. Categorized user data is created by transforming the user data into a platform independent format. The categorized user data is associated with a plurality of content-based bins. One or more behavioral insight categories are determined from the categorized user data. A plurality of behavioral metrics is determined based on the one or more behavioral insight categories and the categorized user data. A personality profile for the individual is generated by converting the plurality of behavioral metrics into one or more scores. A risk assessment for the individual is generated based on the personality profile.
Other implementations are also described and recited herein. Further, while multiple implementations are disclosed, still other implementations of the presently disclosed technology will become apparent to those skilled in the art from the following detailed description, which shows and describes illustrative implementations of the presently disclosed technology. As will be realized, the presently disclosed technology is capable of modifications in various aspects, all without departing from the spirit and scope of the presently disclosed technology. Accordingly, the drawings and detailed description are to be regarded as illustrative in nature and not limiting.
Aspects of the present disclosure involve systems and methods for generating a personality profile from a digital footprint of an individual. The digital footprint may be based on user data from a variety of disparate sources. Generally, user data is collected from a plurality of sources at varying frequencies, using differing schemas, and having various characteristics. The presently disclosed technology generates a personality profile providing an understanding of an individual, including behavior.
The user data can include multiple data files in different formats. In one aspect, the system categorizes the user data into one or more content-based bins (e.g., a location bin, a purchases bin, an activity bin, a fitness bin, a social network bin, and/or a personally identifiably information (PII) bin), standardizing the data from the different, third-party platforms. Stated differently, the system transforms and normalizes the data into a platform independent data format, permitting data from disparate sources to be aggregated and categorized. By way of example, the system recognizes that a first data format for a first platform (e.g., a social media platform) has a first data structure defining a “connection” and a second data format for a second platform (e.g., for a gaming platform) has a second data structure defining a “friend.” The system normalizes these different data formats by recognizing that the “connection” of the first data format and the “friend” of the second data format represent similar underlying information despite the different data formats. As such, the two different data structures may be transformed and normalized into a platform independent data format and categorized into one or more common content-based bins (e.g., social bin and the PII bin).
The data from different platforms may be aggregated and categorized into different formats into the content-based bins using a multi-data source personality profile generator. In one example, the multi-data source personality profile generator generates a content hierarchy (e.g., including various categories and sub-categories of the) and an activity timeline (e.g., action items in a chronological sequence) using the categorized user data. Patterns from the content hierarchy and the activity timeline can be identified, combined, and leveraged to generate a plurality of behavioral insight categories and behavioral metrics corresponding to the behavioral insight categories. The plurality of behavioral insight categories and behavioral metrics may be used to generate a knowledge graph (e.g., via a JavaScript Object Notation (JSON) data structure) providing a multi-dimensional representation of a specific individual. For instance, the behavioral insight categories and behavioral metrics are used to calculate lifestyle index values and brand personality values.
The presently disclosed technology generates a digital footprint for a specific individual in a platform independent format by transforming data obtained from disparate sources in different digital formats. The digital footprint may be used to generate a knowledge graph and/or personality profile representing a sophisticated understanding about the behavior of the specific individual. Accordingly, the presently disclosed technology provides a comprehensive understanding of how the behavior of the specific individual affects risk. For example, restaurant visit data (e.g., aggregated from different digital sources) can be used to generate the content hierarchy (e.g., based on content items of the food orders), the activity timeline (e.g., based on action items representing when the restaurant visits occurred), and/or multiple behavioral dimensions for risk assessment, such as a mobility dimension indicating a driving risk and a health dimension indicating a health risk. A JSON data structure of such categorized data provides accurate binning and fast retrieval of the data for analytics purpose (e.g., for presentation at various graphical user interfaces (GUIs)). The framework of the multi-data source personality profile generator thus provides an optimized and sophisticated understanding of and estimations regarding behavior of one or more specific individuals, thereby improving risk assessment for actuarial pricing, data privacy risk assessments, product recommendation placement, target advertisement placement, and/or the like. Additional advantages of the presently disclosed technology will become apparent from the detailed description herein.
To begin a detailed description of an example system 100 for generating a personality profile 102, reference is made to
The data aggregator 112 extracts the data from various different data types (e.g., with different purposes for different applications) and transforms and standardizes the different data types into a common data structure (i.e., a platform independent data format) by generating and associating one or more identifiers corresponding to the content-based bin(s) 114 and/or extracting the data into a previously defined data structure. Action items represented by the user data 104 can be tagged with previously defined labels and/or scoring components to assign a likelihood score to the user data 104 corresponding to the previously defined labels.
In one implementation, a behavioral insight categories generator 120 uses the user data 104 to generate interest category identifiers and content identifiers and, from the interest category identifiers and content identifiers, other identifiers and correlations between action items as part of a personality profile generation process 122. For instance, a plurality of behavior metrics 124 can be determined and used to generate scores (e.g., labels) for a plurality of behavioral dimensions 126 in a many-to-many relationship. Any particular behavior metric 124 can apply to multiple behavioral dimensions and any particular behavioral dimension can receive multiple behavior metrics 124. A health dimension 128, a finance dimension 130, a mobility dimension 132, an interests dimension 134, a sociability dimension 136, a personal identity dimension 138, and/or other dimensions may be generated from behavior metrics 124. Using the behavioral dimensions 126, the personality profile 102 may be generated. For example, the personality profile 102 may be generated by indicating a plurality of lifestyle index values, brand personality values, and/or the like. The personality profile 102 can be viewable using a GUI rendering a visual presentation of the multi-data source personality profile generator 110 at various layers of analysis and detail. For instance, the system 100 can provide a timeline view, a content view, a personality profile summary view, and/or other views as discussed herein.
The personality profile 102 may be optimized by transforming the different data received in various data formats into a highly structured, detailed, consistent, and viewable data structure that is platform agnostic. For instance, the multi-data source personality profile generator 110 can receive (e.g., at the data aggregator 112) first user data in a first data format 140 from a first digital source of the one or more digital sources 106 (e.g., a credit card service); second user data in a second data format 142 from a second digital source of the one or more digital sources 106 (e.g., a social network account); and/or third user data in a third data format 144 from a third digital source of the one or more digital sources 106 (e.g., a wearable device application). The multi-data source personality profile generator 110 can receive the user data 104 at the server device(s) 108 via one or more network connections provided by one or more network(s) 146.
The first data format 140, the second data format 142, and the third data format 144 can be different data formats, different types of data, for different purposes, and arranged in different data structures, yet the data aggregator 112 is still able to convert the user data 104 into highly-functional, inter-operable, categorized user data having a common schema. The data aggregator 112 assess the various portions of the user data 104 files (e.g., a header, a body, a timestamp, a sending/terminating location or address, etc.) and generates one or more tags, labels, categories, and/or other data structures (e.g., indicating supplemental meta-data) to associate with the user data 104 (e.g., action items represented by the user data 104). Converting the user data 104 into the categorized user data can further include associating the action items represented by the user data 104 with one or more predefined categories, such as one or more (e.g., or all) of the content-based bin(s) 114 including a location bin 148, a purchases bin 150, an activity bin 152, a fitness bin 154, a social network bin 156, and/or a PII bin 158. Accordingly, from this categorized user data, the content hierarchy 116, the activity timeline 118, the interest category identifiers, the content identifiers, and/or the behavioral dimensions 126, the personality profile 102 can be constructed.
Turning to
For instance, the user data 104 (e.g., originating from a variety of different digital sources 106) can represent a plurality of action items, and the input data scaling system 202 can generate a plurality of evidence confidence ratings (e.g., between 1 and 3; 1 and 5; between 1 and 10; 1 and 100; 1 and 1000; 0 and 3; 0 and 5, etc.) associated with the plurality of action items, such that the plurality of action items are weighted, scaled, and/or adjusted to increase or decrease the impact of the action items on the analysis downstream of the input data scaling system 202 performed by the multi-data source personality profile generator 110.
The evidence confidence rating 208 can indicate a level of interest (e.g., of a user or specific individual) associated with the user data 104. The input data scaling system 202 can generate the evidence confidence rating 208 using the origin-based interest scale 204, for instance, based on how the action item of the user data 104 originated or how a user caused the action item to occur. The origin-based interest scale 204 can generate a rating between 1 and 100 (or any range) based on what type of activity the user engaged in to perform the action item. In some instances, a search action item can receive a lowest rating (e.g., between 0 and 25); a website visit action item can receive a second lowest rating (e.g., between 25 and 50); a purchase, a location of a visit, a membership or subscription, a health record, a post, a fitness regime, a friend network, or a clicked ad action item can receive a highest rating (e.g., between 75 and 100); and/or a watched, a “liked,” a comment, or a game action item can receive a second highest rating (e.g., between 50 and 75).
Additionally or alternatively, the evidence confidence rating 208 can be generated using the category-based interest scale 206 (e.g., to generate a second evidence confidence rating 208 and/or to generate a single evidence confidence rating 208 in combination with the origin-based interest scale 204). The category-based interest scale 206 can generate the evidence confidence rating 208 based on a behavioral insight category that the input data scaling system 202 determines to be related to the action item. For instance, the category-based interest scale 206 can receive the behavioral insight category from the behavioral insight categories generator 120 in an iterative/recursive feedback loop and weigh action items associated with higher-impact behavioral insight categories higher than action items associated with lower-impact or no behavioral insight categories. In some instances, the user data 104 and/or data generated downstream from the user data 104 can be scaled or weighted at various points in the data flow of the multi-data source personality profile generator 110. For instance, the input data scaling system 202 can include additional interest scales, such as a frequency interest scale to weigh the user data 104 based on a frequency of occurrences of the action items (e.g., based on the content hierarchy 116) and/or behavioral insight categories associated with the action items (e.g., generated by the behavioral insight categories generator 120). One or more activity timeline-based interest scales can weigh or scale the user data 104 based on activity patterns corresponding to the action items represented by the activity timeline 118 (e.g., as discussed in greater detail below regarding
In some examples, the system 200 (e.g., using the data aggregator 112) can categorize the user data 104 into categorized user data by associating the action items with the content-based bin(s) 114. For instance, action items represented by the user data 104 can be categorized and/or tagged into the location bin 148, such as a location services action item, a social media tagging action item, a location visit action item (e.g., generated by an appointment or scheduling application or a food/restaurant reservation application), a ride-share/taxi service, any other application that uses location-sharing functionality (e.g., via a GPS sensor), and the like. The action items represented by the user data 104 can be categorized and/or tagged into the purchases bin 150, such as e-commerce activity action items, browsing data action item, e-receipts (e.g., emails) action items, social media marketplace action items, subscription action items (e.g., for applications, news sites, magazines, based on renewal emails, etc.), credit card service action items, bank statement action items, rides-share/taxi orders, other mobile application services, and the like.
In some examples, action items represented by the user data 104 can be categorized and/or tagged into the activity bin 152. The activity bin 152 can include one or more sub-categories indicating how action item are consumed/performed. For instance, the activity bin 152 can categorize the action item into a read sub-category, a listened sub-category, a watched sub-category, a searched sub-category, a browsed sub-category, an applications/gaming sub-category, and/or a home monitoring sub-category. Action items categorized into the activity bin 152 can include google and other web service action items, music application action items, newspaper and blog action items (e.g., subscriptions), streaming services (e.g., Netflix®, Amazon Prime®, HBO®, Hulu®, and the like), mobile application action items, Internet-of-things action items, and the like. In some examples, action items represented by the user data 104 can be categorized and/or tagged into the fitness bin 154. For instance, the fitness bin 154 can categorize the action items related to health, diet, sleep, fitness, and/or physical activity. Action items categorized into the fitness bin 154 can include fitness device data (e.g., an Android® device, an Apple® device, etc.) action item, a Fitbit® action item (e.g., a steps or daily steps action item, a biking action item, a physical challenge action item, a calories action item, a sleep duration action item, and the like), a gym subscription action item, a gym purchase action item, a diet plan subscription action item, a restaurant visit or reservation action item, and the like.
In some examples, action items represented by the user data 104 can be categorized and/or tagged into the social network bin 156. The social network bin 156 can include action items originating from actions occurring on one or more social networks (e.g. Facebook®, Twitter®, LinkedIn®, Reddit®, Bumble®, etc.), such as a post action item, a target ad placement action item, a target ad selection action item, a like action item, a comment action item, an upvote or downvote action item, a friend association action item, a friend network action item, and the like. In some examples, action items represented by the user data 104 can be categorized and/or tagged into the PII bin 158. For instance, the PII bin 158 can categorize action items related to or that include private information, identifying information, or personal interests information, such as demographic information, education information, employment information, social network account information, genre/category interest information (e.g., based on activity with digital media), services information (e.g., address, phone number, area code, email address, etc.), and the like.
It is to be understood, as indicated above, that any action item in the user data 104 can be associated with one or multiple content-based bin(s) 114, as described above. For instance, an action item corresponding to ordering a rideshare to the gym can be associated with at least the location bin 148 (e.g., arrival/destination GPS data); the purchases bin 150 (e.g., transactional data, credit card data); the activity bin 152 (e.g., mobile application activity data); and the fitness bin 154 (e.g., gym usage data, exercise time data, etc.). Moreover, any action items discussed herein can be used as a predefined action item and/or training action item for identifying matching action items from the user data 104 (e.g., based on a statistical likelihood that the action item from the user data 104 matches the predefined action item). Any action items not listed above may be recognized and categorized by the system 200 based on a similarity comparison (e.g., using supervised machine learning) to identify which of the action items explicitly discussed herein are most similar to the unlisted action item. In some instances, the system 200 can receive the action items by receiving authentication input from the user linking the multi-data source personality profile generator 110 to the various applications and services providing the user data 104 discussed herein.
In some instances, a first action item 302 can represent a first action and can include a plurality of descriptive identifiers such as “bought sports apparel” (“D1”) at a particular time (e.g., January 3) on the activity timeline 118 corresponding the content of the action item and/or the data extracted into the content-based bin(s) 114. Additional data can be associated with the first action item 302 on the activity timeline 118 such as an action frequency corresponding to identical action items, the values generated by the input data scaling system 202, and the like. A second action item (“D2”) can be “searched for live Super Bowl telecast; and a third action item (“D3”) can be “ordered wings.” The activity timeline 118 can also generate account actions, such as a first account action 304 (“AA1”) being “changed Facebook relationships status;” a second account action (“AA2”) being “created LinkedIn® profile;” a third account action (“AA3”) being “became Amazon Prime® member;” a fourth account action (“AA4”) being “subscribed to HBO®;” and/or a fifth account action (“AA5”) being “subscribed to Hulu®.” Moreover, the activity timeline 118 can include additional action items interspersed with the account actions, such as a fourth action item (“D4”) being “searched for roof leaks;” a fifth action item (“D5”) being “posted for lawn mower suggestions,” a sixth action item (“D6”) being “posted for lawn mower suggestions;” a seventh action item (“D7”) being “clicked ads for lawn mower suggestions;” an eight action item (“D8”) being “bought lawn mower;” a ninth action item (“D9”) being “reserved hotel;” a tenth action item (“D10”) being “booked flight;” an eleventh action item (“D11”) being “bought Christmas lights;” a twelfth action item (“D12”) being “bought wall painting;” and/or a thirteenth action item (“D13”) being “posted Merry Christmas and Happy New Year.”
Additionally, in some instances, the action items can include (starting again at a beginning of the activity timeline 118) a fourteenth action item (“D14”) being “search reviews for ‘The Escape Room’;” a fifteenth action item (“D15”) being “watched trailer for ‘The Escape Room’;” a sixteenth action item (“D16”) being “searched for ‘earthquake’,” a seventeenth action item (“D17”) being “donated for earthquake fund;” an eighteenth action item (“D18”) being “searched reviews for “Five Feet Apart’;” a nineteenth action item (“D19”) being “searched reviews for ‘Maiden’;” a twentieth action item (“D20”) being “bought weights;” a twenty-first action item (“D21”) being “searched reviews for ‘Ford vs Ferrari;’” and the like. Moreover, in some examples, the activity timeline 118 includes one or more social network statistics 306, such as aggregations of data values from one or more social network applications (e.g., a number of Facebook® friends; a post frequency rating such as high, medium, or low; a number of LinkedIn® friends; a title of a liked or followed page, and the like).
The action items organized according to the activity timeline 118 and/or the category-based interest scale 206 as discussed herein can be used to identify action patterns, from which behavioral insight categories are determined, as discussed below regarding
Turning to
Furthermore, the behavioral insight categories generator 120 can identify a second action cluster 406 including the fourth action item D4 “searched for roof leaks” and the fifth action item D5 “searched for roof contractor.” The second action cluster 406 is based on both D4 and D5 being related to home renovations at a particular time on the activity timeline 118, and a home renovations behavior insight category is generated and/or used to generate one or more home renovations-related behavior metrics 124. A third action cluster 408 can be based on the sixth action item D6 “posted for lawn mower suggestions;” the seventh action item D7 “clicked adds for lawn mower suggestions;” and the eighth action item Dg “bought lawn mower.” The third action cluster 408 is identified based on D6, D7, and D8, and can correspond to generating a home, home renovation, lawn, and/or lawn mower behavioral insight category. A fourth action cluster 410 can be identified based on the ninth action item D9 “reserved hotel” and the tenth action item D10 “booked flight,” generating a travel behavioral insight category. A fifth action cluster 412 can be identified based on the eleventh action item D11 “bought Christmas lights;” the twelfth action item D12 “bought wall painting;” and the thirteenth action item D13 “posted or commented about merry Christmas and Happy New Year,” generating a home or holiday behavioral insight category. A sixth action cluster 414 can be identified based on the sixteenth action item D16 “searched for earthquake” and the seventeenth action item D17 “donated for earthquake fund,” generating a philanthropy or geography behavioral insight category.
Turning to
The behavioral insight categories generator 120 can generate behavioral insight categories by detecting activity patterns 402 from the activity timeline 118 generated from the categorized user data. The behavioral insight categories generator 120 can detect an action item recurrence 506, such as an action item recurrence amount, an action item frequency, an action item periodicity, and/or a total number of occurrences. For instance, the action item recurrence 506 can include the fourteenth action item D14 “searched reviews for ‘The Escape Room’ and the fifteenth action item D15 “watched trailer for ‘The Escape Room,’ which can be associated with the activity timeline 118 at a first time (e.g., relatively early in the year). The action item recurrence 506 can also be based on the eighteenth action item D18 “searched reviews for ‘Five Feet Apart;’ the fourth account action AA4 “subscribed to HBO®;” the nineteenth action item D19 “searched for “Maiden;’ and/or the twenty-first action item D21 “searched reviews for “Ford vs Ferrari.” In some instances, the multi-data source personality profile generator 110 can identify one or more activity patterns 402, such as an action frequency, an action recurrence amount, an action periodicity, an action-reaction occurrence, a total number of action occurrences, or an action cluster. Any of the behavioral insight categories discussed herein (e.g., regarding
As noted above, the timeline view 504 representing the activity timeline can be presented using the GUI 502 (e.g., of a computing device 1102 as discussed regarding
For instance, turning to
In some examples, the content hierarchy 116 can include a first content hierarchy level 602 that can be a top or a highest hierarchy level. As depicted in
In some instances, the content hierarchy 116 can be presented as the content view 518 at the GUI 502. For example, a selectable GUI element presented at the 502 may cause the system 700 to toggle between presenting the content view 518 and the timeline view 504. The content hierarchy 116, in addition or alternatively to the activity timeline 118, can be a data structure generated by the multi-data source personality profile generator 110 to represent the user data 104, received from the different digital source(s) 106, in a standardized or common format. The content hierarchy 116 can be used by downstream analytics of the multi-data source personality profile generator 110 (e.g., by the personality profile generation process 122) to determine the behavioral insight categories and/or the behavior metrics 124.
Turning to
For example, the behavioral insight categories generator 120 can determine, for a first action item, a first behavioral insight category identifier being a “movies/tv” identifier. The behavioral insight categories generator 120 can determine a first content identifier for the first action item being “searched for ‘Independence Day,’” and a first source being a search website (e.g., Google®). For a second action item, the behavioral insight categories generator 120 can determine a second behavioral insight category identifier is the “movies/tv” identifier; a second content identifier is a “watched ‘The Help”’ identifier; and a second source is a streaming website (e.g., Netflix) rental. For a third action item, the behavioral insight categories generator 120 can determine a third behavioral insight category identifier is the “movies/tv” identifier; a third content identifier is a “rented ‘50 First Dates’” identifier; and a third source is a streaming website (e.g., Amazon® Prime®) purchase.
The system 700 can determine the one or more behavior metrics 124 based on the one or more interest category identifiers 702, the content identifier(s) 704, and/or the sources 706 associated with the action items of the user data 104. For the example depicted in
Turning to
The system 800 can determine, based on the interest category identifiers 802, the content identifiers 804, and/or the sources 806, that the fourth through eighth action items indicate a behavior metric 124 of highly healthy or high interest in maintaining health. The multi-data source personality profile generator 110 can aggregate the multiple action items from across multiple different content-based bins 114 to generate the interest category identifiers 802 and/or the behavior metrics 124. The system 800 can determine the behavior metrics 124 by detecting relations to common concepts or ideas, or to predefined behavior metric templates (e.g., including one or more threshold values and/or (Y/N) data structures). For instance, the personality profile generation process 122 can determine that the fourth through eight content identifiers of the fourth through eight action items relate to health/fitness (e.g., and/or a positive inclination towards health/fitness) and generate the behavior metrics 124 based on this determination.
It should be understood that the various interest category identifiers 702 assessed or aggregated to determine the behavior metrics 124 can be based on action items and/or tags spanning across different content-based bin(s) 114 (e.g., a first action item associated with the purchases bin 150, a second action item associated with the fitness bin 154, etc.). Moreover, a single action item can be associated with multiple content-based bin(s) 114 and/or generate multiple interest category identifiers 702 and/or content identifiers 704. Additionally, a particular behavior metrics 124 can be generated based on action items corresponding to multiple, different interest category identifiers 702, having multiple, different content identifiers 704, and/or multiple different sources 706 (e.g., as depicted in
In some examples, the one or more interest category identifiers 702 corresponding to behavioral insight categories can include one or more of art, culture, entertainment, automobiles, vehicles, news, family, parenting, sports, recreation, hobbies, interests, geography, travel, home, garden, health, fitness, law, government, politics, food, drink, pets, style, fashion, cosmetics, personal care, history, events, human activities, philosophy, finance, education, careers, business, industrial, real estate, religion, spirituality, science, shopping, society, technology, computing, kids, combinations thereof, and the like. The behavior metric 124 can be generated for a particular behavioral insight category upon determining that sufficient evidence to support the behavior metrics 124 exists (e.g., based on supervised machine learning statistical analysis). Once one or more behavior metrics 124 are generated to a sufficient degree of confidence indicating an affinity or aversion, or another relationship, of the specific individual to the behavioral insight categories. The behavior metrics 124 can be used to generate or define the behavioral dimensions 126 of the personality profile 102, as discussed below.
In some examples, the health dimension 128 can be generated based on one or more behavior metrics 124 indicating physical health, mental health, eating habits, and/or fitness. For instance, behavior metrics 124 influencing the physical health component of the health dimension 128 can include a search, like or post of diseases or symptoms, a subscription to a page or group, a health or wellness appointment, a genetic risk calculation, a genetic test, alcohol or drug abuse, medicine purchase, health application usage, and the like. The health dimension 128 behavior metrics 124 influencing the mental health component can include internet and/or mobile addition, sleeping habits, shopping addiction, searches, likes, and posts, subscriptions to groups and/or pages, and the like. The behavior metrics 124 influencing the physical eating habits component of the health dimension 128 can include food orders, restaurant visits, diet plan searches, diet plan subscription and/or page follows, food purchases, organic purchase, search for food, cuisine, or recipe terms, diet plan searches, food purchases on a food delivery service, and the like. The health dimension 128 behavior metrics 124 influencing the fitness component can include mobility data from fitness applications and/or computing devices (e.g., discussed below regarding
Moreover, the finance dimension 130 can be generated based on one or more behavior metrics 124 indicating online purchases, retail purchases, and/or an income estimate. For instance, the behavior metrics 124 influencing the online purchases component of the finance dimension 130 can include purchase at large retail sites (e.g., Amazon®, eBay®, PayPal, and the like), apparel purchase, electronics purchase, online marketplace purchase, and/or food orders (e.g., via mobile food ordering applications). The behavior metrics 124 influencing the retail purchase component of the finance dimension 130 can include store visits and/or store receipts (e.g., based on credit card statements, bank statements, and/or personal finance management applications). The behavior metrics 124 influencing the income estimate component of the finance dimension 130 can include savings account data, checking account data (e.g., including monthly credits and debits), education level, household composition, job description, earnings-to-debt ratio, and the like.
In some instances, the mobility dimension 132 can be generated based on behavior metrics 124 indicating a daily schedule and/or travel. For instance, the behavior metrics 124 influencing the daily schedule component of the mobility dimension 132 can include one or more of a travel time, an idle time, a schedule for a visit, location hopping, day mobility behavior and night mobility behavior, traffic courses, and the like. Additionally, the behavior metrics 124 influencing the travel component of the mobility dimension 132 can include one or more of a vacation plan, a tourist destination, service-based housing, an extended drive, and/or a frequency of visits. Moreover, the system 1000 can determine the mobility dimension 132 based on searches for vacation gear, flight tickets, rental bookings, and/or searches for rental places.
In some examples, the interests dimension 134 can be generated based on google searches, social media posts, page likes, comments, and subscriptions, purchases, and activities (e.g., games). For instance, the interests dimension 134 can be based on the content identifier(s) 704 of the content hierarchy 116. The interests dimension 134 can indicate consumer traits/interests related to one or more of art, culture, entertainment, automobiles, vehicles, news, family, parenting, sports, recreation, hobbies, interests, geography, travel, home, garden, health, fitness, law, government, politics, food, drink, pets, style, fashion, cosmetics, personal care, history, events, human activities, philosophy, finance, education, careers, business, industrial, real estate, religion, spirituality, science, shopping, society, technology, computing, kids, combinations thereof, and the like.
In some instances, the lifestyle index values 1002 can include a variety of indexes with a score, label, or value on a scale or in a range (e.g., Y/N or between 1 and 10, 1 and 100, etc.) associated with a particular lifestyle component. The supervised machine-learning systems can calculate the lifestyle index values 1002 and/or brand personality values 1004 based on analyzing the behavioral dimensions 126. For instance, the system 1000 can generate one or more values corresponding to a sleep index, a fitness index, a health index, an openness index (e.g., indicating awareness/adaptability to new experiences), an online activity index, a mobility index, a sociability index, a financial management index, a vacation index, a career prosects index, an entertainment index, combinations thereof, and the like.
Moreover, the multi-data source personality profile generator 110 can use the behavioral dimensions 126 to generate the brand personality values 1004, which can include one or more scores indicating a sincerity value (e.g., indicating down to earth and/or honest); an excited value (e.g., indicating daringness and/or adventurousness); a competence value (e.g., indicating reliability, integrity, and/or sincerity); a sophistication value (e.g., indicating an upper/middle/lower class status, education level, and/or charming); and/or a ruggedness value The personality profile 102 can be formed of the lifestyle index values 1002 and the brand personality values 1004, as well as the other data structures and/or interfaces discussed herein (e.g., the categorized user data, the content hierarchy 116, the activity timeline 118, the content view 518, and/or the timeline view 504). As noted above the personality profile 102 can be used to generate a risk assessment for insurance policy pricing; a data privacy assessment for a data privacy service; and/or or a product recommendation for a commerce website.
The network environment 1100 also include at least one server device 108 hosting software, application(s), websites, and the like for receiving input data and analyzing the input data to generate the personality profile 102. The multi-data source personality profile generator 110 can receive inputs from various computing devices and transform the received input data into other unique types of data. The server(s) 108 may be a single server, a plurality of servers with each such server being a physical server or a virtual machine, or a collection of both physical servers and virtual machines. In another implementation, a cloud hosts one or more components of the systems 100-1000. The server(s) 108 may represent an instance among large instances of application servers in a cloud computing environment, a data center, or other computing environment. The server(s) 108 can access data stored at one or more database(s) (e.g., including any of the values or identifiers discussed herein). The systems 100-1000, the server(s) 108, and/or other resources connected to the network(s) 146 may access one or more other servers to access other websites, applications, web services interfaces, GUIs, storage devices, APIs, computing devices, or the like to perform the techniques discussed herein. The server(s) can include one or more computing device(s) 1102, as discussed in greater detail below.
For instance, the network environment 1100 can include the one or more computing device(s) 1102 for executing the multi-data source personality profile generator 110 and/or generating the personality profile 102. In one implementation, the one or more computing device(s) 1102 include the one or more server device(s) 108 executing the multi-data source personality profile generator 110 as a software application and/or a module or algorithmic component of software.
In some instances, the computing device(s) 1102 can include a computer, a personal computer, a desktop computer, a laptop computer, a terminal, a workstation, a server device, a cellular or mobile phone, a mobile device, a smart mobile device a tablet, a wearable device (e.g., a smart watch, smart glasses, a smart epidermal device, etc.) a multimedia console, a television, an Internet-of-Things (IoT) device, a smart home device, a medical device, a virtual reality (VR) or augmented reality (AR) device, a vehicle (e.g., a smart bicycle, an automobile computer, etc.), and/or the like. The computing device(s) 1102 may be integrated with, form a part of, or otherwise be associated with the systems 100-1000. It will be appreciated that specific implementations of these devices may be of differing possible specific computing architectures not all of which are specifically discussed herein but will be understood by those of ordinary skill in the art.
The computing device 1102 may be a computing system capable of executing a computer program product to execute a computer process. Data and program files may be input to the computing device 1102, which reads the files and executes the programs therein. Some of the elements of the computing device 1102 include one or more hardware processors 1104, one or more memory devices 1106, and/or one or more ports, such as input/output (IO) port(s) 1108 and communication port(s) 1110. Additionally, other elements that will be recognized by those skilled in the art may be included in the computing device 1102 but are not explicitly depicted in
The processor 1104 may include, for example, a central processing unit (CPU), a microprocessor, a microcontroller, a digital signal processor (DSP), and/or one or more internal levels of cache. There may be one or more processors 1104, such that the processor 1104 comprises a single central-processing unit, or a plurality of processing units capable of executing instructions and performing operations in parallel with each other, commonly referred to as a parallel processing environment.
The computing device 1102 may be a conventional computer, a distributed computer, or any other type of computer, such as one or more external computers made available via a cloud computing architecture. The presently described technology is optionally implemented in software stored on the data storage device(s) such as the memory device(s) 1106, and/or communicated via one or more of the I/O port(s) 1108 and the communication port(s) 1110, thereby transforming the computing device 1102 in
The one or more memory device(s) 1106 may include any non-volatile data storage device capable of storing data generated or employed within the computing device 1102, such as computer executable instructions for performing a computer process, which may include instructions of both application programs and an operating system (OS) that manages the various components of the computing device 1102. The memory device(s) 1106 may include, without limitation, magnetic disk drives, optical disk drives, solid state drives (SSDs), flash drives, and the like. The memory device(s) 1106 may include removable data storage media, non-removable data storage media, and/or external storage devices made available via a wired or wireless network architecture with such computer program products, including one or more database management products, web server products, application server products, and/or other additional software components. Examples of removable data storage media include Compact Disc Read-Only Memory (CD-ROM), Digital Versatile Disc Read-Only Memory (DVD-ROM), magneto-optical disks, flash drives, and the like. Examples of non-removable data storage media include internal magnetic hard disks, SSDs, and the like. The one or more memory device(s) 1106 may include volatile memory (e.g., dynamic random-access memory (DRAM), static random-access memory (SRAM), etc.) and/or non-volatile memory (e.g., read-only memory (ROM), flash memory, etc.).
Computer program products containing mechanisms to effectuate the systems and methods in accordance with the presently described technology may reside in the memory device(s) 1106 which may be referred to as machine-readable media. It will be appreciated that machine-readable media may include any tangible non-transitory medium that is capable of storing or encoding instructions to perform any one or more of the operations of the present disclosure for execution by a machine or that is capable of storing or encoding data structures and/or modules utilized by or associated with such instructions. Machine-readable media may include a single medium or multiple media (e.g., a centralized or distributed database, and/or associated caches and servers) that store the one or more executable instructions or data structures.
In some implementations, the computing device 1102 includes one or more ports, such as the I/O port(s) 1108 and the communication port(s) 1110, for communicating with other computing or network devices. It will be appreciated that the I/O port 1108 and the communication port 1110 may be combined or separate and that more or fewer ports may be included in the computing device 1102.
The I/O port 1108 may be connected to an I/O device, or other device, by which information is input to or output from the computing device 1102. Such I/O devices may include, without limitation, one or more input devices, output devices, and/or environment transducer devices.
In one implementation, the input devices convert a human-generated signal, such as, human voice, physical movement, physical touch or pressure, and/or the like, into electrical signals as input data into the computing device 1102 via the I/O port 1108. Similarly, the output devices may convert electrical signals received from the computing device 1102 via the I/O port 1108 into signals that may be sensed as output by a human, such as sound, light, and/or touch. The input device may be an alphanumeric input device, including alphanumeric and other keys for communicating information and/or command selections to the processor 1104 via the I/O port 1108. The input device may be another type of user input device including, but not limited to: direction and selection control devices, such as a mouse, a trackball, cursor direction keys, a joystick, and/or a wheel; one or more sensors, such as a camera, a microphone, a positional sensor, an orientation sensor, an inertial sensor, and/or an accelerometer; and/or a touch-sensitive display screen (“touchscreen”). The output devices may include, without limitation, a display, a touchscreen, a speaker, a tactile and/or haptic output device, and/or the like. In some implementations, the input device and the output device may be the same device, for example, in the case of a touchscreen.
In one implementation, the communication port 1110 is connected to the network 146 so the computing device 1102 can receive network data useful in executing the methods and systems set out herein as well as transmitting information and network configuration changes determined thereby. Stated differently, the communication port 1110 connects the computing device 1102 to one or more communication interface devices configured to transmit and/or receive information between the computing device 1102 and other devices (e.g., network devices of the network(s) 146) by way of one or more wired or wireless communication networks or connections. Examples of such networks or connections include, without limitation, Universal Serial Bus (USB), Ethernet, Wi-Fi, Bluetooth®, Near Field Communication (NFC), and so on. One or more such communication interface devices may be utilized via the communication port 1110 to communicate with one or more other machines, either directly over a point-to-point communication path, over a wide area network (WAN) (e.g., the Internet), over a local area network (LAN), over a cellular network (e.g., third generation (3G), fourth generation (4G), Long-Term Evolution (LTE), fifth generation (5G), etc.) or over another communication means. Further, the communication port 1110 may communicate with an antenna or other link for electromagnetic signal transmission and/or reception.
In an example the multi-data source personality profile generator 110, the data aggregator 112, the personality profile generation process 122 and/or other software, modules, services, and operations discussed herein may be embodied by instructions stored on the memory devices 1106 and executed by the processor 1104.
The system set forth in
It is to be understood that the specific order or hierarchy of operations in the methods depicted in
Furthermore, any term of degree such as, but not limited to, “substantially,” as used in the description and the appended claims, should be understood to include an exact, or a similar, but not exact configuration. Similarly, the terms “about” or “approximately,” as used in the description and the appended claims, should be understood to include the recited values or a value that is three times greater or one third of the recited values. For example, about 3 mm includes all values from 1 mm to 9 mm, and approximately 50 degrees includes all values from 16.6 degrees to 150 degrees.
Lastly, the terms “or” and “and/or,” as used herein, are to be interpreted as inclusive or meaning any one or any combination. Therefore, “A, B, or C” or “A, B, and/or C” mean any of the following: “A,” “B,” or “C”; “A and B”; “A and C”; “B and C”; “A, B and C.” An exception to this definition will occur only when a combination of elements, functions, steps or acts are in some way inherently mutually exclusive.
While the present disclosure has been described with reference to various implementations, it will be understood that these implementations are illustrative and that the scope of the present disclosure is not limited to them. Many variations, modifications, additions, and improvements are possible. More generally, implementations in accordance with the present disclosure have been described in the context of particular implementations. Functionality may be separated or combined differently in various implementations of the disclosure or described with different terminology. These and other variations, modifications, additions, and improvements may fall within the scope of the disclosure as defined in the claims that follow.