This invention relates generally to systems and methods for facilitating creation of personal genetic profile products and assessments.
Genomes hold much valuable information that can be used to better understand biological characteristics and traits of humans and animals. Personal genetic profiles can be used by individuals to learn about their genome or genomes of related persons or animals. Biological characteristics and traits influenced or determined by an individual's genome include from easily observable traits such as eye color and height as well as harder to observe or quantify traits and characteristics such as food sensitivities and intellectual and physical aptitudes. Developing an understanding of a genome can provide valuable information that may be used in making decisions about one's behavior and habits.
Until recently, characterizing a genome was prohibitively expensive such that very few individual genomes had been fully or partially characterized. Techniques utilized in genotyping a genome required significant resources that limited genotyping to laboratory use in scientific research and related areas. Developments of cost-effective equipment and procedures for genotyping have made personal genotyping feasible. The output of genetic information from such genotyping procedures still requires expertise in the biological sciences to understand.
At present, much research is being conducted to establish relationships between the human genome and biological characteristics and traits, in particular. Single nucleotide polymorphisms (SNPs) are specific sites identified in particular genes that influence biological characteristics and traits different depending on the particular polymorphism of an individual. Different polymorphisms of the nucleotides at a specific site influence the relevant characteristic or trait differently. The influence of an individual's polymorphism on his/her trait can be positive or negative. Many relationships between the variants of SNPs and their corresponding biological characteristics and traits have been established and many more possible relationships are currently undiscovered and under investigation.
Data from genotyping tests may lack organization and may be difficult to conveniently and meaningfully assemble for presentation to users, particularly where data from large numbers of users are involved. Accordingly, there is a need for systems and methods that facilitate the creation and presentation of personal genetic profile assessments that not only provide an individual with their particular genetic profile assessment, but also convey information related to the particular traits and characteristics that are influenced by the specific SNP variants present in their genetic material. Furthermore, there is a need for systems that streamline the creation of user-friendly products that present such genetic profile assessments.
Presented herein are systems and methods related to a scalable and platform-agnostic framework that facilitates the creation of personal genetic profile products. In particular, in certain embodiments, the systems and methods described herein facilitate the creation, storage, organization, and maintenance (e.g. updating) of data corresponding to the relationships between particular variations that may occur in an individual's DNA (e.g. variants of different specific SNPs) and the health-related phenotypes that they influence. The data stored in this manner can be used as a template not only for presenting personal genetic profile assessments to an individual in an informative and user-friendly manner, but also for the rapid, automated creation of personal genetic profile assessments of a plurality of individuals from data corresponding to genotyping measurements of biological samples provided by the individuals.
In certain embodiments, the systems and methods described herein provide for creation, storage, and updating of a first class of data structures, referred to herein as products, which are used to represent different general classes of health-related traits and characteristics. For example, different particular products are used to represent different classes of traits such as those corresponding to (i) the way in which an individual's body processes different foods and nutrients, (ii) skin health, and (iii) physical fitness.
In certain embodiments, each product is associated with one or more of a second class of data structures, referred to as categories. In certain embodiments, each category corresponds to a particular health-related trait or characteristic (e.g. food sensitivity, food breakdown, hunger and weight, vitamins, skin uv sensitivity, endurance, metabolism, joint health, muscle strength, intelligence). In certain embodiments, the categories with which a particular product is associated each correspond to different health-related traits or characteristics that are related to the general class of health-related traits or characteristics to which the particular product corresponds (e.g. the general class of health-related traits or characteristics that the product represents).
In turn, each category is associated with one or more SNP objects and/or gene objects, wherein each SNP object is a data structure that represents a specific, physical SNP, and each gene object is a data structure that represents a specific gene. SNP objects and gene objects are associated with a particular category based on the specific health-related phenotypes the SNPs and genes to which they correspond influence. In particular, SNP objects and gene objects corresponding to SNPs and genes, respectively, that influence specific health related phenotypes related to the trait or characteristic to which a particular category corresponds are associated with that particular category.
In certain embodiments, the systems and methods described herein provide a convenient user interface (e.g. a graphical user interface (GUI) for creation and/or updating of products, categories, SNP objects, and gene objects, as well as the respective associations between them. The data stored within the framework described herein can thus be updated in a flexible manner in order to account for changes in the body of scientific knowledge regarding the relationships between an individual's genomic profile and their overall health, as well as changes in the desires of individuals to learn about different aspects of their personal genetic profile.
In certain embodiments, the hierarchical organization of product, category, SNP object and gene object data structures serves as a flexible template that facilitates both the rapid creation of individual personal genetic profile assessments from genotyping measurements taken from a plurality of individuals, as well as the presentation of an individual's personal genetic profile assessment. In particular, an individual may purchase assessments corresponding to different products, in order to gain insight into the manner in which their personal genome influences the different general classes of health-related traits and characteristics to which each different product corresponds. Accordingly, an individual's personal genetic profile assessment corresponding to one or more products comprises, for each specific SNP associated with each category that is associated with each of the one or more products, an identification of the particular variant of the specific SNP that the individual has. Typically, the identification is obtained via one or more genotyping measurements performed on a biological sample taken from the individual (e.g. a blood sample, e.g. a cheek swab sample, e.g. a saliva sample).
In certain embodiments, an individual may purchase a first assessment corresponding to a first product, and provide a biological sample for genotyping. The individual's biological sample may be stored (e.g. cryogenically frozen). After a period of time, the individual may choose to purchase additional assessments corresponding to other products, and the individual's previously stored biological sample may be taken from storage for additional genotyping measurements of the additional SNPs that are associated with the new products. Moreover, in certain embodiments, additional new products may be created over time, and new assessments corresponding to new products offered to and purchased by individuals. In certain embodiments, as new information related to the influence of new and/or existing SNPs on different specific health related phenotypes is elucidated, new SNP objects and gene objects may be created, and new associations between them and new or existing categories and/or products established. In certain embodiments, existing personal genetic profile assessments of individuals are automatically updated to reflect such new information.
Thus, by providing a framework comprising a hierarchical organization of data structures corresponding to products, categories, SNP objects, and gene objects, the systems and methods described herein provide an intuitive and flexible approach to storing, updating, and creating new associations between different classes of health-related traits and characteristics, and the underlying genetic variations corresponding to different specific SNPs that influence them.
In one aspect, the invention is directed to a method for creating personal genetic profile products that correspond to an assessment of an individual's genetic profile via a graphical user interface (GUI), the method comprising: (a) presenting, by a processor of a computing device, a graphical user interface element (e.g., widget) for creation of a gene object corresponding to a specific gene (e.g., of the human genome), wherein the gene object is a data structure comprising: one or more SNP objects, each SNP object corresponding to a specific SNP occurring in or nearby (e.g. in a promotor region that influences transcription of the specific gene, e.g. occurring within 5 kb upstream or downstream of the specific gene, e.g. occurring within 100 kb upstream or downstream of the specific gene, e.g. occurring within 500 kb upstream or downstream of the specific gene, e.g. occurring within 1 Mb upstream or downstream of the specific gene) the specific gene with which the gene object is associated, and each SNP object comprising: (i) a SNP reference (e.g. a number or code that identifies the specific SNP corresponding with the SNP object); and (ii) for each of one or more variants of the specific associated SNP: a measurement outcome associated with the variant; a qualifier associated with the variant; and optionally, additional information associated with the variant; and (iii) optionally, additional information associated with the SNP object; and (b) receiving, by the processor, via the graphical user interface element, a SNP object associated with the gene object; (c) associating, by the processor, the received SNP object with the gene object; and (d) storing, by the processor, the gene object for further retrieval and/or updating by the processor via the GUI.
In certain embodiments, the gene object comprises a gene identifier (e.g. an alphanumeric code, e.g. an accepted name of the gene) that identifies the specific gene (e.g. of the human genome) with which the gene object is associated.
In certain embodiments, the SNP reference is an alphanumeric code (e.g. a National Center for Biotechnology Information (NCBI) database reference number) that serves as a unique identifier of the specific SNP with which the SNP object is associated.
In certain embodiments, for each variant, the measurement outcome associated with the variant corresponds to an identification of a result of a genotyping measurement of the specific SNP with which the SNP object that comprises the variant is associated. In certain embodiments, the measurement outcome associated with the variant comprises an identification of a first nucleotide associated with a first copy (e.g. from a first of two sets of chromosomes) of an individual's genetic material, and an identification of a second nucleotide of a second copy (e.g. from a second of two sets of chromosomes) of an individual's genetic material (e.g. the measurement outcome comprises a two letter sequence, each letter identifying a particular nucleotide, e.g. the letter ‘A’ identifies adenine, the letter ‘G’ identifies guanine, e.g. the letter ‘T’ identifies thymine, e.g. the letter ‘C’ identifies cytosine).
In certain embodiments, the SNP object comprises, for each of three physically viable variants of the specific SNP with which the SNP object is associated: a measurement outcome associated with the variant; and a qualifier associated with the variant. In certain embodiments, for each variant, the qualifier associated with the variant corresponds to at least one of (i) and (ii) as follows: (i) a classification of the variant based on prevalence of the variant within a population; and (ii) a classification of the variant based on a health-related phenotype associated with the variant. In certain embodiments, for each variant, the particular qualifier associated with the variant is a specific qualifier of a countable set (e.g. finite number) of predefined qualifiers (e.g. one of three predefined qualifiers, e.g. selected from the group consisting of “Adapt”, “Normal”, “Gifted”).
In certain embodiments, the SNP object comprises additional information associated with the SNP object, the additional information comprising at least one of: (i) a short description (e.g. 2 word or less, 3 word or less) of the specific SNP with which the SNP object is associated (e.g. a short characterization of a health-related trait that the specific SNP influences, e.g. “Alcohol Tolerance”, e.g. “Caffeine Metabolism”, e.g. “Lactose Intolerance”, e.g. “Blood Flow Regulation”); (ii) a description (e.g. a paragraph describing results of published studies directed to health-related traits influenced by the specific SNP with which the SNP object is associated); and (iii) one or more references, each reference identifying a source of information (e.g. a citation identifying a published study, e.g. a weblink).
In certain embodiments, the SNP object comprises, for each variant, additional information associated with the variant, the additional information comprising a description of the variant (e.g. a description of a specific health-related phenotype that an individual with the particular variant exhibits).
In certain embodiments, the method comprises receiving a user input to initiate creation of the gene object (e.g. comprising entering a name of the gene object into the GUI, e.g. clicking a button).
In certain embodiments, receiving the SNP object associated with the gene object comprises receiving, by the processor, via the graphical user interface element, a user input of: (i) a SNP reference; and (ii) for each of one or more variants, a measurement outcome associated with the variant and a qualifier associated with the variant.
In certain embodiments, step (a) of the method comprises: presenting a SNP reference graphical control element for user entry of a SNP reference (e.g. a text box); and for each of one or more variants to be added to a SNP object, presenting: (i) a respective variant measurement graphical control element (e.g. a text box) for user entry of a measurement outcome associated with the respective variant; and (ii) a respective variant qualifier graphical control element (e.g. a drop down list that displays a set of one or more predefined qualifiers) for user selection of a qualifier associated with the respective variant, and step (b) of the method comprises: receiving, via the SNP reference graphical control element, a user input of a SNP reference; and for each of the one or more variants to be added to the SNP object: (i) receiving, via the respective variant measurement graphical control element, a user input of a measurement outcome; and (ii) receiving, via the respective variant qualifier graphical control element, a user selection of a qualifier.
In certain embodiments, the method comprises: (e) presenting, by the processor, a graphical user interface element for creation of a category corresponding to a health-related phenotype (e.g. skin uv sensitivity, e.g. endurance, e.g. metabolism, e.g. joint health, e.g. muscle strength, e.g. food sensitivity, e.g. vitamin levels, e.g. intelligence) associated with a predefined group of one or more related SNPs each of which is associated with the health-related phenotype, wherein the category is a data structure comprising one or more SNP objects; (f) receiving, by the processor, via the graphical user interface element for creation of a category, a selection of one or more SNP objects; (g) associating, by the processor, the one or more selected SNP objects with the category; and (h) storing, by the processor, the category, for further retrieval and/or updating by the processor via the GUI.
In certain embodiments, the category comprises additional information selected from the group consisting of: (i) a name of the category; (ii) a background image associated with the category; (iii) an icon associated with the category; (iv) a category order identifier associated with the category; and (v) a description of the category.
In certain embodiments, step (e) comprises presenting (i) a first gene selection graphical control element (e.g. a drop down list) for selecting a first gene object and (ii) a first SNP selection graphical control element (e.g. a drop down list) for selecting a first SNP object of the first gene object, and step (f) comprises receiving, via the first gene selection graphical control element, a user selection of a first gene object, and receiving, via the first SNP selection graphical control element, a user selection of a first SNP object of the selected first gene object.
In certain embodiments, the first gene selection graphical control element displays a list of selectable elements, each element of the list corresponding to a previously stored gene object, and following a user selection of the first gene object, the first SNP selection graphical control element displays a list of selectable elements, each element of the list corresponding to a SNP object of the previously selected first gene object.
In certain embodiments, the method comprises: receiving, by the processor, a user input to add more genes to the category; and responsive to the user input to add more genes to the category, presenting (i) a second gene selection graphical control element for selecting a second gene object and (ii) a second SNP selection graphical control element for selecting a second SNP object of the second gene object.
In certain embodiments, the method comprises: (i) presenting, by the processor, a graphical user interface element for creation of a product corresponding to a general class of health-related phenotypes (e.g. overall skin health (e.g. Aura), e.g. fitness (e.g. Fitcode), e.g. nutritional health (e.g. Fuel), e.g. exceptional qualities (e.g. Superhero) associated with a predefined group of one or more related categories, wherein the product is a data structure comprising one or more categories; (j) receiving, by the processor, via the graphical user interface element for creation of the product, a selection of one or more categories; (k) associating, by the processor, the one or more selected categories with the product; and (l) storing, by the processor, the product for further retrieval and/or updating by the GUI.
In certain embodiments, the product comprises additional information selected from the group consisting of: (i) a name of the product; (ii) an icon associated with the product; and (iii) a description of the product.
In another aspect, the invention is directed to method of creating and/or updating personal genetic profile assessments for a plurality of individuals, the method comprising: receiving, by a processor of a computing device, genotyping data collected from a plurality of biological samples, each biological sample having been taken from a particular individual of the plurality of individuals, wherein the genotyping data comprises, for each biological sample, one or more genotyping measurements of one or more SNPs, each SNP associated with one or more genes (e.g. each SNP occurring within one of the one or more genes or occurring nearby one or more genes (e.g. within a promotor region that influences transcription of one or more genes, e.g. within 5 kb upstream or downstream of one or more genes, e.g. within 100 kb upstream or downstream of one or more genes, e.g. occurring within 500 kb upstream or downstream of one or more genes, e.g. within 1 Mb upstream or downstream of one or more genes)); and for each individual associated with each biological sample, for each SNP (e.g. for each SNP that was measured for the associated biological sample): (a) identifying, by the processor, a gene object (e.g. a gene object created via any of the methods described above) corresponding to a target gene of the one or more genes with which the SNP is associated and a SNP object (e.g. a SNP object as created via any of the methods described above) of the gene object corresponding to the SNP; (b) receiving, by the processor, a determination of a particular variant of the SNP that the individual has and determining a qualifier (e.g. a qualifier as described in any of the embodiments described above) associated with the particular variant; and (c) storing, by the processor, within the personal genetic profile assessment of the individual: (i) a gene identifier (e.g. a gene identifier of the identified gene object) associated with the measured SNP, wherein the gene identifier identifies the target gene; (ii) a SNP reference (e.g. the SNP reference of the SNP object as created via any of the methods described above) that identifies the measured SNP, wherein the SNP reference is associated with the gene identifier; (iii) a variant identifier that identifies the particular variant of the SNP that the individual has, wherein the variant identifier is associated with the SNP reference; and (iv) the qualifier (e.g. a qualifier as described in any of the embodiments described above) associated with the particular variant, wherein the qualifier is associated with the SNP reference.
In certain embodiments, for each biological sample, the one or more genotyping measurements of the one or more SNPs comprise(s), for each SNP, a first measurement that identifies a first nucleotide of a first copy (e.g. from a first of two sets of chromosomes) of an individual's genetic material, and a second measurement that identifies a second nucleotide of a second copy (e.g. from a second of two sets of chromosomes) of an individual's genetic material. In certain embodiments, the genotyping data comprises data from a PCR-based SNP genotyping assay.
In certain embodiments, the method comprises measuring, via a PCR-based SNP genotyping assay, one or more SNPs for a plurality of biological samples, thereby producing genotyping data.
In certain embodiments, for each biological sample: the genotyping data comprises, for each gene with which each SNP is associated, a corresponding gene identifier, and the genotyping data comprises, for each SNP, a corresponding SNP reference, and step (a) comprises, for each measured SNP: accessing, by the processor, a plurality of previously stored gene objects (e.g. gene objects created and stored via any of the methods described above), wherein each previously stored gene object is a data structure comprising a gene identifier and one or more SNP objects, each SNP object of a previously stored gene object comprising a SNP reference; matching, by the processor, a gene identifier (e.g. of one of the one or more genes with which the measured SNP is associated) of the genetic sequencing data to a gene identifier (e.g. a gene identifier as described above) of one of the plurality of previously stored gene objects, thereby identifying the gene object associated with the gene to which the SNP belongs; and matching, by the processor, the SNP reference (corresponding to the measured SNP) of the genetic sequencing data to a SNP reference of one of the SNP objects (e.g. the SNP reference of the SNP objects created via any of the methods described above) of the plurality of previously stored gene objects, thereby identifying the SNP object of the gene object corresponding to the SNP, thereby identifying the gene object corresponding to the target gene with which the SNP is associated and the SNP object corresponding to the SNP.
In certain embodiments, the genotyping data comprises, for each SNP, a corresponding SNP reference, and for each biological sample, for each measured SNP, determining, by the processor, a particular variant of the SNP that the individual associated with the biological sample has and a qualifier associated with the variant comprises: accessing, by the processor, a plurality of previously stored gene objects (e.g. gene objects previously created and stored via any of the methods described above), wherein each gene object comprises: one or more SNP objects, each SNP object comprising: a SNP reference (e.g. a number or code that identifies a specific SNP associated with the SNP object); and for each of one or more variants of a specific SNP associated with the SNP object, a measurement outcome associated with the variant, and a qualifier associated with the variant; matching, by the processor, the SNP reference (corresponding to the measured SNP) of the genotyping data to a SNP reference of one of the SNP objects of the plurality of previously stored gene objects, thereby identifying a SNP object corresponding to the measured SNP; and matching, by the processor, the genotyping measurement of the measured SNP to a measurement outcome of identified SNP object corresponding to the measured SNP, thereby determining the particular variant of the SNP that the individual has.
In certain embodiments, the method comprises automatically determining, by the processor, for each individual of the plurality of individuals associated with the plurality of measured biological samples, one or more products (e.g. products created via any of the methods described above) associated with the individual's personal genetic profile assessment, wherein: each product corresponds to a predefined group of one or more related categories (e.g. the categories as created via any of the methods described above), and the product is a data structure comprising one or more categories, each category corresponds to a predefined group of one or more related SNPs, each of which is associated with a health-related phenotype, wherein the category is a data structure comprising one or more SNP objects (e.g. each SNP object belonging to a gene object previously created and stored via any of the methods described above), and for each individual of the plurality of individuals, automatically determining one or more products associated with the individual's personal genetic profile assessment comprises, for each product of a plurality of previously stored products: determining a list of all the different SNP objects that the product comprises, the list comprising each distinct SNP object of each category the product comprises; and matching, by the processor, the list of all the different SNP objects of the product to the measured SNPs of the genetic sequencing data collected from the biological sample associated with the individual.
In certain embodiments, the method comprises, for an individual of the plurality of individuals, causing, by the processor, display (e.g. on a computing device associated with the individual) of an assessment graphical user interface (GUI) for view of the individual's personal genetic profile assessment, the assessment GUI comprising, for each measured SNP associated with the individual, a graphical element comprising: (a) at least one of (i) or (ii) as follows: (i) a graphical representation of the associated (with the measured SNP) gene identifier of the individual's personal genetic profile assessment; (ii) a graphical representation of the SNP reference of the individual's personal genetic profile assessment that identifies the measured SNP; and (b) a graphical representation of the qualifier of the individual's personal genetic profile assessment, wherein the qualifier is the qualifier that is associated with the SNP reference that identifies the measured SNP.
In certain embodiments, the method comprises: (a) causing, by the processor, display (e.g. on a computing device associated with the individual) of one or more selectable product graphical control elements, each selectable product graphical control element corresponding to a previously stored product, wherein: each previously stored product corresponds to a predefined group of one or more related categories, and each category corresponds to a predefined group of one or more related gene-SNP combinations each of which is associated with the health-related phenotype; (b) responsive to a user selection of a specific product graphical control element, for each category of the previously stored product corresponding to the selected product graphical control element, causing display of a respective selectable category graphical control element; and (c) responsive to a user selection of a specific category graphical control element, for each gene-SNP combination of the corresponding category, causing display of a corresponding gene graphical control element, wherein each gene graphical control element comprises: at least one of (i) or (ii) as follows: (i) a graphical representation of the associated (with the measured SNP) gene identifier of the individual's personal genetic profile assessment; (ii) a graphical representation of the SNP reference of the individual's personal genetic profile assessment that identifiers the measured SNP; and a graphical representation of the qualifier of the individual's personal genetic profile assessment, wherein the qualifier is the qualifier that is associated with the SNP reference that identifies the measured SNP.
In another aspect, the invention is directed to a system for creating personal genetic profile products that correspond to an assessment of an individual's genetic profile via a graphical user interface (GUI), the system comprising: a processor; and a memory having instructions stored thereon, wherein the instructions, when executed by the processor, cause the processor to: (a) present a graphical user interface element (e.g., widget) for creation of a gene object corresponding to a specific gene (e.g., of the human genome), wherein the gene object is a data structure comprising: one or more SNP objects, each SNP object corresponding to a specific SNP occurring in or nearby (e.g. in a promotor region that influences transcription of the specific gene, e.g. occurring within 5 kb upstream or downstream of the specific gene, e.g. occurring within 100 kb upstream or downstream of the specific gene, e.g. occurring within 500 kb upstream or downstream of the specific gene, e.g. occurring within 1 Mb upstream or downstream of the specific gene) the specific gene with which the gene object is associated, and each SNP object comprising: (i) a SNP reference (e.g. a number or code that identifies the specific SNP corresponding with the SNP object); and (ii) for each of one or more variants of the specific associated SNP: a measurement outcome associated with the variant; a qualifier associated with the variant; and optionally, additional information associated with the variant; and (iii) optionally, additional information associated with the SNP object; and (b) receive, via the graphical user interface element, a SNP object associated with the gene object; (c) associate the received SNP object with the gene object; and (d) store the gene object for further retrieval and/or updating by the processor via the GUI.
In certain embodiments, the gene object comprises a gene identifier (e.g. an alphanumeric code, e.g. an accepted name of the gene) that identifies the specific gene (e.g. of the human genome) with which the gene object is associated.
In certain embodiments, the SNP reference is an alphanumeric code (e.g. a National Center for Biotechnology Information (NCBI) database reference number) that serves as a unique identifier of the specific SNP with which the SNP object is associated.
In certain embodiments, for each variant, the measurement outcome associated with the variant corresponds to an identification of a result of a genotyping measurement of the specific SNP with which the SNP object that comprises the variant is associated. In certain embodiments, the measurement outcome associated with the variant comprises an identification of a first nucleotide associated with a first copy (e.g. from a first of two sets of chromosomes) of an individual's genetic material, and an identification of a second nucleotide of a second copy (e.g. from a second of two sets of chromosomes) of an individual's genetic material (e.g. the measurement outcome comprises a two letter sequence, each letter identifying a particular nucleotide, e.g. the letter ‘A’ identifies adenine, the letter ‘G’ identifies guanine, e.g. the letter ‘T’ identifies thymine, e.g. the letter ‘C’ identifies cytosine).
In certain embodiments, the SNP object comprises, for each of three physically viable variants of the specific SNP with which the SNP object is associated: a measurement outcome associated with the variant; and a qualifier associated with the variant. In certain embodiments, for each variant, the qualifier associated with the variant corresponds to at least one of (i) and (ii) as follows: (i) a classification of the variant based on prevalence of the variant within a population; and (ii) a classification of the variant based on a health-related phenotype associated with the variant. In certain embodiments, for each variant, the particular qualifier associated with the variant is a specific qualifier of a countable set (e.g. finite number) of predefined qualifiers (e.g. one of three predefined qualifiers, e.g. selected from the group consisting of “Adapt”, “Normal”, “Gifted”).
In certain embodiments, the SNP object comprises additional information associated with the SNP object, the additional information comprising at least one of: (i) a short description (e.g. 2 word or less, 3 word or less) of the specific SNP with which the SNP object is associated (e.g. a short characterization of a health-related trait that the specific SNP influences, e.g. “Alcohol Tolerance”, e.g. “Caffeine Metabolism”, e.g. “Lactose Intolerance”, e.g. “Blood Flow Regulation”); (ii) a description (e.g. a paragraph describing results of published studies directed to health-related traits influenced by the specific SNP with which the SNP object is associated); and (iii) one or more references, each reference identifying a source of information (e.g. a citation identifying a published study, e.g. a weblink).
In certain embodiments, the SNP object comprises, for each variant, additional information associated with the variant, the additional information comprising a description of the variant (e.g. a description of a specific health-related phenotype that an individual with the particular variant exhibits).
In certain embodiments, the instructions cause the processor to receive a user input to initiate creation of the gene object (e.g. comprising entering a name of the gene object into the GUI, e.g. clicking a button).
In certain embodiments, the instructions cause the processor to receive the SNP object associated with the gene object by receiving, via the graphical user interface element, a user input of: (i) a SNP reference; and (ii) for each of one or more variants, a measurement outcome associated with the variant and a qualifier associated with the variant.
In certain embodiments, the instructions cause the processor to: at step (a): present a SNP reference graphical control element for user entry of a SNP reference (e.g. a text box); and for each of one or more variants to be added to a SNP object, present: (i) a respective variant measurement graphical control element (e.g. a text box) for user entry of a measurement outcome associated with the respective variant; and (ii) a respective variant qualifier graphical control element (e.g. a drop down list that displays a set of one or more predefined qualifiers) for user selection of a qualifier associated with the respective variant; and at step (b): receive, via the SNP reference graphical control element, a user input of a SNP reference; and for each of the one or more variants to be added to the SNP object: (i) receive, via the respective variant measurement graphical control element, a user input of a measurement outcome; and (ii) receive, via the respective variant qualifier graphical control element, a user selection of a qualifier.
In certain embodiments, the instructions cause the processor to: (e) present a graphical user interface element for creation of a category corresponding to a health-related phenotype (e.g. skin uv sensitivity, e.g. endurance, e.g. metabolism, e.g. joint health, e.g. muscle strength, e.g. food sensitivity, e.g. vitamin levels, e.g. intelligence) associated with a predefined group of one or more related SNPs each of which is associated with the health-related phenotype, wherein the category is a data structure comprising one or more SNP objects; (f) receive, via the graphical user interface element for creation of a category, a selection of one or more SNP objects; (g) associate the one or more selected SNP objects with the category; and (h) store the category for further retrieval and/or updating by the processor via the GUI.
In certain embodiments, the category comprises additional information selected from the group consisting of: (i) a name of the category; (ii) a background image associated with the category; (iii) an icon associated with the category; (iv) a category order identifier associated with the category; and (v) a description of the category.
In certain embodiments, the instructions cause the processor to: at step (e), present (i) a first gene selection graphical control element (e.g. a drop down list) for selecting a first gene object and (ii) a first SNP selection graphical control element (e.g. a drop down list) for selecting a first SNP object of the first gene object; and at step (f): receive, via the first gene selection graphical control element, a user selection of a first gene object; and receive, via the first SNP selection graphical control element, a user selection of a first SNP object of the selected first gene object.
In certain embodiments, the first gene selection graphical control element displays a list of selectable elements, each element of the list corresponding to a previously stored gene object, and following a user selection of the first gene object, the first SNP selection graphical control element displays a list of selectable elements, each element of the list corresponding to a SNP object of the previously selected first gene object.
In certain embodiments, the instructions cause the processor to: receive a user input to add more genes to the category; and responsive to the user input to add more genes to the category, present (i) a second gene selection graphical control element for selecting a second gene object and (ii) a second SNP selection graphical control element for selecting a second SNP object of the second gene object.
In certain embodiments, the instructions cause the processor to: (i) present a graphical user interface element for creation of a product corresponding to a general class of health-related phenotypes (e.g. overall skin health (e.g. Aura), e.g. fitness (e.g. Fitcode), e.g. nutritional health (e.g. Fuel), e.g. exceptional qualities (e.g. Superhero) associated with a predefined group of one or more related categories, wherein the product is a data structure comprising one or more categories; (j) receive, via the graphical user interface element for creation of the product, a selection of one or more categories; (k) associate the one or more selected categories with the product; and (l) store the product for further retrieval and/or updating by the GUI.
In certain embodiments, the product comprises additional information selected from the group consisting of: (i) a name of the product; (ii) an icon associated with the product; and (iii) a description of the product.
In another aspect, the invention is directed to a system for creating and/or updating personal genetic profile assessments for a plurality of individuals, the system comprising: a processor; and a memory having instructions stored thereon, wherein the instructions, when executed by the processor, cause the processor to: receive genotyping data collected from a plurality of biological samples, each biological sample having been taken from a particular individual of the plurality of individuals, wherein the genotyping data comprises, for each biological sample, one or more genotyping measurements of one or more SNPs, each SNP associated with one or more genes (e.g. each SNP occurring within one of the one or more genes or occurring nearby one or more genes (e.g. within a promotor region that influences transcription of one or more genes, e.g. within 5 kb upstream or downstream of one or more genes, e.g. within 100 kb upstream or downstream of one or more genes, e.g. occurring within 500 kb upstream or downstream of one or more genes, e.g. within 1 Mb upstream or downstream of one or more genes)); and for each individual associated with each biological sample, for each SNP (e.g. for each SNP that was measured for the associated biological sample): (a) identify a gene object (e.g. a gene object created via any of the systems described above) corresponding to a target gene of the one or more genes with which the SNP is associated and a SNP object (e.g. a SNP object of any of the systems described above) of the gene object corresponding to the SNP; (b) receive a determination of a particular variant of the SNP that the individual has and determining a qualifier (e.g. a qualifier as described in any of the embodiments described above) associated with the particular variant; and (c) store within the personal genetic profile assessment of the individual: (i) a gene identifier (e.g. a gene identifier of the identified gene object) associated with the measured SNP, wherein the gene identifier identifies the target gene; (ii) a SNP reference (e.g. the SNP reference of the SNP object of any of the systems described above) that identifies the measured SNP, wherein the SNP reference is associated with the gene identifier; (iii) a variant identifier that identifies the particular variant of the SNP that the individual has, wherein the variant identifier is associated with the SNP reference; and (iv) the qualifier (e.g. a qualifier as described in any of the embodiments described above) associated with the particular variant, wherein the qualifier is associated with the SNP reference.
In certain embodiments, for each biological sample, the one or more genotyping measurements of the one or more SNPs comprise(s), for each SNP, a first measurement that identifies a first nucleotide of a first copy (e.g. from a first of two sets of chromosomes) of an individual's genetic material, and a second measurement that identifies a second nucleotide of a second copy (e.g. from a second of two sets of chromosomes) of an individual's genetic material. In certain embodiments, the genotyping data comprises data from a PCR-based SNP genotyping assay.
In certain embodiments, the system comprises a reader (e.g. a PCR system reader for measuring PCR-based genotyping assays) for measuring genotyping data for one or more SNPs for a plurality of biological samples and providing the genotyping data to the processor.
In certain embodiments, for each biological sample: the genotyping data comprises, for each gene with which each SNP is associated, a corresponding gene identifier, and the genotyping data comprises, for each SNP, a corresponding SNP reference, and the instructions cause the processor to, at step (a), for each measured SNP: access a plurality of previously stored gene objects (e.g. gene objects created and stored via any of the systems described above), wherein each previously stored gene object is a data structure comprising a gene identifier and one or more SNP objects, each SNP object of a previously stored gene object comprising a SNP reference; match a gene identifier (e.g. of one of the one or more genes with which the measured SNP is associated) of the genetic sequencing data to a gene identifier (e.g. a gene identifier as described above) of one of the plurality of previously stored gene objects, thereby identifying the gene object associated with the gene to which the SNP belongs; and match the SNP reference (corresponding to the measured SNP) of the genetic sequencing data to a SNP reference of one of the SNP objects (e.g. the SNP reference of the SNP objects created via any of the systems described above) of the plurality of previously stored gene objects, thereby identifying the SNP object of the gene object corresponding to the SNP, thereby identifying the gene object corresponding to the target gene with which the SNP is associated and the SNP object corresponding to the SNP.
In certain embodiments, the genotyping data comprises, for each SNP, a corresponding SNP reference, and the instructions cause the processor to, for each biological sample, for each measured SNP, determine a particular variant of the SNP that the individual associated with the biological sample has and a qualifier associated with the variant by: accessing a plurality of previously stored gene objects (e.g. gene objects previously created and stored via any of the systems described above), wherein each gene object comprises: one or more SNP objects, each SNP object comprising: a SNP reference (e.g. a number or code that identifies a specific SNP associated with the SNP object); and for each of one or more variants of a specific SNP associated with the SNP object, a measurement outcome associated with the variant, and a qualifier associated with the variant; matching the SNP reference (corresponding to the measured SNP) of the genotyping data to a SNP reference of one of the SNP objects of the plurality of previously stored gene objects, thereby identifying a SNP object corresponding to the measured SNP; and matching the genotyping measurement of the measured SNP to a measurement outcome of identified SNP object corresponding to the measured SNP, thereby determining the particular variant of the SNP that the individual has.
In certain embodiments, the instructions cause the processor to, for each individual of the plurality of individuals associated with the plurality of measured biological samples, automatically determine one or more products (e.g. products created via any of the systems described above) associated with the individual's personal genetic profile assessment, wherein: each product corresponds to a predefined group of one or more related categories (e.g. categories created via any of the systems described above), and the product is a data structure comprising one or more categories, each category corresponds to a predefined group of one or more related SNPs, each of which is associated with a health-related phenotype, wherein the category is a data structure comprising one or more SNP objects (e.g. each SNP object belonging to a gene object previously created and stored via any of the systems described above), and for each individual of the plurality of individuals, the instructions cause the processor to determine one or more products associated with the individual's personal genetic profile assessment by, for each product of a plurality of previously stored products: determining a list of all the different SNP objects that the product comprises, the list comprising each distinct SNP object of each category the product comprises; and matching the list of all the different SNP objects of the product to the measured SNPs of the genetic sequencing data collected from the biological sample associated with the individual.
In certain embodiments, the instructions cause the processor to, for an individual of the plurality of individuals, cause display (e.g. on a computing device associated with the individual) of an assessment graphical user interface (GUI) for view of the individual's personal genetic profile assessment, the assessment GUI comprising, for each measured SNP associated with the individual, a graphical element comprising: (a) at least one of (i) or (ii) as follows: (i) a graphical representation of the associated (with the measured SNP) gene identifier of the individual's personal genetic profile assessment; (ii) a graphical representation of the SNP reference of the individual's personal genetic profile assessment that identifies the measured SNP; and (b) a graphical representation of the qualifier of the individual's personal genetic profile assessment, wherein the qualifier is the qualifier that is associated with the SNP reference that identifies the measured SNP.
In certain embodiments, the instructions cause the processor to: (a) cause display (e.g. on a computing device associated with the individual) of one or more selectable product graphical control elements, each selectable product graphical control element corresponding to a previously stored product, wherein: each previously stored product corresponds to a predefined group of one or more related categories, and each category corresponds to a predefined group of one or more related gene-SNP combinations each of which is associated with the health-related phenotype; (b) responsive to a user selection of a specific product graphical control element, for each category of the previously stored product corresponding to the selected product graphical control element, cause display of a respective selectable category graphical control element; (c) responsive to a user selection of a specific category graphical control element, for each gene-SNP combination of the corresponding category, cause display of a corresponding gene graphical control element, wherein each gene graphical control element comprises: at least one of (i) or (ii) as follows: (i) a graphical representation of the associated (with the measured SNP) gene identifier of the individual's personal genetic profile assessment; (ii) a graphical representation of the SNP reference of the individual's personal genetic profile assessment that identifiers the measured SNP; and a graphical representation of the qualifier of the individual's personal genetic profile assessment, wherein the qualifier is the qualifier that is associated with the SNP reference that identifies the measured SNP.
The foregoing and other objects, aspects, features, and advantages of the present disclosure will become more apparent and better understood by referring to the following description taken in conjunction with the accompanying drawings, in which:
The features and advantages of the present disclosure will become more apparent from the detailed description set forth below when taken in conjunction with the drawings, in which like reference characters identify corresponding elements throughout. In the drawings, like reference numbers generally indicate identical, functionally similar, and/or structurally similar elements.
Approximately: As used herein, the term “approximately” or “about,” as applied to one or more values of interest, refers to a value that is similar to a stated reference value. In certain embodiments, the term “approximately” or “about” refers to a range of values that fall within 25%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, or less in either direction (greater than or less than) of the stated reference value unless otherwise stated or otherwise evident from the context and except where such number would exceed 100% of a possible value.
Variant: As used herein, the terms “variant” refers to a specific variation of a specific SNP occurring in the genetic material of a population. In certain embodiments, a variant is a specific combination of a first allele of a first copy of an individual's genetic material (e.g. corresponding to an individual's paternal DNA) and a second allele of a second copy of an individual's genetic material (e.g. corresponding to an individual's maternal DNA), as occurs in diploid organisms (e.g. humans).
Qualifier: As used herein, the term “qualifier” refers to a classification (e.g. a label) of a particular variant of a given SNP. The qualifier associated with a given variant is the particular classification (e.g. label) of that variant. For example, a given variant may be associated with a particular qualifier of a predefined set of possible qualifiers. For example, a given variant may be associated with a qualifier selected from a group of labels such as “Adapt,” “Normal,” and “Gifted.” In certain embodiments, for a given variant of a given SNP, a qualifier corresponds to a classification of the given variant based on (i) the prevalence of the given variant within a population (e.g. if the variant is common, e.g. if the variant is rare) and/or (ii) a health-related phenotype associated with the variant. For example, a common variant may be associated with the qualifier “Normal”. A rare variant that confers a disadvantageous phenotype, such as a predisposition to high cholesterol, may be associated with the qualifier “Adapt” (e.g. classified as rare and disadvantageous). A rare variant that confers an advantageous phenotype, such as a predisposition to lower cholesterol, may be associated with the qualifier “Gifted” (e.g. accordingly, the variant is classified as rare and advantageous).
SNP object: As used herein, the term “SNP object” refers to a data structure corresponding to (e.g. that is used to represent) a specific single nucleotide polymorphism (SNP). In certain embodiments, a SNP object comprises a SNP reference that identifies the specific SNP to which the SNP object corresponds. The SNP reference may be an alphanumeric code such as an accepted name of the SNP or other identifying mark or label capable of being stored electronically. The SNP reference may be an alphanumeric code such as a National Center for Biotechnology Information (NCBI) database reference number. Gene object: As used herein, the term “gene object” refers to a data structure corresponding to (e.g. that is used to represent) a specific physical gene within a given genome (e.g. the human genome).
Category: As used herein, the term “category” refers to a data structure corresponding to (e.g. that is used to represent) a particular health-related trait or characteristic.
Product, Genetic Profile Product, Personal Genetic Profile Product: As used herein, the terms “product,” “genetic profile product,” and “personal genetic profile product,” refer to a data structure corresponding to (e.g. that is used to represent) a general class of health-related traits and/or characteristics. In certain embodiments a product is associated with one or more categories that correspond to health-related traits and characteristics related to the general class of health-related traits and characteristics to which the product corresponds.
User: As used herein, the term “user” refers to a person, company, or organization that uses a graphical user interface to create data structures. In certain embodiments, a user also genotypes a biological sample in response to an assessment corresponding to a product being purchased or made accessible to an individual.
Individual: As used herein, the term “individual” refers to a person who uses an assessment graphical user interface in order to view information about a genome. The individual may supply one or more biological samples to be genotyped in order for a personal genetic profile assessment to be formed. The individual may purchase or be given access to one or more products in order to view a personal genetic profile assessment.
Graphical Control Element: As used herein, the term “graphical control element” refers to an element of a graphical user interface element that may be used to provide user and/or individual input. A graphical control element may be a textbox, dropdown list, radio button, data field, checkbox, button (e.g., selectable icon), list box, or slider.
Associate, Associated with: As used herein, the terms “associate,” and “associated with,” as in a first data structure is associated with a second data structure, refer to a computer representation of an association between two data structures or data elements that is stored electronically (e.g. in computer memory).
Provide: As used herein, the term “provide”, as in “providing data”, refers to a process for passing data in between different software applications, modules, systems, and/or databases. In certain embodiments, providing data comprises the execution of instructions by a process to transfer data in between software applications, or in between different modules of the same software application. In certain embodiments a software application may provide data to another application in the form of a file. In certain embodiments an application may provide data to another application on the same processor. In certain embodiments standard protocols may be used to provide data to applications on different resources. In certain embodiments a module in a software application may provide data to another module by passing arguments to that module.
It is contemplated that systems, architectures, devices, methods, and processes of the claimed invention encompass variations and adaptations developed using information from the embodiments described herein. Adaptation and/or modification of the systems, architectures, devices, methods, and processes described herein may be performed, as contemplated by this description.
Throughout the description, where articles, devices, systems, and architectures are described as having, including, or comprising specific components, or where processes and methods are described as having, including, or comprising specific steps, it is contemplated that, additionally, there are articles, devices, systems, and architectures of the present invention that consist essentially of, or consist of, the recited components, and that there are processes and methods according to the present invention that consist essentially of, or consist of, the recited processing steps.
It should be understood that the order of steps or order for performing certain action is immaterial so long as the invention remains operable. Moreover, two or more steps or actions may be conducted simultaneously.
The mention herein of any publication, for example, in the Background section, is not an admission that the publication serves as prior art with respect to any of the claims presented herein. The Background section is presented for purposes of clarity and is not meant as a description of prior art with respect to any claim.
Documents are incorporated herein by reference as noted. Where there is any discrepancy in the meaning of a particular term, the meaning provided in the Definition section above is controlling.
The systems and methods described herein relate to a scalable framework that provides for and facilitates the creation of personal genetic profile products. A graphical user interface allows a user to create new data structures, such as products, categories, gene objects, and SNP objects, as well as establish associations between existing data structures. The framework for creating personal genetic profile products allows previously stored data structures to be linked to newly created data structures and newly added data structures to be linked to existing data structures in order to create new products or expand existing products without requiring repetitive addition of pre-existing data structures. Products may be updated or created in response to increased interest from individuals in new categories of biological characteristics and/or traits or as new associations between SNPs and/or genes and biological characteristics and traits are developed in research. The use of scalable frameworks simplifies the effort required to update and create new products.
Data corresponding to a plurality of genotyping measurements comprising measured SNPs and measured genes, derived from one or more biological samples supplied by an individual, can be associated with related SNP objects and gene objects of one or more products to form a personal genetic profile assessment. An individual may then purchase or be given access to particular products in order to ascertain more information about his/her individual genome and potential implications for various biological characteristics and/or traits. For example, a product that relates generally to food can show an individual whether he/she is at risk for severe food allergies, more likely to find the taste of certain foods bitter, or likely to be lactose intolerant. In this way, the product is a generic data structure generalized to cover all individuals that may purchase that product, wherein the product can show an individual aspects of his/her genome when it becomes associated with the individual's genetic sequencing data.
In some embodiments, an individual purchasing or being given access to a product prompts genotyping of the relevant genes/SNPs using the one or more biological samples supplied by the individual. In some embodiments, genotyping of genes/SNPs corresponding to the particular product purchased or made accessible can be completed by a user in approximately or under one day (e.g., 24 hours or within one business day). Such selective genotyping can allow a reduction in the expense of products to individuals as well as a reduction in the amount of stored genetic sequencing data. Given that genetic information is highly confidential, reduced genetic sequencing data storage can reduce liabilities and/or risks to the user and the individual. In some embodiments, the genotyping is performed using a PCR-based SNP genotyping assay.
Turning to
In certain embodiments, a first class of data structures, referred to herein as products, are used to represent different general classes of health-related traits and characteristics. In certain embodiments, each product has a name (e.g. a product data structure comprises a name (e.g. text data representing the name)) that provides a convenient, and memorable way to refer to the product. For example, a particular product 112 (e.g. named “FUEL”) is used to represent a class of traits corresponding to the way in which an individual's body processes different foods and nutrients. Another product 114 (e.g. named “AURA”) is used to represent a class of traits corresponding to skin health. Another product 116 (e.g. named “FITCODE”) is used to represent a class of traits corresponding to physical fitness. Another product 118 (e.g. named “SUPERHERO”) is used to represent a class of traits corresponding to physical and intellectual performance.
In certain embodiments, each product is in turn associated with one or more of a second class of data structures, referred to as categories. In certain embodiments, each category corresponds to a particular health-related trait or characteristic (e.g. food sensitivity, food breakdown, hunger and weight, vitamins, skin uv sensitivity, endurance, metabolism, joint health, muscle strength, intelligence). In certain embodiments, the categories with which a particular product is associated each correspond to different health-related traits or characteristics that are related to the general class of health-related traits or characteristics to which the particular product corresponds (e.g. the general class of health-related traits or characteristics that the product represents). As with products, in certain embodiments, each category has a name (e.g. a category data structure comprises a name (e.g. text data representing the name)) that provides a convenient, and memorable way to refer to the product.
In turn, each category is associated with one or more SNP objects, each SNP object corresponding to a specific SNP. Each SNP object associated with a particular category corresponds to a specific SNP that influences a specific health related phenotype that relates to the trait or characteristic to which the particular category corresponds. Each SNP object may identify the specific SNP to which it corresponds via a SNP reference that the SNP object comprises. The SNP reference may be an alphanumeric code such as an accepted name of the SNP or other identifying mark or label capable of being stored electronically. The SNP reference may be an alphanumeric code such as a National Center for Biotechnology Information (NCBI) database reference number.
For example, the schematic of
The “FUEL” product 112 is associated with categories such as “Food Sensitivity” 122, “Food Breakdown” 124, “Hunger and Weight” 126, and “Vitamins” 128. Several SNP objects corresponding to specific SNPs that influence phenotypes related to an individual's sensitivity to different types of foods, and, accordingly, are associated with the “Food Sensitivity” category 122 are shown. In
For example, SNP object 132 corresponds to the rs671 SNP, which influences the manner in which an individual processes alcohol. In particular, depending on the particular variant of the rs671 SNP that an individual has, the individual may process alcohol normally, or be impaired in their ability to process alcohol, and likely suffer from adverse effects resulting from alcohol consumption, such as flushing, headaches, fatigue, and sickness. Accordingly, providing an individual with knowledge of the particular variant of the rs671 SNP that they have may allow them to modify their behavior accordingly, for example, by being mindful of the amounts of alcohol that they consume (e.g. on a regular basis, e.g. in social settings).
Other SNP objects corresponding to SNPs that influence food sensitivity related phenotypes, and, accordingly, are associated with the “Food Sensitivity” category 222 are shown. For example, SNP object 144 corresponds to the rs762551 SNP that influences caffeine metabolism, SNP object 146 corresponds to the rs4988235 SNP that influences lactose intolerance, and SNP object 148 corresponds to the rs4988235 SNP that influences an aversion to the herb Cilantro (e.g. depending on the particular variant of this SNP that an individual has, they may either perceive Cilantro as pleasant tasting, or bitter and soap-like in taste).
In certain examples, multiple SNPs are associated with a particular phenotype and, accordingly, the SNP objects to which they correspond may be grouped together. For example, three SNPS—rs713598 (corresponding to SNP object 150a), rs10246939 (corresponding to SNP object 150b), and rs1726866 (corresponding to SNP object 150c), —influence the sensitivity of an individual to bitter tasting foods (e.g. cabbage, broccoli, cauliflower, kale, brussel sprouts, and collard greens), and, accordingly, their enjoyment of or aversion to such foods.
SNPs correspond to specific locations within or nearby (e.g. a SNP may occur in a promotor region that influences transcription of a particular gene, e.g. a SNP may occur within 5 kb upstream or downstream of a particular gene, e.g. a SNP may occur within 100 kb upstream or downstream of a particular gene, e.g. a SNP may occur within 500 kb upstream or downstream of a particular gene, e.g. a SNP may occur within 1 Mb upstream or downstream of a particular gene) genes in an individual's genetic material. Accordingly, in certain embodiments, as shown in
Other SNPs objects correspond to SNPs that are nearby particular genes of interest and thereby influence phenotypes associated expression of the gene. For example, rs12696304 is a SNP that lies 1.5 kb downstream from the TERC gene, and influences biological aging associated with the TERC gene. Accordingly, in one example, a SNP object corresponding to the rs12696304 SNP is associated a gene object corresponding to the TERC gene.
In certain embodiments, multiple SNPs of interest occur within a single gene. For example, the three SNPs related to bitter taste—rs713598, rs10246939, and rs1726866—occur within the TAS2R38 gene. Accordingly, SNP objects 150a, 150b, and 150c, which correspond to the rs713598, rs10246939, and rs1726866 SNPs, respectively, are all associated with a gene object 170 corresponding to the TAS2R38 gene.
In certain embodiments, different products correspond to different general classes of health-related traits and characteristics. For example, products may be based on particular organs (e.g. product 114, named “AURA”, is related to skin health), or particular habits, activities, or bodily functions. For example, food related biological characteristics and traits may be covered by a single products or a plurality of products. A single product or a plurality of products may be based on learning and brain function characteristics and traits. A single product or a plurality of products may be based on physical fitness (e.g., cardiovascular strength, agility, flexibility, muscular strength).
For example, as shown in
In certain embodiments, a particular SNP object is associated with two or more categories. For example, the rs17782313 SNP, occurring in the FTO gene, influences an individual's appetite. Accordingly, as shown in
For example, the SNP object 154 corresponding to the rs1800795 SNP of the IL-6 gene (accordingly, SNP object 154 is associated with gene object 174, which corresponds to the IL-6 gene) is associated with the “Exercise Recovery” category 134 and the “Power Performance” category 136, both of which are associated with the “FITCODE” product 116. In addition, in certain embodiments, a category is associated with two or more products. For example, the “Power Performance” category 136 is associated with the “FITCODE” product 116, as well as the “SUPERHERO” product 118, which provides an assessment of a general class of traits related to physical and intellectual performance.
Thus, by providing a framework comprising a hierarchical organization of data structures corresponding to products, categories, SNP objects, and gene objects, the systems, methods, and architectures described herein provide an intuitive and flexible approach to storing, updating, and creating new associations between different classes of health-related traits and characteristics, and the underlying genetic variations corresponding to different specific SNPs that influence them.
In certain embodiments the hierarchical organization of product, category, SNP object and gene object data structures serves as a flexible template that facilitates both the rapid creation of individual personal genetic profile assessments from genotyping measurements taken from a plurality of individuals, and the presentation of an individual's personal genetic profile assessment. In particular, an individual may purchase assessments corresponding to different products, in order to gain insight into the manner in which their personal genome influences the different general classes of health-related traits and characteristics to which each different product corresponds. Accordingly, an individual's personal genetic profile assessment corresponding to one or more products comprises, for each specific SNP associated with each category that is associated with each of the one or more products, an identification of the particular variant of the specific SNP that the individual has. Typically, the identification is obtained via one or more genotyping measurements performed on a biological sample taken from the individual (e.g. a blood sample, e.g. a cheek swab sample, e.g. a saliva sample).
In certain embodiments, an individual may purchase a first assessment corresponding to a first product, and provide a biological sample for genotyping. The individual's biological sample may be stored (e.g. cryogenically frozen). After a period of time, the individual may choose to purchase additional assessments corresponding to other products, and the individual's previously stored biological sample may be taken from storage for additional genotyping measurements of the additional SNPs that are associated with the new products. Moreover, in certain embodiments, additional new products may be created over time, and new assessments corresponding to new products offered to and purchased by individuals. In certain embodiments, as new information related to the influence of new and/or existing SNPs on different specific health related phenotypes is elucidated, new SNP objects and gene objects may be created, and new associations between them and new or existing categories and/or products established. In certain embodiments, existing personal genetic profile assessments of individuals are automatically updated to reflect new information.
In certain embodiments, in order to facilitate the creation and presentation of individual personal genetic profile assessments (e.g. corresponding to one or more different products) based on the framework described above, the product, category, SNP object, and gene object data structures described herein store a variety of information.
Referring now to
Gene object 230a is also associated with additional information 244. Additional information 244 may comprise one or more data structures comprising information such as a unique gene identifier that corresponds gene object 230a to a specific physical gene and descriptive information about the corresponding gene. The gene identifier may be an alphanumeric code such as an accepted name of the gene or other identifying mark or label capable of being stored electronically. Additional information may be stored as a single data structure or a plurality of data structures.
SNP object 242b is associated with three variants 252a-c, SNP reference 250, and additional information 254. SNP reference 250 is a unique identifier of the SNP that corresponds the SNP object to a specific physical SNP. The SNP reference may be an alphanumeric code such as an accepted name of the gene or other identifying mark or label capable of being stored electronically. The SNP reference may be an alphanumeric code such as a National Center for Biotechnology Information (NCBI) database reference number. Additional information 254 may comprise one or more data structures with other descriptive information about the corresponding SNP.
Variants 252a-c correspond to specific alleles of the SNP that corresponds to SNP object 242b. Variant 252a is associated with measurement outcome 260, qualifier 262, and additional information 264. Measurement outcome 260 is an identifier, such as an alphanumeric code, that identifies the specific alleles corresponding to the variant 252a. Qualifier 262 is an identifier, such as an alphanumeric code, that identifies a classification of variant 252a, wherein the classification may be based on the prevalence of the variant within a population, a health-related phenotype associated with the variant, and/or other relevant classification bases. Additional information 264 comprises a description of the variant. For example, the additional information comprises a description of the specific health-related phenotype that an individual with variant 252a exhibits or an explanation of the prevalence of variant 252a.
In certain embodiments, an individual views their genomic information using an assessment graphical user interface (assessment GUI) that is populated using one or more products (e.g., one or more hierarchies of data structures, such as the exemplary hierarchy of
Referring now to
By selecting the “FUEL” product from the LifeProfile listing of
Selecting the Food Sensitivity category 312a brings the individual to a view of the assessment GUI shown in
The short description of the SNP 316 characterizes the biological characteristic or trait influenced by the corresponding SNP in the individual's genome. For example, the short description of the SNP 316 is “Alcohol Tolerance.” The individual would see that selecting the first selectable control element in the list would provide the individual with information about how the individual's genome influence his/her tolerance for alcohol consumption. The individual may select a particular selectable control element to view detailed information based on the short description of a SNP and/or the qualifier of the variant corresponding to his/her particular alleles of the SNP (as displayed by the graphical representation of the qualifier).
The graphical representation of the qualifier 324a is a graphic showing each of the qualifiers associated with the three variants corresponding to the SNP with the qualifier of the particular variant corresponding to the individual's alleles highlighted. Qualifiers may be words or short phrases that characterize the variant. For example, “adapt” may be used to characterize variants that are uncommon and/or disadvantageous; “normal” may be used to characterize variants that are common and/or neither advantageous nor disadvantage; and “gifted” may be used to characterize variants that are uncommon and/or advantageous. The graphical representation of the qualifier 324a highlights the qualifier associated with the variant corresponding to the individual's alleles in red. Different colors may be used to highlight different qualifiers in a graphical representation of a qualifier. For example, in
Some genes have multiple related SNPs. The related SNPs may influence a single biological characteristic or trait or a plurality of biological characteristics and/or traits. Each SNP may correspond to a unique selectable control element in the assessment GUI. For example, the graphical representation of the gene identifier 318 appears in two separate selectable control elements shown
Selecting the first selectable control element identified by short description 316 (“Alcohol Tolerance”) brings the individual to a view of the assessment GUI shown in
Referring now to
The assessment GUI shown in
Snapshots of an exemplary graphical user interface element used in process 400 are shown in
After selecting graphical control element 418, the user views the graphical control element state of
The graphical control elements below graphical control element 428 are used to input data associated with variants that will be associated with the SNP object. The user inputs variant identifiers to be associated with the new variants using graphical control elements 430a-b. The user inputs qualifiers to be associated with the new variants using graphical control elements 432a-b. The user inputs descriptions to be associated with the new variants using graphical control elements 434a-b. The user inputs short comments to be associated with the new variants using graphical control elements 436a-b. A graphical control element that when selected provides a further set of graphical control elements for inputting data associated with an additional variation (not shown) may additionally be provided. In certain embodiments, a SNP object has exactly three associated variants.
A graphical control element may be provided that when selected provides graphical control elements for created additional new SNP objects with associated input data (not shown in
A snapshot of an exemplary graphical user interface element used in method 500 is shown in
Graphical control element 520 provides the user two dropdown lists for selecting a SNP object to be associated with the category being created by selecting from a list of all gene objects and a list of all SNP objects. In some embodiments, the dropdown list of SNP objects only displays SNP objects associated with the gene object selected in the gene object dropdown list. The gene object and SNP object dropdown lists may display the gene identifiers and SNP references, respectively, corresponding to stored genes and SNPs in order to allow the user to select the desired data structure. In some embodiments, if the user only selects from the gene object dropdown list, all SNP objects associated with that gene object will be associated with the category being created. Only one SNP object is selected in graphical control element 520. Graphical control element 518 may be selected to provide another set of dropdown lists for adding an additional SNP object (e.g., a new copy of graphical control element 520).
Graphical control element 514 may be selected or unselected to toggle the visibility of the category being created, for example, to allow a user to develop new or existing products without interfering with the use experience of individuals viewing related interfaces. The user can save the input information using graphical control element 516.
A snapshot of an exemplary graphical user interface element used in process 600 is shown in
Processes 400, 500, and 600 may be performed using a single graphical user interface element or a set of graphical user interface elements. In some embodiments, new data structures (e.g., gene objects, SNP objects, categories, products) may be created using a minimally necessary set of user inputs (e.g., input from only a set of required graphical control elements). For example, a user may wish to create a new category without associating any gene objects or SNP objects. This may require entering a category name, background image, and icon with all other input being optional.
A user may wish to examine or otherwise review data associated with various data structures without having to navigate between different views of a graphical user interface in order to see relevant data. A data graphical user interface (data GUI) may be used to display all data associated with a particular data structure (e.g., object). Such a data GUI allows a user to quickly and efficiently view large amounts of data. In certain embodiments, the data GUI is configured to allow a user to select from all stored gene objects. Upon selection of a particular gene object, the data GUI displays data associated with each SNP object of the gene object, including information related to each of the variants (e.g. measurement outcomes, e.g. qualifiers) of each SNP object associated with the gene object.
A snapshot of an exemplary data GUI is shown in
In order to populate an assessment GUI to provide to an individual, genotyping data must be added to the individual's personal genetic profile assessment.
The genotyping data in
As shown in
The cloud computing environment 1000 may include a resource manager 1006. The resource manager 1006 may be connected to the resource providers 1002 and the computing devices 1004 over the computer network 1008. In some implementations, the resource manager 1006 may facilitate the provision of computing resources by one or more resource providers 1002 to one or more computing devices 1004. The resource manager 1006 may receive a request for a computing resource from a particular computing device 1004. The resource manager 1006 may identify one or more resource providers 1002 capable of providing the computing resource requested by the computing device 1004. The resource manager 1006 may select a resource provider 1002 to provide the computing resource. The resource manager 1006 may facilitate a connection between the resource provider 1002 and a particular computing device 1004. In some implementations, the resource manager 1006 may establish a connection between a particular resource provider 1002 and a particular computing device 1004. In some implementations, the resource manager 1006 may redirect a particular computing device 1004 to a particular resource provider 1002 with the requested computing resource.
The computing device 1100 includes a processor 1102, a memory 1104, a storage device 1106, a high-speed interface 1108 connecting to the memory 1104 and multiple high-speed expansion ports 1110, and a low-speed interface 1112 connecting to a low-speed expansion port 1114 and the storage device 1106. Each of the processor 1102, the memory 1104, the storage device 1106, the high-speed interface 1108, the high-speed expansion ports 1110, and the low-speed interface 1112, are interconnected using various busses, and may be mounted on a common motherboard or in other manners as appropriate. The processor 1102 can process instructions for execution within the computing device 1100, including instructions stored in the memory 1104 or on the storage device 1106 to display graphical information for a GUI on an external input/output device, such as a display 1116 coupled to the high-speed interface 1108. In other implementations, multiple processors and/or multiple buses may be used, as appropriate, along with multiple memories and types of memory. Also, multiple computing devices may be connected, with each device providing portions of the necessary operations (e.g., as a server bank, a group of blade servers, or a multi-processor system).
The memory 1104 stores information within the computing device 1100. In some implementations, the memory 1104 is a volatile memory unit or units. In some implementations, the memory 1104 is a non-volatile memory unit or units. The memory 1104 may also be another form of computer-readable medium, such as a magnetic or optical disk.
The storage device 1106 is capable of providing mass storage for the computing device 1100. In some implementations, the storage device 1106 may be or contain a computer-readable medium, such as a floppy disk device, a hard disk device, an optical disk device, or a tape device, a flash memory or other similar solid state memory device, or an array of devices, including devices in a storage area network or other configurations. Instructions can be stored in an information carrier. The instructions, when executed by one or more processing devices (for example, processor 1102), perform one or more methods, such as those described above. The instructions can also be stored by one or more storage devices such as computer- or machine-readable mediums (for example, the memory 1104, the storage device 1106, or memory on the processor 1102).
The high-speed interface 1108 manages bandwidth-intensive operations for the computing device 1100, while the low-speed interface 1112 manages lower bandwidth-intensive operations. Such allocation of functions is an example only. In some implementations, the high-speed interface 1108 is coupled to the memory 1104, the display 1116 (e.g., through a graphics processor or accelerator), and to the high-speed expansion ports 1110, which may accept various expansion cards (not shown). In the implementation, the low-speed interface 1112 is coupled to the storage device 1106 and the low-speed expansion port 1114. The low-speed expansion port 1114, which may include various communication ports (e.g., USB, Bluetooth®, Ethernet, wireless Ethernet) may be coupled to one or more input/output devices, such as a keyboard, a pointing device, a scanner, or a networking device such as a switch or router, e.g., through a network adapter.
The computing device 1100 may be implemented in a number of different forms, as shown in the figure. For example, it may be implemented as a standard server 1120, or multiple times in a group of such servers. In addition, it may be implemented in a personal computer such as a laptop computer 1122. It may also be implemented as part of a rack server system 1124. Alternatively, components from the computing device 1100 may be combined with other components in a mobile device (not shown), such as a mobile computing device 1150. Each of such devices may contain one or more of the computing device 1100 and the mobile computing device 1150, and an entire system may be made up of multiple computing devices communicating with each other.
The mobile computing device 1150 includes a processor 1152, a memory 1164, an input/output device such as a display 1154, a communication interface 1166, and a transceiver 1168, among other components. The mobile computing device 1150 may also be provided with a storage device, such as a micro-drive or other device, to provide additional storage. Each of the processor 1152, the memory 1164, the display 1154, the communication interface 1166, and the transceiver 1168, are interconnected using various buses, and several of the components may be mounted on a common motherboard or in other manners as appropriate.
The processor 1152 can execute instructions within the mobile computing device 1150, including instructions stored in the memory 1164. The processor 1152 may be implemented as a chipset of chips that include separate and multiple analog and digital processors. The processor 1152 may provide, for example, for coordination of the other components of the mobile computing device 1150, such as control of user interfaces, applications run by the mobile computing device 1150, and wireless communication by the mobile computing device 1150.
The processor 1152 may communicate with a user through a control interface 1158 and a display interface 1156 coupled to the display 1154. The display 1154 may be, for example, a TFT (Thin-Film-Transistor Liquid Crystal Display) display or an OLED (Organic Light Emitting Diode) display, or other appropriate display technology. The display interface 1156 may comprise appropriate circuitry for driving the display 1154 to present graphical and other information to a user. The control interface 1158 may receive commands from a user and convert them for submission to the processor 1152. In addition, an external interface 1162 may provide communication with the processor 1152, so as to enable near area communication of the mobile computing device 1150 with other devices. The external interface 1162 may provide, for example, for wired communication in some implementations, or for wireless communication in other implementations, and multiple interfaces may also be used.
The memory 1164 stores information within the mobile computing device 1150. The memory 1164 can be implemented as one or more of a computer-readable medium or media, a volatile memory unit or units, or a non-volatile memory unit or units. An expansion memory 1174 may also be provided and connected to the mobile computing device 1150 through an expansion interface 1172, which may include, for example, a SIMM (Single In Line Memory Module) card interface. The expansion memory 1174 may provide extra storage space for the mobile computing device 1150, or may also store applications or other information for the mobile computing device 1150. Specifically, the expansion memory 1174 may include instructions to carry out or supplement the processes described above, and may include secure information also. Thus, for example, the expansion memory 1174 may be provided as a security module for the mobile computing device 1150, and may be programmed with instructions that permit secure use of the mobile computing device 1150. In addition, secure applications may be provided via the SIMM cards, along with additional information, such as placing identifying information on the SIMM card in a non-hackable manner.
The memory may include, for example, flash memory and/or NVRAM memory (non-volatile random access memory), as discussed below. In some implementations, instructions are stored in an information carrier and, when executed by one or more processing devices (for example, processor 1152), perform one or more methods, such as those described above. The instructions can also be stored by one or more storage devices, such as one or more computer- or machine-readable mediums (for example, the memory 1164, the expansion memory 1174, or memory on the processor 1152). In some implementations, the instructions can be received in a propagated signal, for example, over the transceiver 1168 or the external interface 1162.
The mobile computing device 1150 may communicate wirelessly through the communication interface 1166, which may include digital signal processing circuitry where necessary. The communication interface 1166 may provide for communications under various modes or protocols, such as GSM voice calls (Global System for Mobile communications), SMS (Short Message Service), EMS (Enhanced Messaging Service), or MMS messaging (Multimedia Messaging Service), CDMA (code division multiple access), TDMA (time division multiple access), PDC (Personal Digital Cellular), WCDMA (Wideband Code Division Multiple Access), CDMA2000, or GPRS (General Packet Radio Service), among others. Such communication may occur, for example, through the transceiver 1168 using a radio-frequency. In addition, short-range communication may occur, such as using a Bluetooth®, Wi-Fi™, or other such transceiver (not shown). In addition, a GPS (Global Positioning System) receiver module 1170 may provide additional navigation- and location-related wireless data to the mobile computing device 1150, which may be used as appropriate by applications running on the mobile computing device 1150. The mobile computing device 1150 may also communicate audibly using an audio codec 1160, which may receive spoken information from a user and convert it to usable digital information. The audio codec 1160 may likewise generate audible sound for a user, such as through a speaker, e.g., in a handset of the mobile computing device 1150. Such sound may include sound from voice telephone calls, may include recorded sound (e.g., voice messages, music files, etc.) and may also include sound generated by applications operating on the mobile computing device 1150.
The mobile computing device 1150 may be implemented in a number of different forms, as shown in the figure. For example, it may be implemented as a cellular telephone 1180. It may also be implemented as part of a smart-phone 1182, personal digital assistant, or other similar mobile device.
Various implementations of the systems and techniques described here can be realized in digital electronic circuitry, integrated circuitry, specially designed ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various implementations can include implementation in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, coupled to receive data and instructions from, and to transmit data and instructions to, a storage system, at least one input device, and at least one output device.
These computer programs (also known as programs, software, software applications or code) include machine instructions for a programmable processor, and can be implemented in a high-level procedural and/or object-oriented programming language, and/or in assembly/machine language. As used herein, the terms machine-readable medium and computer-readable medium refer to any computer program product, apparatus and/or device (e.g., magnetic discs, optical disks, memory, Programmable Logic Devices (PLDs)) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term machine-readable signal refers to any signal used to provide machine instructions and/or data to a programmable processor.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user and a keyboard and a pointing device (e.g., a mouse or a trackball) by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user can be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front end component (e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back end, middleware, or front end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include a local area network (LAN), a wide area network (WAN), and the Internet.
The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
In some implementations, the data structures (e.g. products, categories, SNP objects, gene objects) described herein can be separated, combined or incorporated into single or combined data structures. The data structures depicted in the figures are not intended to limit the systems described herein to the data structure frameworks shown therein.
Elements of different implementations described herein may be combined to form other implementations not specifically set forth above. Elements may be left out of the processes, computer programs, databases, etc. described herein without adversely affecting their operation. In addition, the logic flows depicted in the figures do not require the particular order shown, or sequential order, to achieve desirable results. Various separate elements may be combined into one or more individual elements to perform the functions described herein. In view of the structure, functions and apparatus of the systems and methods described here, in some implementations.
Throughout the description, where apparatus and systems are described as having, including, or comprising specific components, or where processes and methods are described as having, including, or comprising specific steps, it is contemplated that, additionally, there are apparatus, and systems of the present invention that consist essentially of, or consist of, the recited components, and that there are processes and methods according to the present invention that consist essentially of, or consist of, the recited processing steps.
It should be understood that the order of steps or order for performing certain action is immaterial so long as the invention remains operable. Moreover, two or more steps or actions may be conducted simultaneously.
While the invention has been particularly shown and described with reference to specific preferred embodiments, it should be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.
This application claims the benefit of U.S. Provisional Application No. 62/436,947, filed on Dec. 20, 2016, the content of which is hereby incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
62436947 | Dec 2016 | US |