Aspects of the subject technology relate to computational biology, genetics, and clinical diagnostics.
Medical sequencing is an approach to discovery of genetic causes of complex disorders. Sequencing of a genome or portion thereof of individuals affected by a disease or with a trait of interest may be performed to determine the cause of common, complex traits.
Exome sequencing is a strategy by which the coding regions of a genome are selectively sequenced as an alternative to whole genome sequencing. The exome represents an enriched portion of the genome that can be used to search for variants with large effect sizes.
By sequencing the coding region, exome sequencing has the potential to be clinically relevant in genetic diagnosis due to current understanding of functional consequences in sequence variation. The functional variation that is responsible for both Mendelian and common diseases may be identified with high coverage in sequence depth.
Embodiments of computer-implemented bioinformatics programs and related methods are described herein. One such bioinformatics program annotates human genetic variants by integrating multiple sources of information. According to some embodiments, the bioinformatics program rapidly filters variants which do not play a role in the etiology of particular diseases. This filtering may be performed based on annotations or based on a family history inheritance model analysis in order to assist scientists and molecular diagnosticians to classify human variants and ultimately identify the underlying mutation leading to patients' genetic disease.
The subject technology is illustrated, for example, according to various aspects described below. Various examples of aspects of the subject technology are described as numbered clauses (1, 2, 3, etc.) for convenience. These are provided as examples and do not limit the subject technology. It is noted that any of the dependent clauses may be combined in any combination, or placed into any independent clause, e.g., clause 1 or clause 55. The other clauses can be presented in a similar manner.
1. A computer-implemented method of diagnosing a genetic influence for a condition in a proband, comprising:
2. The method of clause 1, further comprising determining a treatment plan based on the genetic influence.
3. The method of clause 1, wherein the identifying a genetic influence comprises estimating a probability that one of more of the remaining variants significantly influences at least one of clinical signs or symptoms of the proband.
4. The method of clause 1, wherein the removing variants not compatible with the Mendelian inheritance model comprises comparing a genotype from the proband to a genotype from at least one family member.
5. The method of clause 1, wherein the removing variants not compatible with the Mendelian inheritance model further comprises removing a heterozygous variant of the proband that exists in at least one unaffected family member as a homozygous variant.
6. The method of clause 1, wherein the Mendelian inheritance model is a dominant model.
7. The method of clause 6, wherein the removing variants not compatible with the Mendelian inheritance model further comprises removing a heterozygous variant of the proband if it exists in at least one unaffected family member or does not exist in at least one affected family member.
8. The method of clause 6, wherein the removing variants that are present in unaffected controls comprises removing a candidate heterozygous variant that presents in at least one unaffected control as either a heterozygous or homozygous variant.
9. The method of clause 1, wherein the Mendelian inheritance model is a recessive model.
10. The method of clause 9, wherein the removing variants not compatible with the Mendelian inheritance model comprises removing a candidate homozygous variant that at least one of (a) presents in at least one unaffected family member or unaffected control as a homozygous variant or (b) does not present in at least one affected family member.
11. The method of clause 9, wherein the removing variants that are present in unaffected controls comprises removing a candidate pair of compound heterozygous variants that present in at least one unaffected control.
12. The method of clause 1, wherein the Mendelian inheritance model is a sex-linked recessive model.
13. The method of clause 12, wherein the removing variants not compatible with the Mendelian inheritance model comprises removing a candidate variant that at least one of (a) presents in at least one unaffected male family member or male unaffected control as a hemizygous variant, (b) presents in at least one unaffected female family member or female unaffected control as a homozygous variant, or (c) does not present in at least one affected family member.
14. The method of clause 1, wherein the list is an index list, and further comprising forming the index list, from a master list of more genetic variants than are in the index list, by the following steps:
15. The method of clause 1, wherein the list is an index list, and further comprising forming the index list, from a master list of more genetic variants than are in the index list, by the following steps:
16. The method of clause 15, wherein common variants comprise single nucleotide polymorphisms (SNPs), deletions, insertions, and indels.
17. A computer implementation system for diagnosing a genetic influence for a condition in a proband, comprising:
18. The computer implementation system of clause 17, further comprising a determining module that, by a processor, determines a treatment plan based on the genetic influence.
19. The method of clause 1, wherein the identifying module is configured to identify genetic influence by estimating a probability that one of more of the remaining variants significantly influences at least one of clinical signs or symptoms of the proband.
20. The computer implementation system of clause 17, wherein the input comprises a selection between a recessive model of Mendelian inheritance and a dominant model of Mendelian inheritance.
21. The computer implementation system of clause 17, wherein the input comprises a selection between an autosomal model of Mendelian inheritance, an X-linked model of Mendelian inheritance, and a Y-linked model of Mendelian inheritance.
22. The computer implementation system of clause 17, wherein the input comprises a selection of whether to allow de novo mutations.
23. The computer implementation system of clause 17, wherein the list is an index list, the computer implementation system further comprising a forming module that, by a processor, forms the index list, from a master list of more genetic variants than are in the index list.
24. The computer implementation system of clause 23, wherein the forming module, by a processor:
25. A machine-readable medium comprising machine-readable instructions for causing a processor to execute a method comprising:
26. The machine-readable medium of clause 25, wherein the list is an index list, and wherein the method further comprises forming the index list, from a master list of more genetic variants than are in the index list.
27. The machine-readable medium of clause 26, wherein forming the index list comprises:
28. The machine-readable medium of clause 26, wherein forming the index list comprises:
29. The machine-readable medium of clause 25, wherein the input comprises:
Additional features and advantages of the subject technology will be set forth in the description below, and in part will be apparent from the description, or may be learned by practice of the subject technology. The advantages of the subject technology will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are intended to provide further explanation of the subject technology as claimed.
The accompanying drawings, which are included to provide further understanding of the subject technology and are incorporated in and constitute a part of this specification, illustrate aspects of the subject technology and together with the description serve to explain the principles of the subject technology.
In a traditional genetic diagnosis, physicians base the selection of individual genes for testing on their examination of the patient and conduct genetic tests upon one or a few specific genes at a time. Pinpointing relevant genes based on a patient's clinical diagnosis has proven to be difficult due to a number of factors including lack of known disease genes associated with less common phenotypes, phenotypic variability, genetic heterogeneity, joint contribution of multiple genes to complex phenotypes, pleiotropy, variable penetrance, among many others.
In contrast, whole exome testing using sequencing is a much broader test targeting the exons of nearly all the genes in the human genome (over 20,000). A large number of genetic diseases are caused by mutations located in the exons, which are the regions of genes that code for proteins.
The exome is the part of the genome formed by exons, the coding portions of genes that are expressed. Providing the genetic blueprint used in the synthesis of proteins and other functional gene products, the exome is the most functionally relevant part of the genome, and, therefore, the most likely to contribute to the phenotype of an organism. The exome of the human genome consists of roughly 180,000 exons constituting about 1% of the total genome, or about 30 megabases of DNA. Though comprising a very small fraction of the genome, mutations in the exome are thought to harbor 85% of disease-causing mutations. Thus, exome sequencing is an efficient strategy to determine the genetic basis of many Mendelian or single gene disorders.
A robust approach to sequencing the complete coding region (exome) has the potential to be clinically relevant in genetic diagnosis due to current understanding of functional consequences in sequence variation. One goal of this approach is to identify the functional variation that is responsible for Mendelian diseases, such as Miller syndrome and hereditary intellectual disability, without the high costs associated with whole-genome sequencing, while maintaining high coverage in sequence depth.
Exome sequencing has the potential to locate causative genes in complex diseases, which previously has not been possible due to limitations in traditional methods. Targeted capture and massively parallel sequencing represents a cost-effective, reproducible, and robust strategy with high sensitivity and specificity to detect variants causing protein-coding changes in individual human genomes.
There are multiple technologies available to undertake methods to identify causal genetic variants associated with disease. Each technology has its own technical, financial, and throughput limitations. Microarrays, for example, require hybridization probes of known sequence and are therefore limited by probe design and thus prevent the identification of genetic changes that can be detected. Massively parallel sequencing technologies used for exome sequencing, on the other hand, make it now possible to identify the cause of many unknown diseases by screening thousands of loci at once. This technology addresses the present limitations of hybridization genotyping arrays and classical sequencing.
Exome sequencing has become increasingly practical with the falling cost and increased throughput of whole genome sequencing. Even by only sequencing the exomes of individuals, a large quantity of data and sequence information is generated which requires a significant amount of data analysis. Challenges associated with the analysis of this data include changes in programs used to align and assemble sequence reads. Various sequence technologies also have different error rates and generate various read-lengths which can pose challenges in comparing results from different sequencing platforms.
Individual whole exome sequencing can generate over 10,000-100,000 human variants. Disclosed herein are solutions for interpreting the vast number of variants and distinguishing the causative mutation(s) from those which do not play a role in the disease etiology of interest. While the present disclosure may be applied to analysis of exome sequencing results, the techings provided herein may be applied to one or more whole entire genomes or portions thereof.
Common complex diseases can have heterogeneous descriptions based on informal assembly of component phenotypes into the disease description. Given this heterogeneity of the features that can be ascribed to a disease, and because the principles of this model are not limited to “diseases” as that term is used in the art, the disclosed model and methods can be used in connection with “traits.” The term trait is intended to encompass observed features that may or may not constitute or be a component of an identified disease. Such traits can be medically relevant and can be associated with elements just as diseases can.
The disclosed models, and disclosed methods based on the models, can be used to generate valuable and useful information. At a basic level, identification of elements (such as genetic variants) that are associated with a trait (such as a disease or phenotype) provides greater understanding of traits, diseases, and phenotypes. Thus, the disclosed models and methods can be used as research tools. At another level, the elements associated with traits through use of the disclosed model and methods are significant targets for, for example, drug identification and/or design, therapy identification and/or design, subject and patient identification, diagnosis, prognosis as they relate to the trait. The disclosed models and methods will identify elements associated with traits that are more significant or more likely to be significant to the genesis, maintenance, severity, and/or amelioration of the trait. The display, output, cataloging, addition to databases, and the like of elements associated with traits and the association of elements to traits provides useful tools and information to those identifying, designing and validating drugs, therapies, diagnostic methods, prognostic methods in relation to traits.
According to some embodiments, various steps of a program 100 may be performed to annotate or filter variants in the process of narrowing a broad set of variants. According to some embodiments, as shown in
According to some embodiments, as shown in
According to some embodiments, the annotation phase 120 is performed prior to the filtering phase 130. The filtering phase 130 may be informed by the annotation provided in the annotation phase 120. For example, comparison or analysis performed during the filtering phase 130 may make reference to one or more annotations provided in the annotation phase 120.
According to some embodiments, step 121 is performed prior to steps 122, 123, 124, and 125. Steps 122, 123, 124, and 125 can be performed in any order. During the annotation phase 120, an index list may be formed from a master list by implementing one or more of steps 121-125.
According to some embodiments, steps 131, 132, 133, and 134 are performed prior to step 135. For example, variants may be removed from a master list based on characteristics of each variant without respect to a particular proband. Performance of the steps of the filtering phase 130 in this manner reduces the number of variants that are to be filtered based on one or more inputs with respect to a particular proband. This order of steps may improve efficiency, cost, and speed of analysis by not requiring that every variant by filtered with respect to a particular proband. According to some embodiments, step 135 is performed prior to step 136. Steps 131, 132, 133, and 134 can be performed in any order. During the filtering phase 130, an index list may be formed from a master list by implementing one or more of steps 131-136.
According to some embodiments, the annotation phase 120 may be performed prior to or after the filtering phase 130. According to some embodiments, any number (e.g., all or less than all) of the steps 121-125 of annotation phase 120 may be performed. For example, at least one of steps 121-125 may be performed. By further example, at least two of steps 121-125 may be performed. The steps of the annotation phase 120 may be performed in any order. According to some embodiments, any number (e.g., all or less than all) of the steps 131-136 of filtering phase 130 may be performed. For example, at least one of steps 131-136 may be performed. By further example, at least two of steps 131-136 may be performed. The steps of the filtering phase 130 may be performed in any order.
According to some embodiments, as shown in step 121 of
According to some embodiments, as shown in step 122 of
According to some embodiments, as shown in step 123 of
According to some embodiments, as shown in step 124 of
According to some embodiments, as shown in step 125 of
According to aspects of the present disclosure, provided is a filtering phase 130 to reliably remove irrelevant variants. Any number (e.g., all or less than all) of the filtering steps disclosed herein may be performed. The steps of the filtering phase 130 may be performed in any order.
According to some embodiments, as shown in step 131 of
According to some embodiments, a variant may be evaluated to determine whether it has a sufficiently significant frequency. As used herein, “frequency” means the proportion of individuals in a population with a given genotype. For example, frequency may be calculated as the number of individuals in a population with a given genotype divided by the total number of individuals in the population. Frequency may be expressed as a ratio or percentage.
According to some embodiments, a variant may be evaluated to determine whether it has been observed in a minimum number of individuals among the general population (i.e. number of occurrences). As used herein, “occurrence” means a number of individuals in a population with a given genotype. For example, occurrence may be numerated as an integer number of individuals in a population with a given genotype. Qualification of a variant based on occurrence may avoid problems introduced by occurrences that are caused by errors. For example, a single computational error may incorrectly contribute to the frequency of a variant. Where the population is small such a single error may significantly impact the frequency calculation. By requiring the occurrence to be at least one (for example) or greater, the impact of such a single computational error may be avoided. By requiring satisfaction of both a frequency threshold and an occurrence threshold, variants may be more accurately and properly filtered.
According to some embodiments, each evaluation (e.g., frequency and occurrence) may be made as a comparison to a predetermined or user-selected threshold. Separate thresholds may be applied, for example, based on an inheritance model to be applied. For example, a default minimum frequency and minimum number of occurrences are 1% and 5 times respectively for the recessive model, and 0.1% and 3 respectively for the dominant model. The thresholds can be modified by a user based on an input by the user.
According to some embodiments, as shown in step 132 of
According to some embodiments, as shown in step 133 of
According to some embodiments, as shown in step 134 of
According to some embodiments, as shown in step 135 of
According to some embodiments, as shown in step 136 of
According to some embodiments, as shown in
According to some embodiments, as shown in
According to some embodiments, as shown in
According to some embodiments, as shown in
According to some embodiments, as shown in
According to some embodiments, as shown in
According to some embodiments, as shown in
In the following detailed description, numerous specific details are set forth to provide a comprehensive understanding of the subject technology. It will be apparent to one of ordinary skill in the art that the subject technology may be practiced without some of these specific details. In other instances, well-known structures and techniques have not been shown in detail so as not to obscure the subject technology.
The various aspects of the present invention mentioned above, as well as many other aspects of the invention, are described in greater details below.
According to some embodiments, as shown in
According to some embodiments, a variant report is constructed as a union for all samples instead of constructing them separately. Because not every sample would have a variant call on any particular position, the coverage information at this position would be missing in the individual variant report for these samples. Because not every sample would have a variant call passing the coverage and quality score threshold, the variant would be missing in the individual variant report, and will be considered as a wild-type mistakenly.
According to some embodiments, the program may annotate variants according to the genomic coordinates. The annotation starts with locating the position onto gene transcript regions and provides the detail description of the variant type and changes on both DNA and protein level. Once the annotation of relevant gene transcript is obtained, it will be used to search against public databases to obtain the relevant population frequency and disease-related information. The detail annotation algorithm is described in the following section.
The process of retrieving the relevant gene transcript information involves three steps to determine if a variant is within (1) intergenic region, (2) noncoding regions of gene transcripts, or (3) coding DNA sequence (CDS) regions. Each step is followed by the corresponding annotation for the position of gene transcription and translation if it applies. Besides the position annotation, types of variants are also provided. According to some embodiments, as shown in
According to some embodiments, as shown in step 510 of
According to some embodiments, once a list of gene transcript(s) within which the variant is located is obtained, the second checkup step can be performed. The program may provide DNA-level description of a variant according to its location on each gene transcript, including intronic, 5′-untranslated (5′ UTR), and 3′-untranslated regions (3′ UTR). For each gene transcript on the list, the mRNA and CDS structures are retrieved by searching the transcript RefSeq ID against the entire gene transcript public database.
According to some embodiments, the mRNA structure of each gene transcript comprises a list of chromosomal segments. Each segment represents an exonic region, and its location is indicated by a pair of genomic coordinates for mRNA transcription starting and ending position, respectively. The CDS structures are described with the same fashion as mRNA structures by listing all pairs of protein translation starting and ending position.
According to some embodiments, given the chromosomal coordinate of the variant and the mRNA structure of each gene transcript, the program can determine if the variant is located on either intronic or exonic region. This is done by scanning all pairs of transcription starting and ending coordinates and then checking if the variant position is in between the starting and ending coordinates. If the variant position is equal to or greater than the starting position and is equal to or less than the ending position, this variant is located in the exonic region. Otherwise, it is located in the intronic region.
According to some embodiments, the program can determine if the variant is located in a protein coding region or non-coding region by comparing the variant coordinate to CDS structure coordinates. There are three categories of non-coding region: (i) if a variant coordinate is less than the starting position of the 1st pair coordinates of a CDS structure, it is located in the 5′ UTR (see step 530). (ii) If a variant coordinate is greater than the ending position of the last coordinate pair of a CDS structure, it is located in the 3′ UTR (see step 560). (iii) If a variant is not classified as either one of the UTR regions, it is located in an intronic region (see steps 540 and 560). Variants belongs to these three categories have only DNA level description. The detailed annotation process is described herein. If a variant does not satisfied any of the above criteria, then it is located on CDS regions.
According to some embodiments, if a variant is in 5′ UTR region, the DNA level description is as “c.-number”, where the number is an integer number, which indicates the distance between the variant to the 1st coding nucleotide. It is obtained by counting the number of bases from the variants position (or the closest starting position of exon pair coordinates, if a variant is on the intronic region) to the ending position of the pair coordinates where the variant (or the closest starting position) is located, and then adding the length of each following coordinates pair until reaching the 1st coding nucleotide.
According to some embodiments, if a variant is on 3′ UTR region, the DNA level description is as “c.*number”, where the number is an integer number, which indicating the distance between the variant to the last coding nucleotide. It is obtained by counting the number of bases from the variants position (or the closest ending position of exon pair coordinates, if a variant is on the intronic region) to the starting position of the pair coordinates where the variant (or the closest ending position) is located, and then adding the length of each previous coordinates pair until reaching the last coding nucleotide.
According to some embodiments, if a variant is not classified as either one of the UTR regions, the DNA level description is as “c.number”, where the number is an integer number, which indicates the distance between the variant to the first coding nucleotide. It is obtained by counting the number of bases from the variant closest starting or ending position of exon pair coordinates, and adding the length of each following coordinates pair until reaching the 1st coding nucleotide.
According to some embodiments, if a variant is in an intronic region, the DNA level description will be added by a “+” or “−” sign and then followed by a number, where the number is an integer number, which indicates the distance between the variant to closest nucleotide, which is within an exonic region. It is obtained by counting the number of bases from the variants position to the closest nucleotide base, which is within any of mRNA segments. If the closest base is the starting position of an mRNA segment, the “−” sign is used for the annotation. Otherwise, the “+” sign will be used.
According to some embodiments, as shown in step 520 of
According to some embodiments, the DNA level description is as “c.number”, where the number is an integer number, which indicates the distance between the variant to the 1st coding nucleotide. It is obtained by counting the number of bases from the 1st coding nucleotide through all bases, which are within exonic regions, until the variant position is reached.
According to some embodiments, if a variant is located at exonic region and inside of CDS, the annotation includes both DNA and protein-level descriptions. The protein-level description comprises the following strings: “p.”, “letter”, “number”, and “string”. The “letter” is a single letter amino acid name, which refers to the amino acid on the wild type protein sequence. The “string” can be a single letter amino acid name, which refers to the amino acid on protein sequence after the variant happens if the protein-level variation is involved with single amino acid substitution or remains unchanged. Otherwise, it can be used to indicate the changes of deletion and/or insertion in protein level, which will be described herein. For example, “p.A100G” indicates that the 100th amino acid of the wild type protein sequence, alanine (A) mutated as glycine (G).
According to some embodiments, the reference amino acid is obtained by checking the chromosomal coordinates of the codon where the variant located. The three nucleotide bases of the codon are obtained by searching the chromosomal coordinates of the codon against the human genome build (hg19). Once the codon is retrieved, the translation of the codon from DNA to protein is the amino acid of the protein sequence before the variation.
According to some embodiments, the mutated amino acid is obtained according to the modified nucleotide sequence by substituting, deleting, and/or inserting the relevant nucleotide bases of the variant. Once the modified nucleotide sequence is obtained, the translation from DNA to protein sequence is performed. Once the comparison between original and the new protein sequence is done, the program can indicate the protein sequence changes caused by the variant changes on DNA sequence.
Variants can be classified according to the type of changes on both DNA level description (
According to some embodiments, as shown in step 620 of
According to some embodiments, as shown in step 630 of
According to some embodiments, as shown in step 640 of
According to some embodiments, as shown in
According to some embodiments, as shown in
According to some embodiments, as shown in
According to some embodiments, as shown in step 820 of
According to some embodiments, as shown in step 830 of
According to some embodiments, as shown in step 840 of
According to some embodiments, as shown in step 850 of
According to some embodiments, as shown in step 860 of
According to some embodiments, as shown in step 870 of
The second pair refers to the last amino acid, which was deleted in the deletion event. The “sequence” is the inserted amino acid sequence, comprising at least one single letter amino acid name, which is inserted between the first and second amino acid after the deletion event. The method to obtain the amino acid sequence and position has been described herein.
According to some embodiments, the program filters variants through a process comprising one or more of the following criteria: a) by population frequency and times of occurrence, b) by variant location, c) by family history inheritance pattern, and d) by normal control list. A normal control list may be a list based on a population study. The normal controls may comprise unaffected controls (i.e., individuals that are unaffected by a candidate variant). The variants survived from the process are candidate variants of proposed genetic model and provided by an output to molecular geneticists for further evaluation.
According to some embodiments, a list of variants may be filtered by population frequency and/or number of occurrences. Population information may come from database sources such as dbSNP, 1000 genome, and ESP. For each source, the frequency and number of occurrences of each variant are retrieved, if they are available, and compared independently to the minimum cutoff values, which are predetermined or user-selected. The default values of frequency and times are 1% and 5 for recessive model, and 0.1% and 3 for dominant model. According to some embodiments, to be classified as a common SNP, both frequency and occurences thresholds must be satisfied. According to some embodiments, to be classified as a common SNP, either a frequency threshold or an occurrences threshold must be satisfied. The common SNP classification from any source may lead to the elimination of this variant.
According to some embodiments, a list of variants may be filtered by variant location. Based on variant's position relative to a gene transcript, variants can be classified into one of three groups: intergenic, intronic, and exonic. All variants in intergenic region may be discarded. Variants in intronic region may be saved if it is sufficiently close (e.g., based on a predetermined of user-selected number of basepair separations) to any splicing junction, or if it has been reported before by HGMD/OMIM as a disease causative mutation. For example, two basepair or less to any splicing function is defined as sufficiently close. By further example, variants in an exonic region may be saved and delivered to the next step, e.g., filtering by family history, except synonymous mutation without HGMD/OMIM records.
According to some embodiments, a list of variants may be filtered by family history inheritance pattern. Besides filtering with population occurrence and variant location, a variant set may be further narrowed down based on proposed genetic model and family history information. For example, an autosomal model shrinks the set by only including variants in an autosome. By further example, X-linked and Y-link models limit variants to those in chromosome X and Y, respectively.
According to some embodiments, to enter a candidate pool, each variant must abide by Mendelian inheritance pattern. For example, a script may take each variant from an affected person as a seed and compare the genotype of the affected person to genotypes of his/her parents or other family members. An inheritance conflict leads to the removal of variants. According to some embodiments, an option to allow for de novo mutations may be provided. Such an allowance may be provided with respect to one allele only. According to some embodiments, the de novo mutation of two or more alleles at the same position may be prohibited. If only one parent was sequenced, the genotype of a non-sequenced person may be estimated automatically by choosing the one with the highest probability. If a person was sequenced, but there was no variant-call at some position, the genotype of this person at that position was assumed as homozygous reference (−/−).
According to some embodiments, as shown in
According to some embodiments, as shown in
All genetic models described so far are related to autosome only and no gender information is needed. However, gender information plays an important role in filtration when proposed model is X-linked or Y-linked. For example, in X-linked recessive model, all variants with hemizygous mutation (+/o) in healthy males or hemizygous reference (−/o) in affected males were removed from candidate pool. Following the pattern disclosed herein and shown in
According to some embodiments, a list of variants may be filtered by normal control list. The normal control list comprises persons assumed to be unaffected and lack of causative mutations related to the current project. The selection of normal controls may be based on reported phenotypes and personal experience. Variants from normal controls may be deposited in an internal database and contrasted to all candidate variants or variant pairs. The variants may be removed from the candidate set if at least one exactly hit can be found from the normal control set.
According to some embodiments, the system may determine a treatment plan based on a genetic influence for an identified condition. For example, upon determining a genetic influence by filtering variants, the system may correlate the genetic influence with one or more treatment plans. For example, a lookup table may be provided with a pairing of genetic influences with one or more treatment plans. By further example, a lookup table may be provided with a set of treatment plans correlated with one or more genetic influences, wherein each of the treatment plans is provided with a probability of being a proper match with a respective genetic influence. By further example, a lookup table may be provided with a range of treatment plans for selection by a user (e.g., a physician).
The processing system 1102 may include a processor for executing instructions and may further include a machine-readable medium 1119, such as a volatile or non-volatile memory, for storing data and/or instructions for software programs. The instructions, which may be stored in a machine-readable medium 1110 and/or 1119, may be executed by the processing system 1102 to control and manage access to the various networks, as well as provide other communication and processing functions. The instructions may also include instructions executed by the processing system 1102 for various user interface devices, such as a display 1112 and a keypad 1114. The processing system 1102 may include an input port 1122 and an output port 1124. Each of the input port 1122 and the output port 1124 may include one or more ports. The input port 1122 and the output port 1124 may be the same port (e.g., a bi-directional port) or may be different ports.
The processing system 1102 may be implemented using software, hardware, or a combination of both. By way of example, the processing system 1102 may be implemented with one or more processors. A processor may be a general-purpose microprocessor, a microcontroller, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA), a Programmable Logic Device (PLD), a controller, a state machine, gated logic, discrete hardware components, or any other suitable device that can perform calculations or other manipulations of information.
A machine-readable medium can be one or more machine-readable media. Software shall be construed broadly to mean instructions, data, or any combination thereof, whether referred to as software, firmware, middleware, microcode, hardware description language, or otherwise. Instructions may include code (e.g., in source code format, binary code format, executable code format, or any other suitable format of code).
Machine-readable media (e.g., 1119) may include storage integrated into a processing system, such as might be the case with an ASIC. Machine-readable media (e.g., 1110) may also include storage external to a processing system, such as a Random Access Memory (RAM), a flash memory, a Read Only Memory (ROM), a Programmable Read-Only Memory (PROM), an Erasable PROM (EPROM), registers, a hard disk, a removable disk, a CD-ROM, a DVD, or any other suitable storage device. Those skilled in the art will recognize how best to implement the described functionality for the processing system 1102. According to one aspect of the disclosure, a machine-readable medium is a computer-readable medium encoded or stored with instructions and is a computing element, which defines structural and functional interrelationships between the instructions and the rest of the system, which permit the instructions' functionality to be realized. In one aspect, a machine-readable medium is a non-transitory machine-readable medium, a machine-readable storage medium, or a non-transitory machine-readable storage medium. In one aspect, a computer-readable medium is a non-transitory computer-readable medium, a computer-readable storage medium, or a non-transitory computer-readable storage medium. Instructions may be executable, for example, by a client device or server or by a processing system of a client device or server. Instructions can be, for example, a computer program including code.
An interface 1116 may be any type of interface and may reside between any of the components shown in
By way of illustration and not limitation, in one aspect of the disclosure, stated from a perspective of a server side (treating a server as a local device and treating a client device as a remote device), a server application is executed (or runs) at a server 1206. While a remote client device 1202 may receive and display a view of the server application on a display local to the remote client device 1202, the remote client device 1202 does not execute (or run) the server application at the remote client device 1202. Stated in another way from a perspective of the client side (treating a server as remote device and treating a client device as a local device), a remote application is executed (or runs) at a remote server 1206.
By way of illustration and not limitation, a client device 1202 can represent a computer, a mobile phone, a laptop computer, a thin client device, a personal digital assistant (PDA), a portable computing device, or a suitable device with a processor. In one example, a client device 1202 is a smartphone (e.g., iPhone, Android phone, Blackberry, etc.). In certain configurations, a client device 1202 can represent an audio player, a game console, a camera, a camcorder, an audio device, a video device, a multimedia device, or a device capable of supporting a connection to a remote server. In one example, a client device 1202 can be mobile. In another example, a client device 1202 can be stationary. According to one aspect of the disclosure, a client device 1202 may be a device having at least a processor and memory, where the total amount of memory of the client device 1202 could be less than the total amount of memory in a server 1206. In one example, a client device 1202 does not have a hard disk. In one aspect, a client device 1202 has a display smaller than a display supported by a server 1206. In one aspect, a client device may include one or more client devices.
In some embodiments, a server 1206 may represent a computer, a laptop computer, a computing device, a virtual machine (e.g., VMware® Virtual Machine), a desktop session (e.g., Microsoft Terminal Server), a published application (e.g., Microsoft Terminal Server) or a suitable device with a processor. In some embodiments, a server 1206 can be stationary. In some embodiments, a server 1206 can be mobile. In certain configurations, a server 1206 may be any device that can represent a client device. In some embodiments, a server 1206 may include one or more servers.
In one example, a first device is remote to a second device when the first device is not directly connected to the second device. In one example, a first remote device may be connected to a second device over a communication network such as a Local Area Network (LAN), a Wide Area Network (WAN), and/or other network.
When a client device 1202 and a server 1206 are remote with respect to each other, a client device 1202 may connect to a server 1206 over a network 1204, for example, via a modem connection, a LAN connection including the Ethernet or a broadband WAN connection including DSL, Cable, T1, T3, Fiber Optics, Wi-Fi, or a mobile network connection including GSM, GPRS, 3G, WiMax or other network connection. A network 1204 can be a LAN network, a WAN network, a wireless network, the Internet, an intranet or other network. A network 1204 may include one or more routers for routing data between client devices and/or servers. A remote device (e.g., client device, server) on a network may be addressed by a corresponding network address, such as, but not limited to, an Internet protocol (IP) address, an Internet name, a Windows Internet name service (WINS) name, a domain name or other system name. These illustrate some examples as to how one device may be remote to another device. But the subject technology is not limited to these examples.
According to certain embodiments of the subject technology, the terms “server” and “remote server” are generally used synonymously in relation to a client device, and the word “remote” may indicate that a server is in communication with other device(s), for example, over a network connection(s).
According to certain embodiments of the subject technology, the terms “client device” and “remote client device” are generally used synonymously in relation to a server, and the word “remote” may indicate that a client device is in communication with a server(s), for example, over a network connection(s).
In some embodiments, a “client device” may be sometimes referred to as a client or vice versa. Similarly, a “server” may be sometimes referred to as a server device or vice versa.
In some embodiments, the terms “local” and “remote” are relative terms, and a client device may be referred to as a local client device or a remote client device, depending on whether a client device is described from a client side or from a server side, respectively. Similarly, a server may be referred to as a local server or a remote server, depending on whether a server is described from a server side or from a client side, respectively. Furthermore, an application running on a server may be referred to as a local application, if described from a server side, and may be referred to as a remote application, if described from a client side.
In some embodiments, devices placed on a client side (e.g., devices connected directly to a client device(s) or to one another using wires or wirelessly) may be referred to as local devices with respect to a client device and remote devices with respect to a server. Similarly, devices placed on a server side (e.g., devices connected directly to a server(s) or to one another using wires or wirelessly) may be referred to as local devices with respect to a server and remote devices with respect to a client device.
As used herein, the word “module” refers to logic embodied in hardware or firmware, or to a collection of software instructions, possibly having entry and exit points, written in a programming language, such as, for example C++. A software module may be compiled and linked into an executable program, installed in a dynamic link library, or may be written in an interpretive language such as BASIC. It will be appreciated that software modules may be callable from other modules or from themselves, and/or may be invoked in response to detected events or interrupts. Software instructions may be embedded in firmware, such as an EPROM or EEPROM. It will be further appreciated that hardware modules may be comprised of connected logic units, such as gates and flip-flops, and/or may be comprised of programmable units, such as programmable gate arrays or processors. The modules described herein are preferably implemented as software modules, but may be represented in hardware or firmware.
It is contemplated that the modules may be integrated into a fewer number of modules. One module may also be separated into multiple modules. The described modules may be implemented as hardware, software, firmware or any combination thereof. Additionally, the described modules may reside at different locations connected through a wired or wireless network, or the Internet.
In general, it will be appreciated that the processors can include, by way of example, computers, program logic, or other substrate configurations representing data and instructions, which operate as described herein. In other embodiments, the processors can include controller circuitry, processor circuitry, processors, general purpose single-chip or multi-chip microprocessors, digital signal processors, embedded microprocessors, microcontrollers and the like.
Furthermore, it will be appreciated that in one embodiment, the program logic may advantageously be implemented as one or more components. The components may advantageously be configured to execute on one or more processors. The components include, but are not limited to, software or hardware components, modules such as software modules, object-oriented software components, class components and task components, processes methods, functions, attributes, procedures, subroutines, segments of program code, drivers, firmware, microcode, circuitry, data, databases, data structures, tables, arrays, and variables.
The foregoing description is provided to enable a person skilled in the art to practice the various configurations described herein. While the subject technology has been particularly described with reference to the various figures and configurations, it should be understood that these are for illustration purposes only and should not be taken as limiting the scope of the subject technology.
There may be many other ways to implement the subject technology. Various functions and elements described herein may be partitioned differently from those shown without departing from the scope of the subject technology. Various modifications to these configurations will be readily apparent to those skilled in the art, and generic principles defined herein may be applied to other configurations. Thus, many changes and modifications may be made to the subject technology, by one having ordinary skill in the art, without departing from the scope of the subject technology.
It is understood that the specific order or hierarchy of steps in the processes disclosed is an illustration of exemplary approaches. Based upon design preferences, it is understood that the specific order or hierarchy of steps in the processes may be rearranged. Some of the steps may be performed simultaneously. The accompanying method claims present elements of the various steps in a sample order, and are not meant to be limited to the specific order or hierarchy presented.
As used herein, the phrase “at least one of” preceding a series of items, with the terms “and” or “or” to separate any of the items, modifies the list as a whole, rather than each member of the list (i.e., each item). The phrase “at least one of” does not require selection of at least one item; rather, the phrase allows a meaning that includes at least one of any one of the items, and/or at least one of any combination of the items, and/or at least one of each of the items. By way of example, the phrases “at least one of A, B, and C” or “at least one of A, B, or C” each refer to only A, only B, or only C; any combination of A, B, and C; and/or at least one of each of A, B, and C.
Terms such as “top,” “bottom,” “front,” “rear” and the like as used in this disclosure should be understood as referring to an arbitrary frame of reference, rather than to the ordinary gravitational frame of reference. Thus, a top surface, a bottom surface, a front surface, and a rear surface may extend upwardly, downwardly, diagonally, or horizontally in a gravitational frame of reference.
A phrase such as “an aspect” does not imply that such aspect is essential to the subject technology or that such aspect applies to all configurations of the subject technology. A disclosure relating to an aspect may apply to all configurations, or one or more configurations. An aspect may provide one or more examples of the disclosure. A phrase such as “an aspect” may refer to one or more aspects and vice versa. A phrase such as “an embodiment” does not imply that such embodiment is essential to the subject technology or that such embodiment applies to all configurations of the subject technology. A disclosure relating to an embodiment may apply to all embodiments, or one or more embodiments. An embodiment may provide one or more examples of the disclosure. A phrase such “an embodiment” may refer to one or more embodiments and vice versa. A phrase such as “a configuration” does not imply that such configuration is essential to the subject technology or that such configuration applies to all configurations of the subject technology. A disclosure relating to a configuration may apply to all configurations, or one or more configurations. A configuration may provide one or more examples of the disclosure. A phrase such as “a configuration” may refer to one or more configurations and vice versa.
Furthermore, to the extent that the term “include,” “have,” or the like is used in the description or the claims, such term is intended to be inclusive in a manner similar to the term “comprise” as “comprise” is interpreted when employed as a transitional word in a claim.
The word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any embodiment described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments.
A reference to an element in the singular is not intended to mean “one and only one” unless specifically stated, but rather “one or more.” Pronouns in the masculine (e.g., his) include the feminine and neuter gender (e.g., her and its) and vice versa. The term “some” refers to one or more. Underlined and/or italicized headings and subheadings are used for convenience only, do not limit the subject technology, and are not referred to in connection with the interpretation of the description of the subject technology. All structural and functional equivalents to the elements of the various configurations described throughout this disclosure that are known or later come to be known to those of ordinary skill in the art are expressly incorporated herein by reference and intended to be encompassed by the subject technology. Moreover, nothing disclosed herein is intended to be dedicated to the public regardless of whether such disclosure is explicitly recited in the above description.
While certain aspects and embodiments of the invention have been described, these have been presented by way of example only, and are not intended to limit the scope of the invention. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms without departing from the spirit thereof. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the invention.