Progress in science and technology with shifting epidemiology and demographics are creating the capabilities and demand for alternatives to symptom-driven medical models. Reducing age-related chronic diseases associated with premature mortality among adults is an urgent priority requiring new approaches and technologies.
The near-doubling of average human life expectancy over the last 150 years is a tribute to scientific advancements in medicine and public health. Most of this success—though not all—is the result of progress in control and prevention of infectious diseases particularly among young children. Eighty-five percent of children born now in the US are expected to live to 65 years of age; at least 42% will likely celebrate an 85th birthday. Because of these successes most of humanity is facing a daunting and costly new medical challenge in the form of age-related chronic diseases.
Most age-related chronic diseases have heritability, are often slowly progressive with symptom-free onset, and are associated with common risk factors. In 2015, the estimated US cumulative mortality risk among males 50 to 74 years of age was 39%; for women, the risk was lower but still substantial at 24%. The causes of these deaths are similar across genders with neoplasms and cardiovascular disease accounting for about one-third each, and diabetes and related conditions, respiratory, cirrhosis and other liver diseases, and neurologic disorders accounting for the remaining one-third.
The vast majority of primary medical interventions are informed by and implemented upon population based studies. This is a problem since a single individual cannot be adequately represented by an entire population due to genetic and environmental heterogeneity within the population.
Few examples demonstrate how genomics might be proactively incorporated into new models for medical practice, and what infrastructure will be needed to support data generation and use. The methods described herein allow the integration of disparate orthogonal health data in a quantitative way to enable disease diagnosis and the determination of an optimal treatment plan. These methods are also useful for determining an increased risk of a future diagnosis of a disease, and importantly they allow for the identification of sub symptomatic diseases allowing earlier treatment and intervention in order to increase positive health outcomes.
Genomics as currently applied has been disappointing in its ability to unravel the “missing heritability” of most age-related chronic diseases, and other common diseases. This is rapidly changing as a result of public and private efforts to expand sequencing. First, we are increasingly finding and seeing supporting evidence for the increasing recognition of rare variants with large effect sizes. Combining this with advancements in monogenic and polygenic methodologies to assess causation including Mendelian randomization methods, extension of genome-wide association study to create hazard models, and continued exploration of pleiotropy, increase clinical utility. Second, increasingly detailed mapping of molecular pathways and mechanisms associated with diseases and risk factors provide a much needed improved capability to link genotype and phenotype data. Described herein, we demonstrate the use of global metabolomics in mapping to genomic defects. This can significantly strengthen with additional experience and automation. Thirdly, we quantitatively integrate genomics with other clinical data, particularly advanced imaging data, to create point-of-care clinical decision support. For example, if one queries more than 40,000 genomes (individuals and families), genotype-phenotype associations can be described with millisecond response times. The methods described herein prioritize individual opportunities for tertiary (disease treatment), secondary (risk factor control), and primary prevention using human- and machine-driven feature extraction.
The value of evolving medical practice from disease diagnosis to risk detection is supported by the study in Example 1. For example, we recommended follow-up imaging studies for slightly more than one-third of our study participants. Some of this is the nature of screening, which drives need for more definitive imaging studies better suited to specific abnormalities. Other instances of referral were intended to identify change over a specified time period which might be suggestive of cancer such as finding a cystic pancreatic lesion or instability of a vascular lesion such an intracranial aneurysm. In some instances, we don't know enough to confidently predict the natural course of these findings, and as a result may cause unnecessary anxiety and unneeded surgery. However, the life-threatening consequences and relatively high prevalence of diseases associated with these lesions suggests that early recognition is likely to be beneficial for most individuals. Expansion of some or all of our approach to broader populations requires the methods described herein.
The methods, computer media, and systems described herein deploy genome sequencing (e.g., whole genome sequencing, exosome sequencing, SNP typing) in combination with one or more other routine and advanced diagnostic technologies including: microbiome sequencing (e.g., gut or dermal microbiome); global metabolomics; 3D/4D imaging focusing on non-contrast whole-body magnetic resonance imaging and echocardiogram; 2-week cardiac monitoring; and functional neurologic testing to detect risk for age-related chronic diseases.
In a certain aspect, described herein, is a method of detecting an undiagnosed medical condition comprising: acquiring a plurality of health metrics of the individual, wherein at least one of the plurality of health metrics comprises nucleotide sequence data; implementing a genetic risk rule that defines a genetic risk for the undiagnosed medical condition; implementing a non-genetic risk rule that defines a non-genetic risk for the undiagnosed medical condition; and generating a confidence score for the undiagnosed medical condition that comprises a function of the genetic risk rule and the non-genetic risk rule. In certain embodiments, the undiagnosed medical condition is an increased likelihood of developing a medical condition. In certain embodiments, the medical condition comprises Parkinson's disease, Alzheimer's disease, ischemic heart disease, hyperlipidemia, high blood pressure, cardiac arrhythmia, long QT syndrome, insulin resistance, Type II diabetes, non-alcoholic fatty liver disease, cirrhosis of the liver, kidney failure, heart failure, depression, bipolar disorder, schizophrenia, or a cancer. In certain embodiments, the cancer comprises breast cancer, prostate cancer, lung cancer, melanoma, pancreatic cancer, kidney cancer, skin cancer, bladder cancer, ovarian cancer, cervical cancer, colon cancer, a leukemia, a lymphoma, head and neck cancer, or brain cancer. In certain embodiments, the nucleotide sequence data comprises DNA sequence data. In certain embodiments, the nucleotide sequence data comprises a list of nucleotide sequence variants compared to a reference genome. In certain embodiments, the plurality of health metrics further comprises a phenotypic measurement, a family medical history, a personal medical history, or a gut microbiome assessment. In certain embodiments, the phenotypic measurement comprises a clinical measurement or a clinical laboratory test. In certain embodiments, the clinical measurement or the clinical laboratory test comprises a sleep apnea score, cognitive assessment, neurological test, quantitative Neuro imaging, balance assessment, gait assessment, weight, height, systolic blood pressure, diastolic blood pressure, resting pulse rate, cardiac rhythm monitoring, electrocardiogram, blood lipid levels, blood glucose level, oral glucose tolerance test, blood insulin level, body fat measurement, or whole body MRI. In certain embodiments, the whole body MRI comprises an estimate of total body fat mass or percentage, subcutaneous fat mass or percentage, visceral fat mass or percentage, muscle mass or percentage, liver fat mass or percentage, brain volume, or hippocampal volume. In certain embodiments, the genetic risk rule comprises ranking a nucleotide sequence variant based upon a score reflecting a pathogenicity of the nucleotide sequence for the undiagnosed medical condition. In certain embodiments, the pathogenicity of the nucleotide sequence for the undiagnosed medical condition is previously determined using a genome wide association study or hazard score associated therewith, presence in ClinVar database, or presence in a gene known or suspected to be causative for the undiagnosed medical condition. In certain embodiments, the second set of rules comprises ranking the non-genetic risk for the undiagnosed medical condition comprises ranking the phenotypic measurement against a plurality of phenotypic measurements derived from a population of individuals. In certain embodiments, ranking the non-genetic risk for the undiagnosed medical condition comprises assigning a quantile score to the non-genetic risk for the undiagnosed medical condition. In certain embodiments, ranking the non-genetic risk for the undiagnosed medical condition comprises assigning a quintile score to the non-genetic risk for the undiagnosed medical condition. In certain embodiments, the second set of rules comprises determining an amount of standard deviations the phenotypic measurement is away from a mean level for the undiagnosed medical condition derived from a plurality of phenotypic measurements derived from a population of individuals. In certain embodiments, the amount of standard deviations is greater than 2. In certain embodiments, the confidence score for the undiagnosed medical condition that comprises a function of the genetic risk rule and the non-genetic risk rule is more accurate than a confidence score for either the genetic risk rule or the non-genetic risk rule alone. In certain embodiments, the method further comprises delivering a report of the confidence score for the undiagnosed medical condition to a health care provider. In certain embodiments, the method further comprises delivering a report of the confidence score for the undiagnosed medical condition to an individual. In certain embodiments, the undiagnosed medical condition comprises a plurality of undiagnosed medical conditions. In certain embodiments, a non-transitory computer-readable storage media is encoded with a computer program including instructions executable by a processor to create a program to detect an undiagnosed medical condition according to the methods described herein.
A better understanding of the features and advantages of the embodiments in the present disclosure will be obtained by reference to the following detailed description that sets forth illustrative embodiments and the accompanying drawings of which:
Unless otherwise defined, all technical terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the embodiments disclosed herein belongs. As used in this specification and the appended claims, the singular forms “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise. Any reference to “or” herein is intended to encompass “and/or” unless otherwise stated.
Unless otherwise defined, scientific and technical terms used in connection with the present teachings described herein shall have the meanings that are commonly understood by those of ordinary skill in the art. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. Generally, nomenclatures utilized in connection with, and techniques of, cell and tissue culture, molecular biology, and protein and oligo- or polynucleotide chemistry and hybridization described herein are those well known and commonly used in the art. Standard techniques are used, for example, for nucleic acid purification and preparation, chemical analysis, recombinant nucleic acid, and oligonucleotide synthesis. Enzymatic reactions and purification techniques are performed according to manufacturer's specifications or as commonly accomplished in the art or as described herein. The techniques and procedures described herein are generally performed according to conventional methods well known in the art and as described in various general and more specific references that are cited and discussed throughout the instant specification. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (Third ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 2000). The nomenclatures utilized in connection with, and the laboratory procedures and techniques described herein are those well known and commonly used in the art.
The phrase “next generation sequencing” (NGS) refers to sequencing technologies having increased throughput as compared to traditional Sanger- and capillary electrophoresis-based approaches, for example with the ability to generate hundreds of thousands of relatively small sequence reads at a time. Some examples of next generation sequencing techniques include, but are not limited to, sequencing by synthesis, sequencing by ligation, and sequencing by hybridization. More specifically, the MISEQ, HISEQ and NEXTSEQ Systems of Illumina and the Personal Genome Machine (PGM) and SOLiD Sequencing System of Life Technologies Corp, provide massively parallel sequencing of whole or targeted genomes. The SOLiD System and associated workflows, protocols, chemistries, etc. are described in more detail in PCT Publication No. WO 2006/084132, entitled “Reagents, Methods, and Libraries for Bead-Based Sequencing,” international filing date Feb. 1, 2006, U.S. patent application Ser. No. 12/873,190, entitled “Low-Volume Sequencing System and Method of Use,” filed on Aug. 31, 2010, and U.S. patent application Ser. No. 12/873,132, entitled “Fast-Indexing Filter Wheel and Method of Use,” filed on Aug. 31, 2010, the entirety of each of these applications being incorporated herein by reference thereto.
As used herein “genomic sequence variant” refers to any nucleotide difference in an individual's genome sequence compared to a reference genome or reference sequence. The variant can be a single nucleotide variant (SNV), insertion or deletion (Indel), or translocation. In certain embodiments, the indel comprises more than a single nucleotide. In certain embodiments, a genomic sequence variant excludes mitochondrial deoxyribonucleic acid (DNA) sequences. In certain embodiments, a genomic sequence variant excludes variants found on either of the non-autosomal human X or Y chromosomes. In certain embodiments, the genomic sequence variant is a human genomic sequence variant.
As used herein “reference genome” refers to any standard publicly available reference genome, for example GRCh38, the Genome Reference Consortium human genome (build 38). Alternatively, the reference genome can be one that is constructed de novo from sequencing a plurality of genomes. In certain embodiments, the plurality of genomes is greater than 10,000 different genomes. In certain embodiments, the plurality of genomes is greater than 100,000 different genomes.
As used herein “disease” refers to any cause of mortality or decreased quality of life that is independent of natural cause. Disease include: age-related chronic diseases; accidents and injuries; cancers; autoimmune/inflammatory diseases, infectious diseases, genetic diseases, psychological disease and maternal/fetal health.
The methods, computer media, and systems described herein are useful for diagnosing hidden, latent, or subsymptomatic conditions. Importantly this type of diagnosis is beyond the current reach of physicians relying solely on in-office examination and laboratory testing. In addition to a diagnosis of active disease, the methods described herein are useful for determining an increased risk of developing a disease. The diseases diagnosed or identified as high risk include, but are not limited to: age-related chronic diseases, such as, Parkinson's disease, Alzheimer's disease, dementia, ischemic heart disease, hyperlipidemia, high blood pressure, cardiac arrhythmia, long QT syndrome, insulin resistance, Type II diabetes, non-alcoholic fatty liver disease, cirrhosis of the liver, liver failure, kidney failure, heart failure, cardiovascular disease, congestive heart failure, emphysema, chronic obstructive pulmonary disease; accidents and injuries, such as, increased risk of alcohol related injuries, increased risk of injury due to voluntary intoxication with drugs or alcohol, self-inflicted injuries, suicide attempts, occupational injuries, sports/fitness-related injuries; infectious diseases, such as, bacterial, viral or parasitic infections, fungal infections, genetic diseases, such as inborn errors of metabolism, X-linked recessive disorders, autosomal dominant disorders, immunodeficiency, cystic fibrosis, enzyme deficiency; psychological disease, such as, depression, bipolar disorder, schizophrenia, mania, anxiety disorder; maternal/fetal health, such as, gestational diabetes, preeclampsia, miscarriage, or sudden infant death syndrome.
The methods, computer media, and systems described herein are useful for diagnosing hidden, latent, or subsymptomatic cancers. The cancer can comprise a cancer such as: lymphoma/leukemia; head and neck cancer; brain cancer; stomach cancer; pancreatic cancer; colon cancer; liver cancer; renal cancer; breast cancer; prostate cancer; cervical cancer; ovarian cancer; acute lymphoblastic leukemia, adult; acute lymphoblastic leukemia, childhood; acute myeloid leukemia, adult; acute myeloid leukemia, childhood; adreno-cortical carcinoma; AIDS-related cancers; AIDS-related lymphoma; anal cancer; appendix cancer; astrocytomas; atypical teratoid/rhabdoid tumor; basal cell carcinoma; bile duct cancer, extrahepatic; bladder cancer; bone cancer, osteosarcoma and malignant fibrous histiocytoma; brain stem glioma; brain tumor; central Nervous System embryonal tumors; astrocytomas; craniopharyngioma; ependymoblastoma; brain tumor, ependymoma; medulloblastoma; medulloepithelioma; Pineal Parenchymal tumors of Intermediate differentiation; supratentorial primitive neuro ectodermal tumors and pineoblastoma; brain and Spinal cord tumors; breast cancer; breast cancer, male; bronchial tumors; Burkitt lymphoma; carcinoid tumor; central nervous system atypical teratoid/rhabdoid tumor; central nervous system embryonal tumors; central nervous system (CNS) lymphoma, cervical cancer; primary; cervical cancer; childhood cancers; chordoma; chronic lymphocytic leukemia; chronic myelogenous leukemia; chronic myeloproliferative disorders; colon cancer; colorectal cancer; craniopharyngioma; cutaneous T-cell lymphoma; embryonal tumors, central nervous system; endometrial cancer; ependymoblastoma; ependymoma; esophageal cancer; esthesioneuroblastoma; Ewing sarcoma family of tumors; extracranial germ cell tumor; extragonadal germ cell tumor; extrahepatic bile duct cancer; eye cancer, Intraocular melanoma; eye cancer, Retinoblastoma; gallbladder cancer; gastric (stomach) cancer; gastrointestinal carcinoid tumor; gastrointestinal Stromal tumor (GIST); germ cell tumor, extracranial; germ cell tumor, extragonadal; germ cell tumor, ovarian; gestational trophoblastic tumor; glioma; hairy cell leukemia; head and Neck cancer; heart cancer; hepatocellular (liver) cancer, adult (Primary); hepatocellular (liver) cancer; histiocytosis, langerhans cell; Hodgkin lymphoma, adult; Hodgkin lymphoma, childhood; hypopharyngeal cancer; Intraocular melanoma; islet cell tumors (endocrine pancreas); Kaposi Sarcoma; kidney (renal cell) cancer; kidney cancer; langerhans cell histiocytosis; laryngeal cancer; laryngeal cancer, childhood; leukemia, acute lymphoblastic, adult; leukemia, acute lymphoblastic, childhood; leukemia, acute myeloid, adult; leukemia, acute myeloid, childhood; leukemia, chronic lymphocytic; leukemia, chronic myelogenous; leukemia, hairy cell; lip and oral cavity cancer; liver cancer, adult (Primary); liver cancer; lung cancer, non-small cell; lung cancer, small cell; lymphoma, AIDS-related; lymphoma, Burkitt; lymphoma, cutaneous T-cell; lymphoma, Hodgkin, adult; lymphoma, Hodgkin, childhood; lymphoma, non-Hodgkin, adult; lymphoma, non-Hodgkin, childhood; lymphoma, primary central nervous system (CNS); macroglobulinemia, Waldenstrom; malignant fibrous histiocytoma of bone and osteosarcoma; medulloblastoma; medulloepithelioma; melanoma; melanoma, Intraocular (eye); Merkel cell carcinoma; mesothelioma, adult malignant; mesothelioma; metastatic Squamous Neck cancer with occult primary; mouth cancer; multiple endocrine neoplasia Syndrome; multiple myeloma/Plasma cell neoplasm; mycosis fungoides; myelodysplastic Syndromes; myelodysplastic/myeloproliferative neoplasms; myelogenous leukemia, chronic; myeloid leukemia, adult acute; myeloid leukemia, childhood acute; myeloma, multiple; myeloproliferative disorders, chronic; nasal cavity and paranasal sinus cancer; nasopharyngeal cancer; neuroblastoma; non-Hodgkin lymphoma, adult; non-Hodgkin lymphoma, childhood; Non-Small cell lung cancer; oral cancer; oral cavity cancer, lip and; oropharyngeal cancer; osteosarcoma and malignant fibrous histiocytoma of bone; ovarian cancer; ovarian epithelial cancer; ovarian germ cell tumor; ovarian low malignant potential tumor; pancreatic cancer; pancreatic cancer, Islet cell tumors; papillomatosis; paranasal sinus and nasal cavity cancer; parathyroid cancer; penile cancer; pharyngeal cancer; pineal parenchymal tumors of intermediate differentiation; pituitary tumor; plasma cell neoplasm/multiple myeloma; pleuropulmonary blastoma; Pregnancy and breast cancer; primary central nervous system (CNS) lymphoma; prostate cancer; rectal cancer; renal cell (Kidney) cancer; renal pelvis and ureter, transitional cell cancer; respiratory tract cancer with chromosome 15 changes; retinoblastoma; rhabdomyosarcoma; salivary gland cancer; salivary gland cancer; sarcoma, Ewing sarcoma family of tumors; sarcoma, Kaposi; sarcoma, soft tissue, adult; Sarcoma, soft tissue, childhood; Sarcoma, uterine; Sézary syndrome; skin cancer (non-melanoma); skin cancer; skin cancer (melanoma); skin carcinoma, Merkel cell; small cell lung cancer; small intestine cancer; soft tissue sarcoma, adult; soft tissue sarcoma, childhood; squamous cell carcinoma; squamous neck cancer with occult primary, metastatic; stomach (gastric) cancer; supratentorial primitive neuroectodermal tumors; T-cell lymphoma, cutaneous; testicular cancer; throat cancer; thyoma and thymic carcinoma; thyroid cancer; transitional cell cancer of the renal pelvis and ureter; trophoblastic tumor, gestational; unknown primary site, carcinoma of; ureter and renal pelvis, transitional cell cancer; urethral cancer; uterine cancer, endometrial; uterine sarcoma; uveal melanoma; vaginal cancer; vulvar cancer; Waldenström macroglobulinemia; or Wilm's tumor.
The methods, computer media, and systems described herein are useful for diagnosing hidden, latent, or subsymptomatic autoimmune or inflammatory disorders. The autoimmune or inflammatory can comprise an autoimmune or inflammatory disorder such as: acute optic neuritis, alopecia areata, ankylosing spondylitis, antiphospholipid syndrome, autoimmune Addison's disease, autoimmune diseases of the adrenal gland, arthritis, autoimmune hemolytic anemia, autoimmune hepatitis, autoimmune oophoritis and orchitis, autoimmune thrombocytopenia, Behcet's disease, bullous pemphigoid, bronchiolitis obliterans, cardiomyopathy, celiac sprue- dermatitis, chronic fatigue immune dysfunction syndrome (CFIDS), chronic inflammatory demyelinating polyneuropathy, Crohn's disease, Churg-Strauss syndrome, cicatrical pemphigoid, CREST syndrome, cold agglutinin disease, discoid lupus, essential mixed cryoglobulinemia, fibromyalgia-fibromyositis, glomerulonephritis, Graves' disease, Guillain-Barre, Hashimoto's thyroiditis, idiopathic pulmonary fibrosis, idiopathic thrombocytopenia purpura (ITP), IgA neuropathy, inflammatory bowel disease (IBD), juvenile arthritis, lichen planus, Meniere's disease, mixed connective tissue disease, multiple sclerosis, type 1 or immune-mediated diabetes mellitus, myasthenia gravis, pemphigus vulgaris, pernicious anemia, polyarteritis nodosa, polychondritis, polyglandular syndromes, polymyalgia rheumatica, polymyositis and dermatomyositis, primary agammaglobulinemia, primary biliary cirrhosis, psoriasis, psoriatic arthritis, Raynauld's phenomenon, Reiter's syndrome, sarcoidosis, scleroderma, progressive systemic sclerosis, Sjogren's syndrome, Good pasture's syndrome, stiff-man syndrome, systemic lupus erythematosus, lupus erythematosus, takayasu arteritis, temporal arteristis/giant cell arteritis, ulcerative colitis, uveitis, vasculitides such as dermatitis herpetiformis vasculitis, vitiligo, Wegener's granulomatosis, anti-glomerular basement membrane disease, antiphospholipid syndrome, autoimmune diseases of the nervous system, familial Mediterranean fever, Lambert-Eaton Myasthenic syndrome, sympathetic ophthalmia, or polyendocrinopathies, or sepsis.
In addition to the phenotypic or clinical measurements gathered from an individual the health metric comprises nucleic acid sequence data from an individual. The nucleic acid sequence can comprise one or more DNA sequences. In certain embodiments, the DNA sequence comprises a sequence for an individual's whole genome. In certain embodiments, the DNA sequence comprises a sequence for only the high confidence regions of an individual's whole genome. For example the high confidence region of an individual's whole genome can be defined by the NA12878 Genome-In-A-Bottle call set (GiaB v2.19). In certain embodiments, the DNA sequence comprises a sequence for greater than 99%, 95%, 90%, 85%, 80%, 75%, 70%, 65%, or 60% of the high confidence region of an individual's whole genome as defined by the GiaB v2.19. The DNA sequence can comprise a sequence of a plurality of contiguous nucleotides from an individual's genome. In certain embodiments, the DNA sequence comprises a sequence of at least 100; 1,000; 10,000; 100,000; or 1,000,000 contiguous nucleotides, including increments therein, from an individual's genome. In certain embodiments, the DNA sequence does not comprise the sequence of ribonucleic acid (RNA). In certain embodiments, the DNA sequence does not comprise the sequence of cDNA generated from ribonucleic acid (RNA). The DNA sequence can be generated from an individual's healthy tissue, semen, blood, plasma, serum, saliva, or stool. Additionally envisioned are DNA sequences generated from a tumor sample whether benign or malignant.
In certain embodiments, DNA sequence data for use with the methods, systems and media, described herein, is generated by any suitable method. The DNA sequence data can be generated by Sanger sequencing or by any next-generation sequencing technology. In certain embodiments, the DNA sequence data is generated, by way of non-limiting example, by pyrosequencing, sequencing by synthesis, sequencing by ligation, ion semiconductor sequencing, or single molecule real time sequencing. In certain embodiments, the DNA sequence data is generated by any technology capable of generating 1 gigabase of nucleotide reads per 24 hour period. In certain embodiments, the DNA sequence data is obtained from a third party or from a contracted provider.
In certain embodiments, the health metric for use with the methods, systems and media, comprises described herein is a plurality of genomic sequence variants (GSV). The genomic sequence variants can be determined de novo during implementation of any of the methods either by comparing to a reference genome or a reference genome constructed on the fly from a plurality of greater than 1,000; 10,000; or 100,000 different genomes, including increments therein. In certain embodiments, GSVs are determined by a third party and received by the party performing the method. In certain embodiments, determining a GSV encompasses receiving a list or file that comprises an individual's GSVs. The health metrics utilize a plurality of GSVs. In some cases greater than 10; 50; 100; 500; 1,000; 10,000; 100,000; or 1,000,000 GSVs, including increments therein when compared to a reference sequence, are utilized for the health metric.
The genomic sequence variants of an individual are used to set that individual's genetic risk. This genetic risk is defined by a genetic risk rule. This genetic risk can be disease specific, for example, diabetes, heart failure, specific cancers, or other mortalities. GSVs are first determined and then their potential pathogenicity is determined. Pathogenicity can be determined by using for example a statistical association of the GSV with a given disease or a given gene involved in a given disease. The pathogenicity can be defined based upon a score such as a CADD score, presence in the NCBI ClinVar database, or by other methods of determining a selection pressure on the specific genomic locus. Methods for defining selective pressure through for example a context dependent tolerability score, an n-variant score, regional variation score or a protein tolerability score can be implemented such as those described in U.S. Provisional Application Ser. No. 62/333,653 or U.S. Provisional Application Ser. No. 62/410,783 and are incorporated by reference herein in their entirety. The genetic risk rule can for example rank pathogenic variants by percentile, quartile, quintile, etc. For example, only the top 99th, 98th, 97th, 96th, 95th, 94th, 93th, 92nd, 91st, 90th, 80th, 70th, 60th, or 50th percentile, including increments therein, may be used when determining a genetic risk.
Referring to
Health Metrics
The methods, computer media, and systems described herein utilize a plurality of health metrics to diagnose diseases and risk factors. The method can deploy 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, or more independent health metrics, including increments therein. A health metric is a discreet result from a diagnostic test determined by a physician, laboratory, specialist, or questionnaire. The health metrics of this disclosure comprise phenotypic or clinical measurements gathered from an individual. The health measurement can be those normally acquired in a physician's office such as height, weight, blood pressure, reflex measurements, skin fold measurements, temperature, blood oxygen saturation, resting pulse rate, urinalysis and the like. The health measurement can be those normally measured by a laboratory using a patient's blood, plasma, serum, saliva, stool, urine, semen, or pap smear and can include: analysis of blood lipids, such as, fatty acids, non-esterified fatty acids, omega-3 fatty acids, cholesterols, high-density lipoprotein (HDL), low-density lipoprotein (LDL), very low-density lipoprotein (VLDL), chylomicrons, triglycerides, diglycerides, monoglycerides; measures of carbohydrate usage, such as glucose levels (fasting or non-fasting), oral glucose tolerance test, h1Ac; liver enzymes and markers, such as, aspartate aminotransferase, alkaline phosphatase, aspartate aminotransferase, albumin, bilirubin; electrolytes, such as, calcium, sodium, potassium, magnesium, chloride; blood pH, bicarbonate, hemoglobin, red cell count, white blood cell count; specific markers for cancer such as, for example, prostate specific antigen; tests for viral infection such as antibodies against or sequences from, HIV, hepatitis A, B, or C; or bacterial or yeast cultures. The health measurement can be more specialized and comprise histological examination, MM analysis, EKG analysis, EEG analysis, cardiac stress test, psychological evaluation, gait and balance test, ultrasound, CAT scan, X-rays, bone density measurements, body composition measurements, colonoscopy, or PET scans. The health measurement can comprise demographic factors such as age, gender, race, genetic origin or ancestry, health history, or family history. The health measurement can also comprise measurements that are tracked by wearable devices, such as a wearable health monitor, including activity, sleep/wake cycle, sleep analysis, pulse rate and rhythm and the like.
The measured health metrics of an individual are used to set that individual's non-genetic risk. This non-genetic risk can be disease specific, for example, diabetes, heart failure, specific cancers, or other mortalities. This non-genetic risk is defined by a non-genetic risk rule. Physiological measurements are ranked and segmented by known distributions of the physiological measurement. For example, LDL in the 75th percentile or above may be used when determining a risk for a congestive heart failure. The non-genetic risk rule can rank physiological measurements by percentile, quartile, quintile, etc. For example, only the top 99th, 98th, 97th, 96th, 95th, 94th, 93rd, 92nd, 91st, 90th, 80th, 80th, 70th, 60th, or 50th percentile, including increments therein, may be used when determining a non-genetic risk. The non-genetic risk can also be defined in part by a binary response such as a Yes or NO on a questionnaire concerning health history, environmental factors, or family history, or by a plus or minus result from for example an MM or EKG test.
The methods described herein for determining risk for and diagnose of medical conditions comprise combining a plurality of health metrics wherein one of the health metrics comprise nucleotide sequence data and at least one other health metric, but can comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, or more independent health metrics, including increments therein. The genetic and non-genetic risk rules are then combined and appropriately weighted per disease. For example the genetic risk can be assigned a weight that is 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100% more, including increments therein, than the non-genetic risk. For example the non-genetic risk can be assigned a weight that is 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100% more, including increments therein, than the genetic risk. For example, the genetic risk can be assigned a weight that is 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold, 20-fold, 50-fold, or 100-fold more, including increments therein, than the non-genetic risk. For example, the non-genetic risk can be assigned a weight that is 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold, 20-fold, 50-fold, or 100-fold more, including increments therein, than the genetic risk.
The platforms, systems, media, and methods described herein may include a digital processing device, or use of the same. In further embodiments, the digital processing device includes one or more hardware central processing units (CPUs) or general purpose graphics processing units (GPGPUs) that carry out the device's functions. In still further embodiments, the digital processing device further comprises an operating system configured to perform executable instructions. In some embodiments, the digital processing device is optionally connected to a computer network. In further embodiments, the digital processing device is optionally connected to the Internet such that it accesses the World Wide Web. In still further embodiments, the digital processing device is optionally connected to a cloud computing infrastructure. In other embodiments, the digital processing device is optionally connected to an intranet. In other embodiments, the digital processing device is optionally connected to a data storage device.
In accordance with the description herein, suitable digital processing devices include, by way of non-limiting examples, server computers, desktop computers, laptop computers, and notebook computers. Those of skill in the art will recognize that many smartphones are suitable for use in the system described herein. Those of skill in the art will also recognize that select televisions, video players, and digital music players with optional computer network connectivity are suitable for use in the system described herein. Suitable tablet computers include those with booklet, slate, and convertible configurations, known to those of skill in the art.
The digital processing device includes an operating system configured to perform executable instructions. The operating system is, for example, software, including programs and data, which manages the device's hardware and provides services for execution of applications. Those of skill in the art will recognize that suitable server operating systems include, by way of non-limiting examples, FreeBSD, OpenBSD, NetBSD®, Linux, Apple® Mac OS X Server®, Oracle® Solaris®, Windows Server®, and Novell® NetWare®. Those of skill in the art will recognize that suitable personal computer operating systems include, by way of non-limiting examples, Microsoft® Windows®, Apple® Mac OS X®, UNIX®, and UNIX-like operating systems such as GNU/Linux®. In some embodiments, the operating system is provided by cloud computing.
The digital processing device includes a storage and/or memory device. The storage and/or memory device is one or more physical apparatuses used to store data or programs on a temporary or permanent basis. In some embodiments, the device is volatile memory and requires power to maintain stored information. In some embodiments, the device is non-volatile memory and retains stored information when the digital processing device is not powered. In further embodiments, the non-volatile memory comprises flash memory. In some embodiments, the non-volatile memory comprises dynamic random-access memory (DRAM). In some embodiments, the non-volatile memory comprises ferroelectric random access memory (FRAM). In some embodiments, the non-volatile memory comprises phase-change random access memory (PRAM). In other embodiments, the device is a storage device including, by way of non-limiting examples, CD-ROMs, DVDs, flash memory devices, magnetic disk drives, magnetic tapes drives, optical disk drives, and cloud computing based storage. In further embodiments, the storage and/or memory device is a combination of devices such as those disclosed herein.
The digital processing device, in some cases, includes a display to send visual information to a user. In some embodiments, the display is a liquid crystal display (LCD). In further embodiments, the display is a thin film transistor liquid crystal display (TFT-LCD). In some embodiments, the display is an organic light emitting diode (OLED) display. In various further embodiments, on OLED display is a passive-matrix OLED (PMOLED) or active-matrix OLED (AMOLED) display. In some embodiments, the display is a plasma display. In other embodiments, the display is a video projector. In yet other embodiments, the display is a head-mounted display in communication with the digital processing device, such as a VR headset. In further embodiments, suitable VR headsets include, by way of non-limiting examples, HTC Vive, Oculus Rift, Samsung Gear VR, Microsoft HoloLens, Razer OSVR, FOVE VR, Zeiss VR One, Avegant Glyph, Freefly VR headset, and the like. In still further embodiments, the display is a combination of devices such as those disclosed herein.
The digital processing device, in some cases, includes an input device to receive information from a user. In some embodiments, the input device is a keyboard. In some embodiments, the input device is a pointing device including, by way of non-limiting examples, a mouse, trackball, track pad, joystick, game controller, or stylus. In some embodiments, the input device is a touch screen or a multi-touch screen. In other embodiments, the input device is a microphone to capture voice or other sound input. In other embodiments, the input device is a video camera or other sensor to capture motion or visual input. In further embodiments, the input device is a Kinect, Leap Motion, or the like. In still further embodiments, the input device is a combination of devices such as those disclosed herein.
Referring to
Continuing to refer to
Continuing to refer to
Continuing to refer to
Methods as described herein can be implemented by way of machine (e.g., computer processor) executable code stored on an electronic storage location of the digital processing device 201, such as, for example, on the memory 210 or electronic storage unit 215. The machine executable or machine readable code can be provided in the form of software. During use, the code can be executed by the processor 205. In some cases, the code can be retrieved from the storage unit 215 and stored on the memory 210 for ready access by the processor 205. In some situations, the electronic storage unit 215 can be precluded, and machine-executable instructions are stored on memory 210.
The platforms, systems, media, and methods disclosed herein may include one or more non-transitory computer readable storage media encoded with a program including instructions executable by the operating system of an optionally networked digital processing device. In further embodiments, a computer readable storage medium is a tangible component of a digital processing device. In still further embodiments, a computer readable storage medium is optionally removable from a digital processing device. In some embodiments, a computer readable storage medium includes, by way of non-limiting examples, CD-ROMs, DVDs, flash memory devices, solid state memory, magnetic disk drives, magnetic tape drives, optical disk drives, cloud computing systems and services, and the like. In some cases, the program and instructions are permanently, substantially permanently, semi-permanently, or non-transitorily encoded on the media.
The platforms, systems, media, and methods disclosed herein may include at least one computer program, or use of the same. A computer program includes a sequence of instructions, executable in the digital processing device's CPU, written to perform a specified task. Computer readable instructions may be implemented as program modules, such as functions, objects, Application Programming Interfaces (APIs), data structures, and the like, that perform particular tasks or implement particular abstract data types. In light of the disclosure provided herein, those of skill in the art will recognize that a computer program may be written in various versions of various languages.
The functionality of the computer readable instructions may be combined or distributed as desired in various environments. In some embodiments, a computer program comprises one sequence of instructions. In some embodiments, a computer program comprises a plurality of sequences of instructions. In some embodiments, a computer program is provided from one location. In other embodiments, a computer program is provided from a plurality of locations. In various embodiments, a computer program includes one or more software modules. In various embodiments, a computer program includes, in part or in whole, one or more web applications, one or more mobile applications, one or more standalone applications, one or more web browser plug-ins, extensions, add-ins, or add-ons, or combinations thereof.
A computer program described herein may include a web application. In light of the disclosure provided herein, those of skill in the art will recognize that a web application, in various embodiments, utilizes one or more software frameworks and one or more database systems. In some embodiments, a web application is created upon a software framework such as Microsoft®.NET or Ruby on Rails (RoR). In some embodiments, a web application utilizes one or more database systems including, by way of non-limiting examples, relational, non-relational, object oriented, associative, and XML database systems. In further embodiments, suitable relational database systems include, by way of non-limiting examples, Microsoft®SQL Server, mySQL™, and Oracle®. Those of skill in the art will also recognize that a web application, in various embodiments, is written in one or more versions of one or more languages. A web application may be written in one or more markup languages, presentation definition languages, client-side scripting languages, server-side coding languages, database query languages, or combinations thereof. In some embodiments, a web application is written to some extent in a markup language such as Hypertext Markup Language (HTML), Extensible Hypertext Markup Language (XHTML), or eXtensible Markup Language (XML). In some embodiments, a web application is written to some extent in a presentation definition language such as Cascading Style Sheets (CSS). In some embodiments, a web application is written to some extent in a client-side scripting language such as Asynchronous Javascript and XML (AJAX), Flash® Actionscript, Javascript, or Silverlight. In some embodiments, a web application is written to some extent in a server-side coding language such as Active Server Pages (ASP), ColdFusion, Perl, Java™, JavaServer Pages (JSP), Hypertext Preprocessor (PHP), Python™, Ruby, Tcl, Smalltalk, WebDNA®, or Groovy. In some embodiments, a web application is written to some extent in a database query language such as Structured Query Language (SQL). In some embodiments, a web application integrates enterprise server products such as IBM® Lotus Domino. In some embodiments, a web application includes a media player element. In various further embodiments, a media player element utilizes one or more of many suitable multimedia technologies including, by way of non-limiting examples, Adobe®Flash®, HTML 5, Apple® QuickTime®, Microsoft® Silverlight®, Java™, and Unity®.
Referring to
Referring to
A computer program described herein may include a mobile application provided to a mobile digital processing device. In some embodiments, the mobile application is provided to a mobile digital processing device at the time it is manufactured. In other embodiments, the mobile application is provided to a mobile digital processing device via the computer network described herein.
In view of the disclosure provided herein, a mobile application is created by techniques known to those of skill in the art using hardware, languages, and development environments known to the art. Those of skill in the art will recognize that mobile applications are written in several languages. Suitable programming languages include, by way of non-limiting examples, C, C++, C#, Objective-C, Java™, Javascript, Pascal, Object Pascal, Python™, Ruby, VB.NET, WML, and XHTML/HTML with or without CSS, or combinations thereof.
Suitable mobile application development environments are available from several sources. Commercially available development environments include, by way of non-limiting examples, AirplaySDK, alcheMo, Appcelerator®, Celsius, Bedrock, Flash Lite, .NET Compact Framework, Rhomobile, and WorkLight Mobile Platform. Other development environments are available without cost including, by way of non-limiting examples, Lazarus, MobiFlex, MoSync, and Phonegap. Also, mobile device manufacturers distribute software developer kits including, by way of non-limiting examples, iPhone and iPad (iOS) SDK, Android™ SDK, BlackBerry® SDK, BREW SDK, Palm® OS SDK, Symbian SDK, webOS SDK, and Windows® Mobile SDK.
Those of skill in the art will recognize that several commercial forums are available for distribution of mobile applications including, by way of non-limiting examples, Apple® App Store, Google® Play, Chrome Web Store, BlackBerry® App World, App Store for Palm devices, App Catalog for webOS, Windows® Marketplace for Mobile, Ovi Store for Nokia® devices, Samsung® Apps, and Nintendo® DSi Shop.
A computer program described herein may include a standalone application, which is a program that is run as an independent computer process, not an add-on to an existing process, e.g., not a plug-in. Those of skill in the art will recognize that standalone applications are often compiled. A compiler is a computer program(s) that transforms source code written in a programming language into binary object code such as assembly language or machine code. Suitable compiled programming languages include, by way of non-limiting examples, C, C++, Objective-C, COBOL, Delphi, Eiffel, Java™, Lisp, Python™, Visual Basic, and VB .NET, or combinations thereof. Compilation is often performed, at least in part, to create an executable program. In some embodiments, a computer program includes one or more executable complied applications.
The platforms, systems, media, and methods disclosed herein may include software, server, and/or database modules, or use of the same. In view of the disclosure provided herein, software modules are created by techniques known to those of skill in the art using machines, software, and languages known to the art. The software modules disclosed herein are implemented in a multitude of ways. In various embodiments, a software module comprises a file, a section of code, a programming object, a programming structure, or combinations thereof. In further various embodiments, a software module comprises a plurality of files, a plurality of sections of code, a plurality of programming objects, a plurality of programming structures, or combinations thereof. In various embodiments, the one or more software modules comprise, by way of non-limiting examples, a web application, a mobile application, and a standalone application. Software modules can be in one computer program or application, or more than one computer program or application. Software modules can be hosted on one machine, or on more than one machine. In further embodiments, software modules are hosted on cloud computing platforms. Software modules can be hosted on one or more machines in one location, one or more machines in more than one location.
The platforms, systems, media, and methods disclosed herein may include one or more databases, or use of the same. In view of the disclosure provided herein, those of skill in the art will recognize that many databases are suitable for storage and retrieval of information, such as: genomic data, which may comprise information on genomic sequence variants in a.vcf format or other format; phenotypic data which may comprise physiological measurements, dimensional measurements, health or family histories; and clinical measurements and/or diagnoses which may comprise disease diagnoses or measurements made in a clinical setting. In various embodiments, suitable databases include, by way of non-limiting examples, relational databases, non-relational databases, object oriented databases, object databases, entity-relationship model databases, associative databases, and XML databases. Further non-limiting examples include SQL, PostgreSQL, MySQL, Oracle, DB2, and Sybase. In some embodiments, a database is internet-based. In further embodiments, a database is web-based. In still further embodiments, a database is cloud computing-based. In other embodiments, a database is based on one or more local computer storage devices.
The following illustrative examples are representative of embodiments of the software applications, systems, and methods described herein and are not meant to be limiting in any way.
Integrating Genotype and Phenotype Information Uncovers Undiagnosed Conditions and Risk Factors
We enrolled 209 study participants, median age 55 yrs., range 20-98 yrs., 34.5% female, between September 10, 2015 and May 16, 2016. Twenty-one (10%) of the 209 participants were from 7 families. Selected characteristics comparing study participants to age and gender-adjusted NHANES cohort, a US population-based sample, is shown in Table 1. Routine clinical laboratory testing was obtained on 90 study participants (43%); Quantose IR (including fasting blood glucose) was obtained on 208 participants and included fasting blood glucose. Magnetic resonance imaging-based quantitative body compartment-specific fat and muscle estimation was conducted on 126 participants (60%). Some portion of the intended 2-week cardiac rhythm monitoring was completed on 140 (67%) participants; the median duration of monitoring was 5.9 days (range 0.8-14 days) (
We identified seventeen study participants (8%) with evidence of age-related chronic diseases considered significant and highly actionable requiring prompt medical attention following confirmation of screening findings: four early stage neoplasias (thymoma, renal cell carcinoma, and two high grade prostate neoplasms), one enlarged aortic root, two newly recognized atrial fibrillation cases, two medically significant arrhythmias, one 3rd degree heart block, one primary biliary cholangitis, and one xanthinuria (see Table 2). Some individuals had no detectable genetic risk emphasizing the value of phenotyping technology.
Table 3 lists the pathogenic associations of genomic variants. 52 (25%, 1:4) participants had likely mechanistic genotype-phenotype associations (
We identified 164 (78%, >3:4) participants with evidence of age-related chronic disease or risk factors. One-hundred-and-eighteen study participants (56%) had evidence of diabetes or risk for diabetes: 15 (7%) had type 2 diabetes; 80 (38%) had pre-diabetes (38%), and 23 (11%) had insulin resistance (based on Quantose IR). Only 19 (16%) reported a history of type 2 diabetes or pre- diabetes (Table 1). One-hundred-and-twenty-four participants (59%) had evidence of atherosclerotic disease or risk. Thirty-three (16%) had evidence of metabolic syndrome. Twenty-eight participants (13%) met a screening definition for non-alcoholic fatty liver disease (NAFLD), and one had suspected non-alcoholic steatohepatitis (NASH). Many participants had multiple over-lapping conditions including: 29 with pre-diabetes and atherosclerotic disease or risk; 19 with pre-diabetes, atherosclerotic disease or risk, and metabolic syndrome and; 13 with insulin resistance and atherosclerotic disease or risk (
We identified 10 unique alleles in 14 subjects with metabolic signatures consistent with penetrance. Metabolic pathways impacted by the allelic differences included fatty acid beta oxidation, fatty acid synthesis, urea cycle, and signatures associated with oxidative stress. Strong metabolic signatures were observed for two polymorphisms matching the genes' function. Two heterozygous ACADS variants, c.1510G>A and c.1030C>T, coding for the short-chain acyl- Coenzyme A dehydrogenase (SCAD) were detected in one case. In another case, the heterozygous ACADM variant c.1456C>T coding for medium-chain acyl-Coenzyme A dehydrogenase (MCAD) was detected and interestingly both enzymes participate in fatty acid beta-oxidation by reducing different fatty acid chain length. SCAD specifically acts on the short chain fatty acid butyryl-CoA and MCAD reduces acyl-CoA chains containing 6-12 carbons. In the absence of SCAD activity, byproducts of butyryl-CoA including butyrycarnitine and ethylmalonate accumulate. Greatly elevated levels of butyrylcarnitine and ethylmalonate (Z-scores above 97.5th percentile) were observed in the plasma suggestive of combined metabolic penetrance of these variants. Moreover, greatly elevated medium chain acyl-carnitines, hexanoylcarnitine, octanoylcarnitine and decanoylcarnitine (Z-scores above 97.5th percentile) were detected suggestive of reduced MCAD activity. Large genome-wide association studies combined with metabolic profiling have previously identified associations between ACADS and MCAD and their respective metabolic substrates lending support to the metabolic penetrance observed on an individual basis in this study. We previously reported on additional metabolomic/genetic variants which are heterozygotes for known recessively inherited disorders. These studies established that “carrier” disease state does not reflect carrier for individual metabolic variation. The number of adult cases of metabolic penetrance will continue to expand using this approach.
Metabolomics analysis also detected xanthinuria in an individual with early onset (20's) recurrent renal stones (6 episodes) as well as the drug effect of xanthine oxidase inhibitors in 3 other individuals. Although hypoxanthine and especially xanthine levels were elevated in both cases, normal urate and elevated orotate and orotidine levels, due to perturbed pyrimidine synthesis , were only observed in individuals taking xanthine oxidase inhibitors (allopurinols) for their gout conditions.
We enrolled active adults >18 years old (without acute illness, activity-limiting unexplained illness or symptoms, or known active cancer) able to come for 6-8 hours of on-site data collection, were able to undergo magnetic resonance imaging without sedation, in the case of women were not pregnant or attempting to become pregnant, and were interested in undergoing a novel precision medicine screening approach for disease risk detection including genomics and other testing, as part of an institutional review board-approved clinical research protocol. Study results were returned to study participants who were encouraged to involve their primary care physicians.
Participants underwent a verbal review of the institutional review board-approved consent (Western Institutional Review Board) and were given time to ask and receive answers to questions during a one-half to one-hour sessions conducted by health professionals. Study participants underwent standardized activities related to data collection and return of results in pre-visit, visit, and post-visit phases during a 1-year study period.
Selected data were collected regarding past medical and family history, risk factors, and medical symptoms prior to or during study participant visit. Participants were instructed to stop taking supplements for 72 hours, and to fast after dinner the night before their morning appointment. On the day of visit, blood was obtained for whole genome sequencing (Human Longevity, Inc.), global metabolomics and QUANTOSE™ IR (Metabolon), and routine clinical laboratory tests (LabCorp Inc.™) Two-week cardiac rhythm monitoring (Zio XT Patch™, iRhythm Technologies, Inc.™) kits were provided with instructions for use, or monitoring was initiated during visit. Height, weight, and sitting blood pressure were obtained. Genomic variants were annotated using integrated public and proprietary annotation sources in the HLI Knowledgebase including ClinVar, and HGMD (Qiagen). Monogenic rare variants were classified as pathogenic (P), likely pathogenic (LP), or variant of uncertain significance (VUS). The HLI Knowledgebase integrates allele frequencies for variants derived from HU's database of >12,000 sequences and provides a platform for query of these variants with annotation data.
To identify potentially medically significant rare monogenic variants we used an internal version (release 0.27) of HLI Search™ in a two-step process: the first step focused on allele frequency <1% in the HLI cohort with annotation using ClinVar and HGMD as well as predicted loss of function variants; the second step focused on participant-specific phenotype-driven queries using an allele frequency of <1% based on family and individual medical history as well as abnormal clinical testing results. Global metabolic profiling was performed using ultrahigh performance liquid- phase chromatography separation coupled with tandem mass spectrometry to assess the metabolic penetrance of the variants in these subject. Z-scores were calculated for all metabolites in each subject against a reference cohort consisting of 42 fasted subjects of normal health, and metabolites with Z-scores below the 2.5th or above the 97.5th percentiles of the reference cohort were considered to be potentially indicative of metabolic abnormalities that warranted further investigation. Integration of metabolomic and gene sequence data was achieved by a proprietary pathway analysis program developed by Metabolon and HLI.
Study participants underwent whole body magnetic resonance imaging (GE Discovery MR750w 3.0T) in research mode using protocols and post-processing for volumetric brain imaging (Neuroquant™, CorTechs Laboratories™), cancer detection (using restriction spectrum imaging), neurovascular and cardiovascular visualization, liver-specific fat and iron estimation, and quantitative body compartment-specific fat and muscle estimation (AMRA); other post-processing was done by MMIS. GE Lunar iDXA with Pro Package was used for skeletal and metabolic health assessment. Magnetic resonance imaging and iDXA. GE Vivid E95 was used for echocardiography and a GE Mac 2000 was used to obtain a 12-lead resting electrocardiogram. Two-week cardiac monitoring, electrocardiogram, and echocardiography were interpreted by a physician. Participants with likely mechanistic genomic findings correlating with clinical data were identified by expert review to identify convergent genomic and clinical (or phenotype) data relationships including at least two clinical (or phenotype) data elements supporting a genomic observation, including three generation family history and metabolite level correlation based on pathway mapping. Baseline characteristics including reported past medical history for major categories of age-related chronic diseases by study participants were compared to responses from NHANES, a US population-based cohort (Table 1), adjusted for age and sex distributions. Study participants with evidence of age-related chronic diseases considered significant and highly actionable were defined as new genomic and/or other clinical findings which based on current medical practice indicated the need for medical attention to avoid potentially life-threatening consequences immediately or within 30 days from their visit. Participants with evidence of age-related chronic disease or disease risk factors were identified as including: 1) type 2 diabetes, pre-diabetes and insulin resistance (Quantose IR); 2) likely atherosclerotic disease or risk; 3) metabolic syndrome ; 4) non-alcoholic fatty liver disease and non- alcoholic steatohepatitis, based on clinical guidelines or other recent literature. Measured fasting blood glucose, hemoglobin A1C, personal medical history for diabetes, or Quantose IR was used to identify participants as having diabetes, pre-diabetes or insulin resistance. The presence of any of the following were considered to be evidence of likely atherosclerotic disease or risk: “yes” in response to any of the following questions: 1) Ever told you had coronary artery disease, 2) Ever told you had a heart attack, 3) Ever told you had congestive heart failure, 4) Taking prescription for hypertension, and 5) Taking prescription for cholesterol, or if sitting blood pressure >normal, LDL cholesterol>normal, or Lipoprotein-associated phospholipase A2 (Lp-PLA2)>normal. The presence of any three of the following 5 criteria were considered to be evidence of metabolic syndrome: 1) visceral adipose tissue measured by MM (post-processing by AMRA™)>2SD above normal, or android/gynoid fat measured by iDXA >normal; 2) triglycerides>150 mg/dL; 3) HDL cholesterol<40 mg/dL in men and <50 mg/dL in women or the participant is currently taking prescribed medicine for high cholesterol; 4) blood pressure>130/85 mmHg or the participant is currently taking prescription for hypertension; 5) Measured fasting glucose or hemoglobin A1c indicates pre-diabetes or “borderline” in response to the question - Doctor told you have diabetes. The presence of non-alcoholic fatty liver disease or nonalcoholic steatohepatitis were considered likely if: for non-alcoholic fatty liver disease MM-based estimate liver fat was <4% and did not have alcohol dependence, and for these individuals we used a formula including other demographic and laboratory data to identify likely non-alcoholic steatohepatitis.
While the preferred embodiments have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the embodiments disclosed herein. It should be understood that various alternatives to the embodiments described herein may be employed depending on the specific implementations.
This application claims the benefit of priority under 35 U.S.C. § 119 from U.S. Provisional Patent Application 62/500,426 filed May 2, 2017. The disclosure of the above identified application is incorporated herein by reference in its entirety as if set forth in full.
Number | Date | Country | |
---|---|---|---|
62500426 | May 2017 | US |